hanchenye-llvm-project/libclc
Jan Vesely 999b1d9426 amdgcn: rewrite barrier() using fence and clang __builtin_amdgcn_s_barrier
Specs require using fences when barrier() is invoked:
"The barrier function will either flush any variables stored in local memory
or queue a memory fence to ensure correct ordering of memory operations to local memory."
and
"The barrier function will queue a memory fence to ensure correct ordering
of memory operations to global memory."

Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu>
Reviewed-by: Aaron Watry <awatry@gmail.com>
Tested-by: Aaron Watry <awatry@gmail.com>
llvm-svn: 311022
2017-08-16 17:09:00 +00:00
..
amdgcn/lib amdgcn: rewrite barrier() using fence and clang __builtin_amdgcn_s_barrier 2017-08-16 17:09:00 +00:00
amdgcn-amdhsa/lib amdgcn-amdhsa: Add get_num_groups implementation 2016-09-16 22:43:31 +00:00
amdgpu/lib Replace nextafter implementation 2016-09-08 16:37:56 +00:00
build configure.py: Make python3 friendly 2017-08-02 15:00:59 +00:00
generic amdgcn: Implement {read_,write_,}mem_fence builtin 2017-08-16 17:08:56 +00:00
ptx/lib
ptx-nvidiacl/lib AMDGPU: Implement get_global_offset builtin 2016-07-22 17:24:24 +00:00
r600/lib amdgcn: Fix return type of get_num_groups 2016-08-25 07:31:40 +00:00
test
utils Move BufferPtr into the block where it it being used 2017-02-12 21:33:49 +00:00
www Update page to list supported targets 2016-02-13 01:02:06 +00:00
.gitignore .gitignore: Ignore amdgcn-mesa object directory 2017-02-24 20:32:18 +00:00
CREDITS.TXT
LICENSE.TXT Update copyright year to 2016. 2016-03-30 22:39:03 +00:00
README.TXT
compile-test.sh
configure.py configure.py: Drop explicit import of int builtin 2017-08-15 22:24:05 +00:00

README.TXT

This file contains invisible Unicode characters

This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

libclc
------

libclc is an open source, BSD licensed implementation of the library
requirements of the OpenCL C programming language, as specified by the
OpenCL 1.1 Specification. The following sections of the specification
impose library requirements:

  * 6.1: Supported Data Types
  * 6.2.3: Explicit Conversions
  * 6.2.4.2: Reinterpreting Types Using as_type() and as_typen()
  * 6.9: Preprocessor Directives and Macros
  * 6.11: Built-in Functions
  * 9.3: Double Precision Floating-Point
  * 9.4: 64-bit Atomics
  * 9.5: Writing to 3D image memory objects
  * 9.6: Half Precision Floating-Point

libclc is intended to be used with the Clang compiler's OpenCL frontend.

libclc is designed to be portable and extensible. To this end, it provides
generic implementations of most library requirements, allowing the target
to override the generic implementation at the granularity of individual
functions.

libclc currently only supports the PTX target, but support for more
targets is welcome.

Compiling and installing with Make
----------------------------------

$ ./configure.py --with-llvm-config=/path/to/llvm-config && make
$ make install

Note you can use the DESTDIR Makefile variable to do staged installs.

$ make install DESTDIR=/path/for/staged/install

Compiling and installing with Ninja
-----------------------------------

$ ./configure.py -g ninja --with-llvm-config=/path/to/llvm-config && ninja
$ ninja install

Note you can use the DESTDIR environment variable to do staged installs.

$ DESTDIR=/path/for/staged/install ninja install

Website
-------

http://www.pcc.me.uk/~peter/libclc/