hanchenye-llvm-project/libclc
Aaron Watry 0bf96b1712 relational: Implement shuffle2 builtin
This was added in CL 1.1

Tested with a Radeon HD 7850 (Pitcairn) using the CL CTS via:
test_conformance/relationals/test_relationals shuffle_built_in_dual_input

v2: Add half support to shuffle2
    Move shuffle2 to misc/

Signed-off-by: Aaron Watry <awatry@gmail.com>
Reviewed-by: Jan Vesely <jan.vesely@rutgers.edu>
llvm-svn: 312404
2017-09-02 02:23:28 +00:00
..
amdgcn/lib amdgcn: rewrite barrier() using fence and clang __builtin_amdgcn_s_barrier 2017-08-16 17:09:00 +00:00
amdgcn-amdhsa/lib amdgcn-amdhsa: Add get_num_groups implementation 2016-09-16 22:43:31 +00:00
amdgpu/lib Replace nextafter implementation 2016-09-08 16:37:56 +00:00
build configure.py: Make python3 friendly 2017-08-02 15:00:59 +00:00
generic relational: Implement shuffle2 builtin 2017-09-02 02:23:28 +00:00
ptx/lib
ptx-nvidiacl/lib AMDGPU: Implement get_global_offset builtin 2016-07-22 17:24:24 +00:00
r600/lib amdgcn: Fix return type of get_num_groups 2016-08-25 07:31:40 +00:00
test
utils Move BufferPtr into the block where it it being used 2017-02-12 21:33:49 +00:00
www
.gitignore .gitignore: Ignore amdgcn-mesa object directory 2017-02-24 20:32:18 +00:00
CREDITS.TXT
LICENSE.TXT
README.TXT
compile-test.sh
configure.py configure.py: Drop explicit import of int builtin 2017-08-15 22:24:05 +00:00

README.TXT

This file contains invisible Unicode characters

This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

libclc
------

libclc is an open source, BSD licensed implementation of the library
requirements of the OpenCL C programming language, as specified by the
OpenCL 1.1 Specification. The following sections of the specification
impose library requirements:

  * 6.1: Supported Data Types
  * 6.2.3: Explicit Conversions
  * 6.2.4.2: Reinterpreting Types Using as_type() and as_typen()
  * 6.9: Preprocessor Directives and Macros
  * 6.11: Built-in Functions
  * 9.3: Double Precision Floating-Point
  * 9.4: 64-bit Atomics
  * 9.5: Writing to 3D image memory objects
  * 9.6: Half Precision Floating-Point

libclc is intended to be used with the Clang compiler's OpenCL frontend.

libclc is designed to be portable and extensible. To this end, it provides
generic implementations of most library requirements, allowing the target
to override the generic implementation at the granularity of individual
functions.

libclc currently only supports the PTX target, but support for more
targets is welcome.

Compiling and installing with Make
----------------------------------

$ ./configure.py --with-llvm-config=/path/to/llvm-config && make
$ make install

Note you can use the DESTDIR Makefile variable to do staged installs.

$ make install DESTDIR=/path/for/staged/install

Compiling and installing with Ninja
-----------------------------------

$ ./configure.py -g ninja --with-llvm-config=/path/to/llvm-config && ninja
$ ninja install

Note you can use the DESTDIR environment variable to do staged installs.

$ DESTDIR=/path/for/staged/install ninja install

Website
-------

http://www.pcc.me.uk/~peter/libclc/