hanchenye-llvm-project

History

Jeroen Ketema 1364d268a4 Implement mem_fence on ptx PTX does not differentiate between read and write fences. Hence, these a lowered to a mem_fence call. The mem_fence function compiles to the “member.cta” instruction, which commits all outstanding reads and writes of a thread such that these become visible to all other threads in the same CTA (i.e., work-group). The instruction does not differentiate between global and local memory. Hence, the flags parameter is ignored, except for deciding whether a “member.cta” instruction should be issued at all. Reviewed-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 315235		2017-10-09 19:43:04 +00:00
..
mem_fence	Implement mem_fence on ptx	2017-10-09 19:43:04 +00:00
synchronization	…
workitem	…
SOURCES	Implement mem_fence on ptx	2017-10-09 19:43:04 +00:00