hanchenye-llvm-project/clang/lib/CodeGen
Francesco Petrogalli 67e4330fac [sve][acle] Implement some of the C intrinsics for brain float.
Summary:
The following intrinsics have been extended to support brain float types:

svbfloat16_t svclasta[_bf16](svbool_t pg, svbfloat16_t fallback, svbfloat16_t data)
bfloat16_t svclasta[_n_bf16](svbool_t pg, bfloat16_t fallback, svbfloat16_t data)
bfloat16_t svlasta[_bf16](svbool_t pg, svbfloat16_t op)

svbfloat16_t svclastb[_bf16](svbool_t pg, svbfloat16_t fallback, svbfloat16_t data)
bfloat16_t svclastb[_n_bf16](svbool_t pg, bfloat16_t fallback, svbfloat16_t data)
bfloat16_t svlastb[_bf16](svbool_t pg, svbfloat16_t op)

svbfloat16_t svdup[_n]_bf16(bfloat16_t op)
svbfloat16_t svdup[_n]_bf16_m(svbfloat16_t inactive, svbool_t pg, bfloat16_t op)
svbfloat16_t svdup[_n]_bf16_x(svbool_t pg, bfloat16_t op)
svbfloat16_t svdup[_n]_bf16_z(svbool_t pg, bfloat16_t op)

svbfloat16_t svdupq[_n]_bf16(bfloat16_t x0, bfloat16_t x1, bfloat16_t x2, bfloat16_t x3, bfloat16_t x4, bfloat16_t x5, bfloat16_t x6, bfloat16_t x7)
svbfloat16_t svdupq_lane[_bf16](svbfloat16_t data, uint64_t index)

svbfloat16_t svinsr[_n_bf16](svbfloat16_t op1, bfloat16_t op2)

Reviewers: sdesmalen, kmclaughlin, c-rhodes, ctetreau, efriedma

Subscribers: tschuett, hiraditya, rkruppe, psnobl, cfe-commits, llvm-commits

Tags: #clang, #llvm

Differential Revision: https://reviews.llvm.org/D82345
2020-06-29 16:09:08 +00:00
..
ABIInfo.h [ARM] Add __bf16 as new Bfloat16 C Type 2020-06-05 10:32:43 +01:00
Address.h
BackendUtil.cpp [SanitizeCoverage] Rename -fsanitize-coverage-{white,black}list to -fsanitize-coverage-{allow,block}list 2020-06-19 22:22:47 -07:00
CGAtomic.cpp Clean up usages of asserting vector getters in Type 2020-04-13 13:01:40 -07:00
CGBlocks.cpp [CodeGen] Simplify the way lifetime of block captures is extended 2020-06-11 16:06:22 -07:00
CGBlocks.h [CodeGen] Simplify the way lifetime of block captures is extended 2020-06-11 16:06:22 -07:00
CGBuilder.h Reapply "[IRBuilder] Virtualize IRBuilder" 2020-02-17 19:04:11 +01:00
CGBuiltin.cpp [sve][acle] Implement some of the C intrinsics for brain float. 2020-06-29 16:09:08 +00:00
CGCUDANV.cpp [cuda][hip] Fix `RegisterVar` function prototype. 2020-04-03 12:57:09 -04:00
CGCUDARuntime.cpp
CGCUDARuntime.h Fix GCC warning on enum class bitfield. NFC. 2020-03-28 10:20:34 -04:00
CGCXX.cpp [Alignment][NFC] Use Align with CreateAlignedLoad 2020-01-27 10:58:36 +01:00
CGCXXABI.cpp [clang] Add an API to retrieve implicit constructor arguments. 2020-05-19 09:21:26 +02:00
CGCXXABI.h [AIX][Frontend] Static init implementation for AIX considering no priority 2020-06-19 08:27:07 -04:00
CGCall.cpp [ARM][Clang] Removing lowering of half-precision FP arguments and returns from Clang's CodeGen 2020-06-18 13:17:07 +01:00
CGCall.h [CodeGen] Emit destructor calls to destruct non-trivial C struct objects 2020-03-20 18:34:22 -07:00
CGClass.cpp [clang/llvm] As part of using inclusive language within 2020-06-20 16:03:58 -07:00
CGCleanup.cpp [CodeGen] Simplify the way lifetime of block captures is extended 2020-06-11 16:06:22 -07:00
CGCleanup.h Remove clang::Codegen::EHPadEndScope as unused 2020-06-23 15:18:49 -07:00
CGCoroutine.cpp [Coroutines] Do not evaluate InitListExpr of a co_return 2020-03-16 12:42:44 +08:00
CGDebugInfo.cpp Make ASTFileSignature an array of 20 uint8_t instead of 5 uint32_t 2020-06-11 09:12:29 +01:00
CGDebugInfo.h Reland (again) D80966 [codeview] Put !heapallocsite on calls to operator new 2020-06-09 09:27:32 -07:00
CGDecl.cpp [Clang][OpenMP][OMPBuilder] Moving OMP allocation and cache creation code to OMPBuilderCBHelpers 2020-06-28 19:04:20 -04:00
CGDeclCXX.cpp [AIX][Frontend] Static init implementation for AIX considering no priority 2020-06-19 08:27:07 -04:00
CGException.cpp Remove clang::Codegen::EHPadEndScope as unused 2020-06-23 15:18:49 -07:00
CGExpr.cpp [Clang][OpenMP][OMPBuilder] Moving OMP allocation and cache creation code to OMPBuilderCBHelpers 2020-06-28 19:04:20 -04:00
CGExprAgg.cpp attempt to fix failing buildbots after 3bab88b7ba 2020-06-15 12:58:37 +02:00
CGExprCXX.cpp Change debuginfo check for addHeapAllocSiteMetadata 2020-06-09 11:01:06 -07:00
CGExprComplex.cpp attempt to fix failing buildbots after 3bab88b7ba 2020-06-15 12:58:37 +02:00
CGExprConstant.cpp attempt to fix failing buildbots after 3bab88b7ba 2020-06-15 12:58:37 +02:00
CGExprScalar.cpp [CodeGen] Use the common semantic for fixed-point codegen, not the result semantic. 2020-06-29 16:22:29 +02:00
CGGPUBuiltin.cpp [Alignment][NFC] Use Align with CreateAlignedStore 2020-01-23 17:34:32 +01:00
CGLoopInfo.cpp [Clang] Add llvm.loop.unroll.disable to loops with -fno-unroll-loops. 2020-04-07 14:01:55 +01:00
CGLoopInfo.h [Clang] Add llvm.loop.unroll.disable to loops with -fno-unroll-loops. 2020-04-07 14:01:55 +01:00
CGNonTrivialStruct.cpp [NFC] Silence compiler warning [-Wmissing-braces]. 2020-06-17 13:01:53 -07:00
CGObjC.cpp Reland D81869 "Modify FPFeatures to use delta not absolute settings" 2020-06-27 01:34:57 -07:00
CGObjCGNU.cpp [NFC] Refactoring PropertyAttributeKind for ObjCPropertyDecl and ObjCDeclSpec. 2020-04-23 17:21:25 -04:00
CGObjCMac.cpp [ADT/STLExtras.h] - Add llvm::is_sorted wrapper and update callers. 2020-04-14 14:11:02 +03:00
CGObjCRuntime.cpp [clang] CodeGen: Make getOrEmitProtocol public for Swift 2020-04-01 08:55:56 -07:00
CGObjCRuntime.h [clang] CodeGen: Make getOrEmitProtocol public for Swift 2020-04-01 08:55:56 -07:00
CGOpenCLRuntime.cpp
CGOpenCLRuntime.h
CGOpenMPRuntime.cpp [AIX][Frontend] Static init implementation for AIX considering no priority 2020-06-19 08:27:07 -04:00
CGOpenMPRuntime.h [OPENMP50]Initial codegen for 'affinity' clauses. 2020-06-02 10:50:08 -04:00
CGOpenMPRuntimeNVPTX.cpp [OPENMP]Dynamic globalization for parallel target regions. 2020-06-25 08:25:24 -04:00
CGOpenMPRuntimeNVPTX.h [OpenMP] `omp begin/end declare variant` - part 2, sema ("+CG") 2020-03-27 02:30:58 -05:00
CGRecordLayout.h
CGRecordLayoutBuilder.cpp [CodeGen] Increase applicability of ffine-grained-bitfield-accesses for targets with limited native integer widths 2020-06-12 10:33:47 +01:00
CGStmt.cpp attempt to fix failing buildbots after 3bab88b7ba 2020-06-15 12:58:37 +02:00
CGStmtOpenMP.cpp [Clang][OpenMP][OMPBuilder] Moving OMP allocation and cache creation code to OMPBuilderCBHelpers 2020-06-28 19:04:20 -04:00
CGVTT.cpp
CGVTables.cpp [clang] Frontend components for the relative vtables ABI (round 2) 2020-06-11 11:17:08 -07:00
CGVTables.h [clang] Frontend components for the relative vtables ABI (round 2) 2020-06-11 11:17:08 -07:00
CGValue.h [Matrix] Implement matrix index expressions ([][]). 2020-06-01 20:08:49 +01:00
CMakeLists.txt Update compiler extension integration into the build system 2020-04-24 09:40:14 +02:00
CodeGenABITypes.cpp [clang] Add an API to retrieve implicit constructor arguments. 2020-05-19 09:21:26 +02:00
CodeGenAction.cpp Expose IRGen API to add the default IR attributes to a function definition. 2020-05-16 14:44:54 -04:00
CodeGenFunction.cpp Reland D81869 "Modify FPFeatures to use delta not absolute settings" 2020-06-27 01:34:57 -07:00
CodeGenFunction.h [Clang][OpenMP][OMPBuilder] Moving OMP allocation and cache creation code to OMPBuilderCBHelpers 2020-06-28 19:04:20 -04:00
CodeGenModule.cpp Fix crash on XCore on unused inline in EmitTargetMetadata 2020-06-24 12:48:17 -07:00
CodeGenModule.h Fix crash on XCore on unused inline in EmitTargetMetadata 2020-06-24 12:48:17 -07:00
CodeGenPGO.cpp [PGO] Fix computation of function Hash 2020-05-27 09:15:21 +02:00
CodeGenPGO.h [CodeGenPGO] Fix shadow variable warning. NFC. 2020-03-02 15:06:34 +00:00
CodeGenTBAA.cpp Reland Implement _ExtInt as an extended int type specifier. 2020-04-17 10:45:48 -07:00
CodeGenTBAA.h
CodeGenTypeCache.h [ARM] Add __bf16 as new Bfloat16 C Type 2020-06-05 10:32:43 +01:00
CodeGenTypes.cpp [AArch64][SVE] ACLE: Add bfloat16 to struct load/stores. 2020-06-23 12:12:35 +01:00
CodeGenTypes.h CodeGenTypes::CGRecordLayouts: Use unique_ptr to simplify memory management 2020-04-28 22:31:16 -07:00
ConstantEmitter.h attempt to fix failing buildbots after 3bab88b7ba 2020-06-15 12:58:37 +02:00
ConstantInitBuilder.cpp Fix ConstantAggregateBuilderBase::getRelativeOffset 2020-06-15 12:23:20 -07:00
CoverageMappingGen.cpp Avoid including FileManager.h from SourceManager.h 2020-03-11 13:53:12 -07:00
CoverageMappingGen.h Reland: [Coverage] Revise format to reduce binary size 2020-02-28 18:12:04 -08:00
EHScopeStack.h [CodeGen] Simplify the way lifetime of block captures is extended 2020-06-11 16:06:22 -07:00
ItaniumCXXABI.cpp [AIX] Default AIX to using -fno-use-cxa-atexit 2020-06-19 08:27:07 -04:00
MacroPPCallbacks.cpp
MacroPPCallbacks.h
MicrosoftCXXABI.cpp [AIX][Frontend] Static init implementation for AIX considering no priority 2020-06-19 08:27:07 -04:00
ModuleBuilder.cpp
ObjectFilePCHContainerOperations.cpp Make ASTFileSignature an array of 20 uint8_t instead of 5 uint32_t 2020-06-11 09:12:29 +01:00
PatternInit.cpp Clean up usages of asserting vector getters in Type 2020-04-13 13:01:40 -07:00
PatternInit.h
README.txt
SanitizerMetadata.cpp [Analysis/Transforms/Sanitizers] As part of using inclusive language 2020-06-20 00:42:26 -07:00
SanitizerMetadata.h [Analysis/Transforms/Sanitizers] As part of using inclusive language 2020-06-20 00:42:26 -07:00
SwiftCallingConv.cpp [SVE] Eliminate calls to default-false VectorType::get() from Clang 2020-06-01 10:02:14 -07:00
TargetInfo.cpp Fix crash on XCore on unused inline in EmitTargetMetadata 2020-06-24 12:48:17 -07:00
TargetInfo.h Fix crash on XCore on unused inline in EmitTargetMetadata 2020-06-24 12:48:17 -07:00
VarBypassDetector.cpp
VarBypassDetector.h

README.txt

IRgen optimization opportunities.

//===---------------------------------------------------------------------===//

The common pattern of
--
short x; // or char, etc
(x == 10)
--
generates an zext/sext of x which can easily be avoided.

//===---------------------------------------------------------------------===//

Bitfields accesses can be shifted to simplify masking and sign
extension. For example, if the bitfield width is 8 and it is
appropriately aligned then is is a lot shorter to just load the char
directly.

//===---------------------------------------------------------------------===//

It may be worth avoiding creation of alloca's for formal arguments
for the common situation where the argument is never written to or has
its address taken. The idea would be to begin generating code by using
the argument directly and if its address is taken or it is stored to
then generate the alloca and patch up the existing code.

In theory, the same optimization could be a win for block local
variables as long as the declaration dominates all statements in the
block.

NOTE: The main case we care about this for is for -O0 -g compile time
performance, and in that scenario we will need to emit the alloca
anyway currently to emit proper debug info. So this is blocked by
being able to emit debug information which refers to an LLVM
temporary, not an alloca.

//===---------------------------------------------------------------------===//

We should try and avoid generating basic blocks which only contain
jumps. At -O0, this penalizes us all the way from IRgen (malloc &
instruction overhead), all the way down through code generation and
assembly time.

On 176.gcc:expr.ll, it looks like over 12% of basic blocks are just
direct branches!

//===---------------------------------------------------------------------===//