hanchenye-llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	48cc6c623a	[x86] Refactor the bit shift code the same as I just did the byte shift code. While this didn't have the miscompile (it used MatchLeft consistently) it missed some cases where it could use right shifts. I've added a test case Craig Topper came up with to exercise the right shift matching. This code is really identical between the two. I'm going to merge them next so that we don't keep two copies of all of this logic. llvm-svn: 229655	2015-02-18 09:19:58 +00:00
Ulrich Weigand	7db6918e2b	[SystemZ] Support all TLS access models - CodeGen part The current SystemZ back-end only supports the local-exec TLS access model. This patch adds all required CodeGen support for the other TLS models, which means in particular: - Expand initial-exec TLS accesses by loading TLS offsets from the GOT using @indntpoff relocations. - Expand general-dynamic and local-dynamic accesses by generating the appropriate calls to __tls_get_offset. Note that this routine has a non-standard ABI and requires loading the GOT pointer into %r12, so the patch also adds support for the GLOBAL_OFFSET_TABLE ISD node. - Add a new platform-specific optimization pass to remove redundant __tls_get_offset calls in the local-dynamic model (modeled after the corresponding X86 pass). - Add test cases verifying all access models and optimizations. llvm-svn: 229654	2015-02-18 09:13:27 +00:00
Ulrich Weigand	7bdd7c2346	[SystemZ] Support all TLS access models - MC part The current SystemZ back-end only supports the local-exec TLS access model. This patch adds all required MC support for the other TLS models, which means in particular: - Support additional relocation types for Initial-exec model: R_390_TLS_IEENT Local-dynamic-model: R_390_TLS_LDO32, R_390_TLS_LDO64, R_390_TLS_LDM32, R_390_TLS_LDM64, R_390_TLS_LDCALL General-dynamic model: R_390_TLS_GD32, R_390_TLS_GD64, R_390_TLS_GDCALL - Support assembler syntax to generate additional relocations for use with __tls_get_offset calls: :tls_gdcall: :tls_ldcall: The patch also adds a new test to verify fixups and relocations, and removes the (already unused) FK_390_PLT16DBL/FK_390_PLT32DBL fixup kinds. llvm-svn: 229652	2015-02-18 09:11:36 +00:00
Elena Demikhovsky	714f23bcdb	AVX-512: Added support for FP instructions with embedded rounding mode. By Asaf Badouh <asaf.badouh@intel.com> llvm-svn: 229645	2015-02-18 07:59:20 +00:00
Chandler Carruth	55553f5299	[x86] Rewrite the byte shift detection to not use boolean variables to track state. I didn't like this in the code review because the pattern tends to be error prone, but I didn't see a clear way to rewrite it. Turns out that there were bugs here, I found them when fuzz testing our shuffle lowering for correctness on x86. The core of the problem is that we need to consistently test all our preconditions for the same directionality of shift and the same input vector. Instead, formulate this as two predicates (one doesn't depend on the input in any way), pass things like the directionality and input vector as inputs, and loop over the alternatives. This fixes a pattern of very rare miscompiles coming out of this code. Turned up roughly 4 out of every 1 million v8 shuffles in my fuzz testing. The new code is over half a million test runs with no failures yet. I've also fuzzed every other function in the lowering code with over 3.5 million test cases and not discovered any other miscompiles. llvm-svn: 229642	2015-02-18 07:13:48 +00:00
Craig Topper	b324e43aed	[X86] Remove AVX2 and SSE2 pslldq and psrldq intrinsics. We can represent them in IR with vector shuffles now. All their uses have been removed from clang in favor of shuffles. llvm-svn: 229640	2015-02-18 06:24:44 +00:00
Matt Arsenault	0ba644b66b	R600/SI: Rename dst encoding field to be consistent with docs The docs call this vdst instead of just dst. llvm-svn: 229614	2015-02-18 02:15:37 +00:00
Matt Arsenault	e3dbcf6656	R600/SI: Consistently capitalize encoding field names Some formats capitalized these, but most didn't. Change them all to be consistently lowercase. Now, non-encoding fields and convenience bits are capitalized. Also remove weird looking empty line in some of the formats. llvm-svn: 229613	2015-02-18 02:15:35 +00:00
Matt Arsenault	1ecac06a6f	R600/SI: Set noNamedPositionallyEncodedOperands llvm-svn: 229612	2015-02-18 02:15:32 +00:00
Matt Arsenault	096ec1e10c	R600/SI: Fix src1_modifiers for class instructions src1 doesn't have modifiers, but the operand was missing resulting in an encoding build error when all fields are required.' llvm-svn: 229611	2015-02-18 02:15:30 +00:00
Matt Arsenault	65fa1c425d	R600/SI: Fix not setting clamp / omod for v_cndmask_b32_e64 Rename the multiclass since it now applies to the output modifiers as well. llvm-svn: 229610	2015-02-18 02:15:27 +00:00
Matt Arsenault	284d7dfb53	R600: Fix operand encoding error llvm-svn: 229609	2015-02-18 02:10:42 +00:00
Matt Arsenault	1991f5e40b	R600/SI: Fix encoding error from glc bit on VI SMRD instructions llvm-svn: 229608	2015-02-18 02:10:40 +00:00
Matt Arsenault	e6c5241814	R600/SI: Fix operand encoding for flat instructions llvm-svn: 229607	2015-02-18 02:10:37 +00:00
Matt Arsenault	07e3bb153f	R600/SI: Fix error from vdst on no return atomics Set the ignored field to 0 so we can enable noNamedPositionallyEncodedOperands. llvm-svn: 229606	2015-02-18 02:10:35 +00:00
Matt Arsenault	caa1288fff	R600/SI: Add missing offset operand to buffer bothen llvm-svn: 229605	2015-02-18 02:04:38 +00:00
Matt Arsenault	2ad8bab7ee	R600/SI: Add missing soffset operand to global atomics llvm-svn: 229604	2015-02-18 02:04:35 +00:00
Matt Arsenault	3c34ae293c	R600/SI: Fix brace identation llvm-svn: 229603	2015-02-18 02:04:31 +00:00
Eric Christopher	8af49b3214	Make the Mips AsmPrinter independent of global subtarget initialization. Initialize the subtarget once per function and migrate EmitStartOfAsmFile to either use calls on the TargetMachine or get information from the subtarget we'd use for assembling. The top-level-ness of the MIPS attribute output for assembly is, by nature, contrary to how we'd want to do this for an LTO situation where we have multiple cpu architectures so this solution is good enough for now. llvm-svn: 229596	2015-02-18 01:01:57 +00:00
Eric Christopher	bbe6ff50f3	Unify selectMipsCPU implementations. llvm-svn: 229595	2015-02-18 00:55:06 +00:00
Andrea Di Biagio	e7b58ee555	[X86][FastIsel] Teach how to select scalar integer to float/double conversions. This patch teaches fast-isel how to select a (V)CVTSI2SSrr for an integer to float conversion, and how to select a (V)CVTSI2SDrr for an integer to double conversion. Added test 'fast-isel-int-float-conversion.ll'. Differential Revision: http://reviews.llvm.org/D7698 llvm-svn: 229589	2015-02-17 23:40:58 +00:00
Rafael Espindola	df19519800	Add r228939 back with a fix. The problem in the original patch was not switching back to .text after printing an eh table. Original message: On ELF, put PIC jump tables in a non executable section. Fixes PR22558. llvm-svn: 229586	2015-02-17 23:34:51 +00:00
Sanjay Patel	e951a3839a	rename variables again because these tables also deal with stores; NFC Suggestion by Simon Pilgrim llvm-svn: 229574	2015-02-17 22:38:06 +00:00
Simon Pilgrim	1d89a02abb	[X86][SSE] Generalised unpckl/unpckh shuffle matching Added commuted unpckl/unpckh shuffle matching patterns as many cases containing undefined lanes fail to commute by themselves. Differential Revision: http://reviews.llvm.org/D7564 llvm-svn: 229571	2015-02-17 22:24:32 +00:00
Sanjay Patel	1a20fdf36f	Add comment to explain a non-obvious setting; NFC. This is paraphrased from Simon Pilgrim's comment in: http://reviews.llvm.org/D7492 llvm-svn: 229566	2015-02-17 22:09:54 +00:00
Sanjay Patel	203ee500e9	remove function names from comments; NFC llvm-svn: 229558	2015-02-17 21:55:20 +00:00
Sanjay Patel	52f9f7c0f3	replace meaningless variable names; NFCI llvm-svn: 229549	2015-02-17 21:37:28 +00:00
Tom Stellard	7b3aa88ac1	R600/SI: Fix asam errors in SIFoldOperands We were trying to fold into implicit uses, which led to out of bounds access of the MCInstrDesc::OpInfo arrray. llvm-svn: 229533	2015-02-17 20:11:54 +00:00
Sanjay Patel	b811c1d6a5	prevent folding a scalar FP load into a packed logical FP instruction (PR22371) Change the memory operands in sse12_fp_packed_scalar_logical_alias from scalars to vectors. That's what the hardware packed logical FP instructions define: 128-bit memory operands. There are no scalar versions of these instructions...because this is x86. Generating the wrong code (folding a scalar load into a 128-bit load) is still possible using the peephole optimization pass and the load folding tables. We won't completely solve this bug until we either fix the lowering in fabs/fneg/fcopysign and any other places where scalar FP logic is created or fix the load folding in foldMemoryOperandImpl() to make sure it isn't changing the size of the load. Differential Revision: http://reviews.llvm.org/D7474 llvm-svn: 229531	2015-02-17 20:08:21 +00:00
Eric Christopher	a49d68e078	Make the ARM AsmPrinter independent of global subtarget initialization. Initialize the subtarget once per function and migrate Emit{Start\|End}OfAsmFile to either use attributes on the TargetMachine or get information from the subtarget we'd use for assembling. One bit (getISAEncoding) touched the general AsmPrinter and the debug output. Handle this one by passing the function for the subprogram down and updating all callers and users. The top-level-ness of the ARM attribute output for assembly is, by nature, contrary to how we'd want to do this for an LTO situation where we have multiple cpu architectures so this solution is good enough for now. llvm-svn: 229528	2015-02-17 20:02:32 +00:00
Tom Stellard	bc3776803b	R600/SI: Extend private extload pattern to include zext loads llvm-svn: 229507	2015-02-17 16:36:00 +00:00
Benjamin Kramer	6cd780ff21	Prefer SmallVector::append/insert over push_back loops. Same functionality, but hoists the vector growth out of the loop. llvm-svn: 229500	2015-02-17 15:29:18 +00:00
Andrea Di Biagio	eb97f92489	[X86] Silence -Wsign-compare warnings. GCC 4.8 reported two new warnings due to comparisons between signed and unsigned integer expressions. The new warnings were accidentally introduced by revision 229480. Added explicit casts to silence the warnings. No functional change intended. llvm-svn: 229488	2015-02-17 11:20:11 +00:00
Elena Demikhovsky	ba84672519	AVX-512: changes in intel_ocl_bi calling conventions - added mask types v8i1 and v16i1 to possible function parameters - enabled passing 512-bit vectors in standard CC - added a test for KNL intel_ocl_bi conventions llvm-svn: 229482	2015-02-17 09:20:12 +00:00
Michael Kuperstein	ff5acaf50c	[X86] Combine vector anyext + and into a vector zext Vector zext tends to get legalized into a vector anyext, represented as a vector shuffle with an undef vector + a bitcast, that gets ANDed with a mask that zeroes the undef elements. Combine this into an explicit shuffle with a zero vector instead. This allows shuffle lowering to match it as a zext, instead of matching it as an anyext and emitting an explicit AND. This combine only covers a subset of the cases, but it's a start. Differential Revision: http://reviews.llvm.org/D7666 llvm-svn: 229480	2015-02-17 08:22:51 +00:00
Eric Christopher	5c0e009d3a	Make the PowerPC AsmPrinter independent of global subtarget initialization. Initialize the subtarget once per function and migrate EmitStartOfAsmFile to either use attributes on the TargetMachine or get information from all of the various subtargets. llvm-svn: 229475	2015-02-17 07:21:21 +00:00
Eric Christopher	75dc3904a5	Add a FIXME to move IsLittleEndian to the target machine. llvm-svn: 229472	2015-02-17 06:45:17 +00:00
Eric Christopher	fee6aaf683	Move ABI handling and 64-bitness to the PowerPC target machine. This required changing how the computation of the ABI is handled and how some of the checks for ABI/target are done. llvm-svn: 229471	2015-02-17 06:45:15 +00:00
Chandler Carruth	55db07016e	[x86] Teach the unpack lowering to try wider element unpacks. This allows it to match still more places where previously we would have to fall back on floating point shuffles or other more complex lowering strategies. I'm hoping to replace some of the hand-rolled unpack matching with this routine is it gets more and more clever. llvm-svn: 229463	2015-02-17 02:12:24 +00:00
Hal Finkel	5cedafb8cd	[PowerPC] Support non-direct-sub/superclass VSX copies Our register allocation has become better recently, it seems, and is now starting to generate cross-block copies into inflated register classes. These copies are not transformed into subregister insertions/extractions by the PPCVSXCopy class, and so need to be handled directly by PPCInstrInfo::copyPhysReg. The code to do this was almost there, but not quite (it was unnecessarily restricting itself to only the direct sub/super-register-class case (not copying between, for example, something in VRRC and the lower-half of VSRC which are super-registers of F8RC). Triggering this behavior manually is difficult; I'm including two bugpoint-reduced test cases from the test suite. llvm-svn: 229457	2015-02-16 23:46:30 +00:00
Simon Atanasyan	79ba8407d2	[Mips] Add .MIPS.options section descriptor kinds enumeration No functional changes. llvm-svn: 229452	2015-02-16 22:59:29 +00:00
Ahmed Bougacha	bf2b90e92d	[ARM] Remove unused declaration. NFC. GlobalMerge was moved to lib/CodeGen a while ago, and is no longer called "ARMGlobalMerge". llvm-svn: 229448	2015-02-16 22:30:08 +00:00
Cameron McInally	c5764cbe4e	[AVX512] Make 512b vector floating point rounds legal on AVX512. llvm-svn: 229445	2015-02-16 22:15:42 +00:00
Simon Pilgrim	b2c00f3286	[X86][SSE] Add SSE MOVQ instructions to SSEPackedInt domain Patch to explicitly add the SSE MOVQ (rr,mr,rm) instructions to SSEPackedInt domain - prevents a number of costly domain switches. Differential Revision: http://reviews.llvm.org/D7600 llvm-svn: 229439	2015-02-16 21:50:56 +00:00
Craig Topper	49df44e2e2	[X86] Remove the multiply by 8 that goes into the shift constant for X86ISD::VSHLDQ and X86ISD::VSRLDQ. This simplifies the pattern matching in isel and allows these nodes to become the patterns embedded in the instruction. llvm-svn: 229431	2015-02-16 20:52:07 +00:00
Craig Topper	44026efa88	[X86] Remove x86.avx2.psll.dq.bs and x86.avx2.psrl.dq.bs intrinsics. llvm-svn: 229430	2015-02-16 20:51:59 +00:00
Matthias Braun	d6b108e445	ARM: Transfer kill flag when lowering VSTMQIA to VSTMDIA. llvm-svn: 229425	2015-02-16 19:34:30 +00:00
Aaron Ballman	da9501b25c	We require MSVC 1800 as our minimum, so these checks can safely go away; NFC. (It seems this code has been copy/pasted around, unfortunately.) llvm-svn: 229417	2015-02-16 18:34:57 +00:00
Andrew Trick	05938a5481	AArch64: Safely handle the incoming sret call argument. This adds a safe interface to the machine independent InputArg struct for accessing the index of the original (IR-level) argument. When a non-native return type is lowered, we generate the hidden machine-level sret argument on-the-fly. Before this fix, we were representing this argument as OrigArgIndex == 0, which is an outright lie. In particular this crashed in the AArch64 backend where we actually try to access the type of the original argument. Now we use a sentinel value for machine arguments that have no original argument index. AArch64, ARM, Mips, and PPC now check for this case before accessing the original argument. Fixes <rdar://19792160> Null pointer assertion in AArch64TargetLowering llvm-svn: 229413	2015-02-16 18:10:47 +00:00
Chandler Carruth	1e57e2deb8	[x86] Add a generic unpack-targeted lowering technique. This can be used to generically lower blends and is particularly nice because it is available frome SSE2 onward. This removes a lot of the remaining domain crossing blends in SSE2 code. I'm hoping to replace some of the "interleaved" lowering hacks with something closer to this which should be more principled. First, this needs to learn how to detect and use other interleavings besides that of the natural type provided. That will be a follow-up patch though. llvm-svn: 229378	2015-02-16 12:28:18 +00:00
Chandler Carruth	c802085b3a	[x86] Add initial basic support for forming blends of v16i8 vectors. This blend instruction is ... really lame. The register usage is insane. As a consequence this is probably only barely better than 2 pshufbs followed by a por, and that mostly because it only has to read from a single memory location. However, this doesn't fix as much as I kind of expected, so more to go. Pretty sure that the ordering and delegation of v16i8 is just really, really bad. llvm-svn: 229373	2015-02-16 10:58:23 +00:00
Chandler Carruth	e63bbd97a7	[x86] Switch my usage of VariadicFunction to a "normal" variadic template now that we can use them. This is, of course, horribly ugly because of the required recursive formulation. Suggestions for making it less ugly welcome. llvm-svn: 229367	2015-02-16 09:59:48 +00:00
Craig Topper	7e8dcef094	[X86] Add support for lowering shuffles to 256-bit PALIGNR instruction. llvm-svn: 229359	2015-02-16 06:29:06 +00:00
Chandler Carruth	87e580a659	[x86] Teach the 128-bit vector shuffle lowering routines to take advantage of the existence of a reasonable blend instruction. The 256-bit vector shuffle lowering has leveraged the general technique of decomposed shuffles and blends for quite some time, but this never made it back into the 128-bit code, and there are a large number of patterns where this is substantially better. For example, this removes almost all domain crossing in vector shuffles that involve some blend and some permutation with SSE4.1 and later. See the massive reduction in 'shufps' for integer test cases in this commit. This isn't perfect yet for a few reasons: 1) The v8i16 shuffle lowering continues to plague me. We don't always form an unpack-based blend when that would be better. But the wins pretty drastically outstrip the losses here. 2) The v16i8 shuffle lowering is just a disaster here. I never went and implemented blend support here for some terrible reason. I'll do that next probably. I've not updated it for now. More variations on this technique are coming as well -- we don't shuffle-into-unpack or shuffle-into-palignr, both of which would also be profitable. Note that some test cases grow significantly in the number of instructions, but I expect to actually be faster. We use pshufd+pshufd+blendw instead of a single shufps, but the pshufd's are very likely to pipeline well (two ports on most modern intel chips) and the blend is a very fast instruction. The domain switch penalty will essentially always be more than a blend instruction, which is the only increase in tree height. llvm-svn: 229350	2015-02-16 01:52:02 +00:00
Aaron Ballman	f9a1897c72	Removing LLVM_DELETED_FUNCTION, as MSVC 2012 was the last reason for requiring the macro. NFC; LLVM edition. llvm-svn: 229340	2015-02-15 22:54:22 +00:00
Aaron Ballman	b46962fe5d	Removing LLVM_EXPLICIT, as MSVC 2012 was the last reason for requiring the macro. NFC; LLVM edition. llvm-svn: 229335	2015-02-15 22:00:20 +00:00
Simon Pilgrim	2a7bedb73e	Coding style fixes to recent patches. NFC. llvm-svn: 229312	2015-02-15 14:19:29 +00:00
Simon Pilgrim	00bd79d794	[X86][AVX2] vpslldq/vpsrldq byte shifts for AVX2 This patch refactors the existing lowerVectorShuffleAsByteShift function to add support for 256-bit vectors on AVX2 targets. It also fixes a tablegen issue that prevented the lowering of vpslldq/vpsrldq vec256 instructions. Differential Revision: http://reviews.llvm.org/D7596 llvm-svn: 229311	2015-02-15 13:19:52 +00:00
Chandler Carruth	bf0fb06e0d	[x86] Teach the decomposed shuffle/blend lowering to use an early blend when that will allow it to lower with a single permute instead of multiple permutes. It tries to detect when it will only have to do a single permute in either case to maximize folding of loads and such. This cuts a lot of the avx2 shuffle permute counts in half. =] llvm-svn: 229309	2015-02-15 12:42:15 +00:00
Chandler Carruth	75d9a97569	[x86] Teach the shuffle mask equivalence test to look through build vectors and detect equivalent inputs. This lets the code match unpck-style instructions when only one of the inputs are lined up but the other input is a splat and so which lanes we pull from doesn't matter. Today, this doesn't really happen, but just by accident. I have a patch that normalizes how we shuffle splats, and with that patch this will be necessary for a lot of the mask equivalence tests to work. I don't really know how to write a test case for this specific change until the other change lands though. llvm-svn: 229307	2015-02-15 12:07:55 +00:00
Chandler Carruth	4fe214b1f2	[x86] Tweak the ordering of unpack matching vs. element insertion, and don't try to do element insertion for non-zero-index floating point vectors. We don't have any useful patterns or lowering for element insertion into high elements of a floating point vector, and the generic shuffle lowering will end up being better -- namely it will fall back to unpck. But we should try to handle other forms of element insertion before matching unpck patterns. While this doesn't matter much right now, I'm working on a patch that makes unpck matching much more powerful, and that patch will break without this re-ordering. llvm-svn: 229306	2015-02-15 12:01:14 +00:00
Chandler Carruth	56e0ceda0d	[x86] Stop shuffling zero vectors. =] I was somewhat surprised this pattern really came up, but it does. It seems better to just directly handle it than try to special case every place where we end up forming a shuffle that devolves to a shuffle of a zero vector. llvm-svn: 229301	2015-02-15 10:34:52 +00:00
Chandler Carruth	3d272daaed	[x86] Use a more helpful parenthesizing of these comparisons. Silences a -Wparentheses complaint from GCC. llvm-svn: 229300	2015-02-15 10:15:20 +00:00
Chandler Carruth	62558c1d4d	[x86] When splitting 256-bit vectors into 128-bit vectors, don't extract subvectors from buildvectors. That doesn't really make any sense and it breaks all of the down-stream matching of buildvectors to cleverly lower shuffles. With this, we now get the shift-based lowering of 256-bit vector shuffles with AVX1 when we split them into 128-bit vectors. We also do much better on the zero-extension patterns, although there remains quite a bit of room for improvement here. llvm-svn: 229299	2015-02-15 10:12:02 +00:00
Chandler Carruth	a6f8a3661c	[x86] Make computing the zeroable elements slightly more powerful, at least in theory. I don't actually have a test case that benefits from this, but theoretically, it could come up, and I don't want to try to think about whether this is the culprit or something else is, so I'd rather just make this code powerful. =/ Makes me sad that I can't really test it though. llvm-svn: 229298	2015-02-15 09:33:36 +00:00
Chandler Carruth	0ddfe0c7c5	[x86] Add a slight variation on some of the other generic shuffle lowerings -- one which decomposes into an initial blend followed by a permute. Particularly on newer chips, blends are handled independently of shuffles and so this is much less bottlenecked on the single port that floating point shuffles are executed with on Intel. I'll be adding this lowering to a bunch of other code paths in subsequent commits to handle still more places where we can effectively leverage blends when they're available in the ISA. llvm-svn: 229292	2015-02-15 08:26:30 +00:00
Craig Topper	78c424dfca	[X86] Add assembly parser support for mnemonic aliases for AVX-512 vpcmp instructions. llvm-svn: 229287	2015-02-15 07:13:48 +00:00
Craig Topper	f02ad93270	[X86] Add assembler predicates for the rest of the AVX512 feature flags. This makes the assembly matching consistent across all AVX512 instructions. Without this we were allowing some AVX512 instructions to be parsed always, but not the foundation instructions. llvm-svn: 229280	2015-02-15 04:54:55 +00:00
Craig Topper	a3776de242	[X86] Add the remaining 11 possible exact ModRM formats. This makes their encodings linear which can then be used to simplify some other code. llvm-svn: 229279	2015-02-15 04:16:44 +00:00
Simon Pilgrim	31457d54f7	[X86][XOP] Enable commutation for XOP instructions Patch to allow XOP instructions (integer comparison and integer multiply-add) to be commuted. The comparison instructions sometimes require the compare mode to be flipped but the remaining instructions can use default commutation modes. This patch also sets the SSE domains of all the XOP instructions. Differential Revision: http://reviews.llvm.org/D7646 llvm-svn: 229267	2015-02-14 22:40:46 +00:00
Craig Topper	43860838dc	[X86] Improve parsing support AVX/SSE floating point compare instruction mnemonic aliases. They'll now print with the alias the parser received instead of converting to the explicit immediate form. llvm-svn: 229266	2015-02-14 21:54:03 +00:00
Duncan P. N. Exon Smith	025c0ad74c	Target: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229261	2015-02-14 15:36:52 +00:00
Duncan P. N. Exon Smith	b5054333ec	NVPTX: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229260	2015-02-14 15:35:43 +00:00
Simon Pilgrim	b0bac23fcd	Line ending fix. NFC. llvm-svn: 229256	2015-02-14 13:27:53 +00:00
Chandler Carruth	003ed332bf	Remove a variable only used in an assert and sink its initializer into the assert. Fixes -Wunused-variable on non-asserts builds. llvm-svn: 229250	2015-02-14 09:14:44 +00:00
Matt Arsenault	0bbcd8ba2f	R600/SI: Implement correct f64 fdiv This version passes the OpenCL conformance test. llvm-svn: 229239	2015-02-14 04:30:08 +00:00
Matt Arsenault	044f1d19cf	R600/SI: Use complex operand folding for div_scale llvm-svn: 229238	2015-02-14 04:24:28 +00:00
Matt Arsenault	1bc9d95047	R600/SI: Fix implicit vcc operand to v_div_fmas_* This should allow finally fixing the f64 fdiv implementation. Test is disabled for VI since there seems to be a problem with one of the buffer load instructions on it. llvm-svn: 229236	2015-02-14 04:22:00 +00:00
Matt Arsenault	6e26b8d854	R600/SI: Fix schedule model for v_div_scale_{f32\|f64} llvm-svn: 229235	2015-02-14 04:03:18 +00:00
Matt Arsenault	35733e2dec	R600/SI: Really fix size of VReg_1 llvm-svn: 229234	2015-02-14 03:54:32 +00:00
Matt Arsenault	1bcc8cba5a	R600/SI: Rename encoding field to match docs for VOP3b llvm-svn: 229233	2015-02-14 03:54:29 +00:00
Matt Arsenault	31ec598a2a	R600/SI: Fix not encoding src2 for v_div_scale_{f32\|f64} This apparently got lost in the VI changes. llvm-svn: 229230	2015-02-14 03:40:35 +00:00
Matt Arsenault	692acf1438	R600/SI: Fix VOP3b encoding on VI llvm-svn: 229228	2015-02-14 03:02:23 +00:00
Matt Arsenault	95546b46ab	R600/SI: Fix phys reg copies in SIFoldOperands llvm-svn: 229227	2015-02-14 02:55:57 +00:00
Matt Arsenault	9998168982	R600/SI: Fix copies from SGPR to VCC This shows up without optimizations when vcc is required to be used. llvm-svn: 229226	2015-02-14 02:55:56 +00:00
Matt Arsenault	834b1aa806	R600/SI: Add hack to copy from a VGPR to VCC This hopefully should be fixed when VReg_1 is removed. llvm-svn: 229225	2015-02-14 02:55:54 +00:00
Duncan P. N. Exon Smith	5bedaf934f	PowerPC: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229224	2015-02-14 02:54:07 +00:00
Matt Arsenault	f417ff8f2a	R600/SI: Fix size of VReg_1 This is really a 32-bit register, if we try to check the size of it, we want 32-bits. llvm-svn: 229223	2015-02-14 02:51:44 +00:00
Duncan P. N. Exon Smith	8480c87ce6	R600: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229222	2015-02-14 02:45:45 +00:00
Duncan P. N. Exon Smith	2e75314352	Mips: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229221	2015-02-14 02:37:48 +00:00
Duncan P. N. Exon Smith	2cff9e19a2	ARM: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229220	2015-02-14 02:24:44 +00:00
Duncan P. N. Exon Smith	003bb7d96e	AArch64: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229218	2015-02-14 02:09:06 +00:00
Duncan P. N. Exon Smith	5975a703e6	X86: Canonicalize access to function attributes, NFC Canonicalize access to function attributes to use the simpler API. getAttributes().getAttribute(AttributeSet::FunctionIndex, Kind) => getFnAttribute(Kind) getAttributes().hasAttribute(AttributeSet::FunctionIndex, Kind) => hasFnAttribute(Kind) llvm-svn: 229214	2015-02-14 01:59:52 +00:00
Ahmed Bougacha	8f2b4f0be8	[X86] Factor out the CMOV pseudo definitions. NFCI. llvm-svn: 229206	2015-02-14 01:36:53 +00:00
Matthias Braun	33cc10724d	Revert "On ELF, put PIC jump tables in a non executable section." This reverts commit r228939. The commit broke something in the output of exception handling tables on darwin x86-64. llvm-svn: 229203	2015-02-14 01:16:54 +00:00
Eric Christopher	b2a5fa98e4	Use the template method to grab the target specific subtarget. llvm-svn: 229191	2015-02-14 00:09:46 +00:00
Eric Christopher	fcd3d87ad8	The base pointer save offset can be computed at initialization time, do so and fix up the calls. llvm-svn: 229169	2015-02-13 22:48:53 +00:00
Eric Christopher	a10d58dba8	Move the target machine variable so that it's initialized early enough we can use it to initialize frame lowering. llvm-svn: 229168	2015-02-13 22:48:51 +00:00
Eric Christopher	e8dbfe1cf8	Stash the TargetMachine on the subtarget so we can access it later. Clean up a subtarget function that has it passed in while we're at it. llvm-svn: 229164	2015-02-13 22:23:04 +00:00
Eric Christopher	a4ae213193	PPC LinkageSize can be computed at initialization time, do so. llvm-svn: 229163	2015-02-13 22:22:57 +00:00

1 2 3 4 5 ...

32053 Commits