hanchenye-llvm-project

Commit Graph

Author	SHA1	Message	Date
Saleem Abdulrasool	c4e00289a7	ARM: correct WoA __builtin_alloca handling on O0 When performing a dynamic stack adjustment without optimisations, we would mark SP as def and R4 as kill. This occurred as part of the expansion of a WIN__CHKSTK SDNode which indicated the proper handling of SP and R4. The result would be that we would double define SP as part of an operation, which is obviously incorrect. Furthermore, the VTList for the chain had an incorrect parameter type of i32 instead of Other. Correct these to permit proper lowering of __builtin_alloca at -O0. llvm-svn: 213442	2014-07-19 01:29:51 +00:00
David Peixotto	ae5ba76221	MC: support different sized constants in constant pools On AArch64 the pseudo instruction ldr <reg>, =... supports both 32-bit and 64-bit constants. Add support for 64 bit constants for the pools to support the pseudo instruction fully. Changes the AArch64 ldr-pseudo tests to use 32-bit registers and adds tests with 64-bit registers. Patch by Janne Grunau! Differential Revision: http://reviews.llvm.org/D4279 llvm-svn: 213387	2014-07-18 16:05:14 +00:00
Tim Northover	4e80b584fe	ARM: support legalisation of "fptrunc ... to half" operations. llvm-svn: 213373	2014-07-18 13:01:19 +00:00
Renato Golin	e48d9dc15e	Suppress 'not handled in switch' warning llvm-svn: 213371	2014-07-18 12:13:04 +00:00
Tilmann Scheller	0fc933d6b8	[ARM] Add earlyclobber constraint to pre/post-indexed ARM STR instructions. The post-indexed instructions were missing the constraint, causing unpredictable STR instructions to be emitted. The earlyclobber constraint on the pre-indexed STR instructions is not strictly necessary, as the instruction selection for pre-indexed STR instructions goes through an additional layer of pseudo instructions which have the constraint defined, however it doesn't hurt to specify the constraint directly on the pre-indexed instructions as well, since at some point someone might create instances of them programmatically and then the constraint is definitely needed. This fixes PR20323. llvm-svn: 213369	2014-07-18 12:05:49 +00:00
Renato Golin	c17a07b36a	Refactor ARM subarchitecture parsing Re-commit of a patch to rework the triple parsing on ARM to a more sane model. Patch by Gabor Ballabas. llvm-svn: 213367	2014-07-18 12:00:48 +00:00
Tim Northover	53f3bcf0e9	ARM: support direct f16 <-> f64 conversions ARMv8 has instructions to handle it, otherwise a libcall is needed. llvm-svn: 213254	2014-07-17 11:27:04 +00:00
Tim Northover	fd7e424935	CodeGen: extend f16 conversions to permit types > float. This makes the two intrinsics @llvm.convert.from.f16 and @llvm.convert.to.f16 accept types other than simple "float". This is only strictly needed for the truncate operation, since otherwise double rounding occurs and there's no way to represent the strict IEEE conversion. However, for symmetry we allow larger types in the extend too. During legalization, we can expand an "fp16_to_double" operation into two extends for convenience, but abort when the truncate isn't legal. A new libcall is probably needed here. Even after this commit, various target tweaks are needed to actually use the extended intrinsics. I've put these into separate commits for clarity, so there are no actual tests of f64 conversion here. llvm-svn: 213248	2014-07-17 10:51:23 +00:00
Chris Bieneman	df4b763be5	[RegisterCoalescer] Moving the RegisterCoalescer subtarget hook onto the TargetRegisterInfo instead of the TargetSubtargetInfo. llvm-svn: 213188	2014-07-16 20:13:31 +00:00
Chris Bieneman	80a866a316	Added documentation for SizeMultiplier in the ARM subtarget hook for register coalescing. Also fixed some 80 col violations. No functional code changes. llvm-svn: 213169	2014-07-16 16:27:31 +00:00
Sanjay Patel	a2f658d69d	Move Post RA Scheduling flag bit into SchedMachineModel Refactoring; no functional changes intended Removed PostRAScheduler bits from subtargets (X86, ARM). Added PostRAScheduler bit to MCSchedModel class. This bit is set by a CPU's scheduling model (if it exists). Removed enablePostRAScheduler() function from TargetSubtargetInfo and subclasses. Fixed the existing enablePostMachineScheduler() method to use the MCSchedModel (was just returning false!). Added methods to TargetSubtargetInfo to allow overrides for AntiDepBreakMode, CriticalPathRCs, and OptLevel for PostRAScheduling. Added enablePostRAScheduler() function to PostRAScheduler class which queries the subtarget for the above values. Preserved existing scheduler behavior for ARM, MIPS, PPC, and X86: a. ARM overrides the CPU's postRA settings by enabling postRA for any non-Thumb or Thumb2 subtarget. b. MIPS overrides the CPU's postRA settings by enabling postRA for everything. c. PPC overrides the CPU's postRA settings by enabling postRA for everything. d. X86 is the only target that actually has postRA specified via sched model info. Differential Revision: http://reviews.llvm.org/D4217 llvm-svn: 213101	2014-07-15 22:39:58 +00:00
Chris Bieneman	03695ab57e	[RegisterCoalescer] Add new subtarget hook allowing targets to opt-out of coalescing. The coalescer is very aggressive at propagating constraints on the register classes, and the register allocator doesn’t know how to split sub-registers later to recover. This patch provides an escape valve for targets that encounter this problem to limit coalescing. This patch also implements such for ARM to lower register pressure when using lots of large register classes. This works around PR18825. llvm-svn: 213078	2014-07-15 17:18:41 +00:00
Renato Golin	b8a86c43c0	Revert "Refactor ARM subarchitecture parsing" This reverts commit 7b4a6882467e7fef4516a0cbc418cbfce0fc6f6d. llvm-svn: 212521	2014-07-08 10:06:16 +00:00
Renato Golin	1e9c282cd1	Refactor ARM subarchitecture parsing According to a FIXME in ARMMCTargetDesc.cpp the ARM version parsing should be in the Triple helper class. Patch by: Gabor Ballabas llvm-svn: 212479	2014-07-07 20:01:11 +00:00
Saleem Abdulrasool	763f9a50a5	ARM: properly lower dllimport'ed global values This completes the handling for DLL import storage symbols when lowering instructions. A DLL import storage symbol must have an additional load performed prior to use. This is applicable to variables and functions. This is particularly important for non-function symbols as it is possible to handle function references by emitting a thunk which performs the translation from the unprefixed __imp_ symbol to the proper symbol (although, this is a non-optimal lowering). For a variable symbol, no such thunk can be accommodated. llvm-svn: 212431	2014-07-07 05:18:35 +00:00
Saleem Abdulrasool	220a044888	ARM: correctly mangle dllimport symbols Add support for tracking DLLImport storage class information on a per symbol basis in the ARM instruction selection. Use that information to correctly mangle the symbol (dllimport symbols are referenced via *__imp_<name>). llvm-svn: 212430	2014-07-07 05:18:30 +00:00
Saleem Abdulrasool	1eb4a28b44	ARM: unify symbol name retrieval Ensure that all paths that retrieve the symbol name go through GetARMGVSymbol rather than getSymbol. This is desirable so that any global symbol mangling can be centralised to this function. The motivation for this is handling of symbols that are marked as having dll import dll storage. Such a symbol requires an extra load that is currently handled in the backend and a __imp_ prefix on the symbol name. llvm-svn: 212429	2014-07-07 05:18:22 +00:00
Tim Northover	1bc367a41b	ARM: when falling back to scattered relocs, keep the type. The linker relies on relocation type info (e.g. is it a branch?) to perform the correct actions, so we should keep that even when we end up using a scattered relocation for whatever reason. rdar://problem/17553104 llvm-svn: 212333	2014-07-04 10:58:05 +00:00
Eric Christopher	c1058df66f	Move function dependent resetting of a subtarget variable out of the subtarget. This involved having the movt predicate take the current function - since we care about size in instruction selection for whether or not to use movw/movt take the function so we can check the attributes. This required adding the current MachineFunction to FastISel and propagating through. llvm-svn: 212309	2014-07-04 01:55:26 +00:00
Eric Christopher	2f991c9ee1	Remove caching of the target machine and initialization of the subtarget from ARMISelDAGtoDAG. The former is unnecessary and the latter is initialized on each runOnMachineFunction. llvm-svn: 212297	2014-07-03 22:24:49 +00:00
Yi Kong	93e52da641	[ARM] Implement ISB memory barrier intrinsic Adds support for __builtin_arm_isb. Also corrects DSB and ISB instructions modelling by adding has-side-effects property. llvm-svn: 212276	2014-07-03 16:00:41 +00:00
Juergen Ributzka	3bd03c7099	[DAG] Pass the argument list to the CallLoweringInfo via move semantics. NFCI. The argument list vector is never used after it has been passed to the CallLoweringInfo and moving it to the CallLoweringInfo is cleaner and pretty much as cheap as keeping a pointer to it. llvm-svn: 212135	2014-07-01 22:01:54 +00:00
Scott Douglass	7650a9b871	ARM: take care not to set the ThumbFunc bit on TLS data symbols This fixes LNT SingleSource/UnitTests/Threads with -mthumb. Differential Revision: http://reviews.llvm.org/D4324 llvm-svn: 212029	2014-06-30 09:37:24 +00:00
Saleem Abdulrasool	4a1e409a4d	ARM: use symbolic name for constant This just changes the constant value to the symbolic name corresponding to it. NFC. llvm-svn: 212011	2014-06-30 03:11:14 +00:00
Eric Christopher	83e0723457	Remove extraneous includes from the target machines. llvm-svn: 211800	2014-06-26 19:30:05 +00:00
Eric Christopher	80b24ef429	Move all of the ARM subtarget features down onto the subtarget rather than the target machine. llvm-svn: 211799	2014-06-26 19:30:02 +00:00
Eric Christopher	45fb7b6397	Move the frame lowering constructors out of line to avoid circular includes. llvm-svn: 211798	2014-06-26 19:29:59 +00:00
Renato Golin	ac561c3ac7	Added parsing co-processor names starting with "cr" Additional compliant GAS names for coprocessor register name are enabled for all instruction with parameter MCK_CoprocReg: LDC,LDC2,STC,STC2,CDP,CDP2,MCR,MCR2,MCRR,MCRR2,MRC,MRC2,MRRC,MRRC2 Patch by Andrey Kuharev. llvm-svn: 211776	2014-06-26 13:10:53 +00:00
Rafael Espindola	e2c6624475	Move expression visitation logic up to MCStreamer. Remove the duplicate from MCRecordStreamer. No functionality change. llvm-svn: 211714	2014-06-25 15:45:33 +00:00
Rafael Espindola	2be1281d43	Simplify the visitation of target expressions. No functionality change. llvm-svn: 211707	2014-06-25 15:29:54 +00:00
Christian Pirker	c6308f59b2	ARM: Fix TPsoft for Thumb mode Reviewed at http://reviews.llvm.org/D4230 llvm-svn: 211601	2014-06-24 15:45:59 +00:00
Christian Pirker	6f81e75dab	ARMEB: Vector extend operations Reviewed at http://reviews.llvm.org/D4043 llvm-svn: 211520	2014-06-23 18:05:53 +00:00
Tim Northover	2099862a50	ARM: mark UBFX as not allowing PC. Strictly, it's unpredictable. But we don't quite model that yet and an error is better than ignoring the issue. This one somehow got left out before though. rdar://problem/15997748 llvm-svn: 211490	2014-06-23 09:20:02 +00:00
Rafael Espindola	1fc003e6c5	Allow a target to create a null streamer. Targets can assume that a target streamer is present, so they have to be able to construct a null streamer in order to set the target streamer in it to. Fixes a crash when using the null streamer with arm. llvm-svn: 211358	2014-06-20 13:11:28 +00:00
Oliver Stannard	5dc2934ba2	Emit the ARM build attributes ABI_PCS_wchar_t and ABI_enum_size. Emit the ARM build attributes ABI_PCS_wchar_t and ABI_enum_size based on module flags metadata. llvm-svn: 211349	2014-06-20 10:08:11 +00:00
Karthik Bhat	e03a25da70	Add Support to Recognize and Vectorize NON SIMD instructions in SLPVectorizer. This patch adds support to recognize patterns such as fadd,fsub,fadd,fsub.../add,sub,add,sub... and vectorizes them as vector shuffles if they are profitable. These patterns of vector shuffle can later be converted to instructions such as addsubpd etc on X86. Thanks to Arnold and Hal for the reviews. http://reviews.llvm.org/D4015 llvm-svn: 211339	2014-06-20 04:32:48 +00:00
Eric Christopher	c40e5edbbc	Add a new subtarget hook for whether or not we'd like to enable the atomic load linked expander pass to run for a particular subtarget. This requires a check of the subtarget and so save the TargetMachine rather than only TargetLoweringInfo and update all callers. llvm-svn: 211314	2014-06-19 21:03:04 +00:00
Alp Toker	1d099d9339	Fix typos llvm-svn: 211304	2014-06-19 19:41:26 +00:00
Craig Topper	35b2f75733	Convert some assert(0) to llvm_unreachable or fold an 'if' condition into the assert. llvm-svn: 211254	2014-06-19 06:10:58 +00:00
Eric Christopher	3d19f1388f	Move ARMJITInfo off of the TargetMachine and down onto the subtarget. This required untangling a mess of headers that included around. This a recommit of r210953 with a fix for the removed accessor for JITInfo. llvm-svn: 211233	2014-06-18 22:48:09 +00:00
Weiming Zhao	8c89973462	[ARM] [MC] Refactor the constant pool classes ARMTargetStreamer implements ConstantPool and AssmeblerConstantPools to keep track of assembler-generated constant pools that are used for ldr-pseudo. When implementing ldr-pseudo for AArch64, these two classes can be reused. So this patch factors them out from ARM target to the general MC lib. llvm-svn: 211198	2014-06-18 18:17:25 +00:00
Craig Topper	2a30d7889f	Replace some assert(0)'s with llvm_unreachable. llvm-svn: 211141	2014-06-18 05:05:13 +00:00
James Molloy	c1fd09ba2c	Fix memory leak of RegScavenger accidentally added in r211037. llvm-svn: 211097	2014-06-17 12:31:41 +00:00
Jim Grosbach	07393ba31b	ARM: intrinsic support for rbit. We already have an ARMISD node. Create an intrinsic to map to it so we can add support for the frontend __rbit() intrinsic. rdar://9283021 llvm-svn: 211057	2014-06-16 21:55:30 +00:00
Eric Christopher	daca3cc54a	Since the DataLayout is always found off of the subtarget go ahead and query the base target machine implementation for it. llvm-svn: 211055	2014-06-16 21:18:27 +00:00
Tim Northover	b45c3b74b4	ARM: implement correct atomic operations on v7M ARM v7M has ldrex/strex but not ldrexd/strexd. This means 32-bit operations should work as normal, but 64-bit ones are almost certainly doomed. Patch by Phoebe Buckheister. llvm-svn: 211042	2014-06-16 18:49:36 +00:00
James Molloy	f6419cfb14	Refactor the disabling of Thumb-1 LDM/STM generation Originally I switched the LD/ST optimizer off in TargetMachine as it was previously, but Eric has suggested he'd prefer that it be short-circuited in the pass itself. No functionality change. llvm-svn: 211037	2014-06-16 16:42:53 +00:00
Christian Pirker	2cc1cf0d4b	ARMEB: Fix trunc store for vector types Reviewed at http://reviews.llvm.org/D4135 llvm-svn: 211010	2014-06-16 09:17:30 +00:00
Eric Christopher	f6db93ab81	Temporarily revert r210953 in an attempt to bring the ARM buildbots back. llvm-svn: 210996	2014-06-15 19:55:14 +00:00
Eric Christopher	fb0c26c696	Remove InstrItineraryData off of the TargetMachine - it's already on the subtarget and just forward the accessor. llvm-svn: 210955	2014-06-13 23:11:13 +00:00
Eric Christopher	a0cdc005dd	Move ARMJITInfo off of the TargetMachine and down onto the subtarget. This required untangling a mess of headers that included around. llvm-svn: 210953	2014-06-13 23:04:46 +00:00
Eric Christopher	f047bfd115	The hazard recognizer only needs a subtarget, not a target machine so make it take one. Fix up all users accordingly. llvm-svn: 210948	2014-06-13 22:38:52 +00:00
Oliver Stannard	b5e596f7c3	ARM: Fix fastcc calling convention for Thumb1 When targetting Thumb1 on a processor which has a VFP unit (which is not accessible from Thumb1), we were converting the fastcc calling convention to AAPCS-VFP, which is not possible. llvm-svn: 210889	2014-06-13 08:33:03 +00:00
Eric Christopher	030294e4c5	Move ARMSelectionDAGInfo from the TargetMachine to the subtarget. llvm-svn: 210862	2014-06-13 00:20:39 +00:00
Eric Christopher	a47f6804d2	Move to a private function to initialize subtarget dependencies so we can use initializer lists for the ARMSubtarget and then use this to initialize a moved DataLayout on the subtarget from the TargetMachine. llvm-svn: 210861	2014-06-13 00:20:35 +00:00
Eric Christopher	70e005a171	Have ARMSelectionDAGInfo take a DataLayout as it's argument as the DAG has access to the subtarget and TargetSelectionDAGInfo only needs a DataLayout. llvm-svn: 210859	2014-06-12 23:39:49 +00:00
Saleem Abdulrasool	65ca57a418	CodeGen: enable mov.w/mov.t pairs with minsize for WoA Windows on ARM uses COFF/PE which is intrinsically position independent. For the case of 32-bit immediates, use a pair-wise relocation as otherwise we may exceed the range of operators. This fixes a code generation crash when using -Oz when targeting Windows on ARM. llvm-svn: 210814	2014-06-12 20:06:33 +00:00
James Molloy	1417b0be3e	Disable the load/store optimization pass for Thumb-1. Moritz's changes have improved codegen a lot, but further testing showed significant correctness problems. Disable by default until these have been worked out. Patch by Moritz Roth! llvm-svn: 210789	2014-06-12 15:18:33 +00:00
Jim Grosbach	7a930bf9ef	ARM: honor hex immediate formatting for ldr/str i12 offsets. Previously we would always print the offset as decimal, regardless of the formatting requested. Now we use the formatImm() helper so the value is printed as the client (LLDB in the motivating example) requested. Before: ldr.w r8, [sp, #180] @ always After: ldr.w r8, [sp, #0xb4] @ when printing hex immediates ldr.w r8, [sp, #0180] @ when printing decimal immediates rdar://17237103 llvm-svn: 210701	2014-06-11 20:26:45 +00:00
Renato Golin	65eea557ae	Fix a bug in the Thumb1 ARM Load/Store optimizer Previously, the basic block was searched for future uses of the base register, and if necessary any writeback to the base register was reset using a SUB instruction (e.g. before calling a function) just before such a use. However, this step happened before the merged LDM/STM instruction was built. So if there was (e.g.) a function call directly after the not-yet-formed LDM/STM, the pass would first insert a SUB instruction to reset the base register, and then (at the same location, incorrectly) insert the LDM/STM itself. This patch fixes PR19972. Patch by Moritz Roth. llvm-svn: 210542	2014-06-10 16:39:21 +00:00
Saleem Abdulrasool	abac6e92a0	ARM: add VLA extension for WoA Itanium ABI The armv7-windows-itanium environment is nearly identical to the MSVC ABI. It has a few divergences, mostly revolving around the use of the Itanium ABI for C++. VLA support is one of the extensions that are amongst the set of the extensions. This adds support for proper VLA emission for this environment. This is somewhat similar to the handling for __chkstk emission on X86 and the large stack frame emission for ARM. The invocation style for chkstk is still controlled via the -mcmodel flag to clang. Make an explicit note that this is an extension. llvm-svn: 210489	2014-06-09 20:18:42 +00:00
David Blaikie	960ea3f018	AsmMatchers: Use unique_ptr to manage ownership of MCParsedAsmOperand I saw at least a memory leak or two from inspection (on probably untested error paths) and r206991, which was the original inspiration for this change. I ran this idea by Jim Grosbach a few weeks ago & he was OK with it. Since it's a basically mechanical patch that seemed sufficient - usual post-commit review, revert, etc, as needed. llvm-svn: 210427	2014-06-08 16:18:35 +00:00
Saleem Abdulrasool	90386ad60a	ARM: correct assertion for long-calls on WoA COFF/PE, so the relocation model is never static. Loosen the assertion accordingly. The relocation can still be emitted properly, as it will be converted to an IMAGE_REL_ARM_ADDR32 which will be resolved by the loader taking the base relocation into account. This is necessary to permit the emission of long calls which can be controlled via the -mlong-calls option in the driver. llvm-svn: 210399	2014-06-07 20:29:27 +00:00
Eric Christopher	0dd8d486b3	Have TargetSelectionDAGInfo take a DataLayout initializer rather than a TargetMachine since the only thing it wants is DataLayout. llvm-svn: 210366	2014-06-06 19:04:48 +00:00
Tom Roeder	44cb65fff1	Add a new attribute called 'jumptable' that creates jump-instruction tables for functions marked with this attribute. It includes a pass that rewrites all indirect calls to jumptable functions to pass through these tables. This also adds backend support for generating the jump-instruction tables on ARM and X86. Note that since the jumptable attribute creates a second function pointer for a function, any function marked with jumptable must also be marked with unnamed_addr. llvm-svn: 210280	2014-06-05 19:29:43 +00:00
Andrew Trick	8d2ee37f31	Add a subtarget hook: enablePostMachineScheduler. As requested by AArch64 subtargets. Note that this will have no effect until the AArch64 target actually enables the pass like this: substitutePass(&PostRASchedulerID, &PostMachineSchedulerID); As soon as armv7 switches over, PostMachineScheduler will become the default postRA scheduler, so this won't be necessary any more. Targets using the old postRA schedule would then do: substitutePass(&PostMachineSchedulerID, &PostRASchedulerID); llvm-svn: 210167	2014-06-04 07:06:27 +00:00
Christian Pirker	762b2c624f	ARMEB: Fix function return type f64 Reviewed at http://reviews.llvm.org/D3968 llvm-svn: 209990	2014-06-01 09:30:52 +00:00
Alp Toker	5f83e477c3	Update a couple of header inclusion guards llvm-svn: 209980	2014-05-31 21:26:09 +00:00
Eric Christopher	8995833a34	Have the TLOF creation take a Triple rather than needing a subtarget. llvm-svn: 209937	2014-05-31 00:07:32 +00:00
Tim Northover	86f60b7266	ARM: use AAPCS-style prologues for embedded MachO. Darwin prologues save their GPRs in two stages: a narrow push of r0-r7 & lr, followed by a wide push of the remaining registers if there are any. AAPCS uses a single push.w instruction. It turns out that, on average, enough registers get pushed that code is smaller in the AAPCS prologue, which is a nice property for M-class programmers. They also have other options available for back-traces, so can hopefully deal with the fact that FP & LR aren't adjacent in memory. rdar://problem/15909583 llvm-svn: 209895	2014-05-30 13:23:06 +00:00
Tim Northover	b4ddc0845a	ARM & AArch64: make use of common cmpxchg idioms after expansion The C and C++ semantics for compare_exchange require it to return a bool indicating success. This gets mapped to LLVM IR which follows each cmpxchg with an icmp of the value loaded against the desired value. When lowered to ldxr/stxr loops, this extra comparison is redundant: its results are implicit in the control-flow of the function. This commit makes two changes: it replaces that icmp with appropriate PHI nodes, and then makes sure earlyCSE is called after expansion to actually make use of the opportunities revealed. I've also added -{arm,aarch64}-enable-atomic-tidy options, so that existing fragile tests aren't perturbed too much by the change. Many of them either rely on undef/unreachable too pervasively to be restored to something well-defined (particularly while making sure they test the same obscure assert from many years ago), or depend on a particular CFG shape, which is disrupted by SimplifyCFG. rdar://problem/16227836 llvm-svn: 209883	2014-05-30 10:09:59 +00:00
Amara Emerson	ceeb1c4830	[ARM] Emit correct build attributes for the relocation models. Patch by Asiri Rathnayake. llvm-svn: 209656	2014-05-27 13:30:21 +00:00
Tim Northover	4f1909f1da	ARM: teach AAPCS-VFP to deal with Cortex-M4. Cortex-M4 only has single-precision floating point support, so any LLVM "double" type will have been split into 2 i32s by now. Fortunately, the consecutive-register framework turns out to be precisely what's needed to reconstruct the double and follow AAPCS-VFP correctly! rdar://problem/17012966 llvm-svn: 209650	2014-05-27 10:43:38 +00:00
Tim Northover	f9e798ba6a	Segmented stacks: omit __morestack call when there's no frame. Patch by Florian Zeitz llvm-svn: 209436	2014-05-22 13:03:43 +00:00
Saleem Abdulrasool	2bd1262a32	ARM: introduce llvm.arm.undefined intrinsic This intrinsic permits the emission of platform specific undefined sequences. ARM has reserved the 0xde opcode which takes a single integer parameter (ignored by the CPU). This permits the operating system to implement custom behaviour on this trap. The llvm.arm.undefined intrinsic is meant to provide a means for generating the target specific behaviour from the frontend. This is particularly useful for Windows on ARM which has made use of a series of these special opcodes. llvm-svn: 209390	2014-05-22 04:46:46 +00:00
Eric Christopher	0e6e7cf385	Override runOnMachineFunction for ARMISelDAGToDAG so that we can reset the subtarget on each function. llvm-svn: 209386	2014-05-22 02:00:27 +00:00
Eric Christopher	89f18805f4	Fix typo. llvm-svn: 209377	2014-05-22 01:21:44 +00:00
Saleem Abdulrasool	0bd31835ea	MC: correct IMAGE_REL_ARM_MOV32T relocation emission This corrects the emission of IMAGE_REL_ARM_MOV32T relocations. Previously, we were avoiding the high portion of the relocation too early. If there was a section-relative relocation with an offset greater than 16-bits (65535), you would end up truncating the high order bits of the offset. Allow the current relocation representation to flow through out the MC layer to the object writer. Use the new ability to restrict recorded relocations to avoid emitting the relocation into the final object. llvm-svn: 209337	2014-05-21 23:17:56 +00:00
Saleem Abdulrasool	8d60fdc50d	ARM: correct bundle generation for MOV32T relocations Although the previous code would construct a bundle and add the correct elements to it, it would not finalise the bundle. This resulted in the InternalRead markers not being added to the MachineOperands nor, more importantly, the externally visible defs to the bundle itself. So, although the bundle was not exposing the def, the generated code would be correct because there was no optimisations being performed. When optimisations were enabled, the post register allocator would kick in, and the hazard recognizer would reorder operations around the load which would define the value being operated upon. Rather than manually constructing the bundle, simply construct and finalise the bundle via the finaliseBundle call after both MIs have been emitted. This improves the code generation with optimisations where IMAGE_REL_ARM_MOV32T relocations are emitted. The changes to the other tests are the result of the bundle generation preventing the scheduler from hoisting the moves across the loads. The net effect of the generated code is equivalent, but, is much more identical to what is actually being lowered. llvm-svn: 209267	2014-05-21 01:25:24 +00:00
Christian Pirker	875629f713	ARMEB: Additional test files for ARM fixups llvm-svn: 209200	2014-05-20 09:24:37 +00:00
Benjamin Kramer	f3ad23551d	SDAG: Legalize vector BSWAP into a shuffle if the shuffle is legal but the bswap not. - On ARM/ARM64 we get a vrev because the shuffle matching code is really smart. We still unroll anything that's not v4i32 though. - On X86 we get a pshufb with SSSE3. Required more cleverness in isShuffleMaskLegal. - On PPC we get a vperm for v8i16 and v4i32. v2i64 is unrolled. llvm-svn: 209123	2014-05-19 13:12:38 +00:00
Saleem Abdulrasool	8bfb192ecd	ARM: make libcall setup more table driven Rather than create a series of function calls to setup the library calls, create a table with the information and just use the table to drive the configuration of the library calls. This makes it easier to both inspect the list as well as to modify it. NFC. llvm-svn: 209089	2014-05-18 16:39:11 +00:00
Saleem Abdulrasool	a521845381	ARM: improve WoA ABI conformance for frame register Windows on ARM uses R11 for the frame pointer even though the environment is a pure Thumb-2, thumb-only environment. Replicate this behaviour to improve Windows ABI compatibility. This register is used for fast stack walking, and thus is part of the Windows ABI. llvm-svn: 209085	2014-05-18 04:12:52 +00:00
Saleem Abdulrasool	f11f4b4e20	ARM: consolidate frame pointer register knowledge Use the ARMBaseRegisterInfo to query the frame register. The base register info is aware of the frame register that is used for the frame pointer. Use that to determine the frame register rather than duplicating the knowledge. Although, the code path is slightly different in that it may return SP, that can only occur if the frame pointer has been omitted in the machine function, which is supposed to contain the desired value in that case. llvm-svn: 209084	2014-05-18 03:18:09 +00:00
Saleem Abdulrasool	f3a5a5c546	Target: remove old constructors for CallLoweringInfo This is mostly a mechanical change changing all the call sites to the newer chained-function construction pattern. This removes the horrible 15-parameter constructor for the CallLoweringInfo in favour of setting properties of the call via chained functions. No functional change beyond the removal of the old constructors are intended. llvm-svn: 209082	2014-05-17 21:50:17 +00:00
Saleem Abdulrasool	6d11b7cd7a	ARM: whitespace Remove some whitespace. NFC. llvm-svn: 209079	2014-05-17 21:49:54 +00:00
Saleem Abdulrasool	46fed305db	ARM: use the proper target object format for WoA WoA uses COFF, not ELF. ARMISelLowering::createTLOF would previously return ELF for any non-MachO platform. This was a missed site when the original change for target format support for Windows on ARM was done. llvm-svn: 209057	2014-05-17 04:28:08 +00:00
James Molloy	a70697e10e	Re-enable inline memcpy expansion for Thumb1. Patch by Moritz Roth! llvm-svn: 208994	2014-05-16 14:24:22 +00:00
James Molloy	556763d2ef	Fix the Load/Store optimization pass to work with Thumb1. Patch by Moritz Roth! llvm-svn: 208992	2014-05-16 14:14:30 +00:00
James Molloy	92a15078f1	Enable the Load/Store optimization pass for Thumb1 but make it return immediately for now. Patch by Moritz Roth! llvm-svn: 208991	2014-05-16 14:11:38 +00:00
James Molloy	bb73c23ffa	Fix a few comment typos and style issues. Patch by Moritz Roth! llvm-svn: 208990	2014-05-16 14:08:46 +00:00
Saleem Abdulrasool	056fc3da4a	ARM: add some integer/floating point conversion libcalls Add some Windows on ARM specific library calls. These are provided by msvcrt, and can be used to perform integer to floating-point conversions (and vice-versa) mirroring similar functions in the RTABI. llvm-svn: 208949	2014-05-16 05:41:33 +00:00
Tim Northover	d8d65a69cf	TableGen/ARM64: print aliases even if they have syntax variants. To get at least one use of the change (and some actual tests) in with its commit, I've enabled the AArch64 & ARM64 NEON mov aliases. llvm-svn: 208867	2014-05-15 11:16:32 +00:00
Jonathan Roelofs	4971b40d7e	Fix some dyslexia in an assert message llvm-svn: 208842	2014-05-15 02:24:50 +00:00
Jay Foad	a0653a3e6c	Rename ComputeMaskedBits to computeKnownBits. "Masked" has been inappropriate since it lost its Mask parameter in r154011. llvm-svn: 208811	2014-05-14 21:14:37 +00:00
Christian Pirker	6692e7c116	ARM-BE: test files for vector argument passing Reviewed at http://reviews.llvm.org/D3766 llvm-svn: 208793	2014-05-14 16:59:44 +00:00
Saleem Abdulrasool	27351f2022	ARM: implement support for the UDF mnemonic The UDF instruction is a reserved undefined instruction space. The assembler mnemonic was introduced with ARM ARM rev C.a. The instruction is not predicated and the immediate constant is ignored by the CPU. Add support for the three encodings for this instruction. The changes to the invalid instruction test is due to the fact that the invalid instructions actually overlap with the undefined instruction. Introduction of the new instruction results in a partial decode as an undefined sequence. Drop the tests as they are invalid instruction patterns anyways. llvm-svn: 208751	2014-05-14 03:47:39 +00:00
Christian Pirker	39db7ec81f	ARMEB: Fix byte order of EH frame unwinding instructions, with modified test file This commit was already commited as revision rL208689 and discussd in phabricator revision D3704. But the test file was crashing on OS X and windows. I fixed the test file in the same way as in rL208340. llvm-svn: 208711	2014-05-13 16:44:30 +00:00
Rafael Espindola	2e7eceb317	Revert "ARMEB: Fix byte order of EH frame unwinding instructions" This reverts commit r208689. The test was crashing on OS X and windows. llvm-svn: 208704	2014-05-13 15:19:56 +00:00
Christian Pirker	ea3514ecdb	ARMEB: Fix byte order of EH frame unwinding instructions llvm-svn: 208689	2014-05-13 11:41:49 +00:00

1 2 3 4 5 ...

7572 Commits