hanchenye-llvm-project

Commit Graph

Author	SHA1	Message	Date
NAKAMURA Takumi	d985d76040	Revert r169456, "change MCContext to work on the doInitialization/doFinalization model" It broke many builders. llvm-svn: 169462	2012-12-06 02:00:13 +00:00
Chad Rosier	9f5c68af4c	[arm fast-isel] Make the fast-isel implementation of memcpy respect alignment. rdar://12821569 llvm-svn: 169460	2012-12-06 01:34:31 +00:00
Evan Cheng	5213139f48	Let targets provide hooks that compute known zero and ones for any_extend and extload's. If they are implemented as zero-extend, or implicitly zero-extend, then this can enable more demanded bits optimizations. e.g. define void @foo(i16* %ptr, i32 %a) nounwind { entry: %tmp1 = icmp ult i32 %a, 100 br i1 %tmp1, label %bb1, label %bb2 bb1: %tmp2 = load i16* %ptr, align 2 br label %bb2 bb2: %tmp3 = phi i16 [ 0, %entry ], [ %tmp2, %bb1 ] %cmp = icmp ult i16 %tmp3, 24 br i1 %cmp, label %bb3, label %exit bb3: call void @bar() nounwind br label %exit exit: ret void } This compiles to the followings before: push {lr} mov r2, #0 cmp r1, #99 bhi LBB0_2 @ BB#1: @ %bb1 ldrh r2, [r0] LBB0_2: @ %bb2 uxth r0, r2 cmp r0, #23 bhi LBB0_4 @ BB#3: @ %bb3 bl _bar LBB0_4: @ %exit pop {lr} bx lr The uxth is not needed since ldrh implicitly zero-extend the high bits. With this change it's eliminated. rdar://12771555 llvm-svn: 169459	2012-12-06 01:28:01 +00:00
Pedro Artigas	bf7d3bab26	change MCContext to work on the doInitialization/doFinalization model reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169456	2012-12-06 00:50:55 +00:00
Bill Wendling	ab417b644c	Set the 'MadeChange' variable if we are deleting blocks. llvm-svn: 169455	2012-12-06 00:30:20 +00:00
Michael Ilseman	0f12837be0	Have CannotBeNegativeZero() be aware of the nsz fast-math flag llvm-svn: 169452	2012-12-06 00:07:09 +00:00
Andrew Trick	d3226eee03	RegPressureTracker::dump(): Remove unnecessary argument. llvm-svn: 169443	2012-12-05 23:05:22 +00:00
Eli Bendersky	02631c4e31	Change std::vector to SmallVector<4> and remove some unused methods. This is more consistent with other vectors in this code. In addition, I ran some tests compiling a large program and >96% of fragments have 4 or less fixups, so SmallVector<4> is a good optimization. llvm-svn: 169433	2012-12-05 22:11:02 +00:00
Jyotsna Verma	d3746e6895	Define new-value store instructions with base+immediate addressing mode using multiclass. llvm-svn: 169432	2012-12-05 22:02:56 +00:00
Bill Wendling	fcf6a22b01	Fix name. The array is unboundED. llvm-svn: 169428	2012-12-05 21:43:30 +00:00
Andrew Trick	fda7a8832d	RegisterPressureTracker: fix findUseBetween to handle DebugValue llvm-svn: 169427	2012-12-05 21:37:50 +00:00
Andrew Trick	7bbcad7bcd	RegisterPressureTracker: unify virtual registers and physical regunits. Now that live register units are tracked individually, the code can be simplified. llvm-svn: 169426	2012-12-05 21:37:47 +00:00
Andrew Trick	7f7cee39ab	RegisterPresssureTracker: Track live physical register by unit. This is much simpler to reason about, more efficient, and fixes some corner cases involving implicit super-register defs. Fixed rdar://12797931. llvm-svn: 169425	2012-12-05 21:37:42 +00:00
Nadav Rotem	0a471ea66c	Cost Model: change the default cost of control flow instructions (br / ret / ...) to zero. llvm-svn: 169423	2012-12-05 21:21:26 +00:00
David Sehr	05176cad21	Correct ARM NOP encoding The encoding of NOP in ARMAsmBackend.cpp is missing a trailing zero, which causes the emission of a coprocessor instruction rather than "mov r0, r0" as indicated in the comment. The test also checks for the wrong encoding. http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20121203/157919.html llvm-svn: 169420	2012-12-05 21:01:27 +00:00
Justin Holewinski	fb711156ae	[NVPTX] Fix crash with unnamed struct arguments Patch by Eric Holk llvm-svn: 169418	2012-12-05 20:50:28 +00:00
Jyotsna Verma	90295156d8	Use multiclass to define store instructions with base+immediate offset addressing mode and immediate stored value. llvm-svn: 169408	2012-12-05 19:32:03 +00:00
Bob Wilson	50a62525cd	Adjust JIT target triple on OS X to match the current architecture. For OS X builds, we generate one version of config.h but then build for multiple architectures. This means that the LLVM_HOSTTRIPLE setting may have the wrong architecture. Adjust it dynamically to match the current architecture. <rdar://problem/12715470> llvm-svn: 169405	2012-12-05 19:09:13 +00:00
Matthew Curtis	cd8c881c9f	Fix misplaced closing brace. llvm-svn: 169404	2012-12-05 19:00:34 +00:00
Benjamin Kramer	507aca835e	Try to unbreak the build on hosts that don't transitively pull in a definition for int64_t. Also use the portable (ugly) format string macros, for MSVC compatibility. llvm-svn: 169396	2012-12-05 18:31:11 +00:00
Jakob Stoklund Olesen	a97cec790f	Remove unused MachineInstr constructors. A MachineInstr can only ever be constructed by CreateMachineInstr() and CloneMachineInstr(), and those factories don't use the removed constructors. llvm-svn: 169395	2012-12-05 18:27:39 +00:00
Kevin Enderby	168ffb36a5	Added a option to the disassembler to print immediates as hex. This is for the lldb team so most of but not all of the values are to be printed as hex with this option. Some small values like the scale in an X86 address were requested to printed in decimal without the leading 0x. There may be some tweaks need to places that may still be in decimal that they want in hex. Specially for arm. I made my best guess. Any tweaks from here should be simple. I also did the best I know now with help from the C++ gurus creating the cleanest formatImm() utility function and containing the changes. But if someone has a better idea to make something cleaner I'm all ears and game for changing the implementation. rdar://8109283 llvm-svn: 169393	2012-12-05 18:13:19 +00:00
Pedro Artigas	41b98843e8	- Added calls to doInitialization/doFinalization to immutable passes - fixed ordering of calls to doFinalization to be the reverse of the pass run order due to potential dependencies - fixed machine module info to operate in the doInitialization/doFinalization model, also fixes some FIXMEs reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169391	2012-12-05 17:12:22 +00:00
Evgeniy Stepanov	8b51bab495	[msan] Instrument bswap intrinsic. llvm-svn: 169383	2012-12-05 14:39:55 +00:00
Evgeniy Stepanov	94b257df3c	[msan] Initialize callbacks in runOnFunction as opposed to doInitialization. This mirrors the change in ASan & TSan done in r168864. llvm-svn: 169378	2012-12-05 13:14:33 +00:00
Evgeniy Stepanov	474cb3b3b5	[msan] Change linkage type of __msan_track_origins. LinkOnceODRLinkage globals may be removed in GlobalOpt if not used in the current module. llvm-svn: 169377	2012-12-05 12:49:41 +00:00
Elena Demikhovsky	cd3c1c4a16	Simplified BLEND pattern matching for shuffles. Generate VPBLENDD for AVX2 and VPBLENDW for v16i16 type on AVX2. llvm-svn: 169366	2012-12-05 09:24:57 +00:00
Andrew Trick	d52ab339cb	Added RegisterPressureTracker::dump() for debugging. llvm-svn: 169359	2012-12-05 06:47:08 +00:00
Michael J. Spencer	41ee041d4f	Copy clang/Driver/<Option parsing stuff> to llvm. llvm-svn: 169344	2012-12-05 00:29:32 +00:00
Evan Cheng	d31802c1f6	Add x86 isel lowering logic to form bit test with inverted condition. e.g. x ^ -1. Patch by David Majnemer. rdar://12755626 llvm-svn: 169339	2012-12-05 00:10:38 +00:00
Matt Beaumont-Gay	50f61b662f	Appease GCC's -Wparentheses. (TIL that Clang's -Wparentheses ignores 'x \|\| y && "foo"' on purpose. Neat.) llvm-svn: 169337	2012-12-04 23:54:02 +00:00
Bill Wendling	34c2eb2f99	Split up the ParseOptionalAttrs method into three different methods for each class of attributes. This makes it much easier to check for errors and to reuse the code. llvm-svn: 169336	2012-12-04 23:40:58 +00:00
Nadav Rotem	a8f026e2d4	LoopVectorizer: Increase the number of pointers that can be tested at runtime. If we cant prove statically that the pointers are disjoint then we add the runtime check. llvm-svn: 169334	2012-12-04 23:25:24 +00:00
Nadav Rotem	87fc988c5d	Enable if-conversion during vectorization. llvm-svn: 169331	2012-12-04 22:59:52 +00:00
Evan Cheng	b4eae1361c	ARM custom lower ctpop for vector types. Patch by Pete Couperus. llvm-svn: 169325	2012-12-04 22:41:50 +00:00
Nadav Rotem	93fa5ef957	Fix a bug in vectorization of if-converted reduction variables. If the reduction variable is not used outside the loop then we ran into an endless loop. This change checks if we found the original PHI. llvm-svn: 169324	2012-12-04 22:40:22 +00:00
Jakob Stoklund Olesen	3cb2cb800f	Speed up the AllocationOrder class a bit. Allow the central functions to be inlined, and use the argumentless isHint() function when possible. llvm-svn: 169319	2012-12-04 22:25:16 +00:00
Shuxin Yang	73285933c9	For rdar://12329730, last piece. This change attempts to simplify (X^Y) -> X or Y in the user's context if we know that only bits from X or Y are demanded. A minimized case is provided bellow. This change will simplify "t>>16" into "var1 >>16". ============================================================= unsigned foo (unsigned val1, unsigned val2) { unsigned t = val1 ^ 1234; return (t >> 16) \| t; // NOTE: t is used more than once. } ============================================================= Note that if the "t" were used only once, the expression would be finally optimized as well. However, with with this change, the optimization will take place earlier. Reviewed by Nadav, Thanks a lot! llvm-svn: 169317	2012-12-04 22:15:32 +00:00
David Blaikie	67cb31ebdd	Comment change made in r169304 as requested by Eric Christopher. llvm-svn: 169315	2012-12-04 22:02:33 +00:00
Jyotsna Verma	4da904c8f8	Define store instructions with base+register offset addressing mode using multiclass. llvm-svn: 169314	2012-12-04 21:58:25 +00:00
Bill Wendling	d7767125d5	Use the 'count' attribute to calculate the upper bound of an array. The count attribute is more accurate with regards to the size of an array. It also obviates the upper bound attribute in the subrange. We can also better handle an unbound array by setting the count to -1 instead of the lower bound to 1 and upper bound to 0. llvm-svn: 169312	2012-12-04 21:34:03 +00:00
David Blaikie	5a773bb601	Reapply r160148 (reverted in r163570) fixing spurious breakpoints in modern GDB This reapplies the fix for PR13303 now with more justification. Based on my execution of the GDB 7.5 test suite this results in: expected passes: 16101 -> 20890 (+30%) unexpected failures: 4826 -> 637 (-77%) There are 23 checks that used to pass and now fail. They are all in gdb.reverse. Investigating a few looks like they were accidentally passing due to extra breakpoints being set by this bug. They're generally due to the difference in end location between gcc and clang, the test suite is trying to set breakpoints on the closing '}' that clang doesn't associate with any instructions. llvm-svn: 169304	2012-12-04 21:05:36 +00:00
Eli Bendersky	abe546368b	Make NaCl naming consistent. The triple OSType is called NaCl and is represented textually as NativeClient. Also added a link to the native client project for readers unfamiliar with it. A Clang patch will follow shortly. llvm-svn: 169291	2012-12-04 18:37:26 +00:00
Nadav Rotem	a10b311aec	Add support for reduction variables when IF-conversion is enabled. llvm-svn: 169288	2012-12-04 18:17:33 +00:00
Jyotsna Verma	dfd779e108	Add patterns to define 'combine', 'tstbit', 'ct0/cl0' (count trailing/leading zeros) instructions. llvm-svn: 169287	2012-12-04 18:05:01 +00:00
Jyotsna Verma	22d61dd4ce	Add constant extender support to ALU32 instructions for V2. llvm-svn: 169284	2012-12-04 17:12:00 +00:00
Bill Schmidt	ca4a0c9dbd	This patch introduces initial-exec model support for thread-local storage on 64-bit PowerPC ELF. The patch includes code to handle external assembly and MC output with the integrated assembler. It intentionally does not support the "old" JIT. For the initial-exec TLS model, the ABI requires the following to calculate the address of external thread-local variable x: Code sequence Relocation Symbol ld 9,x@got@tprel(2) R_PPC64_GOT_TPREL16_DS x add 9,9,x@tls R_PPC64_TLS x The register 9 is arbitrary here. The linker will replace x@got@tprel with the offset relative to the thread pointer to the generated GOT entry for symbol x. It will replace x@tls with the thread-pointer register (13). The two test cases verify correct assembly output and relocation output as just described. PowerPC-specific selection node variants are added for the two instructions above: LD_GOT_TPREL and ADD_TLS. These are inserted when an initial-exec global variable is encountered by PPCTargetLowering::LowerGlobalTLSAddress(), and later lowered to machine instructions LDgotTPREL and ADD8TLS. LDgotTPREL is a pseudo that uses the same LDrs support added for medium code model's LDtocL, with a different relocation type. The rest of the processing is straightforward. llvm-svn: 169281	2012-12-04 16:18:08 +00:00
Chandler Carruth	802d755533	Sort includes for all of the .h files under the 'lib' tree. These were missed in the first pass because the script didn't yet handle include guards. Note that the script is now able to handle all of these headers without manual edits. =] llvm-svn: 169224	2012-12-04 07:12:27 +00:00
Nadav Rotem	07674cb566	Give scalar if-converted blocks half the score because they are not always executed due to CF. llvm-svn: 169223	2012-12-04 07:11:52 +00:00
Chandler Carruth	dd7ca93abc	Add a comment about the requirement that the Windows.h header be last. This comment has the dual effect of blocking reorderings with the sort_include script. llvm-svn: 169221	2012-12-04 07:04:57 +00:00

1 2 3 4 5 ...

57772 Commits