hanchenye-llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	336734aca6	Add new RegionInfo pass. The RegionInfo pass detects single entry single exit regions in a function, where a region is defined as any subgraph that is connected to the remaining graph at only two spots. Furthermore an hierarchical region tree is built. Use it by calling "opt -regions analyze" or "opt -view-regions". llvm-svn: 109089	2010-07-22 07:46:31 +00:00
Dan Gohman	00ef93258a	Remove interprocedural-basic-aa and associated code. The AliasAnalysis interface needs implementations to be consistent, so any code which wants to support different semantics must use a different interface. It's not currently worthwhile to add a new interface for this new concept. Document that AliasAnalysis doesn't support cross-function queries. llvm-svn: 107776	2010-07-07 14:27:09 +00:00
Dan Gohman	84f90a387d	Remove context sensitivity concerns from interprocedural-basic-aa, and make it more aggressive in cases where both pointers are known to live in the same function. llvm-svn: 107420	2010-07-01 20:08:40 +00:00
Dan Gohman	c0cca7fdda	Revert the part of r107257 which introduced new logic for using nsw and nuw flags from IR Instructions. On further consideration, this isn't valid. llvm-svn: 107298	2010-06-30 17:27:11 +00:00
Dan Gohman	725ed0364b	Add a testcase for scev-aa's new capability. llvm-svn: 107258	2010-06-30 07:17:47 +00:00
Dan Gohman	9bbd007f15	Add a few more interesting testcases. llvm-svn: 107177	2010-06-29 18:17:11 +00:00
Dan Gohman	0824affeff	Add an Intraprocedural form of BasicAliasAnalysis, which aims to properly handles instructions and arguments defined in different functions, or across recursive function iterations. llvm-svn: 107109	2010-06-29 00:50:39 +00:00
Dan Gohman	7c34ece501	Fix Value::stripPointerCasts and BasicAA to avoid trouble on code in unreachable blocks, which have have use-def cycles. This fixes PR7514. llvm-svn: 107071	2010-06-28 21:16:52 +00:00
Dan Gohman	f820bd327d	Allow "exhaustive" trip count evaluation on phi nodes with all constant operands. llvm-svn: 106537	2010-06-22 13:15:46 +00:00
Dan Gohman	866971ed3d	Fix ScalarEvolution's "exhaustive" trip count evaluation code to avoid assuming that loops are in canonical form, as ScalarEvolution doesn't depend on LoopSimplify itself. Also, with indirectbr not all loops can be simplified. This fixes PR7416. llvm-svn: 106389	2010-06-19 14:17:24 +00:00
Dan Gohman	24ceda8eb0	Revert r106304 (105548 and friends), which are the SCEVComplexityCompare optimizations. There is still some nondeterminism remaining. llvm-svn: 106306	2010-06-18 19:54:20 +00:00
Dan Gohman	4c807fca97	Reapply 105540, 105542, and 105548, and revert r105732. llvm-svn: 106304	2010-06-18 19:26:04 +00:00
Daniel Dunbar	e16d569932	Workaround SCEV non-determinism on this test, for now, to get buildbots back to green. Dan, please revert this once the real problem is fixed. llvm-svn: 105732	2010-06-09 17:54:40 +00:00
Dan Gohman	70910a6ab6	Optimize ScalarEvolution's SCEVComplexityCompare predicate: don't go scrounging through SCEVUnknown contents and SCEVNAryExpr operands; instead just do a simple deterministic comparison of the precomputed hash data. Also, since this is more precise, it eliminates the need for the slow N^2 duplicate detection code. llvm-svn: 105540	2010-06-07 19:06:13 +00:00
Dan Gohman	d07d2f9774	Add a comment to this test. llvm-svn: 102387	2010-04-26 21:37:43 +00:00
Dan Gohman	f33bac3afe	ScalarEvolution support for <= and >= loops. Also, generalize ScalarEvolutions's min and max recognition to handle some new forms of min and max that this change makes more common. llvm-svn: 102234	2010-04-24 03:09:42 +00:00
Chris Lattner	126a58e084	fix some failures my callgraph dump format change broke. llvm-svn: 102197	2010-04-23 18:38:40 +00:00
Dan Gohman	acd700a24b	Don't attempt to analyze values which are obviously undef. This fixes some assertion failures in extreme cases. llvm-svn: 102042	2010-04-22 01:35:11 +00:00
Dan Gohman	6635bb26a6	Generalize ScalarEvolution's PHI analysis to handle loops that don't have preheaders or dedicated exit blocks, as clients may not otherwise need to run LoopSimplify. llvm-svn: 101030	2010-04-12 07:49:36 +00:00
Dan Gohman	cb45bd9cb3	Pointers to zero-sized objects don't point to overlapping objects. llvm-svn: 100789	2010-04-08 18:11:50 +00:00
Chris Lattner	3ae2dd2ba5	add newlines at the end of files. llvm-svn: 100705	2010-04-07 22:53:17 +00:00
Mon P Wang	c576ee9040	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100304	2010-04-04 03:10:48 +00:00
Mon P Wang	999c1b927b	Revert r100191 since it breaks objc in clang llvm-svn: 100199	2010-04-02 18:43:02 +00:00
Mon P Wang	a972ab8564	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100191	2010-04-02 18:04:15 +00:00
Bob Wilson	6f7fd28824	Revert Mon Ping's change 99928, since it broke all the llvm-gcc buildbots. llvm-svn: 99948	2010-03-30 22:27:04 +00:00
Mon P Wang	7460571381	Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) A update of langref will occur in a subsequent checkin. llvm-svn: 99928	2010-03-30 20:55:56 +00:00
Dan Gohman	69451a0950	Avoid analyzing instructions in blocks not reachable from the entry block. They are lots of trouble, and they don't matter. This fixes PR6559. llvm-svn: 98103	2010-03-09 23:46:50 +00:00
Chris Lattner	7d2c1592f3	remove andersen's tests. llvm-svn: 97490	2010-03-01 20:23:15 +00:00
Dan Gohman	6b1e2a829d	Teach ScalarEvolution how to compute a tripcount for a loop with true or false as its exit condition. These are usually eliminated by SimplifyCFG, but the may be left around during a pass which wishes to preserve the CFG. llvm-svn: 96683	2010-02-19 18:12:07 +00:00
Dan Gohman	80386c10d4	-disable-output is no longer needed with -analyze. llvm-svn: 94574	2010-01-26 19:25:59 +00:00
Dan Gohman	51aaf02821	Fix the the ceiling-division used in computing the MaxBECount so that it doesn't have trouble with an intermediate add overflowing. Also, be more conservative about the case where the induction variable in an SLT loop exit can step past the RHS of the SLT and overflow in a single step. Make getSignedRange more aggressive, to recover for some common cases which the above fixes pessimized. This addresses rdar://7561161. llvm-svn: 94512	2010-01-26 04:40:18 +00:00
Tobias Grosser	b478d3e0fc	Fix PR6047 Nodes that had children outside of the post dominator tree (infinite loops) where removed from the post dominator tree. This seems to be wrong. Leave them in the tree. llvm-svn: 93633	2010-01-16 13:38:07 +00:00
Dan Gohman	bc694918cc	Use WriteAsOperand instead of getName() to print loop header names, so that unnamed blocks are handled. llvm-svn: 93059	2010-01-09 18:17:45 +00:00
Dan Gohman	fb4193625a	Delete useless trailing semicolons. llvm-svn: 92740	2010-01-05 17:55:26 +00:00
Chris Lattner	850a3cd905	gvn is optimizing this better now. llvm-svn: 90696	2009-12-06 04:16:05 +00:00
Dan Gohman	03f90ab0a9	Add a comment about A[i+(j+1)]. llvm-svn: 90185	2009-12-01 01:38:10 +00:00
Chris Lattner	5fe97e7aca	@test9 is a testcase for r89958. Before 89958, we misanalyzed the first expression as P+4+4i which we considered to possibly alias P+4j. Now we correctly analyze the former one as P+1+4i. @test10 is a sanity test that verfies that we know that P+4+4i != P+4*i. llvm-svn: 89960	2009-11-26 19:25:46 +00:00
Chris Lattner	1bf7ff704a	Implement PR1143 (at -m64) by making basicaa look through extensions. We previously already handled it at -m32 because there were no i32->i64 extensions for addressing. llvm-svn: 89959	2009-11-26 18:53:33 +00:00
Chris Lattner	631c5b2cb9	teach GetLinearExpression to be a bit more aggressive. llvm-svn: 89955	2009-11-26 17:00:01 +00:00
Chris Lattner	ba0014a44c	update status of this. basicaa is much improved now, only missing the one form (in this testcase). Dan, do you consider this example to be important? llvm-svn: 89953	2009-11-26 16:42:00 +00:00
Chris Lattner	29bc8a91d3	Teach basicaa that x\|c == x+c when the c bits of x are clear. This allows us to compile the example in readme.txt into: LBB1_1: ## %bb movl 4(%rdx,%rax), %ecx movl %ecx, %esi imull (%rdx,%rax), %esi imull %esi, %ecx movl %esi, 8(%rdx,%rax) imull %ecx, %esi movl %ecx, 12(%rdx,%rax) movl %esi, 16(%rdx,%rax) imull %ecx, %esi movl %esi, 20(%rdx,%rax) addq $16, %rax cmpq $4000, %rax jne LBB1_1 instead of: LBB1_1: movl (%rdx,%rax), %ecx imull 4(%rdx,%rax), %ecx movl %ecx, 8(%rdx,%rax) imull 4(%rdx,%rax), %ecx movl %ecx, 12(%rdx,%rax) imull 8(%rdx,%rax), %ecx movl %ecx, 16(%rdx,%rax) imull 12(%rdx,%rax), %ecx movl %ecx, 20(%rdx,%rax) addq $16, %rax cmpq $4000, %rax jne LBB1_1 GCC (4.2) doesn't seem to be able to eliminate the loads in this testcase either, it generates: L2: movl (%rdx), %eax imull 4(%rdx), %eax movl %eax, 8(%rdx) imull 4(%rdx), %eax movl %eax, 12(%rdx) imull 8(%rdx), %eax movl %eax, 16(%rdx) imull 12(%rdx), %eax movl %eax, 20(%rdx) addl $4, %ecx addq $16, %rdx cmpl $1002, %ecx jne L2 llvm-svn: 89952	2009-11-26 16:26:43 +00:00
Chris Lattner	12dacdd359	teach basicaa that A[i] != A[i+1]. llvm-svn: 89951	2009-11-26 16:18:10 +00:00
Chris Lattner	453751031a	rename test llvm-svn: 89950	2009-11-26 16:08:41 +00:00
Chris Lattner	7a5b56aca9	Change the other half of aliasGEP (which handles GEP differencing) to use DecomposeGEPExpression. This dramatically simplifies and shrinks the code by eliminating the horrible CheckGEPInstructions method, fixes a miscompilation (@test3 ) and makes the code more aggressive. In particular, we now handle the @test4 case, which is reduced from the SmallPtrSet constructor. Missing this caused us to emit a variable length memset instead of a fixed size one. llvm-svn: 89922	2009-11-26 02:17:34 +00:00
Chris Lattner	0d23076adf	add a new random feature test llvm-svn: 89921	2009-11-26 02:16:28 +00:00
Chris Lattner	db1e9f1290	remove a silly condition that doesn't make a lot of sense anymore. llvm-svn: 89601	2009-11-22 16:15:59 +00:00
Victor Hernandez	fcc77b1c02	Update computeArraySize() to use ComputeMultiple() to determine the array size associated with a malloc; also extend PerformHeapAllocSRoA() to check if the optimized malloc's arg had its highest bit set, so that it is safe for ComputeMultiple() to look through sext instructions while determining the optimized malloc's array size llvm-svn: 86676	2009-11-10 08:32:25 +00:00
Victor Hernandez	f3db915294	Re-commit r86077 now that r86290 fixes the 179.art and 175.vpr ARM regressions. Here is the original commit message: This commit updates malloc optimizations to operate on malloc calls that have constant int size arguments. Update CreateMalloc so that its callers specify the size to allocate: MallocInst-autoupgrade users use non-TargetData-computed allocation sizes. Optimization uses use TargetData to compute the allocation size. Now that malloc calls can have constant sizes, update isArrayMallocHelper() to use TargetData to determine the size of the malloced type and the size of malloced arrays. Extend getMallocType() to support malloc calls that have non-bitcast uses. Update OptimizeGlobalAddressOfMalloc() to optimize malloc calls that have non-bitcast uses. The bitcast use of a malloc call has to be treated specially here because the uses of the bitcast need to be replaced and the bitcast needs to be erased (just like the malloc call) for OptimizeGlobalAddressOfMalloc() to work correctly. Update PerformHeapAllocSRoA() to optimize malloc calls that have non-bitcast uses. The bitcast use of the malloc is not handled specially here because ReplaceUsesOfMallocWithGlobal replaces through the bitcast use. Update OptimizeOnceStoredGlobal() to not care about the malloc calls' bitcast use. Update all globalopt malloc tests to not rely on autoupgraded-MallocInsts, but instead use explicit malloc calls with correct allocation sizes. llvm-svn: 86311	2009-11-07 00:16:28 +00:00
Victor Hernandez	b9f5899779	Revert r86077 because it caused crashes in 179.art and 175.vpr on ARM llvm-svn: 86213	2009-11-06 01:33:24 +00:00
Victor Hernandez	492ed30a32	Update CreateMalloc so that its callers specify the size to allocate: MallocInst-autoupgrade users use non-TargetData-computed allocation sizes. Optimization uses use TargetData to compute the allocation size. Now that malloc calls can have constant sizes, update isArrayMallocHelper() to use TargetData to determine the size of the malloced type and the size of malloced arrays. Extend getMallocType() to support malloc calls that have non-bitcast uses. Update OptimizeGlobalAddressOfMalloc() to optimize malloc calls that have non-bitcast uses. The bitcast use of a malloc call has to be treated specially here because the uses of the bitcast need to be replaced and the bitcast needs to be erased (just like the malloc call) for OptimizeGlobalAddressOfMalloc() to work correctly. Update PerformHeapAllocSRoA() to optimize malloc calls that have non-bitcast uses. The bitcast use of the malloc is not handled specially here because ReplaceUsesOfMallocWithGlobal replaces through the bitcast use. Update OptimizeOnceStoredGlobal() to not care about the malloc calls' bitcast use. Update all globalopt malloc tests to not rely on autoupgraded-MallocInsts, but instead use explicit malloc calls with correct allocation sizes. llvm-svn: 86077	2009-11-05 00:03:03 +00:00

1 2 3 4 5 ...

262 Commits