hanchenye-llvm-project

Commit Graph

Author	SHA1	Message	Date
Chris Lattner	950fc785ae	whoops, don't accidentally lose variable names llvm-svn: 8955	2003-10-07 22:58:41 +00:00
Chris Lattner	75b4d1deec	Fix bug: InstCombine/cast.ll:test11 / PR#7 llvm-svn: 8954	2003-10-07 22:54:13 +00:00
Chris Lattner	aec3d948cf	Refactor code a bit llvm-svn: 8952	2003-10-07 22:32:43 +00:00
Chris Lattner	815ae2bfe3	Add a sanity check for constant expression casts llvm-svn: 8951	2003-10-07 22:19:19 +00:00
Chris Lattner	f8492537eb	Fix bugzilla bug #5 llvm-svn: 8930	2003-10-07 19:33:31 +00:00
Chris Lattner	ed922162e1	Bill contributed this major rewrite of the -lowerswitch pass to make it generate logarithmic conditional branch sequences instead of linear sequences. Thanks Bill! llvm-svn: 8928	2003-10-07 18:46:23 +00:00
Chris Lattner	800aaaf207	Fix bug in previous checkin llvm-svn: 8922	2003-10-07 15:17:02 +00:00
Chris Lattner	a220b04ce4	Actually _PASS IN_ NO_RESERVE if we have it. Thanks to Brian for fixing this obvious oops llvm-svn: 8899	2003-10-06 19:07:41 +00:00
Misha Brukman	c27da9dce0	ToolRunner.h has been moved from include/Support to include/llvm/Support. llvm-svn: 8898	2003-10-06 18:37:24 +00:00
Chris Lattner	794a7490f8	Users can never be null llvm-svn: 8895	2003-10-06 17:33:39 +00:00
Chris Lattner	e8ed4ef039	Minor speedups for the instcombine pass llvm-svn: 8894	2003-10-06 17:11:01 +00:00
Chris Lattner	6dc0ae2d18	Speed up the predicate used to decide when to inline by caching the size of callees between executions. On eon, in release mode, this changes the inliner from taking 11.5712s to taking 2.2066s. In debug mode, it went from taking 14.4148s to taking 7.0745s. In release mode, this is a 24.7% speedup of gccas, in debug mode, it's a total speedup of 11.7%. This also makes it slightly more aggressive. This could be because we are not judging the size of the functions quite as accurately as before. When we start looking at the performance of the generated code, this can be investigated further. llvm-svn: 8893	2003-10-06 15:52:43 +00:00
Brian Gaeke	4547ab19b0	Add # of printed instructions statistic to both the SPARC and X86 LLC backends. llvm-svn: 8892	2003-10-06 15:41:21 +00:00
Chris Lattner	6aa34b0d0b	Avoid doing pointless work. Amazingly, this makes us go faster. Running the inliner on 252.eon used to take 48.4763s, now it takes 14.4148s. In release mode, it went from taking 25.8741s to taking 11.5712s. This also fixes a FIXME. llvm-svn: 8890	2003-10-06 15:23:43 +00:00
Chris Lattner	8bfda656d9	Implement the NamedRegionTimer class llvm-svn: 8889	2003-10-06 15:02:31 +00:00
Brian Gaeke	82f64531f6	All of our supported operating systems (so far) and FreeBSD technically want you to include <sys/stat.h> for fstat(), struct stat, and friends. llvm-svn: 8887	2003-10-06 03:30:28 +00:00
Chris Lattner	c30f22f57c	This changes the PromoteMemToReg function to create "pruned" SSA form, not "minimal" SSA form (in other words, it doesn't insert dead PHIs). This speeds up the mem2reg pass very significantly because it doesn't have to do a lot of frivolous work in many common cases. In the 252.eon function I have been playing with, this doesn't even insert the 120 PHI nodes that it used to which were trivially dead (in the process of promoting 356 alloca instructions overall). This speeds up the mem2reg pass from 1.2459s to 0.1284s. More significantly, the DCE pass used to take 2.4138s to remove the 120 dead PHI nodes that mem2reg constructed, now it takes 0.0134s (which is the time to scan the function and decide that there is nothing dead). So overall, on this one function, we speed things up a total of 3.5179s, which is a 24.8x speedup! :) This change is tested by the Mem2Reg/2003-10-05-DeadPHIInsertion.ll test, which now passes. llvm-svn: 8884	2003-10-05 22:19:20 +00:00
Chris Lattner	a906bacfdd	Change the interface to PromoteMemToReg to also take a DominatorTree llvm-svn: 8883	2003-10-05 21:20:13 +00:00
Chris Lattner	8047152977	Speed up the mem2reg transform for allocas which are only read/written in a single basic block. This is amazingly common in code generated by the C/C++ front-ends. This change makes it not have to insert ANY phi nodes, whereas before it would insert a ton of dead ones which DCE would have to clean up. Thus, this fix improves compile-time performance of these trivial allocas in two ways: 1. It doesn't have to do the walking and book-keeping for renaming 2. It does not insert dead phi nodes for them which would have to subsequently be cleaned up. On my favorite testcase from 252.eon, this special case handles 305 out of 356 promoted allocas in the function. It speeds up the mem2reg pass from 7.5256s to 1.2505s. It inserts 677 fewer dead PHI nodes, which speeds up a subsequent -dce pass from 18.7524s to 2.4806s. There are still 120 trivially dead PHI nodes being inserted for variables used in multiple basic blocks, but they are not handled by this patch. llvm-svn: 8881	2003-10-05 20:54:03 +00:00
Chris Lattner	5f0c08e9cf	Move support/lib/Support back to lib/Support llvm-svn: 8874	2003-10-05 19:24:38 +00:00
Chris Lattner	fb25374170	Add support for the Invoke instruction by using the LowerInvoke pass llvm-svn: 8872	2003-10-05 19:16:09 +00:00
Chris Lattner	37225efa97	Instead of hacking in custom support for Invoke/Unwind, use the LowerInvoke pass llvm-svn: 8871	2003-10-05 19:15:47 +00:00
Chris Lattner	a43b8f4b2f	Initial checkin of the LLVM->LLVM transform to support code generators which do not support stack unwinding yet llvm-svn: 8869	2003-10-05 19:14:42 +00:00
Chris Lattner	5ed281d7d7	simplify-cfg is really a function pass llvm-svn: 8868	2003-10-05 19:14:16 +00:00
Chris Lattner	df9779c5c2	Be more careful handling PHI nodes, which might be of potentially high degree. This reduces the time to verify a function from eon with a large number of large PHI nodes from 22996s (6.38 hours) to 10.5499s llvm-svn: 8866	2003-10-05 17:44:18 +00:00
Chris Lattner	a5721d3d03	The first PHI node may be null, scan for the first non-null one llvm-svn: 8865	2003-10-05 05:34:39 +00:00
Chris Lattner	203bc011e5	The VersionNumbers vector is only used during PHI placement. Turn it into an argument, allowing us to get rid of the vector. llvm-svn: 8864	2003-10-05 04:33:22 +00:00
Chris Lattner	7d9692df22	* Update file header comment *** Revamp the code which handled unreachable code in the function. Now the code is much more efficient for high-degree basic blocks, such as those that occur in the 252.eon SPEC benchmark. For the interested, the time to promote a SINGLE alloca in _ZN7mrScene4ReadERSi function used to be > 3.5s. Now it is < .075s. The function has a LOT of allocas in it, so it appeared to be infinite looping, this should make it much nicer. :) llvm-svn: 8863	2003-10-05 04:26:39 +00:00
Chris Lattner	db1f81bcb5	Simplify the loop a bit llvm-svn: 8862	2003-10-05 03:45:44 +00:00
Chris Lattner	2093012a03	There is no need for separate WriteSets and PhiNodeBlocks lists. It is just a work-list of value definitions. This allows elimination of the explicit 'iterative' step of the algorithm, and also reuses temporary memory better. llvm-svn: 8861	2003-10-05 03:39:10 +00:00
Chris Lattner	9408b1e8b9	The PhiNodes 2D vector is only used during PHI node placement. It doesn't need to be an instance variable! llvm-svn: 8860	2003-10-05 03:26:25 +00:00
Chris Lattner	417bc173ee	* Document instance vars better * Fuse two parallel loops * Use a more specific type for AllocaLookup llvm-svn: 8859	2003-10-05 03:16:07 +00:00
Chris Lattner	c360f135be	Two small cleanups/speedups: * Do not insert a new entry into NewPhiNodes during the rename pass if there are no PHIs in a block. * Do not compute WriteSets in parallel llvm-svn: 8858	2003-10-05 02:37:36 +00:00
Chris Lattner	c3d57b5923	* Minor cleanups * Eliminate the KillList instance variable, instead, just delete loads and stores as they are "renamed", and delete allocas when they are done * Make the 'visited' set an instance variable to avoid passing it on the stack. llvm-svn: 8857	2003-10-05 01:52:53 +00:00
Chris Lattner	6e7ae58260	A couple of minor code cleanups. Print literal doubles using ftostr instead of <<, because it yields higher precision numbers. llvm-svn: 8855	2003-10-05 00:40:51 +00:00
Chris Lattner	b50d135710	Type tables are now AbstractTypeUsers. This allows them to merge together constants as necessary due to type resolution. With this change, the following spec benchmarks now link: 176.gcc, 177.mesa, 252.eon, 253.perlbmk, & 300.twolf. IOW, all SPEC INT and FP benchmarks now link. llvm-svn: 8853	2003-10-05 00:17:43 +00:00
Chris Lattner	cc593fb6c6	Rename AbstractModuleProvider -> ModuleProvider, to match the header file name, and because, while the class used by the interface is abstract, the actual concept is not. llvm-svn: 8850	2003-10-04 20:14:59 +00:00
Chris Lattner	069d6493c9	Transform two methods to return pointers directly instead of returning them as 'by reference' arguments. llvm-svn: 8849	2003-10-04 20:00:03 +00:00
Chris Lattner	4318c4b549	Use V for values, not D. llvm-svn: 8848	2003-10-04 19:29:21 +00:00
Chris Lattner	a0d15e4d9e	Do not leak the ModuleProvider if releaseModule() throws. llvm-svn: 8847	2003-10-04 19:19:37 +00:00
Chris Lattner	3e88ef9f93	Minor random speedup: make Constant::getNullValue only call ConstantFOO::get once! llvm-svn: 8845	2003-10-03 19:34:51 +00:00
Chris Lattner	2c6cecf321	Explicit copy ctors are no longer needed llvm-svn: 8844	2003-10-03 19:02:41 +00:00
Chris Lattner	7f4cfda3ba	Minor cleanups llvm-svn: 8843	2003-10-03 18:57:54 +00:00
Chris Lattner	0f21ab75fa	This checkin basically amounts to a complete rewrite of the type-resolution machinery. This dramatically simplifies how things works, removes irritating little corner cases, and overall improves speed and reliability. Highlights of this change are: 1. The exponential algorithm built into the code is now gone. For example the time to disassemble one bytecode file from the mesa benchmark went from taking 12.5s to taking 0.16s. 2. The linker bugs should be dramatically reduced. The one remaining bug has to do with constant handling, which I actually introduced in "union-find" checkins. 3. The code is much easier to follow, as a result of fewer special cases. It's probably also smaller. yaay. llvm-svn: 8842	2003-10-03 18:46:24 +00:00
Chris Lattner	29dc65a324	These methods are dead, remove them. Because the bodies will soon be ressurected, just ifdef them out for now. llvm-svn: 8840	2003-10-03 18:39:57 +00:00
Chris Lattner	a72b78967f	Dramatically simplify DerivedType::refineAbstractTypeToInternal This makes use of the new PATypeHolder's to keep types from being deleted prematurely, instead of the wierd "self reference" garbage. This is easier to understand and more efficient as well. llvm-svn: 8834	2003-10-03 04:48:21 +00:00
Chris Lattner	016418b8c2	Make the PATypeHolder use a simple union-find implementation to handle merging of types. This makes it MUCH more efficient than before, also making things simpler. llvm-svn: 8833	2003-10-02 23:35:57 +00:00
Chris Lattner	c7caffab32	There is no need for BytecodeParser to be an AbstractTypeUser. Instead, it can just use PATypeHolders llvm-svn: 8832	2003-10-02 20:26:18 +00:00
Chris Lattner	22e4ca8863	There is no reason for Value to be an AbstractTypeUser. This just makes things significantly more complete. Instead, just make DerivedType's AbstractTypeUser's, and make Value contain a PATypeHolder. llvm-svn: 8828	2003-10-02 19:44:40 +00:00
Chris Lattner	5c02edbf5c	The objects mapped are really PATypeHolders, not PATypeHandles llvm-svn: 8822	2003-10-02 19:00:34 +00:00

1 2 3 4 5 ...

4350 Commits