hanchenye-llvm-project

Commit Graph

Author	SHA1	Message	Date
Ted Kremenek	8d6c828728	Don't crash on empty PTH files. This fixes <rdar://problem/6512714>. llvm-svn: 62673	2009-01-21 07:34:28 +00:00
Chris Lattner	c950296006	really we only need on Read24! llvm-svn: 62672	2009-01-21 07:28:57 +00:00
Chris Lattner	47def9787e	revert my previous patch, it assumed endianness. llvm-svn: 62671	2009-01-21 07:21:56 +00:00
Chris Lattner	a74f7cbb9d	minor cleanups: now that tokens are 4-byte aligned in a PTH file, just load them directly as ints. llvm-svn: 62668	2009-01-21 07:06:08 +00:00
Ted Kremenek	52f73cad4a	Fix: <rdar://problem/6510344> [pth] PTH slows down regular lexer considerably (when it has substantial work) Changes to IdentifierTable: - High-level summary: StringMap never owns IdentifierInfos. It just references them. - The string map now has StringMapEntry<IdentifierInfo> instead of StringMapEntry<IdentifierInfo>. The IdentifierInfo object is allocated using the same bump pointer allocator as used by the StringMap. Changes to IdentifierInfo: - Added an extra pointer to point to the StringMapEntry<IdentifierInfo> in the string map. This pointer will be null if the IdentifierInfo* is only used by the PTHLexer (that is it isn't in the StringMap). Algorithmic changes: - Non-PTH case: IdentifierInfo::get() will always consult the StringMap first to see if we have an IdentifierInfo object. If that StringMapEntry references a null pointer, we allocate a new one from the BumpPtrAllocator and update the reference in the StringMapEntry. - PTH case: We do the same lookup as with the non-PTH case, but if we don't get a hit in the StringMap we do a secondary lookup in the PTHManager for the IdentifierInfo. If we don't find an IdentifierInfo we create a new one as in the non-PTH case. If we do find and IdentifierInfo in the PTHManager, we update the StringMapEntry to refer to it so that the IdentifierInfo will be found on the next StringMap lookup. This way we only do a binary search in the PTH file at most once for a given IdentifierInfo. This greatly speeds things up for source files containing a non-trivial amount of code. Performance impact: While these changes do add some extra indirection in IdentifierTable to access an IdentifierInfo, I saw speedups even in the non-PTH case as well. Non-PTH: For -fsyntax-only on Cocoa.h, we see a 6% speedup. PTH (with Cocoa.h in token cache): 11% speedup. I also did an experiment where we did -fsyntax-only on a source file including a large header and Cocoa.h, but the token cache did not contain the larger header. For this file, we were seeing a performance regression* when using PTH of 3% over non-PTH. Now we are seeing a performance improvement of 9%! Tests: The serialization tests are now failing. I looked at this extensively, and I my belief is that this change is unmasking a bug rather than introducing a new one. I have disabled the serialization tests for now. llvm-svn: 62636	2009-01-20 23:28:34 +00:00
Ted Kremenek	8433f0b400	PTH: Emitted tokens now consist of 12 bytes that are loaded used 3 32-bit loads. This reduces user time but increases system time because of the slightly larger PTH file. Although there is no performance win on Cocoa.h and -Eonly, overall this seems like a good step. llvm-svn: 62542	2009-01-19 23:13:15 +00:00
Chris Lattner	4fd8b958be	do not use SourceManager::getFileCharacteristic(FileID), it is not safe because a #line can change the file characteristic on a per-loc basis. llvm-svn: 62502	2009-01-19 08:01:53 +00:00
Chris Lattner	c033416639	do not use SourceManager::getFileCharacteristic(FileID), it is not safe because a #line can change the file characteristic on a per-loc basis. llvm-svn: 62501	2009-01-19 07:59:15 +00:00
Chris Lattner	cbc35ecb04	Rename SourceManager::getCanonicalFileID -> getFileID. There is no longer such thing as a non-canonical FileID. llvm-svn: 62499	2009-01-19 07:46:45 +00:00
Ted Kremenek	8c3b812148	Run destructors of MacroInfo objects to free memory they allocate. This addresses <rdar://problem/6506035>. llvm-svn: 62498	2009-01-19 07:45:44 +00:00
Chris Lattner	02495d80ef	Make some enums in SourceLocation private, remove a useless assertion from ScratchBuffer. llvm-svn: 62492	2009-01-19 06:57:37 +00:00
Chris Lattner	29a2a191f2	Make SourceLocation::getFileLoc private to reduce the API exposure of SourceLocation. This requires making some cleanups to token pasting and _Pragma expansion. llvm-svn: 62490	2009-01-19 06:46:35 +00:00
Chris Lattner	fc014f80e5	fix rdar://6505352 - Bogus warning with -WUndef, a case Anders noticed. llvm-svn: 62472	2009-01-18 21:18:58 +00:00
Chris Lattner	144aacd19e	rearrange GetIdentifierInfo so that the fast path can be partially inlined into PTHLexer::Lex. This speeds up the user time of PTH -Eonly by another 2ms (4.4%) llvm-svn: 62454	2009-01-18 02:57:21 +00:00
Chris Lattner	18fc6ceb56	rename some variables, only set a tokens identifierinfo if non-null. llvm-svn: 62450	2009-01-18 02:34:01 +00:00
Chris Lattner	9cdd877436	On i386 and x86-64, just do unaligned loads instead of assembling from bytes. This speeds up -Eonly PTH reading of cocoa.h by about 2ms, which is 4.2%. llvm-svn: 62447	2009-01-18 02:19:16 +00:00
Chris Lattner	137d6492a8	switch PTHLexer to use Read32 and friends instead of lots of inlined copies. I verified that this causes no performance change in PTH. llvm-svn: 62445	2009-01-18 02:10:31 +00:00
Chris Lattner	eb09754a9d	switch PTH lexer from using "const char"s to "const unsigned char"s internally. This is just a cleanup that reduces the need to cast to unsigned char before assembling a larger integer. llvm-svn: 62442	2009-01-18 01:57:14 +00:00
Chris Lattner	71dc14b9f0	Rename SourceLocation::getFileID to getChunkID, because it returns the chunk ID not the file ID. This exposes problems in TextDiagnosticPrinter where it should have been using the canonical file ID but wasn't. Fix these along the way. llvm-svn: 62427	2009-01-17 08:45:21 +00:00
Chris Lattner	5509d533f6	simplify some lookups. llvm-svn: 62426	2009-01-17 08:30:10 +00:00
Chris Lattner	757169b60f	Change the Lexer ctor used to lex _Pragma directives into a static factory method. This lets us clean up the interface and make it more obvious that this method is really really _Pragma specific. Note that _Pragma handling uglifies the Lexer in the critical path. It would be very interesting to consider making _Pragma remapping be a new special lexer class of its own. llvm-svn: 62425	2009-01-17 08:27:52 +00:00
Chris Lattner	ab1d4b8abd	simplify PTHManager::CreateLexer llvm-svn: 62424	2009-01-17 08:06:50 +00:00
Chris Lattner	c809089b26	Change the Lexer ctor used in the non _Pragma case to take a FileID instead of a SourceLocation. This should speed it up and definitely simplifies it. llvm-svn: 62422	2009-01-17 08:03:42 +00:00
Chris Lattner	8ddb5cf0cf	in Preprocessor::AdvanceToTokenCharacter, don't actually bother creating a whole lexer when we just want one static method. llvm-svn: 62420	2009-01-17 07:57:25 +00:00
Chris Lattner	5965a28a4b	More simplifications to the lexer ctors. llvm-svn: 62419	2009-01-17 07:56:59 +00:00
Chris Lattner	fcf6452eb4	make the verbose raw-lexer ctor fully explicit instead of having embedded magic. llvm-svn: 62417	2009-01-17 07:42:27 +00:00
Chris Lattner	08354fef13	add a simplified lexer ctor that sets up the lexer to raw-lex an entire file. llvm-svn: 62414	2009-01-17 07:35:14 +00:00
Chris Lattner	f76b92092e	refactor some common initialization code out of the two lexer ctors. llvm-svn: 62411	2009-01-17 06:55:17 +00:00
Chris Lattner	3793bba26f	suck the call to "getSpellingLoc" that all clients do into the implementation of PTHManager::getSpelling. llvm-svn: 62408	2009-01-17 06:29:33 +00:00
Chris Lattner	d32480d3db	this massive patch introduces a simple new abstraction: it makes "FileID" a concept that is now enforced by the compiler's type checker instead of yet-another-random-unsigned floating around. This is an important distinction from the "FileID" currently tracked by SourceLocation. That FileID may refer to the start of a file or to a chunk within it. The new FileID only refers to the file (and its #include stack and eventually #line data), it cannot refer to a chunk. FileID is a completely opaque datatype to all clients, only SourceManager is allowed to poke and prod it. llvm-svn: 62407	2009-01-17 06:22:33 +00:00
Chris Lattner	1abd20901b	Instead of iterating over FileID's, have PTH generation iterate over the content cache directly. Content cache has a 1-1 mapping with fileentries, whereas multiple FileIDs can be the same FileEntry. llvm-svn: 62401	2009-01-17 03:48:08 +00:00
Chris Lattner	5882771102	Fix PR2477 - clang misparses "//*" in C89 mode llvm-svn: 62368	2009-01-16 22:39:25 +00:00
Chris Lattner	5244f34e75	As a performance optimization, don't bother calling MacroInfo::isIdenticalTo if warnings in system headers are disabled. isIdenticalTo can end up calling the expensive getSpelling method, and other bad stuff and is completely unneeded if the warning will be discarded anyway. rdar://6502956 llvm-svn: 62347	2009-01-16 19:50:11 +00:00
Chris Lattner	f49775dc81	only notify callbacks if they exist. llvm-svn: 62334	2009-01-16 19:01:46 +00:00
Chris Lattner	262d4e31b9	Improve #pragma comment support by building the string argument and notifying PPCallbacks about it. llvm-svn: 62333	2009-01-16 18:59:23 +00:00
Chris Lattner	8a24e588d7	minor cleanups to StringLiteralParser: no need to pass target info into its ctor. Also, make it handle validity checking of pascal strings instead of making clients do it. llvm-svn: 62332	2009-01-16 18:51:42 +00:00
Chris Lattner	2ff698df60	Implement basic support for parsing #pragma comment, a microsoft extension documented here: http://msdn.microsoft.com/en-us/library/7f0aews7(VS.80).aspx This is according to my understanding reading the docs, I don't know if it really agrees fully with what VC++ allows. llvm-svn: 62317	2009-01-16 08:21:25 +00:00
Chris Lattner	8a42586c54	more SourceLocation lexicon change: instead of referring to the "logical" location, refer to the "instantiation" location. llvm-svn: 62316	2009-01-16 07:36:28 +00:00
Chris Lattner	7c8556e7bc	remove obsolete comment which happened to go over 80 cols. llvm-svn: 62313	2009-01-16 07:04:11 +00:00
Chris Lattner	15af77f679	remove an unneeded const_cast. llvm-svn: 62311	2009-01-16 07:02:14 +00:00
Chris Lattner	53e384f633	Change some terminology in SourceLocation: instead of referring to the "physical" location of tokens, refer to the "spelling" location. This is more concrete and useful, tokens aren't really physical objects! llvm-svn: 62309	2009-01-16 07:00:02 +00:00
Ted Kremenek	4bbb79a642	PTH: Fix termination condition in binary search. llvm-svn: 62277	2009-01-15 19:28:38 +00:00
Ted Kremenek	a705b04d7f	IdentifierInfo: - IdentifierInfo can now (optionally) have its string data not be co-located with itself. This is for use with PTH. This aspect is a little gross, as getName() and getLength() now make assumptions about a possible alternate representation of IdentifierInfo. Perhaps we should make IdentifierInfo have virtual methods? IdentifierTable: - Added class "IdentifierInfoLookup" that can be used by IdentifierTable to perform "string -> IdentifierInfo" lookups using an auxilliary data structure. This is used by PTH. - Perform tests show that IdentifierTable::get() does not slow down because of the extra check for the IdentiferInfoLookup object (the regular StringMap lookup does enough work to mitigate the impact of an extra null pointer check). - The upshot is that now that some IdentifierInfo objects might be owned by the IdentiferInfoLookup object. This should be reviewed. PTH: - Modified PTHManager::GetIdentifierInfo to not insert entries in IdentifierTable's string map, and instead create IdentifierInfo objects on the fly when mapping from persistent IDs to IdentifierInfos. This saves a ton of work with string copies, hashing, and StringMap lookup and resizing. This change was motivated because when processing source files in the PTH cache we don't need to do any string -> IdentifierInfo lookups. - PTHManager now subclasses IdentifierInfoLookup, allowing clients of IdentifierTable to transparently use IdentifierInfo objects managed by the PTH file. PTHManager resolves "string -> IdentifierInfo" queries by doing a binary search over a sorted table of identifier strings in the PTH file (the exact algorithm we use can be changed as needed). These changes lead to the following performance changes when using PTH on Cocoa.h: - fsyntax-only: 10% performance improvement - Eonly: 30% performance improvement llvm-svn: 62273	2009-01-15 18:47:46 +00:00
Ted Kremenek	bef9fc2240	PTH: Embed a persistentID side-table in the PTH file that is sorted in the lexical order of the corresponding identifier strings. This will be used for a forthcoming optimization. This slows down PTH generation time by 7%. We can revert this change if the optimization proves to not be valuable. llvm-svn: 62248	2009-01-15 01:26:25 +00:00
Ted Kremenek	e9814186ac	PTH: - Use canonical FileID when using getSpelling() caching. This addresses some cache misses we were seeing with -fsyntax-only on Cocoa.h - Added Preprocessor::getPhysicalCharacterAt() utility method for clients to grab the first character at a specified sourcelocation. This uses the PTH spelling cache. - Modified Sema::ActOnNumericConstant() to use Preprocessor::getPhysicalCharacterAt() instead of SourceManager::getCharacterData() (to get PTH hits). These changes cause -fsyntax-only to not page in any sources from Cocoa.h. We see a speedup of 27%. llvm-svn: 62193	2009-01-13 23:19:12 +00:00
Ted Kremenek	7cbdcc25d4	Fix corner cases in PTH getSpelling() binary search. llvm-svn: 62187	2009-01-13 22:16:45 +00:00
Ted Kremenek	b0b4f74b6b	PTH: Fix remaining cases where the spelling cache in the PTH file was being missed when it shouldn't. This shaves another 7% off PTH time for -Eonly on Cocoa.h llvm-svn: 62186	2009-01-13 22:05:50 +00:00
Ted Kremenek	47b8cf6deb	Enhance PTH 'getSpelling' caching: - Refactor caching logic into a helper class PTHSpellingSearch - Allow "random accesses" in the spelling cache, thus catching the remaining cases where 'getSpelling' wasn't hitting the PTH cache For -Eonly, PTH, Cocoa.h: - This reduces wall time by 3% (user time unchanged, sys time reduced) - This reduces the amount of paged source by 1112K. The remaining 1112K still being paged in is from somewhere else (investigating). llvm-svn: 62009	2009-01-09 22:05:30 +00:00
Ted Kremenek	8ae06625b5	Invert assertion condition. llvm-svn: 61961	2009-01-09 00:36:11 +00:00
Ted Kremenek	d5e6e16d0d	PTH: Hook up getSpelling() caching in PTHLexer. This results in a nice performance gain. Here's what we see for -Eonly on Cocoa.h (using PTH): - wall time decreases by 21% (26% speedup overall) - system time decreases by 35% - user time decreases by 6% These reductions are due to not paging source files just to get spellings for literals. The solution in place doesn't appear to be 100% yet, as we still see some of the pages for source files getting mapped in. Using -print-stats, we see that SourceManager maps in 7179K less bytes of source text (reduction of 75%). Will investigate why the remaining 25% are getting paged in. With these changes, here's how PTH compares to non-PTH on Cocoa.h: -Eonly: PTH takes 64% of the time as non-PTH (54% speedup) -fsyntax-only: PTH takes 89% of the time as non-PTH (11% speedup) llvm-svn: 61913	2009-01-08 04:30:32 +00:00

1 2 3 4 5

242 Commits