hanchenye-llvm-project

Commit Graph

Author	SHA1	Message	Date
Martell Malone	8fd2d37b0b	libcxx: fix bootstrapping for mingw-w64 Differential Revision: https://reviews.llvm.org/D33388 llvm-svn: 303928	2017-05-25 22:37:15 +00:00
David Blaikie	9f8669461d	Fix test to handle running on platforms which don't enable pubnames at all Check that there are no entries in the pub sections, but that they may either be not present or present-but-empty. llvm-svn: 303927	2017-05-25 22:10:51 +00:00
Rui Ueyama	2f106b4690	When uncompressing sections, remove SHF_COMPRESSED bits. NFC. In this way, the content and the flag is always consistent, which I think better than removing the bit when input sections reaches the Writer. llvm-svn: 303926	2017-05-25 22:00:36 +00:00
Rafael Espindola	1c2baad6dd	Revert "Simplify a variable type by using StringRef instead of CachedHashStringRef." This reverts commit r303787. It caused a slowdown in fast links. That is, links with no debug info or optimizations. llvm-svn: 303925	2017-05-25 21:53:02 +00:00
Craig Topper	d4039f7283	[InstCombine] Add an InstCombine specific wrapper around isKnownToBeAPowerOfTwo to shorten code. NFC We have wrappers for several other ValueTracking methods that take care of passing all of the analysis and assumption cache parameters. This extends it to isKnownToBeAPowerOfTwo. llvm-svn: 303924	2017-05-25 21:51:12 +00:00
Wei Mi	fd257fa7bf	[GVN] Add phi-translate support in scalarpre. Right now scalarpre doesn't have phi-translate support, so it will miss some simple pre opportunities. Like the following testcase, current scalarpre cannot recognize the last "a * b" is fully redundent because a and b used by the last "a * b" expr are both defined by phis. long a[100], b[100], g1, g2, g3; __attribute__((pure)) long goo(); void foo(long a, long b, long c, long d) { g1 = a * b; if (__builtin_expect(g2 > 3, 0)) { a = c; b = d; g2 = a * b; } g3 = a * b; // fully redundant. } The patch adds phi-translate support in scalarpre. This is only a temporary solution before the newpre based on newgvn is available. Differential Revision: https://reviews.llvm.org/D32252 llvm-svn: 303923	2017-05-25 21:49:02 +00:00
Andrew Kaylor	f466001eef	Add constrained intrinsics for some libm-equivalent operations Differential revision: https://reviews.llvm.org/D32319 llvm-svn: 303922	2017-05-25 21:31:00 +00:00
Matthias Braun	1527baab0c	CodeGen: Rename DEBUG_TYPE to match passnames Rename the DEBUG_TYPE to match the names of corresponding passes where it makes sense. Also establish the pattern of simply referencing DEBUG_TYPE instead of repeating the passname where possible. llvm-svn: 303921	2017-05-25 21:26:32 +00:00
Zachary Turner	2897e0306e	[lld] Fix a bug where we continually re-follow type servers. Originally this was intended to be set up so that when linking a PDB which refers to a type server, it would only visit the PDB once, and on subsequent visitations it would just skip it since all the records had already been added. Due to some C++ scoping issues, this was not occurring and it was revisiting the type server every time, which caused every record to end up being thrown away on all subsequent visitations. This doesn't affect the performance of linking clang-cl generated object files because we don't use type servers, but when linking object files and libraries generated with /Zi via MSVC, this means only 1 object file has to be linked instead of N object files, so the speedup is quite large. llvm-svn: 303920	2017-05-25 21:16:03 +00:00
Zachary Turner	7f97c362a4	[CodeView Type Merging] Don't keep re-allocating temp serializer. Previously, every time we wanted to serialize a field list record, we would create a new copy of FieldListRecordBuilder, which would in turn create a temporary instance of TypeSerializer, which itself had a std::vector<> that was about 128K in size. So this 128K allocation was happening every time. We can re-use the same instance over and over, we just have to clear its internal hash table and seen records list between each run. This saves us from the constant re-allocations. This is worth an ~18.5% speed increase (3.75s -> 3.05s) in my tests. Differential Revision: https://reviews.llvm.org/D33506 llvm-svn: 303919	2017-05-25 21:15:37 +00:00
Zachary Turner	95c625ecc9	Make BinaryStreamReader::readCString a bit faster. Previously it would do a character by character search for a null terminator, to account for the fact that an arbitrary stream need not store its data contiguously so you couldn't just do a memchr. However, the stream API has a function which will return the longest contiguous chunk without doing a copy, and by using this function we can do a memchr on the individual chunks. For certain types of streams like data from object files etc, this is guaranteed to find the null terminator with only a single memchr, but even with discontiguous streams such as MappedBlockStream, it's rare that any given string will cross a block boundary, so even those will almost always be satisfied with a single memchr. This optimization is worth a 10-12% reduction in link time (4.2 seconds -> 3.75 seconds) Differential Revision: https://reviews.llvm.org/D33503 llvm-svn: 303918	2017-05-25 21:12:27 +00:00
Bob Haarman	55256ada25	[pdb] pad source file name buffer at the end instead of the beginning Summary: DbiStreamBuilder calculated the offset of the source file names inside the file info substream as the size of the file info substream minus the size of the file names. Since the file info substream is padded to a multiple of 4 bytes, this caused the first file name to be aligned on a 4-byte boundary. By contrast, DbiModuleList would read the file names immediately after the file name offset table, without skipping to the next 4-byte boundary. This change makes it so that the file names are written to the location where DbiModuleList expects them, and puts any necessary padding for the file info substream after the file names instead of before it. Reviewers: amccarth, rnk, zturner Reviewed By: amccarth, zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D33475 llvm-svn: 303917	2017-05-25 21:12:15 +00:00
Zachary Turner	c4e4b7e31e	Fix a bug in MappedBlockStream. It was using the number of blocks of the entire PDB file as the number of blocks of each stream that was created. This was only an issue in the readLongestContiguousChunk function, which was never called prior. This bug surfaced when I updated an algorithm to use this function and the algorithm broke. llvm-svn: 303916	2017-05-25 21:12:00 +00:00
Sam Clegg	1c154a6107	[WebAssembly] MC: Include unnamed data when writing wasm files Also, include global entries for all data symbols, not just external ones, since these are referenced by the relocation records. Add a test case that includes unnamed data. Differential Revision: https://reviews.llvm.org/D33079 llvm-svn: 303915	2017-05-25 21:08:07 +00:00
Zachary Turner	dda25b128c	[CodeView Type Merging] Avoid record deserialization when possible. A profile shows the majority of time doing type merging is spent deserializing records from sequences of bytes into friendly C++ structures that we can easily access members of in order to find the type indices to re-write. Records are prefixed with their length, however, and most records have type indices that appear at fixed offsets in the record. For these records, we can save some cycles by just looking at the right place in the byte sequence and re-writing the value, then skipping the record in the type stream. This saves us from the costly deserialization of examining every field, including potentially null terminated strings which are the slowest, even though it was unnecessary to begin with. In addition, we apply another optimization. Previously, after deserializing a record and re-writing its type indices, we would unconditionally re-serialize it in order to compute the hash of the re-written record. This would result in an alloc and memcpy for every record. If no type indices were re-written, however, this was an unnecessary allocation. In this patch re-writing is made two phase. The first phase discovers the indices that need to be rewritten and their new values. This information is passed through to the de-duplication code, which only copies and re-writes type indices in the serialized byte sequence if at least one type index is different. Some records have type indices which only appear after variable length strings, or which have lists of type indices, or various other situations that can make it tricky to make this optimization. While I'm not giving up on optimizing these cases as well, for now we can get the easy cases out of the way and lay the groundwork for more complicated cases later. This patch yields another 50% speedup on top of the already large speedups submitted over the past 2 days. In two tests I have run, I went from 9 seconds to 3 seconds, and from 16 seconds to 8 seconds. Differential Revision: https://reviews.llvm.org/D33480 llvm-svn: 303914	2017-05-25 21:06:28 +00:00
Aaron Ballman	c674a056e9	Update the getting started documentation to match the corresponding LLVM commit in r303912. llvm-svn: 303913	2017-05-25 21:02:49 +00:00
Aaron Ballman	472278a52e	Update the documentation and CMake file for Visual Studio generators. By default, CMake uses a 32-bit toolchain, even when on a 64-bit platform targeting a 64-bit build. However, due to the size of the binaries involved, this can cause linker instabilities (such as the linker running out of memory). Guide people to the correct solution to get CMake to use the native toolchain. llvm-svn: 303912	2017-05-25 21:01:30 +00:00
Kostya Serebryany	f3509b6d9b	[asan] relax sanbox_read_proc_self_maps_test to pass even if unshare() fails. llvm-svn: 303911	2017-05-25 20:50:36 +00:00
Hans Wennborg	ca7a3947f5	Make test/Driver/baremetal.cpp pass on Windows llvm-svn: 303910	2017-05-25 20:39:52 +00:00
Erich Keane	a37711b1e2	Earlier revert introduced an extra space, remove it. llvm-svn: 303909	2017-05-25 20:29:17 +00:00
Reid Kleckner	581a6c5d56	Revert "[AMDGPU] add __builtin_amdgcn_s_getpc" This reverts commit r303861, the LLVM intrinsic was reverted. llvm-svn: 303908	2017-05-25 20:28:26 +00:00
Kamil Rytarowski	8855c2ca19	Fix bug #28898 lldb: libedit produces garbled, unusable input on Linux Apply patch from Christos Zoulas, upstream libedit developer. It has been tested on NetBSD/amd64. New code supports combination of wide libedit and disabled LLDB_EDITLINE_USE_WCHAR, which was the popular case on Linux systems. llvm-svn: 303907	2017-05-25 20:12:30 +00:00
Francis Ricci	3bfbd70840	Fix typo in tls patch llvm-svn: 303906	2017-05-25 19:55:44 +00:00
Rui Ueyama	5012858e26	Accept not only --reproduce <foo> but also --reproduce=<foo>. llvm-svn: 303905	2017-05-25 19:49:54 +00:00
Kyle Butt	13379d7c99	PPC: Correct Size for GETtlsADDR PPC::GETtlsADDR is lowered to a branch and a nop, by the assembly printer. Its size was incorrectly marked as 4, correct it to 8. The incorrect size can cause incorrect branch relaxation in PPCBranchSelector under the right conditions. llvm-svn: 303904	2017-05-25 19:37:41 +00:00
Adrian Prantl	efd2b8f824	Add a test for PR33166. This tests optimized code where a variable is allocated on the stack for some part of the function. llvm-svn: 303903	2017-05-25 19:33:16 +00:00
Nico Weber	b3d83a092a	Revert r303859, CodeGen/AMDGPU/llvm.amdgcn.s.getpc.ll fails on bots. llvm-svn: 303902	2017-05-25 19:19:29 +00:00
Manoj Gupta	d536180fdc	[AArch64]: add 'a' inline asm operand modifier. Summary: This is used in the Linux kernel, and effectively just means "print an address". This brings back r193593. Reviewed by: Renato Golin Reviewers: t.p.northover, rengolin, richard.barton.arm, kristof.beyls Subscribers: aemerson, javed.absar, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D33558 llvm-svn: 303901	2017-05-25 19:07:57 +00:00
Eric Fiselier	51056aef65	Update more coroutine_handle signatures to reflect N4663. Thanks to Casey Carter for pointing out the out-of-date tests and implementation. llvm-svn: 303900	2017-05-25 19:04:55 +00:00
Billy Robert O'Neal III	4deabc97a1	Add asserts that the nullptr is maintained in string erase operations. llvm-svn: 303899	2017-05-25 19:01:14 +00:00
Jonathan Roelofs	9b22df691b	Appease more buildbots about r303873 llvm-svn: 303898	2017-05-25 18:55:22 +00:00
Adrian Prantl	f062192632	Fix SelectionDAGBuilder::getDbgValue to not expect DW_OP_deref on FI vars This fixes an oversight in r300522, which changed alloca dbg.values to no longer emit a DW_OP_deref. The array.ll testcase was regenerated from source. Fixes PR33166: https://bugs.llvm.org/show_bug.cgi?id=33166 llvm-svn: 303897	2017-05-25 18:54:10 +00:00
Adrian Prantl	14bd244398	Delete an obsolete paragraph in LangRef. llvm-svn: 303896	2017-05-25 18:54:06 +00:00
Eric Fiselier	2944c5a3fa	Update coroutine_handle<P>::promise to reflect N4663. This patch updates the promise() member to match the current spec. Specifically it removes the non-const overload and make the return type of the const overload non-const. This patch also makes the ASSERT_NOT_NOEXCEPT tests libc++ specific, since other implementations may be free to strengthen the specification. llvm-svn: 303895	2017-05-25 18:52:34 +00:00
David Blaikie	b3cee2fb42	DebugInfo: Produce debug_{gnu_}pub{names,types} entries when explicitly requested, even in -gmlt or when empty Turns out gold doesn't use the DW_AT_GNU_pubnames to decide whether to parse the rest of the DIEs when building gdb-index. This causes gold to trip over LLVM's output when there are DW_FORM_ref_addr present. Gold does use the presence of a debug_gnu_pub{names,types} entry for the CU to skip parsing the debug_info portion, so make sure that's included even when empty (technically, when empty there couldn't be any ref_addr anyway - it only came up when gmlt didn't produce any (even non-empty) pubnames - but given what that reveals about gold's implementation, this seems like a good thing to do for consistency). llvm-svn: 303894	2017-05-25 18:50:28 +00:00
Rui Ueyama	236e781011	Use MD5::hash(). NFC. llvm-svn: 303893	2017-05-25 18:17:43 +00:00
Vitaly Buka	40d54d408b	[compiler-rt] Make print_module_map description consistent with the rest. Reviewers: eugenis Subscribers: llvm-commits, kubamracek Differential Revision: https://reviews.llvm.org/D33160 llvm-svn: 303892	2017-05-25 18:07:48 +00:00
Bob Haarman	ea91fafd33	[llvm-pdbdump] [yaml2pdb] always include object file name in module info Summary: Previously, the yaml2pdb subcommand of llvm-pdbdump only included object file names in module info if a module info stream was present. This change makes it so that we include the object file name even if there is no module info stream for the module. As a result, running llvm-pdbdump pdb2yaml -dbi-module-info original.pdb > original.yaml && llvm-pdbdump yaml2pdb -pdb=new.pdb original.yaml && llvm-pdbdump pdb2yaml -dbi-module-info new.pdb > new.yaml now produces identical original.yaml and new.yaml files. Reviewers: amccarth, zturner Reviewed By: zturner Subscribers: fhahn, llvm-commits Differential Revision: https://reviews.llvm.org/D33463 llvm-svn: 303891	2017-05-25 18:04:17 +00:00
Rui Ueyama	69ae29b1d1	Do not allow delay-importing data symbols. If you pass /delayload:<dllname> to the COFF linker, it creates thunks so that DLLs are loaded when they are used for the first time instead of load-time. This mechanism do not work for data symbols as there's no way to trap acccesses to data imported from DLLs. (Technically, I think if we do not initially map dllimport tables in memory, we could actually trap accesses and delay-load data symbols, but that's not what Windows do.) This patch is to report an error when you try to delay-load data symbols. Fixes https://bugs.llvm.org/show_bug.cgi?id=33106 Differential Revision: https://reviews.llvm.org/D33557 llvm-svn: 303890	2017-05-25 18:03:34 +00:00
Casey Carter	b4f3924dd5	[test] Remove workaround for C1XX conversion-to-nullptr bug VSO#391542 "Types can't be convertible to nullptr_t" Also put internal bug numbers on the workarounds in test_workarounds.h for correlation. Differential Revision: https://reviews.llvm.org/D33290 llvm-svn: 303889	2017-05-25 17:42:21 +00:00
Casey Carter	895db0a5f6	[test] Workaround C1XX bug in uses_allocator_types.hpp VSO#109062 "Explicit template argument specification with empty template parameter pack expansion does not imply further empty pack expansion" Differential Revision: https://reviews.llvm.org/D33214 llvm-svn: 303888	2017-05-25 17:42:17 +00:00
Francis Ricci	86e070f7e9	Implement tls scanning for darwin LSan Summary: This required for any users who call exit() after creating thread-specific data, as tls destructors are only called when pthread_exit() or pthread_cancel() are used. This should also match tls behavior on linux. Getting the base address of the tls section is straightforward, as it's stored as a section offset in %gs. The size is a bit trickier to work out, as there doesn't appear to be any official documentation or source code referring to it. The size used in this patch was determined by taking the difference between the base address and the address of the subsequent memory region returned by vm_region_recurse_64, which was 1024 * sizeof(uptr) on all threads except the main thread, where it was larger. Since the section must be the same size on all of the threads, 1024 * sizeof(uptr) seemed to be a reasonable size to use, barring a more programtic way to get the size. 1024 seems like a reasonable number, given that PTHREAD_KEYS_MAX is 512 on darwin, so pthread keys will fit inside the region while leaving space for other tls data. A larger size would overflow the memory region returned by vm_region_recurse_64, and a smaller size wouldn't leave room for all the pthread keys. In addition, the stress test added here passes, which means that we are scanning at least the full set of possible pthread keys, and probably the full tls section. Reviewers: alekseyshl, kubamracek Subscribers: krytarowski, llvm-commits Differential Revision: https://reviews.llvm.org/D33215 llvm-svn: 303887	2017-05-25 17:41:13 +00:00
Francis Ricci	75ca300f2b	Don't require ThreadState to be contained within tls on all platforms The existing implementation ran CHECKs to assert that the thread state was stored inside the tls. However, the mac implementation of tsan doesn't store the thread state in tls, so these checks fail once darwin tls support is added to the sanitizers. Only run these checks on platforms where the thread state is expected to be contained in the tls. llvm-svn: 303886	2017-05-25 17:41:10 +00:00
Adam Nemet	14205b4a76	Disable two more flaky ASan wait* tests temporarily on Darwin llvm-svn: 303885	2017-05-25 17:24:54 +00:00
Eugene Zelenko	eed5f028ee	[Documentation] Mention hicpp check group in Clang-tidy main document. llvm-svn: 303884	2017-05-25 17:22:29 +00:00
Kostya Kortchinsky	5d0ecbc8d9	[sanitizer] Revert rL303879 as it breaks Windows Summary: Apparently Windows's `UnmapOrDie` doesn't support partial unmapping. Which makes the new region allocation technique not Windows compliant. Reviewers: alekseyshl, dvyukov Reviewed By: alekseyshl Subscribers: llvm-commits, kubamracek Differential Revision: https://reviews.llvm.org/D33554 llvm-svn: 303883	2017-05-25 16:54:44 +00:00
Erich Keane	d46083cc3b	Revert MSVC CXXOperatorNames patch due to issues with Chromium llvm-svn: 303882	2017-05-25 16:24:49 +00:00
Erich Keane	ea0f630055	Revert 303872/303877 since the patch that caused these issues is also being reverted. llvm-svn: 303881	2017-05-25 16:23:00 +00:00
Jonathan Roelofs	8d765ef92d	Relax testcase to appease buildbots When lld isn't built, the tests as they were previously, were too picky about the path to the linker. llvm-svn: 303880	2017-05-25 16:20:51 +00:00
Kostya Kortchinsky	0dd40cf28d	[sanitizer] Change the 32-bit Primary AllocateRegion to reduce fragmentation Summary: Currently, AllocateRegion has a tendency to fragment memory: it allocates `2kRegionSize`, and if the memory is aligned, will unmap `kRegionSize` bytes, thus creating a hole, which can't itself be reused for another region. This is exacerbated by the fact that if 2 regions get allocated one after another without any `mmap` in between, the second will be aligned due to mappings generally being contiguous. An idea, suggested by @alekseyshl, to prevent such a behavior is to have a stash of regions: if the `2kRegionSize` allocation is properly aligned, split it in two, and stash the second part to be returned next time a region is requested. At this point, I thought about a couple of ways to implement this: - either an `IntrusiveList` of regions candidates, storing `next` at the begining of the region; - a small array of regions candidates existing in the Primary. While the second option is more constrained in terms of size, it offers several advantages: - security wise, a pointer in a region candidate could be overflowed into, and abused when popping an element; - we do not dirty the first page of the region by storing something in it; - unless several threads request regions simultaneously from different size classes, the stash rarely goes above 1 entry. I am not certain about the Windows impact of this change, as `sanitizer_win.cc` has its own version of MmapAlignedOrDie, maybe someone could chime in on this. MmapAlignedOrDie is effectively unused after this change and could be removed at a later point. I didn't notice any sizeable performance gain, even though we are saving a few `mmap`/`munmap` syscalls. Reviewers: alekseyshl, kcc, dvyukov Reviewed By: alekseyshl Subscribers: llvm-commits, kubamracek Differential Revision: https://reviews.llvm.org/D33454 llvm-svn: 303879	2017-05-25 16:19:57 +00:00

1 2 3 4 5 ...

263131 Commits All Branches Search

263131 Commits

All Branches