hanchenye-llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	1d555c4e91	R600: Remove AMDILISelLowering llvm-svn: 211519	2014-06-23 18:00:55 +00:00
Matt Arsenault	d5f91fd883	R600: Select is not expensive. llvm-svn: 211518	2014-06-23 18:00:52 +00:00
Matt Arsenault	c4d3d3a16e	R600: Move add/sub with overflow out of AMDILISelLowering Add more tests for these. llvm-svn: 211517	2014-06-23 18:00:49 +00:00
Matt Arsenault	e54e1c3a21	R600: Move more out of AMDILISelLowering llvm-svn: 211516	2014-06-23 18:00:44 +00:00
Matt Arsenault	72573adbf2	R600: Don't set fp_round_inreg action. There's no point in setting this since it seems to only by created in 1 place for ppcf128 llvm-svn: 211515	2014-06-23 18:00:41 +00:00
Matt Arsenault	b8b5153935	R600/SI: Handle i64 sub. We can handle it the same way as add llvm-svn: 211514	2014-06-23 18:00:38 +00:00
Matt Arsenault	9fa3f93173	R600/SI: Move selection of i64 add to separate function. Also don't use a SmallVector for fixed size array. llvm-svn: 211513	2014-06-23 18:00:34 +00:00
Matt Arsenault	c791f39912	R600: Rename AMDIL file llvm-svn: 211512	2014-06-23 18:00:31 +00:00
Matt Arsenault	f4d871b113	Fix missing words in sentence llvm-svn: 211511	2014-06-23 18:00:26 +00:00
Matt Arsenault	762ef017db	Use helper function llvm-svn: 211510	2014-06-23 18:00:24 +00:00
Matt Arsenault	236d9afd18	Alphabetize forward declarations llvm-svn: 211509	2014-06-23 18:00:20 +00:00
Duncan P. N. Exon Smith	0067ff4678	Support: Extract ScaledNumbers::compare() llvm-svn: 211507	2014-06-23 17:47:40 +00:00
Rafael Espindola	886048276f	Allow using .cfi_startproc without a leading symbol. This is possible now that we don't produce .eh symbols. This fixes pr19430. llvm-svn: 211502	2014-06-23 15:34:32 +00:00
Rafael Espindola	440bb21b5a	Stop producing func.eh symbols on Darwin. According Nick Kledzik (http://llvm.org/bugs/show_bug.cgi?id=19430#c2): "... mach-o no longer needs names in the __eh_frame section (and has not for years)." Iain Sandoe confirms it is also unnecessary for their old darwin support. llvm-svn: 211500	2014-06-23 15:13:23 +00:00
Rafael Espindola	73f364ef5f	Remove a temporary hack. Amusingly this survived a lot longer than the CFI transition. We don't even support non-cfi assemblers any more. llvm-svn: 211498	2014-06-23 14:22:55 +00:00
Ulrich Weigand	8ca988f31a	[PowerPC] Refactor getMinCallFrameSize / getMinCallArgumentsSize As of r211495, the only remaining users of getMinCallFrameSize are in core ABI code (LowerFormalParameter / LowerCall). This is actually a good thing, since the details of the parameter save area are ABI specific. With the new ELFv2 ABI in particular, the rules defining the size of the save area will become significantly more complex, so it wouldn't make sense to implement those outside ABI code that has all required information. In preparation, this patch eliminates the getMinCallFrameSize (and associated getMinCallArgumentsSize) routines, and inlines them into all callers. Note that since nearly all call arguments are constant, this allows simplifying the inlined copies to a single line everywhere. No change in generate code expected. llvm-svn: 211497	2014-06-23 14:15:53 +00:00
Ulrich Weigand	f316e1db75	[PowerPC] Allow stack frames without parameter save area The PPCFrameLowering::determineFrameLayout routine currently ensures that every function that allocates a stack frame provides space for the parameter save area (via PPCFrameLowering::getMinCallFrameSize). This is actually not necessary. There may be functions that never call another routine but still allocate a frame; those do not require the parameter save area. In the future, with the ELFv2 ABI, even some routines that do call other functions do not need to allocate the parameter save area. While it is not a bug to allocate the parameter area when it is not needed, it is better to avoid it to save stack space. Note that when any particular function call requires the parameter save area, this space will already have been included by ABI code in the size the CALLSEQ_START insn is annotated with, and therefore included in the size returned by MFI->getMaxCallFrameSize(). This means that determineFrameLayout simply does not need to care about the parameter save area. (It still needs to ensure that every frame provides the linkage area.) This is implemented by this patch. Note that this exposed a bug in the new fast-isel code where the parameter area was not included in the CALLSEQ_START size; this is also fixed. A couple of test cases needed to be adapted for the new (smaller) stack frame size those tests now see. llvm-svn: 211495	2014-06-23 13:47:52 +00:00
Ulrich Weigand	c6fcb7a5de	[PowerPC] Fix IsDarwin arg in PPCFrameLowering:: calls As remarked in the commit message to r211493, in several places throughout the 64-bit SVR4 ABI code there are calls to PPCFrameLowering::getLinkageSize and getMinCallFrameSize using an incorrect IsDarwin argument of "true". (Some of those were made explicit by the above refactoring patch, others have been there all along.) This patch fixes those places to pass "false" for IsDarwin. No change in generated code expected. llvm-svn: 211494	2014-06-23 13:21:43 +00:00
Ulrich Weigand	2bffb95915	[PowerPC] Refactor setMinReservedArea and CalculateParameterAndLinkageAreaSize The PPCISelLowering.cpp routines PPCTargetLowering::setMinReservedArea and CalculateParameterAndLinkageAreaSize are currently used as subroutines from both 64-bit SVR4 and Darwin ABI code. However, the two ABIs are already quite different w.r.t. AltiVec conventions, and they will become more different when the ELFv2 ABI is supported. Also, in general it seems better to disentangle ABI support routines for different ABIs to avoid accidentally affecting one ABI when intending to change only the other. (Actually, the current code strictly speaking already contains a bug: these routines call PPCFrameLowering::getMinCallFrameSize and PPCFrameLowering::getLinkageSize with the IsDarwin parameter set to "true" even on 64-bit SVR4. This bug currently has no adverse effect since those routines always return the same for 64-bit SVR4 and 64-bit Darwin, but it still seems wrong ... I'll fix this in a follow-up commit shortly.) To remove this code sharing, I'm simply inlining both routines into all call sites (there are just two each, one for 64-bit SVR4 and one for Darwin), and simplifying due to constant parameters where possible. A small piece of code that does make sense to share is refactored into the new routine EnsureStackAlignment, now also called from 32-bit SVR4 ABI code. No change in generated code is expected. llvm-svn: 211493	2014-06-23 13:08:27 +00:00
Ulrich Weigand	9ba552db89	[PowerPC] Fix on-stack AltiVec arguments with 64-bit SVR4 Current 64-bit SVR4 code seems to have some remnants of Darwin code in AltiVec argument handing. This had the effect that AltiVec arguments (or subsequent arguments) were not correctly placed in the parameter area in some cases. The correct behaviour with the 64-bit SVR4 ABI is: - All AltiVec arguments take up space in the parameter area, just like any other arguments, whether vararg or not. - They are always 16-byte aligned, skipping a parameter area doubleword (and the associated GPR, if any), if necessary. This patch implements the correct behaviour and adds a test case. (Verified against GCC behaviour via the ABI compat test suite.) llvm-svn: 211492	2014-06-23 12:36:34 +00:00
Tim Northover	2099862a50	ARM: mark UBFX as not allowing PC. Strictly, it's unpredictable. But we don't quite model that yet and an error is better than ignoring the issue. This one somehow got left out before though. rdar://problem/15997748 llvm-svn: 211490	2014-06-23 09:20:02 +00:00
David Majnemer	8114c1ae17	MC: Cleanup parseMSInlineAsm Utilize range based for-loops to simplify some code. Use insert() instead of a loop for simplicity/efficiency. No functionality change. llvm-svn: 211486	2014-06-23 02:17:16 +00:00
Saleem Abdulrasool	bdbc0088da	MC: adjust text section flags for WoA Correct the section flags for code built for Windows on ARM with `-ffunction-sections`. Windows on ARM uses solely Thumb-2 instructions, and indicates that the function is thumb by placing it in a text section that has IMAGE_SCN_MEM_16BIT flag set. When we encounter a .section directive, a new section is constructed. This may be a text segment. In order to identify that we need the additional flag, expose the target triple through the ObjectFileInfo as this information is lost otherwise. Since any modern ARM targeting environment on Windows would be Thumb-2 (Windows ARM NT or Windows Embedded Compact), introducing a new flag to indicate the section attribute seems to be a bit overkill. Simply depend on the target triple. Since there is one location that this information is currently needed, creating a target specific assembly parser and delegating the parsing of section switches also feels a bit heavy handed. If it turns out that this information ends up changing additional behaviour, then it may be worth considering that alternative. llvm-svn: 211481	2014-06-22 22:25:01 +00:00
NAKAMURA Takumi	d77cefe633	Revert r211399, "Generate native unwind info on Win64" It broke Legacy JIT Tests on x86_64-{mingw32\|msvc}, aka Windows x64. llvm-svn: 211480	2014-06-22 22:00:56 +00:00
Jan Vesely	343cd6f056	R600: Use LowerSDIVREM for i64 node replace v2: move div/rem node replacement to R600ISelLowering make lowerSDIVREM protected Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211478	2014-06-22 21:43:01 +00:00
Jan Vesely	109efdff6a	R600: Implement custom SDIVREM. Instead of separate SDIV/SREM. SDIV used UDIV which in turn used UDIVREM anyway. SREM used SDIV(UDIV->UDIVREM)+MUL+SUB, using UDIVREM directly is more efficient. v2: Don't use all caps names Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211477	2014-06-22 21:43:00 +00:00
Filipe Cabecinhas	1af2dfd274	Fix PR20087 by using the source index when changing the vector load llvm-svn: 211472	2014-06-22 17:21:37 +00:00
Arnold Schwaighofer	c11107cb1e	LoopVectorizer: Fix a dominance issue The induction variables start value needs to be defined before we branch (overflow check) to the scalar preheader where we used it. llvm-svn: 211460	2014-06-22 03:38:59 +00:00
Stepan Dyatkovskiy	38afcd70f8	MergeFunctions Pass, removed DenseMap helpers. Patch removes rest part of code related to old implementation. This patch belongs to patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). This one was the final patch. llvm-svn: 211457	2014-06-22 01:53:30 +00:00
Stepan Dyatkovskiy	471eab30b9	MergeFunctions Pass, updated header comments. Added short description for new comparison algorithm, that introduces total ordering among functions set. This patch belongs to patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). llvm-svn: 211456	2014-06-22 00:57:09 +00:00
Weiming Zhao	58eb5ab326	Report error for non-zero data in .bss User may initialize a var with non-zero value and specify .bss section. E.g. : int a __attribute__((section(".bss"))) = 2; This patch converts an assertion to error report for better user experience. Differential Revision: http://reviews.llvm.org/D4199 llvm-svn: 211455	2014-06-22 00:33:44 +00:00
Stepan Dyatkovskiy	f4af855930	MergeFunctions Pass, FnSet has been replaced with FnTree. Patch activates new implementation. So from now, merging process should take time O(Nlog(N)). Where N size of module (we are free to measure it in functions or in instructions). Internally FnTree represents binary tree. So every lookup operation takes O(log(N)) time. It is still not the last patch in series, we also have to clean-up pass from old code, and update pass comments. This patch belongs to patch series that improves MergeFunctions performance time from O(NN) to O(N*log(N)). llvm-svn: 211445	2014-06-21 20:54:36 +00:00
Stepan Dyatkovskiy	71038cadd4	MergeFunctions Pass, removed unused methods from old implementation. Patch removed next old FunctionComparator methods: * enumerate * isEquivalentOperation * isEquivalentGEP * isEquivalentType This patch belongs to patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). llvm-svn: 211444	2014-06-21 20:13:24 +00:00
Stepan Dyatkovskiy	0b58801b69	MergeFunctions, doSanityCheck: fixed body comments. llvm-svn: 211443	2014-06-21 19:07:51 +00:00
Stepan Dyatkovskiy	a77f3d8587	MergeFunctions Pass, introduced sanity check, that checks order relation, introduced among functions set. This patch belongs to patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). llvm-svn: 211442	2014-06-21 18:58:11 +00:00
Stepan Dyatkovskiy	17ee5ac20d	MergeFunctions Pass, introduced total ordering among top-level comparison methods. Patch changes return type of FunctionComparator::compare() and FunctionComparator::compare(const BasicBlock, const BasicBlock) methods from bool (equal or not) to {-1, 0, 1} (less, equal, great). This patch belongs to patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). llvm-svn: 211437	2014-06-21 17:55:51 +00:00
Benjamin Kramer	0bf086f80f	LoopUnrollRuntime: Check for overflow in the trip count calculation. Fixes PR19823. llvm-svn: 211436	2014-06-21 13:46:25 +00:00
Benjamin Kramer	b7f5fb5751	Legalizer: Add support for splitting insert_subvectors. We handle this by spilling the whole thing to the stack and doing the insertion as a store. PR19492. This happens in real code because the vectorizer creates v2i128 when AVX is enabled. llvm-svn: 211435	2014-06-21 12:56:42 +00:00
Benjamin Kramer	8dd637aa04	SCEVExpander: Fold constant PHIs harder. The logic below only understands proper IVs. PR20093. llvm-svn: 211433	2014-06-21 11:47:18 +00:00
Richard Trieu	c1485223a6	Add back functionality removed in r210497. Instead of asserting, output a message stating that a null pointer was found. llvm-svn: 211430	2014-06-21 02:43:02 +00:00
Andrea Di Biagio	e5015d8aba	[X86] Add ISel patterns to select SSE3/AVX ADDSUB instructions. This patch adds ISel patterns to select SSE3/AVX ADDSUB instructions from a sequence of "vadd + vsub + blend". Example: /// typedef float float4 __attribute__((ext_vector_type(4))); float4 foo(float4 A, float4 B) { float4 X = A - B; float4 Y = A + B; return (float4){X[0], Y[1], X[2], Y[3]}; } /// Before this patch, (with flag -mcpu=corei7) llc produced the following assembly sequence: movaps %xmm0, %xmm2 addps %xmm1, %xmm2 subps %xmm1, %xmm0 blendps $10, %xmm2, %xmm0 With this patch, we now get a single addsubps %xmm1, %xmm0 llvm-svn: 211427	2014-06-21 01:31:15 +00:00
Zachary Turner	d119fa028a	Fix the MinGW builder. Apparently std::call_once and std::recursive_mutex are not available on MinGW and breaks the builder. Revert to using a function local static and sys::Mutex just to get the tree green until we figure out a better solution. llvm-svn: 211424	2014-06-21 00:24:51 +00:00
Rafael Espindola	b4076b290e	Always use a temp symbol for CIE. Fixes pr19185. llvm-svn: 211423	2014-06-20 23:54:32 +00:00
Rafael Espindola	c3510c74f7	Use compact unwind for the iOS simulator. Another step in fixing pr19185. llvm-svn: 211416	2014-06-20 22:40:55 +00:00
Rafael Espindola	becdf63f7d	Use a helper function and clang-format. No functionality change. llvm-svn: 211415	2014-06-20 22:37:01 +00:00
Rafael Espindola	df100c337c	Delete dead code. The compact unwind info is only used by code that knows it is supported. llvm-svn: 211412	2014-06-20 22:30:31 +00:00
Duncan P. N. Exon Smith	411840d963	Support: Write ScaledNumber::getQuotient() and getProduct() llvm-svn: 211409	2014-06-20 21:47:47 +00:00
Rafael Espindola	b4357fc293	Don't produce eh_frame relocations when targeting the IOS simulator. First step for fixing pr19185. llvm-svn: 211404	2014-06-20 21:15:27 +00:00
Zachary Turner	c04b892f93	Revert "Replace Execution Engine's mutex with std::recursive_mutex." This reverts commit 1f502bd9d7d2c1f98ad93a09ffe435e11a95aedd, due to GCC / MinGW's lack of support for C++11 threading. It's possible this will go back in after we come up with a reasonable solution. llvm-svn: 211401	2014-06-20 21:07:14 +00:00
Reid Kleckner	4a01230db4	Generate native unwind info on Win64 This patch enables LLVM to emit Win64-native unwind info rather than DWARF CFI. It handles all corner cases (I hope), including stack realignment. Because the unwind info is not flexible enough to describe stack frames with a gap of unknown size in the middle, such as the one caused by stack realignment, I modified register spilling code to place all spills into the fixed frame slots, so that they can be accessed relative to the frame pointer. Patch by Vadim Chugunov! Reviewed By: rnk Differential Revision: http://reviews.llvm.org/D4081 llvm-svn: 211399	2014-06-20 20:35:47 +00:00
Stepan Dyatkovskiy	6baeb8805c	Commited patch from Björn Steinbrink: Summary: Different range metadata can lead to different optimizations in later passes, possibly breaking the semantics of the merged function. So range metadata must be taken into consideration when comparing Load instructions. Thanks! llvm-svn: 211391	2014-06-20 19:11:56 +00:00
Ulrich Weigand	32626014a6	[RuntimeDyld] Fix ppc64 stub relocations on little-endian When RuntimeDyldELF creates stub functions, it needs to install relocations that will resolve to the final address of the target routine. Since those are 16-bit relocs, they need to be applied to the least-significant halfword of the instruction. On big-endian ppc64, this means that addresses have to be adjusted by 2, which is what the code currently does. However, on a little-endian system, the address must not be adjusted; the least-significant halfword is the first one. This patch updates the RuntimeDyldELF code to take the target byte order into account. llvm-svn: 211384	2014-06-20 18:17:56 +00:00
Kevin Enderby	4eff6cdd2e	Fix a warning about the use of const being ignored with a cast. llvm-svn: 211383	2014-06-20 18:07:34 +00:00
Ulrich Weigand	dbc8e1ae28	[RuntimeDyld] Support more PPC64 relocations This adds support for several missing PPC64 relocations in the straight-forward manner to RuntimeDyldELF.cpp. Note that this actually fixes a failure of a large-model test case on PowerPC, allowing the XFAIL to be removed. llvm-svn: 211382	2014-06-20 17:51:47 +00:00
Tom Stellard	ae4c9e7bc3	R600/SI: Add patterns for ctpop inside a branch llvm-svn: 211378	2014-06-20 17:06:11 +00:00
Tom Stellard	9c603ebca4	R600/SI: Add a pattern for f32 ftrunc llvm-svn: 211377	2014-06-20 17:06:09 +00:00
Tom Stellard	a79e9f0f6d	R600: Expand vector flog2 llvm-svn: 211376	2014-06-20 17:06:07 +00:00
Tom Stellard	5222a88653	R600: Expand vector fexp2 llvm-svn: 211375	2014-06-20 17:06:05 +00:00
Tom Stellard	de16a2e59f	R600/SI: SI Control Flow Annotation bug fixed Mixing of AddAvailableValue and GetValueAtEndOfBlock methods of SSAUpdater leaded to the endless loop generation when the nested loops annotated. This fixes a bug in the OCL_ML/KNN OpenCV test. The test case is too complex for FileCheck and would be very fragile. Patch by: Elena Denisova llvm-svn: 211374	2014-06-20 17:06:02 +00:00
Tom Stellard	c9dedb8e29	R600/SI: Add a VALU pattern for i64 xor llvm-svn: 211373	2014-06-20 17:05:57 +00:00
Ulrich Weigand	59c6ab20d6	[PowerPC] Fix small argument stack slot offset for LE When small arguments (structures < 8 bytes or "float") are passed in a stack slot in the ppc64 SVR4 ABI, they must reside in the least significant part of that slot. On BE, this means that an offset needs to be added to the stack address of the parameter, but on LE, the least significant part of the slot has the same address as the slot itself. This changes the PowerPC back-end ABI code to only add the small argument stack slot offset for BE. It also adds test cases to verify the correct behavior on both BE and LE. llvm-svn: 211368	2014-06-20 16:34:05 +00:00
Rafael Espindola	1fc003e6c5	Allow a target to create a null streamer. Targets can assume that a target streamer is present, so they have to be able to construct a null streamer in order to set the target streamer in it to. Fixes a crash when using the null streamer with arm. llvm-svn: 211358	2014-06-20 13:11:28 +00:00
Yaron Keren	6d3194f7d5	The count() function for STL datatypes returns unsigned, even where it's only 1/0 result like std::set. Some of the LLVM ADT already return unsigned count(), while others still return bool count(). In continuation to r197879, this patch modifies DenseMap, DenseSet, ScopedHashTable, ValueMap:: count() to return size_type instead of bool, 1 instead of true and 0 instead of false. size_type is typedef-ed locally within each class to size_t. http://reviews.llvm.org/D4018 Reviewed by dblaikie. llvm-svn: 211350	2014-06-20 10:26:56 +00:00
Oliver Stannard	5dc2934ba2	Emit the ARM build attributes ABI_PCS_wchar_t and ABI_enum_size. Emit the ARM build attributes ABI_PCS_wchar_t and ABI_enum_size based on module flags metadata. llvm-svn: 211349	2014-06-20 10:08:11 +00:00
Zoran Jovanovic	6a29b55a5a	ps][mips64r6] Added LSA/DLSA instructions Differential Revision: http://reviews.llvm.org/D3897 llvm-svn: 211346	2014-06-20 09:28:09 +00:00
Matt Arsenault	f5e2997aff	R600: Trivial subtarget feature cleanups. Remove an unused AMDIL leftover, correct extra periods appearing in the help menu. llvm-svn: 211341	2014-06-20 06:50:05 +00:00
Justin Bogner	6f07046808	ArgList: use MakeArgList overloads in subclasses and clean up some calls. llvm-svn: 211340	2014-06-20 04:36:29 +00:00
Karthik Bhat	e03a25da70	Add Support to Recognize and Vectorize NON SIMD instructions in SLPVectorizer. This patch adds support to recognize patterns such as fadd,fsub,fadd,fsub.../add,sub,add,sub... and vectorizes them as vector shuffles if they are profitable. These patterns of vector shuffle can later be converted to instructions such as addsubpd etc on X86. Thanks to Arnold and Hal for the reviews. http://reviews.llvm.org/D4015 llvm-svn: 211339	2014-06-20 04:32:48 +00:00
Hans Wennborg	cfe341f5d0	Fix .cpp files claiming to be header files llvm-svn: 211334	2014-06-20 01:36:00 +00:00
Chandler Carruth	8366cebeb5	[x86] Make the x86 PACKSSWB, PACKSSDW, PACKUSWB, and PACKUSDW instructions available as synthetic SDNodes PACKSS and PACKUS that will select to the correct instruction variants based on the return type. This allows us to use these rather important instructions when lowering vector shuffles. Also moves the relevant instruction definitions to be split out from the fully generic multiclasses to allow them to match these new SDNodes in the same way that the UNPCK instructions do. No functionality should actually be changed here. llvm-svn: 211332	2014-06-20 01:05:28 +00:00
Hans Wennborg	4dc895164a	Don't build switch lookup tables for dllimport or TLS variables We would previously put dllimport variables in switch lookup tables, which doesn't work because the address cannot be used in a constant initializer. This is basically the same problem that we have in PR19955. Putting TLS variables in switch tables also desn't work, because the address of such a variable is not constant. Differential Revision: http://reviews.llvm.org/D4220 llvm-svn: 211331	2014-06-20 00:38:12 +00:00
Rafael Espindola	393b2b594f	Revert "Add StringMap::insert(pair) consistent with the standard associative container concept." This reverts commit r211309. It looks like it broke some bots: http://lab.llvm.org:8011/builders/clang-x86_64-ubuntu-gdb-75/builds/15563/steps/compile/logs/stdio llvm-svn: 211328	2014-06-20 00:23:03 +00:00
Rafael Espindola	70d3c20b0f	Use the assignment operator. No functionality change. llvm-svn: 211319	2014-06-19 22:27:46 +00:00
Rafael Espindola	a064b0c476	Set missing options in LTOCodeGenerator::setTargetOptions. Patch by Tom Roeder, I just added the test. llvm-svn: 211317	2014-06-19 22:14:12 +00:00
Kevin Enderby	1983fcf86c	Change the output of llvm-nm and llvm-size for Mach-O universal files (aka fat files) to print “ (for architecture XYZ)” for fat files with more than one architecture to be like what the darwin tools do for fat files. Also clean up the Mach-O printing of archive membernames in llvm-nm to use the darwin form of "libx.a(foo.o)". llvm-svn: 211316	2014-06-19 22:03:18 +00:00
Eric Christopher	c40e5edbbc	Add a new subtarget hook for whether or not we'd like to enable the atomic load linked expander pass to run for a particular subtarget. This requires a check of the subtarget and so save the TargetMachine rather than only TargetLoweringInfo and update all callers. llvm-svn: 211314	2014-06-19 21:03:04 +00:00
David Blaikie	37700dc057	Add StringMap::insert(pair) consistent with the standard associative container concept. Patch by Agustín Bergé. llvm-svn: 211309	2014-06-19 20:08:56 +00:00
Eric Christopher	d29430dae9	Fix up a few formatting issues. llvm-svn: 211307	2014-06-19 20:00:09 +00:00
Alp Toker	1d099d9339	Fix typos llvm-svn: 211304	2014-06-19 19:41:26 +00:00
Justin Bogner	cd45f963e2	Support: Add llvm::sys::fs::copy_file A function to copy one file's contents to another. llvm-svn: 211302	2014-06-19 19:35:39 +00:00
David Blaikie	df4d5efc7c	Remove use of removed function, llvm_stop_multithreading llvm-svn: 211291	2014-06-19 18:26:28 +00:00
Zachary Turner	9c9710eaf4	Remove support for LLVM runtime multi-threading. After a number of previous small iterations, the functions llvm_start_multithreaded() and llvm_stop_multithreaded() have been reduced essentially to no-ops. This change removes them entirely. Reviewed by: rnk, dblaikie Differential Revision: http://reviews.llvm.org/D4216 llvm-svn: 211287	2014-06-19 18:18:23 +00:00
David Blaikie	de8e12a49a	DebugInfo: Fission: Ensure the address pool entries for location lists are emitted. The address pool was being emitted before location lists. The latter could add more entries to the pool which would be lost/never emitted. llvm-svn: 211284	2014-06-19 17:59:14 +00:00
Alp Toker	660839f210	MCNullStreamer: assign file IDs to resolve crashes and errors Use the MCStreamer base implementations for file ID tracking instead of overriding them as no-ops. Avoids assertions when streaming Dwarf debug info, and fixes ASM parsing of loc and file directives. llvm-svn: 211282	2014-06-19 17:15:36 +00:00
Jingyue Wu	37fcb5919d	[ValueTracking] Extend range metadata to call/invoke Summary: With this patch, range metadata can be added to call/invoke including IntrinsicInst. Previously, it could only be added to load. Rename computeKnownBitsLoad to computeKnownBitsFromRangeMetadata because range metadata is not only used by load. Update the language reference to reflect this change. Test Plan: Add several tests in range-2.ll to confirm the verifier is happy with having range metadata on call/invoke. Add two tests in AddOverFlow.ll to confirm annotating range metadata to call/invoke can benefit InstCombine. Reviewers: meheff, nlewycky, reames, hfinkel, eliben Reviewed By: eliben Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4187 llvm-svn: 211281	2014-06-19 16:50:16 +00:00
Zachary Turner	6ad2444d5b	Kill the LLVM global lock. This patch removes the LLVM global lock, and updates all existing users of the global lock to use their own mutex. None of the existing users of the global lock were protecting code that was mutually exclusive with any of the other users of the global lock, so its purpose was not being met. Reviewed by: rnk Differential Revision: http://reviews.llvm.org/D4142 llvm-svn: 211277	2014-06-19 16:17:42 +00:00
Oliver Stannard	8b27308617	Emit DWARF info for all code section in an assembly file Currently, when using llvm as an assembler, DWARF debug information is only generated for the .text section. This patch modifies this so that DWARF info is emitted for all executable sections. llvm-svn: 211273	2014-06-19 15:52:37 +00:00
Oliver Stannard	f7693f4c1f	Emit DWARF3 call frame information when DWARF3+ debug info is requested Currently, llvm always emits a DWARF CIE with a version of 1, even when emitting DWARF 3 or 4, which both support CIE version 3. This patch makes it emit the newer CIE version when we are emitting DWARF 3 or 4. This will not reduce compatibility, as we already emit other DWARF3/4 features, and is worth doing as the DWARF3 spec removed some ambiguities in the interpretation of call frame information. It also fixes a minor bug where the "return address" field of the CIE was encoded as a ULEB128, which is only valid when the CIE version is 3. There are no test changes for this, because (as far as I can tell) none of the platforms that we test have a return address register with a DWARF register number >127. llvm-svn: 211272	2014-06-19 15:39:33 +00:00
Matheus Almeida	4f7ef8c6ef	[mips] Implementation of dli. Patch by David Chisnall His work was sponsored by: DARPA, AFRL Some small modifications to the original patch: we now error if it's not possible to expand an instruction (mips-expansions-bad.s has some examples). Added some comments to the expansions. llvm-svn: 211271	2014-06-19 15:08:04 +00:00
Matheus Almeida	3813d57929	[mips] Small update to the logic behind the expansion of assembly pseudo instructions. Summary: The functions that do the expansion now return false on success and true otherwise. This is so we can catch some errors during the expansion (e.g.: immediate too large). The next patch adds some test cases. Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4214 llvm-svn: 211269	2014-06-19 14:39:14 +00:00
Dinesh Dwivedi	8bb5fb0661	Updated comments as suggested by Rafael. Thanks. llvm-svn: 211268	2014-06-19 14:11:53 +00:00
Dinesh Dwivedi	562fd7534c	Added instruction combine to transform few more negative values addition to subtraction (Part 1) This patch enables transforms for following patterns. (x + (~(y & c) + 1) --> x - (y & c) (x + (~((y >> z) & c) + 1) --> x - ((y>>z) & c) Differential Revision: http://reviews.llvm.org/D3733 llvm-svn: 211266	2014-06-19 10:36:52 +00:00
Andrea Di Biagio	54b0949af9	[X86] Teach how to combine horizontal binop even in the presence of undefs. Before this change, the backend was unable to fold a build_vector dag node with UNDEF operands into a single horizontal add/sub. This patch teaches how to combine a build_vector with UNDEF operands into a horizontal add/sub when possible. The algorithm conservatively avoids to combine a build_vector with only a single non-UNDEF operand. Added test haddsub-undef.ll to verify that we correctly fold horizontal binop even in the presence of UNDEFs. llvm-svn: 211265	2014-06-19 10:29:41 +00:00
Dinesh Dwivedi	b62e52e1b5	Refactored and updated SimplifyUsingDistributiveLaws() to * Find factorization opportunities using identity values. * Find factorization opportunities by treating shl(X, C) as mul (X, shl(C)) * Keep NSW flag while simplifying instruction using factorization. This fixes PR19263. Differential Revision: http://reviews.llvm.org/D3799 llvm-svn: 211261	2014-06-19 08:29:18 +00:00
Alp Toker	fb39de3be7	CommandLine: bail out when options get multiply registered These errors are strictly unrecoverable and indicate serious issues such as conflicting option names or an incorrectly linked LLVM distribution. With this change, the errors actually get detected so tests don't pass silently. llvm-svn: 211260	2014-06-19 07:25:25 +00:00
David Majnemer	6cf6c05322	InstCombine: Stop two transforms dueling InstCombineMulDivRem has: // Canonicalize (X+C1)CI -> XCI+C1CI. InstCombineAddSub has: // WX + YZ --> W (X+Z) iff W == Y These two transforms could fight with each other if C1CI would not fold away to something simpler than a ConstantExpr mul. The InstCombineMulDivRem transform only acted on ConstantInts until r199602 when it was changed to operate on all Constants in order to let it fire on ConstantVectors. To fix this, make this transform more careful by checking to see if we actually folded away C1CI. This fixes PR20079. llvm-svn: 211258	2014-06-19 07:14:33 +00:00
Eric Christopher	4c5bff36ad	Move -dwarf-version to an MC level command line option so it's used by all of the MC level tools and codegen. Fix up all uses in the compiler to use this and set it on the context accordingly. llvm-svn: 211257	2014-06-19 06:22:08 +00:00
Eric Christopher	07634e2a5b	Remove unnecessary include. llvm-svn: 211256	2014-06-19 06:22:05 +00:00
Craig Topper	35b2f75733	Convert some assert(0) to llvm_unreachable or fold an 'if' condition into the assert. llvm-svn: 211254	2014-06-19 06:10:58 +00:00
Nick Lewycky	8561a49c27	Move optimization of some cases of (A & C1)\|(B & C2) from instcombine to instsimplify. Patch by Rahul Jain, plus some last minute changes by me -- you can blame me for any bugs. llvm-svn: 211252	2014-06-19 03:51:46 +00:00
Nick Lewycky	c961030ac2	Make instsimplify's analysis of icmp eq/ne use computeKnownBits to determine whether the icmp is always true or false. Patch by Suyog Sarda! llvm-svn: 211251	2014-06-19 03:35:49 +00:00
Nick Lewycky	802df52424	Remove redundant code in InstCombineShift, no functionality change because instsimplify already does this and instcombine calls instsimplify a few lines above. Patch by Suyog Sarda! llvm-svn: 211250	2014-06-19 03:28:28 +00:00
David Majnemer	6a5b812c7b	MS asm: Properly handle quoted symbol names We would get confused by '@' characters in symbol names, we would mistake the text following them for the variant kind. When an identifier a string, the variant kind will never show up inside of it. Instead, check to see if there is a variant following the string. This fixes PR19965. llvm-svn: 211249	2014-06-19 01:25:43 +00:00
Matt Arsenault	a0050b0961	R600/SI: Add intrinsics for various math instructions. These will be used for custom lowering and for library implementations of various math functions, so it's useful to expose these as builtins. llvm-svn: 211247	2014-06-19 01:19:19 +00:00
Eric Christopher	3d19f1388f	Move ARMJITInfo off of the TargetMachine and down onto the subtarget. This required untangling a mess of headers that included around. This a recommit of r210953 with a fix for the removed accessor for JITInfo. llvm-svn: 211233	2014-06-18 22:48:09 +00:00
Matt Arsenault	2b0fa433a0	Use stdint macros for specifying size of constants llvm-svn: 211231	2014-06-18 22:11:03 +00:00
Kevin Enderby	4b8fc281d4	Teach llvm-size to know about Mach-O universal files (aka fat files) and fat files containing archives. Also fix a bug in MachOUniversalBinary::ObjectForArch::ObjectForArch() where it needed a >= when comparing the Index with the number of objects in a fat file. As the index starts at 0. llvm-svn: 211230	2014-06-18 22:04:40 +00:00
Matt Arsenault	692bd5ec2f	R600: Handle fnearbyint The difference from rint isn't really relevant here, so treat them as equivalent. OpenCL doesn't have nearbyint, so this is sort of pointless other than for completeness. llvm-svn: 211229	2014-06-18 22:03:45 +00:00
Marek Olsak	51b8e7b2e7	R600/SI: add gather4 and getlod intrinsics (v3) This contains all the previous patches + getlod support on top of it. It doesn't use SDNodes anymore, so it's quite small. It also adds v16i8 to SReg_128, which is used for the sampler descriptor. Reviewed-by: Tom Stellard llvm-svn: 211228	2014-06-18 22:00:29 +00:00
Matt Arsenault	b55c68f171	Use LL suffix for literal that should be 64-bits. This hopefully fixes Windows llvm-svn: 211225	2014-06-18 21:40:43 +00:00
Saleem Abdulrasool	71ede29e9c	MC: do not add comment string to the AsmToken in AsmLexer::LexLineComment Fixes macros with varargs if the macro instantiation has a trailing comment. Patch by Janne Grunau! llvm-svn: 211219	2014-06-18 20:57:32 +00:00
Saleem Abdulrasool	763e2cb6e5	MCAsmParser: full support for gas' '.if{cond} expression' directives Patch by Janne Grunau! llvm-svn: 211218	2014-06-18 20:57:28 +00:00
Zachary Turner	62ce4e88fd	Replace Execution Engine's mutex with std::recursive_mutex. This change has a bit of a trickle down effect due to the fact that there are a number of derived implementations of ExecutionEngine, and that the mutex is not tightly encapsulated so is used by other classes directly. Reviewed by: rnk Differential Revision: http://reviews.llvm.org/D4196 llvm-svn: 211214	2014-06-18 20:17:35 +00:00
Rafael Espindola	8fb3111248	Revert a C API difference that I incorrectly introduced. LLVMGetBitcodeModuleInContext should not take ownership on error. I will try to localize this odd api requirement, but this should get the bots green. llvm-svn: 211213	2014-06-18 20:07:35 +00:00
Rafael Espindola	ccf10727b0	Make getBaseObject static. Thanks to David Majnemer for noticing. llvm-svn: 211208	2014-06-18 19:08:47 +00:00
Rafael Espindola	8af5cb2c28	Change IRObjectFile to parse the bitcode lazily. The main point of this class is to provide a cheap object interface to a bitcode file, so it has to be as lazy as possible. llvm-svn: 211207	2014-06-18 19:05:24 +00:00
Rafael Espindola	a1ea4ccc06	Remove BitcodeReader::setBufferOwned. We do have use cases for the bitcode reader owning the buffer or not, but we always know which one we have when we construct it. It might be possible to simplify this further, but this is a step in the right direction. llvm-svn: 211205	2014-06-18 18:55:41 +00:00
Diego Novillo	b2ad56effc	Simply test for available locations in optimization remarks. When emitting optimization remarks, we test for the presence of instruction locations by testing for a valid llvm.dbg.cu annotation. This is slightly inefficient because we can simply ask whether the debug location we have is known or not. Additionally, if my current plan works, I will need to remove the llvm.dbg.cu annotation from the IL (or prevent it from being generated) when -Rpass is used without -g. In those cases, we'll want to generate line tables but we will want to prevent code generation from emitting DWARF code for them. Tested on x86_64. llvm-svn: 211204	2014-06-18 18:46:58 +00:00
Ulrich Weigand	f460d69ada	[PowerPC] Remove unnecessary load of r12 in indirect call When looking at the 64-bit SVR4 indirect call sequence, I noticed an unnecessary load of r12. And indeed the code says: // R12 must contain the address of an indirect callee. But this is not correct; in the 64-bit SVR4 (ELFv1) ABI, there is no need to load r12 at this point. It seems this code and comment is a remnant of code originally shared with the Darwin ABI ... This patch simply removes the unnecessary load. llvm-svn: 211203	2014-06-18 18:33:36 +00:00
Rafael Espindola	cd2de416eb	Run clang-format in a small chunk of code I am about to change. llvm-svn: 211201	2014-06-18 18:26:53 +00:00
Weiming Zhao	8c89973462	[ARM] [MC] Refactor the constant pool classes ARMTargetStreamer implements ConstantPool and AssmeblerConstantPools to keep track of assembler-generated constant pools that are used for ldr-pseudo. When implementing ldr-pseudo for AArch64, these two classes can be reused. So this patch factors them out from ARM target to the general MC lib. llvm-svn: 211198	2014-06-18 18:17:25 +00:00
Jan Vesely	85f0dbce5c	R600: Expand vector fceil Move fp64 fceil tests to fceil64.ll v2: rebase Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211194	2014-06-18 17:57:29 +00:00
Ulrich Weigand	ad0cb91ed9	[PowerPC] Simplify and improve loading into TOC register During an indirect function call sequence on the 64-bit SVR4 ABI, generate code must load and then restore the TOC register. This does not use a regular LOAD instruction since the TOC register r2 is marked as reserved. Instead, the are two special instruction patterns: let RST = 2, DS = 2 in def LDinto_toc: DSForm_1a<58, 0, (outs), (ins g8rc:$reg), "ld 2, 8($reg)", IIC_LdStLD, [(PPCload_toc i64:$reg)]>, isPPC64; let RST = 2, DS = 10, RA = 1 in def LDtoc_restore : DSForm_1a<58, 0, (outs), (ins), "ld 2, 40(1)", IIC_LdStLD, [(PPCtoc_restore)]>, isPPC64; Note that these not only restrict the destination of the load to r2, but they also restrict the source of the load to particular address combinations. The latter is a problem when we want to support the ELFv2 ABI, since there the TOC save slot is no longer at 40(1). This patch replaces those two instructions with a single instruction pattern that only hard-codes r2 as destination, but supports generic addresses as source. This will allow supporting the ELFv2 ABI, and also helps generate more efficient code for calls to absolute addresses (allowing simplification of the ppc64-calls.ll test case). llvm-svn: 211193	2014-06-18 17:52:49 +00:00
Matt Arsenault	d22626f6bb	Work around ridiculous warning. Apparently C++ doesn't really have hex floating point constants. llvm-svn: 211192	2014-06-18 17:45:58 +00:00
Matt Arsenault	43160e7af2	R600/SI: Add intrinsics for brev instructions llvm-svn: 211187	2014-06-18 17:13:57 +00:00
Matt Arsenault	dbc9aae1fb	R600/SI: Prettier operand printing for 64-bit ops. Copy what is done for 32-bit already so the order is about the same. llvm-svn: 211186	2014-06-18 17:13:51 +00:00
Matheus Almeida	784f797d4c	[mips] SYNC $stype instruction was added in Mips32 but SYNC with an implied operand ($stype = 0) is valid since Mips2. llvm-svn: 211185	2014-06-18 17:10:30 +00:00
Rafael Espindola	24d8b84838	Fix a memory leak in the error path. llvm-svn: 211184	2014-06-18 17:07:15 +00:00
Matt Arsenault	4601093267	R600: Implement f64 ftrunc, ffloor and fceil. CI has instructions for these, so this fixes them for older hardware. llvm-svn: 211183	2014-06-18 17:05:30 +00:00
Matt Arsenault	e8208ec95b	R600: Custom lower f64 frint for pre-CI llvm-svn: 211182	2014-06-18 17:05:26 +00:00
Matt Arsenault	7aeb813b2a	R600/SI: Temporary fix for f64 fneg This should be a source modifier, but this unblocks most of my math patches. llvm-svn: 211181	2014-06-18 17:05:22 +00:00
Matt Arsenault	520e7c44c1	R600/SI: Comparisons set vcc. llvm-svn: 211178	2014-06-18 16:53:48 +00:00
Adam Nemet	efd0785d82	[X86] AVX512: Add non-temporal stores Note that I followed the AVX2 convention here and didn't add LLVM intrinsics for stores. These can be generated with the nontemporal hint on LLVM IR stores (see new test). The GCC builtins are lowered directly into nontemporal stores. <rdar://problem/17082571> llvm-svn: 211176	2014-06-18 16:51:10 +00:00
Adam Nemet	ded81a810c	[X86] AVX512: Specify compressed displacement for vmovntdqa Use the max 64-bit element size with EVEX_CD8. This should work since element size is ignored for a full-vector access (FVM). llvm-svn: 211175	2014-06-18 16:51:07 +00:00
Ulrich Weigand	9aa09ef30f	[PowerPC] Do not use BLA with the 64-bit SVR4 ABI The PowerPC back-end uses BLA to implement calls to functions at known-constant addresses, which is apparently used for certain system routines on Darwin. However, with the 64-bit SVR4 ABI, this is actually incorrect. An immediate function pointer value on this platform is not directly usable as a target address for BLA: - in the ELFv1 ABI, the function pointer value refers to the function descriptor, not the code address - in the ELFv2 ABI, the function pointer value refers to the global entry point, but BL(A) would only be correct when calling the local entry point This bug didn't show up since using immediate function pointer values is not usually done in the 64-bit SVR4 ABI in the first place. However, I ran into this issue with a certain use case of LLVM as JIT, where immediate function pointer values were uses to implement callbacks from JITted code to helpers in statically compiled code. Fixed by simply not using BLA with the 64-bit SVR4 ABI. llvm-svn: 211174	2014-06-18 16:14:04 +00:00
Ulrich Weigand	7c3f0dc7e4	[PowerPC] Fix emitting instruction pairs on LE My patch r204634 to emit instructions in little-endian format failed to handle those special cases where we emit a pair of instructions from a single LLVM MC instructions (like the bl; nop pairs used to implement the call sequence). In those cases, we still need to emit the "first" instruction (the one in the more significant word) first, on both big and little endian, and not swap them. llvm-svn: 211171	2014-06-18 15:37:07 +00:00
Matheus Almeida	78f8b7b652	[mips] Fix expansion of memory operation if destination register is not a GPR. Summary: The assembler tries to reuse the destination register for memory operations whenever it can but it's not possible to do so if the destination register is not a GPR. Example: ldc1 $f0, sym should expand to: lui $at, %hi(sym) ldc1 $f0, %lo(sym)($at) It's entirely wrong to expand to: lui $f0, %hi(sym) ldc1 $f0, %lo(sym)($f0) Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D4173 llvm-svn: 211169	2014-06-18 14:49:56 +00:00
Matheus Almeida	7de68e77aa	[mips] Report correct location when "erroring" about the use of $at when it's not available. Summary: This removes the FIXMEs from test/MC/Mips/mips-noat.s. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D4172 llvm-svn: 211168	2014-06-18 14:46:05 +00:00
Zoran Jovanovic	5c14b06940	[mips][mips64r6] Add BLTC and BLTUC instructions Differential Revision: http://reviews.llvm.org/D3923 llvm-svn: 211167	2014-06-18 14:36:00 +00:00
Matheus Almeida	29e254f849	[mips] Access $at only if necessary. Summary: This patch doesn't really change the logic behind expandMemInst but it allows us to assemble .S files that use .set noat with some macros. For example: .set noat lw $k0, offset($k1) Can expand to: lui $k0, %hi(offset) addu $k0, $k0, $k1 lw $k0, %lo(offset)($k0) with no need to access $at. Reviewers: dsanders, vmedic Reviewed By: dsanders, vmedic Differential Revision: http://reviews.llvm.org/D4159 llvm-svn: 211165	2014-06-18 14:15:42 +00:00
Cameron McInally	f10a7c963b	Add pattern for unsigned v4i32->v4f64 convert on AVX512. llvm-svn: 211164	2014-06-18 14:04:37 +00:00
Matheus Almeida	ee73cc5894	[mips] Update MipsAsmParser so that it's possible to handle immediates that start with the binary operator NOT (~). Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D4158 llvm-svn: 211163	2014-06-18 13:55:18 +00:00
Matheus Almeida	c3c18956de	[mips] Implement alias for 'and' and 'or' instructions for all ISAs. Summary: Examples: and $2, 4 <=> andi $2, $2, 4 or $2, 4 <=> ori $2, $2, 4 Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D4155 llvm-svn: 211161	2014-06-18 13:30:57 +00:00
Matheus Almeida	7e81576246	[mips] Remove the last usage of parseRegister from MipsAsmParser. Summary: Added negative test case so that we can be sure we handle erroneous situations while parsing the .cpsetup directive. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3681 llvm-svn: 211160	2014-06-18 13:08:59 +00:00
Jan Vesely	ecf5133a2b	R600: Implement 64bit SRA v2: Use capitalized variable name Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211159	2014-06-18 12:27:17 +00:00
Jan Vesely	900ff2e74b	R600: Implement 64bit SRL v2: use C++ style comment Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211158	2014-06-18 12:27:15 +00:00
Jan Vesely	25f362766e	R600: Implement 64bit SHL v2: Use c++ style comment Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211157	2014-06-18 12:27:13 +00:00
Evgeniy Stepanov	4ea1647e8b	[msan] Handle X86 .psad. and .pmadd. intrinsics. llvm-svn: 211156	2014-06-18 12:02:29 +00:00
Tim Northover	d82ed2e581	DAG: move sret demotion into most basic LowerCallTo implementation. It looks like there are two versions of LowerCallTo here: the SelectionDAGBuilder one is designed to operate on LLVM IR, and the TargetLowering one in the case where everything is at DAG level. Previously, only the SelectionDAGBuilder variant could handle demoting an impossible return to sret semantics (before delegating to the TargetLowering version), but this functionality is also useful for certain libcalls (e.g. 128-bit operations on 32-bit x86). So this commit moves the sret handling down a level. rdar://problem/17242889 llvm-svn: 211155	2014-06-18 11:52:44 +00:00
JF Bastien	acf5bc16e3	Revert "Random Number Generator (llvm)" This reverts commit cccba093090d127e0b6d17473b14c264c14c5259. It causes build breakage. llvm-svn: 211146	2014-06-18 06:33:23 +00:00
JF Bastien	f8ad92da5c	Random Number Generator (llvm) Summary: Provides an abstraction for a random number generator (RNG) that produces a stream of pseudo-random numbers. The current implementation uses C++11 facilities and is therefore not cryptographically secure. The RNG is salted with the text of the current command line invocation. In addition, a user may specify a seed (reproducible builds). In clang, the seed can be set via -frandom-seed=X In the back end, the seed can be set via -rng-seed=X This is the llvm part of the patch. clang part: D3391 Reviewers: ahomescu, rinon, nicholas, jfb Reviewed By: jfb Subscribers: jfb, perl Differential Revision: http://reviews.llvm.org/D3390 llvm-svn: 211145	2014-06-18 06:23:25 +00:00
Kevin Qin	f0ec9aff2a	[AArch64] Fix a pattern match failure caused by creating improper CONCAT_VECTOR. ReconstructShuffle() may wrongly creat a CONCAT_VECTOR trying to concat 2 of v2i32 into v4i16. This commit is to fix this issue and try to generate UZP1 instead of lots of MOV and INS. Patch is initalized by Kevin Qin, and refactored by Tim Northover. llvm-svn: 211144	2014-06-18 05:54:42 +00:00
Craig Topper	2a30d7889f	Replace some assert(0)'s with llvm_unreachable. llvm-svn: 211141	2014-06-18 05:05:13 +00:00
Louis Gerbarg	343f5cdfad	Allow X86FastIsel to cope with 64 bit absolute relocations This patch is a follow up to r211040 & r211052. Rather than bailing out of fast isel this patch will generate an alternate instruction (movabsq) instead of the leaq. While this will always have enough room to handle the 64 bit displacment it is generally over kill for internal symbols (most displacements will be within 32 bits) but since we have no way of communicating the code model to the the assmebler in order to avoid flagging an absolute leal/leaq as illegal when using a symbolic displacement. llvm-svn: 211130	2014-06-17 23:22:41 +00:00
Juergen Ributzka	aa60209311	[FastISel][X86] Optimize predicates and fold CMP instructions. This optimizes predicates for certain compares, such as fcmp oeq %x, %x to fcmp ord %x, %x. The latter one is more efficient to generate. The same optimization is applied to conditional branches. llvm-svn: 211126	2014-06-17 21:55:43 +00:00
Zachary Turner	a9380b3efd	Remove more occurrences of the unused-mutex-parameter pattern. This pattern loses some of its usefulness when the mutex type is statically polymorphic as opposed to runtime polymorphic, as swapping out the mutex type requires changing a significant number of function parameters, and templatizing the function parameter requires the methods to be defined in the headers. Furthermore, if LLVM is compiled with threads disabled then there may even be no mutex to acquire anyway, so it should not be up to individual APIs to know whether or not acquiring a mutex is required to use those APIs to begin with. It should be up to the user of the API. llvm-svn: 211125	2014-06-17 21:54:18 +00:00
Tom Stellard	092f332ef2	R600/SI: Make sure target flags are set on pseudo VOP3 instructions llvm-svn: 211120	2014-06-17 19:34:46 +00:00
Rafael Espindola	58cb745f31	Merge lib/Support/WindowsError.cpp into ib/Support/ErrorHandling.cpp. The OSX ranlib warns on files with no symbols, and lib/Support/WindowsError.cpp was empty when building on non-windows. llvm-svn: 211118	2014-06-17 18:06:45 +00:00
Matt Arsenault	295b86e81d	R600/SI: Match cttz_zero_undef llvm-svn: 211116	2014-06-17 17:36:27 +00:00
Matt Arsenault	8579601050	R600/SI: Match ctlz_zero_undef llvm-svn: 211115	2014-06-17 17:36:24 +00:00
Tom Stellard	880a80ad07	R600: Use LDS and vectors for private memory llvm-svn: 211110	2014-06-17 16:53:14 +00:00
Tom Stellard	85ad429f1f	R600/SI: Add a pattern for llvm.AMDGPU.barrier.global llvm-svn: 211109	2014-06-17 16:53:09 +00:00
Tom Stellard	aad4659470	SelectionDAG: Expand i64 = FP_TO_SINT i32 llvm-svn: 211108	2014-06-17 16:53:07 +00:00
Tom Stellard	8942276a2a	R600/SI: Re-initialize the m0 register after using it for indirect addressing We need to store a value greater than or equal to the number of LDS bytes allocated by the shader in the m0 register in order for LDS instructions to work correctly. We always initialize m0 at the beginning of a shader, but this register is also used for indirect addressing offsets, so we need to re-initialize it any time we use indirect addressing. llvm-svn: 211107	2014-06-17 16:53:04 +00:00
Juergen Ributzka	e35705675f	[FastISel][X86] Fix previous refactoring commit (r211077) Overlooked that fcmp_une uses an "or" instead of an "and" for combining the flags. llvm-svn: 211104	2014-06-17 14:47:45 +00:00
Dinesh Dwivedi	657105e582	Fixed jump threading going to infinite loop. This patch add code to remove unreachable blocks from function as they may cause jump threading to stuck in infinite loop. Differential Revision: http://reviews.llvm.org/D3991 llvm-svn: 211103	2014-06-17 14:34:19 +00:00
James Molloy	f1653b5260	Move SetTheory from utils/TableGen into lib/TableGen so Clang can use it. llvm-svn: 211100	2014-06-17 13:10:38 +00:00
James Molloy	c1fd09ba2c	Fix memory leak of RegScavenger accidentally added in r211037. llvm-svn: 211097	2014-06-17 12:31:41 +00:00
Tim Northover	d5531f72dc	AArch64: estimate inline asm length during branch relaxation To make sure branches are in range, we need to do a better job of estimating the length of an inline assembly block than "it's probably 1 instruction, who'd write asm with more than that?". Fortunately there's already a (highly suspect, see how many ways you can think of to break it!) callback for this purpose, which is used by the other targets. rdar://problem/17277590 llvm-svn: 211095	2014-06-17 11:31:42 +00:00
Evgeniy Stepanov	5d97293e26	[msan] Fix a comment. llvm-svn: 211094	2014-06-17 11:26:00 +00:00
Evgeniy Stepanov	df187feae4	[msan] Fix handling of multiplication by a constant with a number of trailing zeroes. Multiplication by an integer with a number of trailing zero bits leaves the same number of lower bits of the result initialized to zero. This change makes MSan take this into account in the case of multiplication by a compile-time constant. We don't handle the general, non-constant, case because (a) it's not going to be cheap (computation-wise); (b) multiplication by a partially uninitialized value in user code is a bad idea anyway. Constant case must be handled because it appears from LLVM optimization of a completely valid user code, as the test case in compiler-rt demonstrates. llvm-svn: 211092	2014-06-17 09:23:12 +00:00
Justin Bogner	f7f2cd35dc	Support: Inject LLVM_VERSION_INFO into the Support library Mimic r116632 in passing LLVM_VERSION_INFO from the Makefile build system to the build. This improves the -version output of tools that use llvm::cl under the configure+make system. llvm-svn: 211091	2014-06-17 06:52:47 +00:00
Justin Bogner	581b592414	tools: Add a space between package version and LLVM_VERSION_INFO This reads a little strangely. Add a space to clean it up. llvm-svn: 211090	2014-06-17 06:52:41 +00:00
Rafael Espindola	087d6274ae	Convert a few loops to use ranges. llvm-svn: 211089	2014-06-17 03:00:40 +00:00
Jordan Rose	57ffdb07fd	Add an overload for SourceMgr::PrintMessage that takes an existing diagnostic. llvm-svn: 211087	2014-06-17 02:15:40 +00:00
Jordan Rose	b4cfd0070d	Modernize doc comments for SourceMgr. No functionality change. llvm-svn: 211086	2014-06-17 02:15:36 +00:00
Jingyue Wu	33bd53df7f	[InstCombine] mark ADD with nuw if no unsigned overflow Summary: As a starting step, we only use one simple heuristic: if the sign bits of both a and b are zero, we can prove "add a, b" do not unsigned overflow, and thus convert it to "add nuw a, b". Updated all affected tests and added two new tests (@zero_sign_bit and @zero_sign_bit2) in AddOverflow.ll Test Plan: make check-all Reviewers: eliben, rafael, meheff, chandlerc Reviewed By: chandlerc Subscribers: chandlerc, llvm-commits Differential Revision: http://reviews.llvm.org/D4144 llvm-svn: 211084	2014-06-17 00:42:07 +00:00
Duncan P. N. Exon Smith	73686d305a	SROA: Only split loads on byte boundaries r199771 accidently broke the logic that makes sure that SROA only splits load on byte boundaries. If such a split happens, some bits get lost when reassembling loads of wider types, causing data corruption. Move the width check up to reject such splits early, avoiding the corruption. Fixes PR19250. Patch by: Björn Steinbrink <bsteinbr@gmail.com> llvm-svn: 211082	2014-06-17 00:19:35 +00:00
Juergen Ributzka	2da1bbc113	[FastISel][X86] Refactor the code to get the X86 condition from a helper function. NFC. Make use of helper functions to simplify the branch and compare instruction selection in FastISel. Also add test cases for compare and conditonal branch. llvm-svn: 211077	2014-06-16 23:58:24 +00:00
Eli Bendersky	ff90324599	Teach LoopUnrollPass to respect loop unrolling hints in metadata. [This is resubmitting r210721, which was reverted due to suspected breakage which turned out to be unrelated]. Some extra review comments were addressed. See D4090 and D4147 for more details. The Clang change that produces this metadata was committed in r210667 Patch by Mark Heffernan. llvm-svn: 211076	2014-06-16 23:53:02 +00:00
Zachary Turner	ccbf3d01f0	Revert r211066, 211067, 211068, 211069, 211070. These were committed accidentally from the wrong branch before having a review sign-off. llvm-svn: 211072	2014-06-16 22:49:41 +00:00
Zachary Turner	ff7d1f4af7	Cleanup more unreferenced MutexGuard parameters on functions. These parameters are intended to serve as sort of a contract that you cannot access the functions outside of a mutex. However, the entire JIT class cannot be accessed outside of a mutex anyway, and all methods acquire a lock as soon as they are entered. Since the containing class already is not intended to be thread-safe, it only serves to add code clutter. llvm-svn: 211071	2014-06-16 22:41:08 +00:00
Zachary Turner	89ae856c46	Kill the LLVM global lock. llvm-svn: 211069	2014-06-16 22:40:42 +00:00
Zachary Turner	d4f7dfe7f2	Remove some code churn. llvm-svn: 211068	2014-06-16 22:40:29 +00:00
Zachary Turner	0f2c641f86	Remove some more code out into a separate CL. llvm-svn: 211067	2014-06-16 22:40:17 +00:00
Zachary Turner	b344f057d0	Users of the llvm global mutex must now acquire it manually. This allows the mutex to be acquired in a guarded, RAII fashion. llvm-svn: 211066	2014-06-16 22:39:38 +00:00
Reed Kotler	9fe3bfd087	Add load/store functionality Summary: This patches allows non conversions like i1=i2; where both are global ints. In addition, arithmetic and other things start to work since fast-isel will use existing patterns for non fast-isel from tablegen files where applicable. In addition i8, i16 will work in this limited context for assignment without the need for sign extension (zero or signed). It does not matter how i8 or i16 are loaded (zero or sign extended) since only the 8 or 16 relevant bits are used and clang will ask for sign extension before using them in arithmetic. This is all made more complete in forthcoming patches. for example: int i, j=1, k=3; void foo() { i = j + k; } Keep in mind that this pass is not enabled right now and is an experimental pass It can only be enabled with a hidden option to llvm of -mips-fast-isel. Test Plan: Run test-suite, loadstore2.ll and I will run some executable tests. Reviewers: dsanders Subscribers: mcrosier Differential Revision: http://reviews.llvm.org/D3856 llvm-svn: 211061	2014-06-16 22:05:47 +00:00
Jim Grosbach	cc71514d3a	AArch64: Add backend intrinsic for rbit. Define an intrinsic for the frontend to use and pattern match it to the RBIT instruction. rdar://9283021 llvm-svn: 211058	2014-06-16 21:55:35 +00:00
Jim Grosbach	07393ba31b	ARM: intrinsic support for rbit. We already have an ARMISD node. Create an intrinsic to map to it so we can add support for the frontend __rbit() intrinsic. rdar://9283021 llvm-svn: 211057	2014-06-16 21:55:30 +00:00
Bill Schmidt	5d82f09b53	[PPC64] Fix PR19893 - improve code generation for local function addresses Rafael opened http://llvm.org/bugs/show_bug.cgi?id=19893 to track non-optimal code generation for forming a function address that is local to the compile unit. The existing code was treating both local and non-local functions identically. This patch fixes the problem by properly identifying local functions and generating the proper addis/addi code. I also noticed that Rafael's earlier changes to correct the surrounding code in PPCISelLowering.cpp were also needed for fast instruction selection in PPCFastISel.cpp, so this patch fixes that code as well. The existing test/CodeGen/PowerPC/func-addr.ll is modified to test the new code generation. I've added a -O0 run line to test the fast-isel code as well. Tested on powerpc64[le]-unknown-linux-gnu with no regressions. llvm-svn: 211056	2014-06-16 21:36:02 +00:00
Eric Christopher	daca3cc54a	Since the DataLayout is always found off of the subtarget go ahead and query the base target machine implementation for it. llvm-svn: 211055	2014-06-16 21:18:27 +00:00
Zachary Turner	2f825df60b	Clean up some unnecessary mutex guards. These were being used as unreferenced parameters to enforce that the methods must not be called without holding a mutex, but all of the methods in question were internal, and the methods were only exposed through an interface whose entire purpose was to serialize access to these structures, so expecting the methods to be accessed under a mutex is reasonable enough. Reviewed by: blaikie Differential Revision: http://reviews.llvm.org/D4162 llvm-svn: 211054	2014-06-16 20:54:28 +00:00
Louis Gerbarg	dcf00251ea	Improve comments for r211040 Added comment to clarify why we r211040 choose to bail out of fast isel instead of generating a more complicated relocation, and fix mislabelled register in the comments of the asan test case. llvm-svn: 211052	2014-06-16 20:31:50 +00:00
Tim Northover	b45c3b74b4	ARM: implement correct atomic operations on v7M ARM v7M has ldrex/strex but not ldrexd/strexd. This means 32-bit operations should work as normal, but 64-bit ones are almost certainly doomed. Patch by Phoebe Buckheister. llvm-svn: 211042	2014-06-16 18:49:36 +00:00
Louis Gerbarg	a5360c4cd8	Fix illegal relocations in X86FastISel On x86_86 the lea instruction can only use a 32 bit immediate value. When the code is compiled statically the RIP register is not used, meaning the immediate is all that can be used for the relocation, which is not sufficient in the case of targets more than +/- 2GB away. This patch bails out of fast isel in those cases and reverts to DAG which does the right thing. Test case included. llvm-svn: 211040	2014-06-16 17:35:40 +00:00
Jim Grosbach	fff5663d48	LowerSwitch: track bounding range for the condition tree. When LowerSwitch transforms a switch instruction into a tree of ifs it is actually performing a binary search into the various case ranges, to see if the current value falls into one cases range of values. So, if we have a program with something like this: switch (a) { case 0: do0(); break; case 1: do1(); break; case 2: do2(); break; default: break; } the code produced is something like this: if (a < 1) { if (a == 0) { do0(); } } else { if (a < 2) { if (a == 1) { do1(); } } else { if (a == 2) { do2(); } } } This code is inefficient because the check (a == 1) to execute do1() is not needed. The reason is that because we already checked that (a >= 1) initially by checking that also (a < 2) we basically already inferred that (a == 1) without the need of an extra basic block spawned to check if actually (a == 1). The patch addresses this problem by keeping track of already checked bounds in the LowerSwitch algorithm, so that when the time arrives to produce a Leaf Block that checks the equality with the case value / range the algorithm can decide if that block is really needed depending on the already checked bounds . For example, the above with "a = 1" would work like this: the bounds start as LB: NONE , UB: NONE as (a < 1) is emitted the bounds for the else path become LB: 1 UB: NONE. This happens because by failing the test (a < 1) we know that the value "a" cannot be smaller than 1 if we enter the else branch. After the emitting the check (a < 2) the bounds in the if branch become LB: 1 UB: 1. This is because by checking that "a" is smaller than 2 then the upper bound becomes 2 - 1 = 1. When it is time to emit the leaf block for "case 1:" we notice that 1 can be squeezed exactly in between the LB and UB, which means that if we arrived to that block there is no need to emit a block that checks if (a == 1). Patch by: Marcello Maggioni <hayarms@gmail.com> llvm-svn: 211038	2014-06-16 16:55:20 +00:00
James Molloy	f6419cfb14	Refactor the disabling of Thumb-1 LDM/STM generation Originally I switched the LD/ST optimizer off in TargetMachine as it was previously, but Eric has suggested he'd prefer that it be short-circuited in the pass itself. No functionality change. llvm-svn: 211037	2014-06-16 16:42:53 +00:00
Rafael Espindola	95cf2f25fe	Fix pr17056. This makes llvm-nm ignore members that are not sufficiently aligned for lib/Object to handle. These archives are invalid. GNU AR is able to handle this, but in general just warns about broken archive members. We should probably start warning too, but for now just make sure llvm-nm exits with an 0. llvm-svn: 211036	2014-06-16 16:41:00 +00:00
Rafael Espindola	ae460027a4	Convert the Archive API to use ErrorOr. Now that we have c++11, even things like ErrorOr<std::unique_ptr<...>> are easy to use. No intended functionality change. llvm-svn: 211033	2014-06-16 16:08:36 +00:00
Tilmann Scheller	9252057a07	[AArch64] Remove dead code. Both function declarations lack a callee and an implementation. llvm-svn: 211029	2014-06-16 15:15:41 +00:00

... 2 3 4 5 6 ...

70640 Commits