dynarmic

Author	SHA1	Message	Date
Lioncash	b6df34cdde	backend_x64/a64_interface: Re-enable the constant folding pass This was disabled for debugging, but never re-enabled. Just to be sure, testing was done downstream in yuzu to make sure this didn't happen to break anything (which seems to be the case).	2020-04-22 20:55:06 +01:00
MerryMage	06ba397af2	emit_x64_vector_floating_point: Hardware FMA implementation for RSqrtStepFused	2020-04-22 20:55:06 +01:00
MerryMage	e553c4fe8d	emit_x64_vector_floating_point: Hardware FMA implementation of FPVectorRecipStepFused	2020-04-22 20:55:06 +01:00
MerryMage	3caeb62ef1	emit_x64_floating_point: Hardware FMA implementation of FPRSqrtStepFused	2020-04-22 20:55:06 +01:00
MerryMage	344ee76aba	emit_x64_floating_point: Hardware FMA implementation of FPRecipStepFused{32,64}	2020-04-22 20:55:06 +01:00
MerryMage	1492573267	emit_x64_vector: SSE implementation of VectorSignedSaturatedAccumulateUnsigned{8,16,32}	2020-04-22 20:55:06 +01:00
Lioncash	26df6e5e7b	emit_x64_vector: Correct static asserts for < 64-bit type checks in saturated accumulate fallbacks I had initially meant to use BitSize() here, not sizeof()	2020-04-22 20:55:06 +01:00
MerryMage	a4a26ac226	emit_x64_vector: EmitVectorSignedSaturatedAccumulateUnsigned64: SSE implementation	2020-04-22 20:55:06 +01:00
MerryMage	a7c66d2d28	emit_x64_vector: Simplify fpsr_qc related code Move the bool conversion into A64JitState::GetFpsr so we don't have to continuously pay the cost of conversion for every saturation instruction.	2020-04-22 20:55:06 +01:00
Lioncash	112cff9ab9	A64: Implement CLZ's vector variant	2020-04-22 20:55:06 +01:00
Lioncash	e739624296	ir: Add opcodes for vector CLZ operations We can optimize these cases further for with the use of a fair bit of shuffling via pshufb and the use of masks, but given the uncommon use of this instruction, I wouldn't consider it to be beneficial in terms of amount of code to be worth it over a simple manageable naive solution like this. If we ever do hit a case where vectorized CLZ happens to be a bottleneck, then we can revisit this. At least with AVX-512CD, this can be done with a single instruction for the 32-bit word case.	2020-04-22 20:55:05 +01:00
MerryMage	d4c37a68a8	A64/translate: VectorZeroUpper for V(64) stores Ensures correctness.	2020-04-22 20:55:05 +01:00
MerryMage	b8daa4feac	simd_two_register_misc: FNEG (vector) with Q == 0 had dirty upper	2020-04-22 20:55:05 +01:00
Lioncash	5653e7637e	emit_x64_vector: Remove unnecessary [[maybe_unused]] attributes These were unintentionally left in when introducing SUQADD and USQADD	2020-04-22 20:55:05 +01:00
Lioncash	14e026a7f0	A64: Implement USQADD's scalar and vector variants	2020-04-22 20:55:05 +01:00
Lioncash	d4a76aaa04	ir: Add opcodes form unsigned saturated accumulations of signed values	2020-04-22 20:55:05 +01:00
Lioncash	18ad7f237d	A64: Implement SUQADD's scalar and vector variants	2020-04-22 20:55:05 +01:00
Lioncash	6f911a26da	ir: Add opcodes for signed saturated accumulations of unsigned values	2020-04-22 20:55:05 +01:00
Lioncash	9a3d38d2ee	A64: Implement SMLAL{2}, SMLSL{2}, UMLAL{2}, and UMLSL{2}'s vector by-element variants We can simply modify the general function made for SMULL{2} and UMULL{2}'s by-element variants to also handle the other multiply-based by-element variants.	2020-04-22 20:55:05 +01:00
Lioncash	6ccfbc9b39	A64: Implement UMULL{2}'s vector by-element variant	2020-04-22 20:55:05 +01:00
Lioncash	58e21f175c	A64: Implement SMULL{2}'s vector by-element variant	2020-04-22 20:55:05 +01:00
Lioncash	134bb02e19	ir/value: Replace includes with forward declarations enum classes are still considered complete types when forward declared (as the compiler knows the exact size of the type from the declaration alone). The only difference in this case being that the members of the enum class aren't visible. Given we don't use the members within this header in any way, we can simply forward declare them here and remove the inclusions.	2020-04-22 20:55:05 +01:00
Lioncash	2c8e07e7d0	ir/cond: Migrate to C++17 nested namespace specifiers	2020-04-22 20:55:05 +01:00
Lioncash	c3b7819a55	CMakeLists: Add missing cond.h header to file listing Allows the file to show up within IDEs more easily.	2020-04-22 20:55:05 +01:00
Lioncash	0a3976059f	A64: Implement URSQRTE	2020-04-22 20:55:05 +01:00
Lioncash	b6e74fd17d	ir: Add opcodes for performing unsigned reciprocal square root estimates	2020-04-22 20:55:05 +01:00
Lioncash	bd3582e811	A64: Implement URECPE	2020-04-22 20:55:05 +01:00
Lioncash	af83360f89	ir: Add opcodes for unsigned reciprocal estimate	2020-04-22 20:55:05 +01:00
Lioncash	740ffa52ae	A64: Implement SQNEG's scalar and vector variant	2020-04-22 20:53:46 +01:00
Lioncash	fca7eddb9e	A64: Add opcodes for signed saturating negations	2020-04-22 20:53:46 +01:00
Lioncash	f1ebbcd7bc	emit_x64_vector: Simplify "position == 0" case for EmitVectorExtract() In the event position is zero, we can just treat it as a NOP, given there's no need to move the data.	2020-04-22 20:53:46 +01:00
Lioncash	87372917f9	emit_x64_vector: Simplify "position == 0" case for EmitVectorExtractLower() In the event position == 0, we can just treat it as a simple movq, clearing the upper half of the XMM register. This also makes that case use only one register.	2020-04-22 20:53:46 +01:00
Lioncash	f5fb496e7e	A64: Implement SQDMULH's by-element scalar variant	2020-04-22 20:53:46 +01:00
Lioncash	40f0576995	A64: Implement SQDMULH's by-element vector variant	2020-04-22 20:53:46 +01:00
MerryMage	8f9206901d	backend/x64: Do not clear fast_dispatch_table if not enabled There is no need to pay for the cost of setting a large block of memory if we're not using it.	2020-04-22 20:53:46 +01:00
MerryMage	9b65100660	A64: Implement FastDispatchHint	2020-04-22 20:53:46 +01:00
MerryMage	f96c43d422	A32: Implement FastDispatchHint	2020-04-22 20:53:46 +01:00
MerryMage	aa8d826c13	ir/terminal: Add FastDispatchHint	2020-04-22 20:53:46 +01:00
Lioncash	1a69a61cb4	A64: Implement SQDMULH's scalar variant	2020-04-22 20:53:46 +01:00
Lioncash	7ebfd0f31c	ir: Add opcodes for scalar signed saturated doubling multiplies	2020-04-22 20:53:46 +01:00
Lioncash	9c03311fed	A64: Implement SQDMULH's vector variant	2020-04-22 20:53:46 +01:00
Lioncash	a0231e5546	ir: Add opcodes for signed saturated doubling multiplies	2020-04-22 20:53:46 +01:00
Lioncash	db24e1f09b	A64: Implement SQABS' scalar variant	2020-04-22 20:53:46 +01:00
Lioncash	bda5d14c7f	A64: Implement SQABS' vector variant.	2020-04-22 20:53:46 +01:00
Lioncash	0507e47420	ir: Add opcodes for signed saturated absolute values	2020-04-22 20:53:46 +01:00
MerryMage	27427595b7	emit_x64_floating_point: EmitFPToFixed: maxsd optimization maxsd is not required when doing a signed conversion, because x64 produces a 0x80...00 value for out of range values.	2020-04-22 20:53:46 +01:00
MerryMage	1abf82ac4a	emit_x64_floating_point: ZeroIfNaN: pxor -> xorps xorps is shorter and more appropriate here.	2020-04-22 20:53:46 +01:00
MerryMage	3415828fb4	IR: Simplify FP{Single,Double}ToFixed{U,S}{32,64}	2020-04-22 20:53:46 +01:00
Lioncash	e30f9816ec	A32/decoder: Add missing <algorithm> includes These includes should be present, as we use std::find_if() within these headers.	2020-04-22 20:53:46 +01:00
Lioncash	4507627905	emit_x64_vector: Provide AVX path for EmitVectorMinU64()	2020-04-22 20:53:46 +01:00
Lioncash	fd49a62b06	emit_x64_vector: Provide AVX path for EmitVectorMinS64()	2020-04-22 20:53:46 +01:00
Lioncash	770723f449	emit_x64_vector: Provide AVX path for EmitVectorMaxU64()	2020-04-22 20:53:46 +01:00
Lioncash	8fb90c0cf1	emit_x64_vector: Provide AVX path for EmitVectorMaxS64()	2020-04-22 20:53:46 +01:00
Lioncash	2cac6ad129	emit_x64_vector: Simplify EmitVectorLogicalLeftShift8() Similar to EmitVectorLogicalRightShift8(), we can determine a mask ahead of time and just and the results of a halfword left shift.	2020-04-22 20:53:46 +01:00
Lioncash	135107279d	emit_x64_vector: Simplify EmitVectorLogicalShiftRight8() We can generate the mask and AND it against the result of a halfword shift instead of looping.	2020-04-22 20:53:46 +01:00
Lioncash	2952b46b16	emit_x64_vector: Amend value definition in SSE 4.1 path for EmitVectorSignExtend16() We should be defining the value after the results have been calculated to be consistent with the rest of the code.	2020-04-22 20:53:46 +01:00
Lioncash	fda19095ea	emit_x64_vector: Remove fallback in EmitVectorSignExtend64() This is fairly trivial to do manually.	2020-04-22 20:53:46 +01:00
Lioncash	39593fcd26	emit_x64_vector: Remove fallback for EmitVectorSignExtend32() We can just do the extension manually, which gets rid of the need to fall back here.	2020-04-22 20:53:46 +01:00
Lioncash	053175f69b	ir_emitter: Rename fpscr_controlled parameters to fpcr_controlled Part of addressing #333	2020-04-22 20:53:46 +01:00
MerryMage	f0184c4b8d	a32/exception_generating: BPKT: Define unpredictable behaviour Define unpredictable behaviour to be BKPT executes conditionally	2020-04-22 20:53:46 +01:00
MerryMage	a12854857b	A32: Add define_unpredictable_behaviour option	2020-04-22 20:53:46 +01:00
MerryMage	b0abaa8312	A32/location_descriptor: Change formatting to use hex	2020-04-22 20:53:46 +01:00
MerryMage	ccbf6c7f63	microinstruction: A32ExceptionRaised causes CPU exception	2020-04-22 20:53:46 +01:00
MerryMage	6595e49a31	A32/types: CondToString: Add nv	2020-04-22 20:53:46 +01:00
MerryMage	d5b9c4a4bb	block_of_code: Hide NX support behind compiler flag Systems that require W^X can use the DYNARMIC_ENABLE_NO_EXECUTE_SUPPORT cmake option.	2020-04-22 20:53:46 +01:00
MerryMage	de4494ffa5	Implement perfmap	2020-04-22 20:53:46 +01:00
MerryMage	f73104633b	a32_emit_x64: Fix incorrect BMI2 implementation for SetCpsr * The MSB for each byte in cpsr_ge were not being appropriately set. * We also expand test coverage to test this case. * We fix the disassembly of the MSR (imm) and MSR (reg) instructions as well.	2020-04-22 20:53:46 +01:00
MerryMage	3432a08e0a	backend/x64: Support W^X systems Closes #176.	2020-04-22 20:53:46 +01:00
BreadFish64	2a65442933	Backend: Create "backend" folder similar to the "frontend" folder	2020-04-22 20:53:46 +01:00
MerryMage	3b13f1eb12	A64/translate: Standardize arguments of helper functions Don't pass in IREmitter when TranslatorVisitor is already available.	2020-04-22 20:53:45 +01:00
MerryMage	a4e556d59c	A64/translate: Standardize TranslatorVisitor abbreviation Prefer v to tv.	2020-04-22 20:53:45 +01:00
MerryMage	9a0dc61efd	emit_x64_vector: Avoid recalculating addresses in EmitVectorTableLookup	2020-04-22 20:53:45 +01:00
Lioncash	3d465e2c36	A64: Implement SQXTN, SQXTUN, and UQXTN's scalar variants We can implement these in terms of the vector variants	2020-04-22 20:53:45 +01:00
Lioncash	4ff39c6ea8	A64: Implement SDOT and UDOT's (by element) variants Gets all of the dot product instructions out of the way.	2020-04-22 20:53:45 +01:00
MerryMage	21df1fb539	emit_x64_vector: Don't load zero constant from memory in EmitVectorTableLookup	2020-04-22 20:53:45 +01:00
MerryMage	3bbcca8757	emit_x64_vector: Special-case is_defaults_zero && table_size == 2 in EmitVectorTableLookup	2020-04-22 20:53:45 +01:00
MerryMage	9cc00f900c	emit_x64_vector: Release registers when possible in EmitVectorTableLookup	2020-04-22 20:53:45 +01:00
MerryMage	a12afd1065	reg_alloc: Add the ability to Release an allocation early	2020-04-22 20:53:45 +01:00
MerryMage	e68bd3c6c1	emit_x64_vector: Special-case table_size == 1 in EmitVectorTableLookup	2020-04-22 20:53:45 +01:00
MerryMage	a4e1f8a63a	emit_x64_vector: SSE4.1 implementation of EmitVectorTableLookup	2020-04-22 20:53:45 +01:00
MerryMage	0c18b85c27	A64: Implement TBL and TBX	2020-04-22 20:53:45 +01:00
MerryMage	89d08c7d61	IR: Add VectorTable and VectorTableLookup IR instructions	2020-04-22 20:53:45 +01:00
MerryMage	0288974512	opcodes: Cleanup opcodes table * Remove T:: prefix from types. * Add another column for a 4th argument.	2020-04-22 20:53:45 +01:00
Lioncash	d9fc6cf31f	A64: Implement SDOT and UDOT's vector variant	2020-04-22 20:53:45 +01:00
Lioncash	cb5e5c5d49	A64: Implement SADALP and UADALP While we're at it we can join the code for SADDLP and UADDLP with these instructions, since the only difference is we do an accumulate at the end of the operation.	2020-04-22 20:53:45 +01:00
Lioncash	29f8b30634	A64: Implement SRSHL and URSHL Implements both scalar and vector variants.	2020-04-22 20:53:45 +01:00
Lioncash	0efa2ce3b0	ir: Add opcodes for performing rounding left shifts	2020-04-22 20:53:45 +01:00
MerryMage	656ceff225	emit_x64_floating_point: Fix smallest normal check in EmitFPMulAdd	2020-04-22 20:53:45 +01:00
Lioncash	f3f60cd179	A64: Implement ISB Given we want to ensure that all instructions are fetched again, we can treat an ISB instruction as a code cache flush.	2020-04-22 20:53:45 +01:00
Lioncash	be53e356a2	A64: Implement FCVTN{2}	2020-04-22 20:53:45 +01:00
Lioncash	4c3d7c5a8d	A64: Implement FCVTL{2}	2020-04-22 20:53:45 +01:00
Lioncash	7eb6be7a6a	A64: Implement FMAXNM and FMINNM vector variants. Currently we can implement these in terms of the scalar IR variants.	2020-04-22 20:53:45 +01:00
Lioncash	8b65ea68c0	A64: Implement FMAXP, FMAXNMP, FMINP, and FMINNMP's vector variants We can just implement these in terms of scalars for the time being.	2020-04-22 20:53:45 +01:00
MerryMage	ec76f95f5a	emit_x64_vector_floating_point: Correct value of smallest_normal_number	2020-04-22 20:53:45 +01:00
MerryMage	e60d6c0d20	fp/info: Incorrect point_position in FPValue	2020-04-22 20:53:45 +01:00
MerryMage	8a3b6364c2	load_store_exclusive: Define s == t state to be Constraint_NONE Downstream (yuzu) mentioned that the instruction: STXR W9, W9, [X0] was executed in the program "Crash N-Sane Trilogy".	2020-04-22 20:53:45 +01:00
MerryMage	cd40e4dae0	A64/translate: Allow for unpredictable behaviour to be defined	2020-04-22 20:53:45 +01:00
MerryMage	d1d6f4feb5	system: Implement MRS CNTFRQ_EL0	2020-04-22 20:53:45 +01:00
Lioncash	7ef7def661	A64: Implement SQ{ADD, SUB}, and UQ{ADD, SUB}'s vector variants Currently we implement these in terms of the scalar variants. Falling back to the interpreter is slow enough to make it more effective than doing that.	2020-04-22 20:46:23 +01:00
Lioncash	a4b0e2ace6	A64: Implement UQADD/UQSUB's scalar variants	2020-04-22 20:46:23 +01:00
Lioncash	acbaf04fef	ir: Add opcodes for unsigned saturating add and subtract	2020-04-22 20:46:23 +01:00
Lioncash	c41b5a3492	x64/reg_alloc: Use type alias for array returned by GetArgumentInfo() This way if the number ever changes, we don't need to change the type in other places.	2020-04-22 20:46:23 +01:00
Lioncash	2188765e28	ir/value: Use type alias CoprocessorInfo for std::array<u8, 8> Provides a more descriptive label for the interface, and avoids the need to hardcode the array size in multiple places.	2020-04-22 20:46:23 +01:00
MerryMage	71e137715d	status_register_access: Add support for bits 0 and 1 of mask to MSR	2020-04-22 20:46:23 +01:00
MerryMage	ac51c2547d	A32/translate/load_store: Correct detection of writeback	2020-04-22 20:46:23 +01:00
MerryMage	d345220251	A32/translate: Add TranslateSingleInstruction	2020-04-22 20:46:23 +01:00
MerryMage	5fc197c564	A32/ir_emitter: Bug fix: IREmitter::ExceptionRaised using incorrect opcode	2020-04-22 20:46:23 +01:00
MerryMage	ff3805e332	A32/decoders: Split instruction list into include file	2020-04-22 20:46:23 +01:00
MerryMage	3f4d118d73	microinstruction: Improve assert messages	2020-04-22 20:46:23 +01:00
MerryMage	a7e6f2a235	emit_x64_vector: EmitVectorNarrow16: AVX512 implementation	2020-04-22 20:46:23 +01:00
MerryMage	b6350e3947	emit_x64_vector: EmitVectorNarrow32: prefer pblendw to loading constant	2020-04-22 20:46:23 +01:00
MerryMage	8fdba189cb	emit_x64_vector: packusdw is SSE4.1	2020-04-22 20:46:23 +01:00
MerryMage	1ef388d1cd	emit_x64_vector_floating_point: Simplify FPVector{Min,Max}	2020-04-22 20:46:23 +01:00
MerryMage	4a1ce797cb	emit_x64_vector_floating_point: Simplify Get*Vector functions	2020-04-22 20:46:23 +01:00
MerryMage	bcaced297a	emit_x64_floating_point: Remove EmitProcessNaNs	2020-04-22 20:46:23 +01:00
MerryMage	2e0885388e	devirtualize: Replace DEVIRT macro with function template	2020-04-22 20:46:23 +01:00
Lioncash	54d8552177	a32_emit_x64: std::move A32::UserConfig in the constructor This avoids a few redundant atomic increments and decrements, considering the UserConfig instance contains a std::array of std::shared_ptr<Coprocessor> instances.	2020-04-22 20:46:23 +01:00
MerryMage	b098c650df	emit_x64_floating_point: Use EmitPostProcessNaNs in EmitFPMulX	2020-04-22 20:46:23 +01:00
MerryMage	c1babf41b2	emit_x64_floating_point: Remove unnecessary DenormalsAreZero from EmitFPSingleToDouble and EmitFPDoubleToSingle	2020-04-22 20:46:23 +01:00
MerryMage	700088408d	emit_x64_floating_point: Simplify EmitFP{Min,Max}{,Numeric}{32,64}	2020-04-22 20:46:23 +01:00
MerryMage	07e0585994	emit_x64_floating_point: Reduce NaN processing overhead	2020-04-22 20:46:23 +01:00
MerryMage	f5e11d117a	A64: Implement FMULX, scalar single/double variant	2020-04-22 20:46:23 +01:00
MerryMage	17f73974f2	IR: Implement FPMulX IR instruction	2020-04-22 20:46:23 +01:00
Lioncash	391e16be64	emit_x64_vector: Vectorize 32-bit variants of paired min/max Gets rid of the fallbacks for these cases.	2020-04-22 20:46:23 +01:00
MerryMage	5ae045d67e	emit_x64_vector: Improve code emission of VectorGetElement* for index == 0	2020-04-22 20:46:23 +01:00
MerryMage	e9ab7f7664	reg_alloc: Do a UseScratch if a Use destination is too small	2020-04-22 20:46:23 +01:00
MerryMage	90f8dda966	emit_x64_floating_point: AVX implementation of ForceToDefaultNaN	2020-04-22 20:46:23 +01:00
MerryMage	dfb660cd16	emit_x64_vector_floating_point: Prefer blendvp{s,d} to vblendvp{s,d} where possible It's a cheaper instruction.	2020-04-22 20:46:23 +01:00
MerryMage	476c0f15da	backend_x64: Remove all use of xmm0	2020-04-22 20:46:23 +01:00
MerryMage	8252efd7b1	emit_x64_vector_floating_point: AVX implementation of ForceToDefaultNaN	2020-04-22 20:46:23 +01:00
MerryMage	746dc521b9	emit_x64_vector_floating_point: Reduce codesize of ForceToDefaultNaN	2020-04-22 20:46:23 +01:00
MerryMage	7731dcdca9	emit_x64_vector_floating_point: Reduce codesize of EmitTwoOpVectorOperation	2020-04-22 20:46:23 +01:00
MerryMage	bb93353f94	emit_x64_vector_floating_point: Correct FMA in FTZ mode x64 rounds before flushing to zero AArch64 rounds after flushing to zero This difference of behaviour is noticable if something would round to a smallest normalized number	2020-04-22 20:46:23 +01:00
MerryMage	8ef195db3c	emit_x64_floating_point: DenormalsAreZero is redundant as hardware already does DAZ Exceptions: F{MIN,MAX}{,NM}	2020-04-22 20:46:23 +01:00
MerryMage	de9d8c461c	emit_x64_floating_point: FlushToZero is redundant as hardware already does FTZ	2020-04-22 20:46:23 +01:00
MerryMage	822fd4a875	backend_x64: Fix FPVectorMulAdd and FPMulAdd NaN handling with denormals Denormals should be treated as zero in NaN handler	2020-04-22 20:46:23 +01:00
MerryMage	b393e15ab6	backend_x64: Fix bugs when FPCR.FZ=1 Bugs: * DenormalsAreZero flushed to positive zero instead of preserving sign. * FMAXNM/FMINNM (scalar) should perform DAZ before special zero handling. * FMAX/FMIN/FMAXNM/FMINNM (vector) did not DAZ.	2020-04-22 20:46:23 +01:00
MerryMage	5e88d66470	fp/info: Deduplicate functions	2020-04-22 20:46:23 +01:00
MerryMage	2019d32743	emit_x64_floating_point: Deduplicate EmitFPMulAdd implementation	2020-04-22 20:46:23 +01:00
MerryMage	e038fe72df	emit_x64_floating_point: Deduplicate code	2020-04-22 20:46:23 +01:00
MerryMage	ec82a845b7	emit_x64_vector_floating_point: Fix FPVector{Max,Min} when FPCR.DN = 1	2020-04-22 20:46:23 +01:00
MerryMage	7f27945411	emit_x64_floating_point: Fix FP{Max,Min} when FPCR.DN = 1	2020-04-22 20:46:23 +01:00
MerryMage	21a28c2545	IR: SSE4.1 implementation of FPVectorRoundInt	2020-04-22 20:46:23 +01:00
MerryMage	9669e49817	A64: Implement FRINT{N,M,P,Z,A,X,I} (vector), single/double variant	2020-04-22 20:46:23 +01:00
MerryMage	f976c47008	IR: Initial implementation of FPVectorRoundInt	2020-04-22 20:46:23 +01:00
MerryMage	f2393488fe	A64: Implement SQADD and SQSUB, scalar variant	2020-04-22 20:46:23 +01:00
MerryMage	10e196480f	IR: Generalise SignedSaturated{Add,Sub} to support more bitwidths	2020-04-22 20:46:23 +01:00
MerryMage	71db0e67ae	a64_emit_x64: Bugfix EmitA64OrQC - Incorrect argument	2020-04-22 20:46:23 +01:00
Lioncash	d0fdd3c6e6	simd_three_same: Extract non-paired SMAX, SMIN, UMAX, UMIN code to a common function Deduplicates a bit of code and makes its layout consistent with the paired variants	2020-04-22 20:46:23 +01:00
Lioncash	2bea2d0512	A64: Implement SMAXP, SMINP, UMAXP, UMINP	2020-04-22 20:46:23 +01:00
Lioncash	463b9a3d02	ir: Add opcodes for vector paired maximum and minimums For the time being, we can just do a naive implementation which avoids falling back to the interpreter a bit. Horizontal operations aren't necessarily x86 SIMD's forte anyways.	2020-04-22 20:46:23 +01:00
Lioncash	43344c5400	A64: Implement SMAXV, SMINV, UMAXV, and UMINV	2020-04-22 20:46:23 +01:00
Lioncash	2501bfbfae	ir: Add opcodes for performing scalar integral min/max	2020-04-22 20:46:23 +01:00
Lioncash	7fdd8b0197	A64: Implement PMULL{2}	2020-04-22 20:46:23 +01:00
Lioncash	5ebf496d4e	translate: Deduplicate GetDataSize() functions Avoids defining the same function multiple times in different files.	2020-04-22 20:46:22 +01:00
Lioncash	f83cd2da9a	floating_point_{conditional}_compare: Deduplicate code Deduplicates the implementation code of instructions by extracting the code to a common function.	2020-04-22 20:46:22 +01:00
MerryMage	f9c6d5e1a0	common: Move all cryptographic function to common/crypto	2020-04-22 20:46:22 +01:00
MerryMage	5dc23e49d7	a32_emit_x64: BMI2 implementation of A32SetCpsr	2020-04-22 20:46:22 +01:00
MerryMage	0f85305933	a32_emit_x64: Shorten EmitA32GetCpsr	2020-04-22 20:46:22 +01:00
MerryMage	9fe2bf8733	a32_emit_x64: Assert that memory layout assumption in EmitA32GetCpsr is valid	2020-04-22 20:46:22 +01:00
Lioncash	b48fb8ca6b	A64: Implement PMUL	2020-04-22 20:46:22 +01:00
Lioncash	affa312d1d	ir: Add opcode for performing polynomial multiplication	2020-04-22 20:46:22 +01:00
MerryMage	dd4ac86f8e	A64: Implement FCVT{N,M,A,P}{U,S} (vector), FCVTZU (vector, integer), single/double variant	2020-04-22 20:46:22 +01:00
MerryMage	28b38916a8	A64: Implement FCVTZS (vector, integer), single/double variant	2020-04-22 20:46:22 +01:00
MerryMage	507bcd8b8b	IR: Implement FPVectorTo{Signed,Unsigned}Fixed	2020-04-22 20:46:22 +01:00
MerryMage	8f75a1fe04	fp/info: Replace constant value generators with FPValue Instead of having multiple different functions we can just have one.	2020-04-22 20:46:22 +01:00
MerryMage	da261772ea	emit_x64_vector_floating_point: AVX implementation of FPVector{Max,Min}	2020-04-22 20:46:22 +01:00
MerryMage	a0d6f0de57	emit_x64_vector_floating_point: Remove unnecessary double jump in HandleNaNs	2020-04-22 20:46:22 +01:00
Lioncash	c778c7b868	A64: Implement FMAX's vector single and double precision variants	2020-04-22 20:46:22 +01:00
Lioncash	009879d92b	A64: Implement FMIN's vector single and double precision variants	2020-04-22 20:46:22 +01:00
MerryMage	7b03da86c2	IR: Implement FPVector{Max,Min}	2020-04-22 20:46:22 +01:00
MerryMage	e76e1186bb	FPRecipEstimate: Move offset out of function MSVC has weird lambda capturing rules.	2020-04-22 20:46:22 +01:00
MerryMage	ddcff86f9c	microinstruction: Update ReadsFromAndWritesToFPSRCumulativeExceptionBits	2020-04-22 20:46:22 +01:00
MerryMage	10de36394e	A64: Implement FRECPS, vector/scalar single/double variants	2020-04-22 20:46:22 +01:00
MerryMage	901bd9b4e2	IR: Implement FPRecipStepFused, FPVectorRecipStepFused	2020-04-22 20:46:22 +01:00
MerryMage	f66f61d8ab	A64: Implement FRECPE, vector single/double variant	2020-04-22 20:46:22 +01:00
MerryMage	939f5f5c7a	IR: Implement FPVectorRecipEstimate	2020-04-22 20:46:22 +01:00
MerryMage	27c73dd56a	A64: Implement FRECPE, scalar single/double variant	2020-04-22 20:46:22 +01:00
MerryMage	fc2d33ae7b	IR: Implement FPRecipEstimate	2020-04-22 20:46:22 +01:00
MerryMage	c1dcfe29f7	IR: Implement FPRecipEstimate	2020-04-22 20:46:22 +01:00
MerryMage	7a673a8a43	fp: Change FPUnpacked to a normalized representation Having a known position for the highest set bit makes writing algorithms easier	2020-04-22 20:46:22 +01:00
MerryMage	3fe45c6d8e	block_of_code: Add ABI_PARAMS array	2020-04-22 20:46:22 +01:00
MerryMage	642b6c31d2	A64: Implement MLA, MLS (by element), vector single/double variant	2020-04-22 20:46:22 +01:00
MerryMage	0de37b11ad	A64: Implement FMLS (vector), single/double variant	2020-04-22 20:46:22 +01:00
MerryMage	64c2f698a2	emit_x64_vector_floating_point: Specify NanHandler::function_type explicitly MSVC doesn't like dealing with auto return types	2020-04-22 20:46:22 +01:00
MerryMage	2ef59b4f03	emit_x64_vector_floating_point: ChooseOnFsize arguments maybe_unused	2020-04-22 20:46:22 +01:00
MerryMage	04f325a05e	IR: Implement FPVectorNeg	2020-04-22 20:46:22 +01:00
MerryMage	934132e0c5	A64: Implement FMLA (vector), single/double variant	2020-04-22 20:46:22 +01:00
MerryMage	771a4fc20b	IR: Implement FPVectorMulAdd	2020-04-22 20:46:22 +01:00
MerryMage	3218bb9890	emit_x64_vector_floating_point: Standardize naming scheme	2020-04-22 20:46:22 +01:00
MerryMage	8f72be0a02	emit_x64_floating_point: Simplify indexers	2020-04-22 20:46:22 +01:00
MerryMage	25b28bb234	emit_x64_vector_floating_point: Simplify EmitVectorOperation*	2020-04-22 20:46:22 +01:00
MerryMage	1edd0125b2	mp: rename mp.h to mp/function_info.h	2020-04-22 20:46:22 +01:00
MerryMage	0921678edb	emit_x64_vector: Slightly improve ArithmeticShiftRightByte	2020-04-22 20:46:22 +01:00
MerryMage	43407c4bb4	emit_x64_vector: Simplify VectorShuffleImpl	2020-04-22 20:46:22 +01:00
MerryMage	ecbf9dbae5	IR: Implement A64OrQC	2020-04-22 20:46:22 +01:00
MerryMage	f0fecf2615	A64: Implement UQSHRN, UQRSHRN (vector)	2020-04-22 20:46:22 +01:00
MerryMage	8f4c1a8558	emit_x64_vector: -0x80000000 isn't -0x80000000	2020-04-22 20:46:22 +01:00
MerryMage	b455b566e7	A64: Implement UQXTN (vector)	2020-04-22 20:46:22 +01:00
MerryMage	e686a81612	emit_x64_vector: Fix non-SSE4.1 saturated narrowing reconstruction comparison Allows non-SSE4.1 to produce the correct FPSR.QC flag	2020-04-22 20:46:22 +01:00
MerryMage	3874cb37e3	A64: Implement SQXTN (vector)	2020-04-22 20:46:22 +01:00
MerryMage	8ef114d48f	emit_x64_vector: packusdw reqiures SSE4.1 In EmitVectorSignedSaturatedNarrowToUnsigned32.	2020-04-22 20:46:22 +01:00
MerryMage	712c6c1d7e	A64: Implement SQSHRUN, SQRSHRUN (vector)	2020-04-22 20:46:22 +01:00
MerryMage	c5722ec963	simd_shift_by_immediate: Simplify ShiftRight	2020-04-22 20:46:22 +01:00
MerryMage	f020dbe4ed	A64: Implement SQXTUN	2020-04-22 20:46:22 +01:00
MerryMage	6918ef7360	microinstruction: Reorganize FPSCR related instruction queries	2020-04-22 20:46:22 +01:00
Lioncash	a639fa5534	microinstruction: Add missing FP scalar opcodes to ReadsFromFPSCR() and WritesToFPSCR() These were forgotten when the opcodes were added.	2020-04-22 20:46:22 +01:00
Lioncash	3ca18d8a6d	u128: Make Bit() a const-qualified member function This function doesn't modify the struct members, so it can be made const.	2020-04-22 20:46:22 +01:00
MerryMage	b2e4c16ef8	A64: Implement FRSQRTS (vector), single/double variant	2020-04-22 20:46:22 +01:00
MerryMage	45dc5f74f3	A64: Implement FRSQRTE (vector), single/double variant	2020-04-22 20:46:22 +01:00
MerryMage	b74d5520f9	A64: Implement FRSQRTS (scalar), single/double variant	2020-04-22 20:46:22 +01:00
MerryMage	506e544bfe	IR: Implement FPRSqrtStepFused	2020-04-22 20:46:22 +01:00
MerryMage	6eb069e80d	fp: Implement FPRSqrtStepFused	2020-04-22 20:46:22 +01:00
MerryMage	b0ff35fcd1	fp: Implement FPNeg	2020-04-22 20:46:22 +01:00
MerryMage	ca6774ccce	process_nan: Add two operand variant	2020-04-22 20:46:22 +01:00
Lioncash	ace7d2ba50	A64: Implement FMAXP, FMINP, FMAXNMP and FMINNMP's scalar double/single-precision variant	2020-04-22 20:46:21 +01:00
MerryMage	66bb05fc0a	emit_x64_floating_point: Fixup special NaN case in FMA FPMulAdd implementation	2020-04-22 20:46:21 +01:00
Lioncash	070637e0f6	fp: Use a forward declaration in fused.h It's permissible to forward declare here, so we can do so and eliminate a direct header dependency	2020-04-22 20:46:21 +01:00
Lioncash	030820f649	u128: Implement comparison operators in terms of one another We can just implement the comparisons in terms of operator< and implement inequality with the negation of operator==.	2020-04-22 20:46:21 +01:00
MerryMage	76b07d6646	u128: StickyLogicalShiftRight requires special-casing for amount == 64 In this case (128 - amount) == 64, and this invokes undefined behaviour	2020-04-22 20:46:21 +01:00
Lioncash	49c7edf7c6	A64: Implement FMLA and FMLS (by element)'s double/single-precision scalar variant	2020-04-22 20:46:21 +01:00
Lioncash	c704acafe4	A64: Implement FMUL (by element)'s scalar double/single-precision variant	2020-04-22 20:46:21 +01:00
MerryMage	0ce11b7b15	emit_x64_floating_point: Implement accurate fallback for FPMulAdd{32,64}	2020-04-22 20:46:21 +01:00
MerryMage	e199887fbc	fp: Implement FPMulAdd	2020-04-22 20:46:21 +01:00
MerryMage	53a8c15d12	process_nan: Add FPProcessNaNs3	2020-04-22 20:46:21 +01:00
MerryMage	1c8e93e74d	block_of_code: Add SysV ABI fifth and sixth parameters	2020-04-22 20:46:21 +01:00
MerryMage	1fe8f51c54	u128: Add StickyLogicalShiftRight	2020-04-22 20:46:21 +01:00
MerryMage	b0afd53ea7	u128: Add Multiply64To128	2020-04-22 20:46:21 +01:00
MerryMage	5566fab29a	u128: Add u128::Bit	2020-04-22 20:46:21 +01:00
MerryMage	3e62fea003	u128: Add comparison operators	2020-04-22 20:46:21 +01:00
MerryMage	f17cd6f2c5	unpacked: Use ResidualErrorOnRightShift in FPRoundBase Fixes a bug relating to exponents that are severely out of range.	2020-04-22 20:46:21 +01:00
MerryMage	805428e35e	fp: Remove MantissaT	2020-04-22 20:46:21 +01:00
MerryMage	bda86fd167	FPRSqrtEstimate: Improve documentation of RecipSqrtEstimate	2020-04-22 20:46:21 +01:00
Lioncash	0a64a66b26	FPRSqrtEstimate: Deduplicate array bounds Dehardcodes a few constants in the loops.	2020-04-22 20:46:21 +01:00
Lioncash	b7bd70fd19	A64: Implement FMAXV, FMINV, FMAXNMV, and FMINNMV	2020-04-22 20:46:21 +01:00
Lioncash	664fb12e21	FPRSqrtEstimate: Use forward declarations where applicable	2020-04-22 20:46:21 +01:00
Lioncash	3447c82656	translate: Return by bool in helpers where applicable Gets rid of a bit of duplication regarding the early-out cases and makes all helpers functions consistent (previously some had a return type of bool, while others had a return type of void).	2020-04-22 20:46:21 +01:00
Lioncash	d65b056eba	Simplify fallback case for EmitVectorSetElement64()	2020-04-22 20:46:21 +01:00
MerryMage	6087c2af6f	emit_x64_floating_point: s/Esimate/Estimate/	2020-04-22 20:46:21 +01:00
MerryMage	f837ce8e78	simd_scalar_two_register_misc: Implement FRSQRTE, scalar variant	2020-04-22 20:46:21 +01:00
MerryMage	bde58b04d4	IR: Implement FPRSqrtEstimate	2020-04-22 20:46:21 +01:00
MerryMage	16061c28f3	simd_vector_x_indexed_element: Implement FMUL (by element), vector variant	2020-04-22 20:46:21 +01:00
MerryMage	55eaa16615	a64_emit_x64: Ensure host has updated ticks in EmitA64GetCNTPCT Discovered by @Subv. Fixes incomplete fix begun in 5a91c94dca47c9702dee20fbd5ae1f4c07eef9df. That fix fails to take into account that LinkBlock doesn't update ticks until there are no remaining ticks to be executed. Test added to confirm fix.	2020-04-22 20:46:21 +01:00
MerryMage	edd795e991	a64_emit_x64: Fix stack misalignment on Windows for 128-bit exclusive writes Discovered by @Subv. Includes a test to ensure this codepath is exercised on Windows.	2020-04-22 20:46:21 +01:00
Lioncash	04b4c8b0cf	emit_x64_aes: Eliminate extraneous usage of a scratch register in EmitAESInverseMixColumns() We can just use the same register the data is in as the result register, eliminating the need to use a completely separate register to store the result.	2020-04-22 20:46:21 +01:00
Lioncash	e5d80e998e	A64: Implement SADDLV	2020-04-22 20:46:21 +01:00
Lioncash	a1bc8ddb53	A64: Implement UADDLV	2020-04-22 20:46:21 +01:00
Lioncash	1dc1e3dcd8	fp: Use forward declarations where applicable Minimizes the amount of files that need to be rebuilt if the headers ever change.	2020-04-22 20:46:21 +01:00
Lioncash	46cb0d813b	emit_x64_vector: Append 'v' prefix onto movq in AVX path This is something I missed when adding in the AVX broadcast code.	2020-04-22 20:46:21 +01:00
Subv	4606a081c9	A64: The A64SetTPIDR IR instruction writes to a system register and should not be eliminated by the dead code elimination pass. Previously this instruction was alway eliminated, resulting in incorrect values for TPIDR_EL0.	2020-04-22 20:46:21 +01:00
MerryMage	b53127600b	fp: A64::FPCR -> FP::FPCR	2020-04-22 20:46:21 +01:00
MerryMage	084bf63a10	bit_util: Implement ClearBits and ModifyBits	2020-04-22 20:46:21 +01:00
MerryMage	699c5f36d5	system: Simplify static_cast	2020-04-22 20:46:21 +01:00
MerryMage	3f602129f4	system: Ensure value of CNTPCT_EL0 is accurate Since we currently only update the host's tick count at the end of a block, we force an end-of-block before executing a MRS %, CNTPCT_ELO instruction.	2020-04-22 20:46:21 +01:00
Lioncash	84affdb260	safe_ops: Avoid cases where shift bases are invalid with signed values For example, say the converted signed type is s64, shifting left by 63 bits would be undefined behavior. However, given an ASL is essentially the same behavior as an LSL we can just use an unsigned type instead of converting to a signed type.	2020-04-22 20:46:21 +01:00
Lioncash	d0274f412a	safe_ops: Avoid signed overflow in Negate() Negation of values such as -9223372036854775808 can't be represented in signed equivalents (such as long long), leading to signed overflow. Therefore, we can just invert bits and add 1 to perform this behavior with unsigned arithmetic.	2020-04-22 20:46:21 +01:00
Lioncash	af3e23b224	simd_scalar_shift_by_immediate: Implement FCVT{ZS, ZU} (vector, fixed-point)'s scalar double/single-precision variant	2020-04-22 20:46:21 +01:00
Lioncash	91abf87169	simd_scalar_two_register_misc: Implement FCVT{AS, AU, MS, MU, NS, NU, PS, PU, ZS, ZU} (vector)'s scalar double/single-precision variants We can simply implement this in terms of the fixed-point IR opcodes.	2020-04-22 20:46:21 +01:00
Lioncash	0ec8dac660	emit_x64: Remove FPSCR_RoundTowardsZero() virtual function from EmitContext struct This code was bugged in that we were comparing if the rounding mode was not equal to rounding towards zero. Fortunately, however, nothing uses this function anymore, and there's already the more general FPSCR_RMode() available, so this can be removed entirely.	2020-04-22 20:46:21 +01:00
Lioncash	fd92e2f186	emit_x64: Add missing <array> include Commit 755adef62e504a8d616de9dda8937d2428a9471b introduced a helper alias for std::array, eliminating the need to manually type out sizes for them, however I forgot to add the include for <array>	2020-04-22 20:46:21 +01:00
Lioncash	f939bd0228	emit_x64_vector{_floating_point}: Add helper alias for sizing arrays relative to vector width Avoids needing to remember to specify the proper size of the arrays, all that's needed is to specify the type of the array and the size will automatically be deduced from it. This helps prevent potential oversized or undersized arrays from being specified.	2020-04-22 20:46:21 +01:00
MerryMage	58f3399032	A64/PopRSBHint: Prevent RETing to a guest PC of ~0ull from crashing the jit	2020-04-22 20:46:21 +01:00
MerryMage	e18fca17dc	A64: Implement FABD in terms of existing IR instructions Fixes NaN issue. Closes #306.	2020-04-22 20:46:21 +01:00
MerryMage	1dbe9d95e6	FPRoundInt: Final FPRound based on new sign While this shouldn't change any of the results in theory, it's just logically more consistent	2020-04-22 20:46:21 +01:00
MerryMage	83be491875	emit_x64_floating_point: SSE4.1 implementation of EmitFPRound	2020-04-22 20:46:20 +01:00
MerryMage	a40127a054	A64: Implement FRINTX, FRINTI (scalar)	2020-04-22 20:46:20 +01:00
MerryMage	962fa3b65e	A64: Implement FRINTP, FRINTM, FRINTZ (scalar)	2020-04-22 20:46:20 +01:00
MerryMage	5200bf41cf	A64: Implement FRINTN (scalar)	2020-04-22 20:46:20 +01:00
MerryMage	8718dc1692	A64: Implement FRINTA (scalar)	2020-04-22 20:46:20 +01:00
MerryMage	b228694012	IR: Implement FPRoundInt	2020-04-22 20:46:20 +01:00
MerryMage	e24054f4d7	fp: Implement FPRoundInt	2020-04-22 20:46:20 +01:00
MerryMage	f876e4afa2	fp: Implement FPProcessNaN	2020-04-22 20:46:20 +01:00
MerryMage	591adee443	fp/info: Add DefaultNaN	2020-04-22 20:46:20 +01:00
MerryMage	797e18cd97	fp: Move FPToFixed to its own file	2020-04-22 20:46:20 +01:00
MerryMage	295deb4035	a64_jit_state: Add FPSR.QC flag	2020-04-22 20:46:20 +01:00
Lioncash	7797bc2fb2	emit_x64_vector: Use non-scratch Use* variants of registers within EmitVectorUnsignedAbsoluteDifference() In some cases, a register isn't modified, depending on the branch taken, so we can signify this by using the non-scratch variants in certain cases.	2020-04-22 20:46:20 +01:00
Lioncash	f7f83b76b7	simd_scalar_two_register_misc: Implement scalar double/single-precision variants of FCM{EQ, GE, GT, LE, LT} (zero)	2020-04-22 20:46:20 +01:00
Lioncash	9db6d1e98b	translate_arm: Remove unnecessary rotr() function We already have RotateRight() in our common code, so we can remove this function and replace it with it. We can also implement ArmExpandImm_C() in terms of ArmExpandImm().	2020-04-22 20:46:20 +01:00
Lioncash	9f8a44c982	cast_util: Remove unnecessary typename Given we use std::aligned_storage_t, we don't need to specify typename here. If we used std::aligned_storage, then we would need to.	2020-04-22 20:46:19 +01:00
MerryMage	89e43867c1	A64: Implement FADDP (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	33fa65de23	A64: Implement FADDP (vector)	2020-04-22 20:46:19 +01:00
MerryMage	9dba273a8c	A64: Implement SADDLP	2020-04-22 20:46:19 +01:00
MerryMage	70ff2d73b5	A64: Implement UADDLP	2020-04-22 20:46:19 +01:00
MerryMage	5563bbbd79	A64: Implement EXT	2020-04-22 20:46:19 +01:00
MerryMage	304cc7f61e	emit_x64_floating_point: SSE4.1 implementation for FP{Double,Single}ToFixed{S,U}{32,64}	2020-04-22 20:46:19 +01:00
MerryMage	3d9677d094	A64: Implement FCVTMU (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	79c9018d60	A64: Implement FCVTMS (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	49c4499a87	A64: Implement FCVTPU (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	af661ef5a6	A64: Implement FCVTPS (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	27319822bb	A64: Implement FCVTAU (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	c0c7a26314	A64: Implement FCVTAS (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	a1965a74a0	A64: Implement FCVTNU (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	7d36dbcdfd	A64: Implement FCVTNS (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	617ca0adf0	floating_point_conversion_integer: Refactor implementation of FCVTZS_float_int and FCVTZU_float_int	2020-04-22 20:46:19 +01:00
MerryMage	caaf36dfd6	IR: Initial implementation of FP{Double,Single}ToFixed{S,U}{32,64} This implementation just falls-back to the software floating point implementation.	2020-04-22 20:46:19 +01:00
MerryMage	760cc3ca89	EmitContext: Expose FPCR	2020-04-22 20:46:19 +01:00
MerryMage	9571269552	fp/op: Implement FPToFixed	2020-04-22 20:46:19 +01:00
MerryMage	8087e8df05	mantissa_util: Implement ResidualErrorOnRightShift Accurately calculate residual error that is shifted out	2020-04-22 20:46:19 +01:00
MerryMage	8668d61881	fp/unpacked: Implement FPRound	2020-04-22 20:46:19 +01:00
MerryMage	55d590c01f	FPCR: Add AHP setter and FZ16 getter	2020-04-22 20:46:19 +01:00
MerryMage	7360a2579b	mp: Implement metaprogramming library	2020-04-22 20:46:19 +01:00
MerryMage	4ab029c114	fp: Implement FPUnpack	2020-04-22 20:46:19 +01:00
MerryMage	4875658917	fp: Implement FPProcessException	2020-04-22 20:46:19 +01:00
MerryMage	3cb98e1560	fp: Move fp_util to fp/util	2020-04-22 20:46:19 +01:00
MerryMage	c41a38b13e	fp: Add FPSR	2020-04-22 20:46:19 +01:00
MerryMage	66381352f3	fp: Add FPInfo Provides information about floating-point format for various bit sizes	2020-04-22 20:46:19 +01:00
MerryMage	d21659152c	safe_ops: Implement safe shifting operations Implement shifiting operations that perform consistently across architectures without running into undefined or implemented-defined behaviour.	2020-04-22 20:46:19 +01:00
MerryMage	b00fe23b91	bit_util: Implement MostSignificantBit	2020-04-22 20:46:19 +01:00
MerryMage	95ad0d0a66	bit_util: Use Ones to implement Bits	2020-04-22 20:46:19 +01:00
MerryMage	62b640b2fa	bit_util: Add ClearBit and ModifyBit	2020-04-22 20:46:19 +01:00
MerryMage	8651c2d10e	u128: Implement u128 For when we need a 128-bit integer	2020-04-22 20:46:19 +01:00
Lioncash	e7409fdfe4	A64: Implement UCVTF (vector, integer)'s double/single-precision variant	2020-04-22 20:46:19 +01:00
Lioncash	4aa4885ba7	ir: Add opcodes for vector conversion of u32/u64 to floating-point	2020-04-22 20:46:19 +01:00
Lioncash	fcae4e2418	simd_three_different: Deduplicate common implementations Generally, the only difference between the signed variants and the unsigned variants is whether or not we use a sign-extension or zero-extension, so we can simply use common functions to implement both cases without totally duplicating code twice here.	2020-04-22 20:46:19 +01:00
Lioncash	9c0d5cf15c	floating_point_conversion_integer: Handle S64/U64 -> F32 conversions in SCVTF_float_int and UCVTF_float_int	2020-04-22 20:46:19 +01:00
Lioncash	7a84b6e8d8	ir: Add opcodes for converting S64 and U64 to single-precision floating-point values	2020-04-22 20:46:19 +01:00
Lioncash	066061fa50	constant_pool: Remove unnecessary std::memset from constructor AllocateFromCodeSpace() already zeroes out the allocated memory.	2020-04-22 20:46:19 +01:00
Lioncash	a1d6a86e8c	A64: Implement ADDV	2020-04-22 20:46:19 +01:00
Lioncash	35026a6ce3	emit_x64_vector: Vectorize fallback path for EmitVectorMaxU32()	2020-04-22 20:46:19 +01:00
Lioncash	245c903129	simd_three_same: Join FPAbsoluteComparison() into FPCompareRegister() These are part of the same comparison family, so there's no real point in keeping them separate.	2020-04-22 20:46:19 +01:00
Lioncash	9912836b59	A64: Implement scalar double/single-precision variants of FACGE, FACGT, FCMEQ, FCMGE, FCMGT	2020-04-22 20:46:18 +01:00
MerryMage	0b97e9bd8d	emit_x64_floating_point: Fix EmitFPU64ToDouble for TowardsMinusInfinity rounding mode	2020-04-22 20:46:18 +01:00
MerryMage	a2eb9a02e0	backend_x86: Add FPSCR_RMode to EmitContext	2020-04-22 20:46:18 +01:00
MerryMage	d875c08ebf	fp: Extract common RoundingMode enum	2020-04-22 20:46:18 +01:00
Lioncash	3714bc0ed4	floating_point_conversion_integer: Use FPS64ToDouble and FPU64ToDouble in SCVTF_float_int and UCVTF_float_int The opcodes introduced in 979b6f39f1621b80bd463645ec5b08661cb6b1bf can also be used here, avoiding more falling back to the interpreter.	2020-04-22 20:46:18 +01:00
Lioncash	b97358075e	simd_scalar_two_register_misc: Handle 64-bit case in SCVTF and UCVTF's scalar double/single-precision variant Avoids falling back to the interpreter in the 64-bit case.	2020-04-22 20:46:18 +01:00
Lioncash	7252293184	emit_x64_floating_point: Correct use of UseGpr() in EmitFPU32ToDouble() and EmitFPU32ToSingle() In the non-AVX512 path, the following code is present: code.mov(from.cvt32(), from.cvt32()); since this potentially modifies 'from', we should be using UseScratchGpr() instead.	2020-04-22 20:46:18 +01:00
Lioncash	fbd7623fe5	emit_x64_floating_point: Add AVX512F conversion operations to EmitFPU32ToSingle() and EmitFPU32ToDouble() AVX-512F provides convenient instructions for these kinds of conversions directly	2020-04-22 20:46:18 +01:00
Lioncash	3a41465eaf	ir: Add opcodes for converting S64 and U64 to double-precision values	2020-04-22 20:46:18 +01:00
MerryMage	436ca80bcd	Merge branch 'global_monitor'	2020-04-22 20:46:18 +01:00
Lioncash	0f4bf26e05	simd_two_register_misc: Utilize FPVectorAbs in FABS implementations Since we already have opcodes introduced to implement FACGE and FACGT, we can reutilize it for the FABS implementations.	2020-04-22 20:46:18 +01:00
MerryMage	821cff1227	A64: Add ClearExclusiveState method	2020-04-22 20:46:18 +01:00
Lioncash	81e572c78c	ir: Extend FPVectorAbs opcode to also handle 16-bit elements for FP16	2020-04-22 20:46:18 +01:00
MerryMage	2a8de5f733	a64_emit_x64: Clear exclusive state in EmitA64CallSupervisor The kernel would have to execute an ERET instruction to return to userland; this clears exclusive state.	2020-04-22 20:46:18 +01:00
Lioncash	53dbb6a92a	A64: Implement FACGE's vector single/double precision variants	2020-04-22 20:46:18 +01:00
MerryMage	57f7c7e1b0	Implement global exclusive monitor	2020-04-22 20:46:18 +01:00
Lioncash	6912a02d9b	A64: Implement FACGT's vector single/double precision variants	2020-04-22 20:46:18 +01:00
MerryMage	85234338d3	a64_emit_x64: Simplify EmitExclusiveWrite	2020-04-22 20:46:18 +01:00
Lioncash	fc731dddae	ir: Add opcodes for performing vector absolute floating-point values This will be usable for implementing FACGE and FACGT	2020-04-22 20:46:18 +01:00
MerryMage	2fc6b33829	CMakeLists: Add missing files	2020-04-22 20:46:18 +01:00
Lioncash	0bee648b4f	emit_x64_vector: Deduplicate a bit of code in EmitVectorSetElement{8, 32, 64} functions Given both branches are the same, we can hoist out the common code.	2020-04-22 20:46:18 +01:00
Lioncash	d86fea0d28	A64: Implement FCMEQ (zero)'s vector single and double precision variant	2020-04-22 20:46:18 +01:00
Lioncash	593eca7fb1	A64: Implement load/store single structure instructions Implements LD{1, 2, 3, 4}, LD{1, 2, 3, 4}R, and ST{1, 2, 3, 4} single structure variants.	2020-04-22 20:46:18 +01:00
Lioncash	9bec354791	A64: Implement FCMEQ (register)'s vector single and double precision variant	2020-04-22 20:46:18 +01:00
Lioncash	b6e223fc58	emit_x64_vector: Deduplicate a bit of code within EmitVectorGetElement8() Given both branches use the same destination register size, we can hoist the common code out.	2020-04-22 20:46:18 +01:00
Lioncash	5ce187a54e	ir: Add opcodes for floating-point vector equalities	2020-04-22 20:46:18 +01:00
MerryMage	be354dbfd0	ir/basic_block: Add missing U16 immediate type to DumpBlock	2020-04-22 20:46:18 +01:00
Lioncash	cf188448d4	emit_x64_vector: Vectorize fallback case in EmitVectorMultiply64() Gets rid of the need to perform a fallback.	2020-04-22 20:46:18 +01:00
MerryMage	5503ff28c3	llvm_disassemble: Allow disassembly of invalid AArch64 instructions	2020-04-22 20:46:18 +01:00
Lioncash	954deff2d4	emit_x64_vector: Add break to final case in EmitVectorRoundingHalvingAddUnsigned() This doesn't alter behavior but does make the code better if anything else is ever added to this function in the future.	2020-04-22 20:46:18 +01:00
Lioncash	11a92eaaef	A64: Implement SRHADD and URHADD	2020-04-22 20:46:18 +01:00
Lioncash	9e75d08860	A64: Implement FABD's scalar single/double precision variant	2020-04-22 20:46:18 +01:00
Lioncash	bc718c5b28	ir: Add opcodes for performing rounding halving adds	2020-04-22 20:46:18 +01:00
Lioncash	d898d1779d	A64: Implement FABD's vector single/double precision variant	2020-04-22 20:46:18 +01:00
Lioncash	054549da35	emit_x64_vector: Simplify AVX-512 codepath in EmitVectorMultiply64 I realized I introduced a helper for simple AVX operation emitting, so use that instead of writing it all out long-form.	2020-04-22 20:46:18 +01:00
Lioncash	8a4f8aed06	ir: Add opcode for performing FP vector absolute differences	2020-04-22 20:46:18 +01:00
Lioncash	cb456f914b	A64: Implement UMLAL{2}, UMLSL{2}, and UMULL{2} Now that we have the helper function set up for the signed variants, we can also modify it to be used with the unigned ones by performing a zero extension instead of a sign extension.	2020-04-22 20:46:18 +01:00
MerryMage	ba84e7a8de	A64: Implement FNMSUB	2020-04-22 20:46:18 +01:00
Lioncash	3576c02d91	A64: Implement SMLSL{2}	2020-04-22 20:46:18 +01:00
MerryMage	a1042cfcd8	A64: Implement FNMADD	2020-04-22 20:46:18 +01:00
Lioncash	ada5c0b2fa	A64: Implement SMLAL{2}	2020-04-22 20:46:18 +01:00
MerryMage	0d83032a6f	A64: Implement FMSUB	2020-04-22 20:46:18 +01:00
Lioncash	2d1aca25e6	A64: Implement SMULL{2}	2020-04-22 20:46:18 +01:00
MerryMage	69e00d225c	A64: Implement FMADD	2020-04-22 20:46:18 +01:00
MerryMage	8c90fcf58e	IR: Implement FPMulAdd	2020-04-22 20:46:18 +01:00
Lioncash	c5ae9107a9	A64: Implement SABAL/SABAL2 and SABDL/SABDL2 Now that we have a helper function for the unsigned variants, we can modify it to also be usable with the signed variants.	2020-04-22 20:46:18 +01:00
Lioncash	24e3299276	A64: Implement FCMGT, FCMGE (register) vector double and single precision variants	2020-04-22 20:46:18 +01:00
Lioncash	26d4473851	A64: Implement UABAL/UABAL2	2020-04-22 20:46:18 +01:00
Lioncash	350bc70be8	A64: Implement FCMGT, FCMGE, FCMLE, FCMLT (zero) vector double and single precision variants.	2020-04-22 20:46:18 +01:00
Lioncash	3397742c74	A64: Implement UABDL/UABDL2	2020-04-22 20:46:18 +01:00
Lioncash	c695da1cf3	ir: Add opcode for floating-point GE and GT comparisons The rest of the comparisons can be implemented in terms of these two	2020-04-22 20:46:18 +01:00
Lioncash	6de5ed96e5	emit_x64_vector: Emit VPMULLQ in EmitVectorMultiply64 on AVX-512{DQ, VL} capable CPUs Shortens code-gen down to a single instruction in the 64-bit path.	2020-04-22 20:46:18 +01:00
Lioncash	9054d1c20b	A64: Implement LDR (literal, SIMD&FP)	2020-04-22 20:46:18 +01:00
Lioncash	0da5e949a8	Correct typo in DataCacheOperation enum Fixes a typo for the InvalidateByVAToPoC enum entry. Given yuzu is the only known user of 64-bit mode and it doesn't use this value, we can get away with changing this.	2020-04-22 20:46:18 +01:00
Lioncash	9736e2cce2	A64: Implement FABS' half-precision variant	2020-04-22 20:46:18 +01:00
Lioncash	6e5750e4ec	A64: Implement FABS' single and double precision variant	2020-04-22 20:46:18 +01:00
Lioncash	7bce8d8757	A64: Implement URSHR (scalar) and URSRA (scalar) Now that the utility function is all set up from implementing SRSRA, the unsigned variants can now be trivially implemented by modifying the utility function to perform a logical shift right instead of an arithmetical shift right for the unsigned case.	2020-04-22 20:46:18 +01:00
Lioncash	1e70a589b0	A64: Implement SRSRA (scalar)	2020-04-22 20:46:18 +01:00
Lioncash	998aef07f6	A64: Implement SRSHR (scalar)	2020-04-22 20:46:17 +01:00
Lioncash	7c0250e9f8	A64: Implement SABA	2020-04-22 20:46:17 +01:00
Lioncash	f00789e6f7	A64: Implement SABD	2020-04-22 20:46:17 +01:00
Lioncash	1e10017f4b	ir: Add opcodes for signed absolute differences	2020-04-22 20:46:17 +01:00
Tillmann Karras	d3b44c1b5a	decoder_detail: use structured bindings	2020-04-22 20:46:17 +01:00
Lioncash	f745eb28bf	simd_two_register_misc: Handle 64-bit case for SCVTF_int_4	2020-04-22 20:46:17 +01:00
Lioncash	3f6c529da2	ir: Add opcode to perform the vector conversion S64->F64 Unfortunately x86 prior to AVX-512 doesn't really give us any convenient instruction to do the work for us	2020-04-22 20:46:17 +01:00
Lioncash	0e61ee6bf6	A64: Implement SHLL/SHLL2	2020-04-22 20:46:17 +01:00
Lioncash	43e6e98c3b	A64: Add missing decoding for PRFM (unscaled offset)	2020-04-22 20:46:17 +01:00
Lioncash	f2a85d5601	A64: Implement UHSUB	2020-04-22 20:46:17 +01:00
Lioncash	b33360a324	A64: Implement SHSUB	2020-04-22 20:46:17 +01:00
Lioncash	44a5f8095a	ir: Add opcodes for performing vector halving subtracts	2020-04-22 20:46:17 +01:00
Lioncash	4f37c0ec5a	A64: Implement SM4EKEY	2020-04-22 20:46:17 +01:00
Lioncash	3bde3347a5	A64: Implement SM4E	2020-04-22 20:46:17 +01:00
Lioncash	b312d28295	ir: Add an opcode for doing an SM4 lookup table query	2020-04-22 20:46:17 +01:00
Lioncash	27a6d5f6ce	emit_x64_vector: Use VPOPCNTB in EmitVectorPopulationCount() if AVX-512 BITALG is available	2020-04-22 20:46:17 +01:00
Lioncash	4dcc7724e0	A64: Implement UHADD	2020-04-22 20:46:17 +01:00
Lioncash	f8714f7250	A64: Implement SHADD	2020-04-22 20:46:17 +01:00
Lioncash	089096948a	ir: Add opcodes for performing halving adds	2020-04-22 20:46:17 +01:00
Lioncash	3d00dd63b4	emit_x64_vector: Emit VPMINSQ and VPMINUQ for 64-bit vector min operations if AVX-512VL is available	2020-04-22 20:46:17 +01:00
Lioncash	b97b71b8aa	emit_x64_vector: Emit VPMAXSQ and VPMAXUQ for 64-bit vector max operations if AVX-512VL is available	2020-04-22 20:46:17 +01:00
Lioncash	033e400df0	emit_x64_vector_floating_point: Deduplicate accurate NaN handling code Allows the code to both be used from the 32 bit and 64 bit operations without duplicating code.	2020-04-22 20:46:17 +01:00
Lioncash	0f067b7330	emit_x64_vector: Emit VPABSQ in EmitVectorAbs() for the 64-bit case if AVX-512VL is available	2020-04-22 20:46:17 +01:00
Lioncash	d4ee878cbd	emit_x64_vector: Use VPSRAQ in EmitVectorArithmeticShiftRight64() if AVX-512VL is available	2020-04-22 20:46:17 +01:00
Lioncash	b38dd191bd	disassembler_arm: Remove rotation helper function in favor of Common::RotateRight Mildly reduces the amount of duplicated behavior	2020-04-22 20:46:17 +01:00
Lioncash	51e4f1d9db	emit_x64_vector: Vectorize fallback path of EmitVectorMaxS32()	2020-04-22 20:46:17 +01:00
Lioncash	c692ccdd6d	emit_x64_vector: Vectorize fallback path of EmitVectorMaxS8()	2020-04-22 20:46:17 +01:00
Lioncash	b194313d8c	emit_x64_vector: Vectorize fallback path in EmitVectorMinU32()	2020-04-22 20:46:17 +01:00
Lioncash	7ceda6d919	emit_x64_vector: Vectorize fallback path in EmitVectorMinU16()	2020-04-22 20:46:17 +01:00
Lioncash	cda85a1da0	emit_x64_vector: Vectorize fallback path in EmitVectorMinS32()	2020-04-22 20:46:17 +01:00
Lioncash	6e08eed210	emit_x64_vector: Vectorize fallback path in EmitVectorMinS8()	2020-04-22 20:46:17 +01:00
Lioncash	0fb6dce689	emit_x64_vector: Remove unnecessary if constexpr expression in LogicalVShift This can simply be merged with the previous one.	2020-04-22 20:46:17 +01:00
Lioncash	5b71b1337b	emit_x64_vector: Avoid left shift of negative value in LogicalVShift Now that we handle the signed variants, we also have to be careful about left shifts with negative values, as this is considered undefined behavior.	2020-04-22 20:46:17 +01:00
Lioncash	9954d28868	a64_jitstate: Zero SP and PC on construction of A64JitState Given we zero out/reset everything else in the struct, do the same for these members to keep initialization consistent	2020-04-22 20:46:17 +01:00
Lioncash	4efbd40ea4	backend_x64/callback: Default virtual destructor in the cpp file Prevents the vtable being generated in each translation unit that includes the header (and silences -Wweak-vtables warnings)	2020-04-22 20:46:17 +01:00
Lioncash	edd0b5c8c7	a32_interface/a64_interface: Change reinterpret_casts to static_casts in GetCurrentBlock thunks It's well-defined to static_cast a void* to its proper type.	2020-04-22 20:46:17 +01:00
Lioncash	e71612d394	A64: Implement SSHL (scalar)	2020-04-22 20:46:17 +01:00
Lioncash	ef1e69a1e3	A64: Implement SSHL (vector)	2020-04-22 20:46:17 +01:00
Lioncash	21974ee57e	backend_x64/ir: Amend generic LogicalVShift() template to also handle signed variants Also adds IR opcodes to dispatch said variants	2020-04-22 20:46:17 +01:00
Lioncash	9fc89f0a0e	emit_x64_vector_floating_point: Use arrays for retrieving size instead of hardcoding the size Similar changes were done in emit_x64_vector, but these were missed.	2020-04-22 20:46:17 +01:00
Lioncash	af28e89a13	emit_x64_vector: Vectorize fallback path in EmitVectorMaxU16()	2020-04-22 20:46:17 +01:00
Lioncash	cda75e2079	A64: Implement CMTST's scalar variant	2020-04-22 20:46:17 +01:00
Lioncash	0d20423ad5	emit_x64_vector: Vectorize non-SSE4.1 fallback path for VectorMultiply32()	2020-04-22 20:46:17 +01:00
Lioncash	d70ee7c0d1	emit_x64_vector: Use VBPROADCAST where applicable and available Uses the instruction that does what it says in its name if available. Allows avoiding the use of a scratch register in EmitVectorBroadcast8() and EmitVectorBroadcastLower8()'s SSSE3 path.	2020-04-22 20:46:17 +01:00
Lioncash	bebe7235ae	A64: Implement UZP1 and UZP2	2020-04-22 20:46:17 +01:00
Lioncash	26d77c6f09	ir: Add opcodes for performing vector deinterleaving	2020-04-22 20:46:17 +01:00
Lioncash	d6f9ed47d9	A64: Implement FNEG (half-precision)	2020-04-22 20:46:17 +01:00
Lioncash	7efbd73bac	A64: Implement USHL (scalar)	2020-04-22 20:46:17 +01:00
Lioncash	41f4717f2b	A64: Implement FNEG (vector)	2020-04-22 20:46:17 +01:00
Lioncash	ba1cc6366d	A64: Implement RSUBHN/RSUBHN2	2020-04-22 20:46:17 +01:00
Lioncash	e41640fe33	A64: Implement RADDHN/RADDHN2	2020-04-22 20:46:17 +01:00
Lioncash	b719a6b3f7	A64: Implement XAR	2020-04-22 20:46:17 +01:00
Lioncash	0b1b131ec2	simd_two_register_misc: Factor out common comparison code Gets rid of a tiny bit of duplicated code.	2020-04-22 20:46:17 +01:00
Lioncash	ed0b84da70	A64: Implement CMLE (zero)'s vector variant	2020-04-22 20:46:17 +01:00
Lioncash	b595a68ffa	A64: Implement CMTST (vector)	2020-04-22 20:46:17 +01:00
Lioncash	48c7f8630c	A64: Implement ADDHN{2} and SUBHN{2}	2020-04-22 20:46:17 +01:00
Lioncash	3acd9c9200	translate: zero extend result in Vpart when storing to lower part of vector	2020-04-22 20:46:17 +01:00
Lioncash	87ca63699f	emit_x64_vector: Emit PMAXUD in EmitVectorMaxU32 on SSE4.1-capable CPUs	2020-04-22 20:46:17 +01:00
Lioncash	f17702f608	emit_x64_vector: Emit PMINUD in EmitVectorMinU32 on SSE4.1-capable CPUs	2020-04-22 20:46:17 +01:00
Lioncash	596a8dd1dd	emit_x64_vector: Emit PMINSD in EmitVectorMinS32 on SSE4.1-capable CPUs Provides a better alternative to a fallback operation.	2020-04-22 20:46:17 +01:00
Lioncash	75fd4eaaaa	emit_x64_vector: Get rid of some magic numbers in loop bounds	2020-04-22 20:46:17 +01:00
Lioncash	7b80ac25eb	emit_x64_vector: Generify variable shift functions	2020-04-22 20:46:17 +01:00
Lioncash	4ec735f707	A64: Implement CMLE (zero)'s scalar variant	2020-04-22 20:46:17 +01:00
Lioncash	6534184df2	A64: Implement CMLT (zero)'s scalar single/double-precision variant	2020-04-22 20:46:17 +01:00
Lioncash	8863c9bb4b	A64: Implement SHA512H2	2020-04-22 20:46:17 +01:00
Lioncash	033b890e25	A64: Implement SHA512H	2020-04-22 20:46:17 +01:00
Lioncash	d1f5b084b4	A64: Handle S32->F32 case for SCVTF (vector)	2020-04-22 20:46:17 +01:00
Lioncash	38fa984b53	IR: Add opcode for packed word->f32 conversions	2020-04-22 20:46:16 +01:00
Lioncash	b8587d8e34	A64: Implement SHA512SU1	2020-04-22 20:46:16 +01:00
Lioncash	44d846045a	A64: Implement SHA512SU0	2020-04-22 20:46:16 +01:00
Lioncash	ca903c1585	A64: Implement SHA256H and SHA256H2	2020-04-22 20:46:16 +01:00
MerryMage	e4237c44eb	A64: Implement SCVTF (vector, integer), scalar varaint	2020-04-22 20:46:16 +01:00
MerryMage	bfba38d0b6	impl: Reorganize scalar two-register misc instructions	2020-04-22 20:46:16 +01:00
Lioncash	ea582b17cc	A64: Implement SHA256SU1	2020-04-22 20:46:16 +01:00
Lioncash	06c5dcaf5e	simd_two_register_misc: Add missing zeroing of the vector for CMGT and CMLT	2020-04-22 20:46:16 +01:00
Lioncash	0d50d7314b	A64: Implement CMGE (zero)'s vector variant	2020-04-22 20:46:16 +01:00
Lioncash	ab35dc0e78	A64: Implement MLS (by element)	2020-04-22 20:46:16 +01:00
Lioncash	1651e60462	A64: Implement MUL (by element)	2020-04-22 20:46:16 +01:00
MerryMage	a86d4093cd	A64: Implement MLA (by element)	2020-04-22 20:46:16 +01:00
Lioncash	7f47402609	A64: Implement ABS (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	c8eb4528be	A64: Implement SHA256SU0	2020-04-22 20:46:16 +01:00
Lioncash	181c3b0790	A64: Implement SHA1M	2020-04-22 20:46:16 +01:00
Lioncash	47bc97a71b	A64: Implement SHA1P	2020-04-22 20:46:16 +01:00
Lioncash	718f3e9bb4	A64: Implement scalar variants of CMEQ, CMGT, and CMGE zero comparison instructions These can trivially use the ScalarCompare helper function.	2020-04-22 20:46:16 +01:00
Lioncash	3ad4e547e4	A64: Implement scalar variant of NEG	2020-04-22 20:46:16 +01:00
Lioncash	b4f3051e4b	simd: Relocate REV16, REV32 and REV64 vector variants to the proper file These aren't scalar instruction variants.	2020-04-22 20:46:16 +01:00
Lioncash	19e276d10f	A64: Implement CMEQ (register, scalar)	2020-04-22 20:46:16 +01:00
Lioncash	5b8c9e5146	A64: Implement CMHS (register, scalar)	2020-04-22 20:46:16 +01:00
Lioncash	78bb12276a	A64: Implement CMHI (register, scalar)	2020-04-22 20:46:16 +01:00
Lioncash	c18b20b8d1	A64: Implement CMGE (register, scalar)	2020-04-22 20:46:16 +01:00
Lioncash	755981d0da	A64: Implement CMGT (register, scalar)	2020-04-22 20:46:16 +01:00
Lioncash	da6627124b	A64: Implement SHA1C	2020-04-22 20:46:16 +01:00
Lioncash	3c013bd9f8	A64: Implement SLI (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	154cac594a	A64: Implement SRI (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	6bcfdba1ad	general: Remove unused lambda captures Resolves warnings that occur in Xcode 9.3	2020-04-22 20:46:16 +01:00
Lioncash	205ca6b4cb	A64: Implement SHA1SU1	2020-04-22 20:46:16 +01:00
Lioncash	16a001b9ff	A64: Implement SHA1SU0	2020-04-22 20:46:16 +01:00
Lioncash	3b6db59850	A64: Implement TRN2	2020-04-22 20:46:16 +01:00
Lioncash	30e158f8d0	A64: Implement TRN1	2020-04-22 20:46:16 +01:00
Lioncash	52cad2d9d0	A64: Implement SSRA (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	255a33936d	A64: Implement SSHR (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	6723b00497	A64: Implement USRA (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	d56fa8f735	A64: Implement USHR (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	870e418b0b	A64: Implement SHL (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	97f2bea4f2	A64: Implement SM3PARTW1	2020-04-22 20:46:16 +01:00
Lioncash	e268b110f0	simd_sha512: Simplify RAX1 Now that the vector rotation helpers are in, replace the explicit shifting with the relevant helper function that does the same thing. Simply tidies up code; no behavioral changes are made.	2020-04-22 20:46:16 +01:00
Lioncash	20d2491267	A64: Implement SM3PARTW2	2020-04-22 20:46:16 +01:00
Lioncash	e1b662e90c	ir: Add helper functions for vector rotation	2020-04-22 20:46:16 +01:00
Lioncash	8a60a63a8b	A64: Implement SM3TT2B	2020-04-22 20:46:16 +01:00
Lioncash	b3d4c02098	A64: Implement SM3TT2A	2020-04-22 20:46:16 +01:00
Lioncash	7fbccabd81	A64: Implement SM3TT1B	2020-04-22 20:46:16 +01:00
Lioncash	769373b3ed	A64: Implement SM3TT1A	2020-04-22 20:46:16 +01:00
Lioncash	2d269fdcc7	simd_shift_by_immediate: Merge signed/unsigned helper functions Gets rid of a little more code duplication.	2020-04-22 20:46:16 +01:00
Lioncash	d5461be6b4	A64: Implement SM3SS1	2020-04-22 20:46:16 +01:00
Lioncash	2db032ac83	A64: Implement SRI (vector)	2020-04-22 20:46:16 +01:00
Lioncash	11005cfe26	A64: Implement SLI (vector)	2020-04-22 20:46:16 +01:00
Lioncash	e3d9bf55e7	A64: Implement SRSRA (vector)	2020-04-22 20:46:16 +01:00
Lioncash	bc6016cad7	A64: Implement SRSHR (vector)	2020-04-22 20:46:16 +01:00
MerryMage	6c9c829a08	imm: Add additional bit position checks to Imm::Bits	2020-04-22 20:46:16 +01:00
MerryMage	be907a61f7	math_util: rvalue references for std::forward	2020-04-22 20:46:16 +01:00
Lioncash	a2f8cdf0a3	A64: Implement SSUBL/SSUBL2	2020-04-22 20:46:16 +01:00
Lioncash	d456fb85c8	A64: Implement SADDL/SADDL2	2020-04-22 20:46:16 +01:00
Lioncash	5c9e7f328d	A64: Implement USUBL/USUBL2	2020-04-22 20:46:16 +01:00
Lioncash	88d70e3b8a	A64: Implement UADDL/UADDL2	2020-04-22 20:46:16 +01:00
Lioncash	4b3d70de5f	simd_shift_by_immediate: Factor out common code in shift instructions Gets rid of partial duplication of the same code for instructions that only have a small behavior difference to them. e.g. The only difference between SSHR and SSRA is that SSRA adds an accumulator before storing the result.	2020-04-22 20:46:16 +01:00
Lioncash	56803f5203	A64: Implement URSRA (vector)	2020-04-22 20:46:16 +01:00
Lioncash	8afdf4b23d	A64: Implement URSHR (vector)	2020-04-22 20:46:16 +01:00
Lioncash	16613ee066	A64: Implement RSHRN/RSHRN2	2020-04-22 20:46:15 +01:00
Lioncash	937990fd2a	A64: Implement SHRN/SHRN2	2020-04-22 20:46:15 +01:00
Lioncash	80e005e5b5	A64/translate: Amend I() to also handle u8 and u16 immediates This is necessary for instructions like SRSHR, and other related instructions.	2020-04-22 20:46:15 +01:00
MerryMage	7969871aa3	A64: Implement FMOV (vector, immediate) and mark other SIMD modified immediate instructions as unallocated	2020-04-22 20:46:15 +01:00
MerryMage	5c95e28ed0	A64: Implement ZIP2	2020-04-22 20:46:15 +01:00
MerryMage	871aefb9a0	decoder/a64: Tweak ordering algorithm Ensuring only instruction families are sorted with each other in the fashion previously devised does not admit a total ordering.	2020-04-22 20:46:15 +01:00
MerryMage	575590d18d	ir_emitter: Remove overloads Having overloads made explicit casting necesssary for these functions when using types like UAny.	2020-04-22 20:46:15 +01:00
Lioncash	83ff7a43d1	A64: Implement RBIT (vector)	2020-04-22 20:46:15 +01:00
Lioncash	64b1f2d468	ir: Add opcode for reversing bits in a vector	2020-04-22 20:46:15 +01:00
Lioncash	9de60b60bb	A64/translate: Amend instruction prototypes erroneously marked as taking Reg Makes the prototypes consistent	2020-04-22 20:46:15 +01:00
Lioncash	cf81f04ed3	A64: Implement RAX1	2020-04-22 20:46:15 +01:00
Lioncash	7371e63a7b	a64_get_set_elimination_pass: Make TrackingType enum an enum class Prevents placing single letter enum members into the surrounding scope.	2020-04-22 20:46:15 +01:00
Lioncash	7bcb1c115a	A64: Implement ABS (vector)	2020-04-22 20:46:15 +01:00
Lioncash	e33dcce14a	ir: Add opcodes for performing vector absolute values	2020-04-22 20:46:15 +01:00
Lioncash	84d49309b9	A64: Implement USUBW/USUBW2	2020-04-22 20:46:15 +01:00
Lioncash	e20fce6b5a	A64: Implement SSUBW/SSUBW2	2020-04-22 20:46:15 +01:00
Lioncash	00af6eeab9	A64: Implement SADDW/SADDW2	2020-04-22 20:46:15 +01:00
MerryMage	78a047f0f9	A64: Implement EXT	2020-04-22 20:46:15 +01:00
MerryMage	3472f371df	IR: Implement VectorExtract, VectorExtractLower IR instructions	2020-04-22 20:46:15 +01:00
MerryMage	8bba37089e	A64: Implement UADDW	2020-04-22 20:46:15 +01:00
MerryMage	5c47f03888	A64: Implement FMUL (vector)	2020-04-22 20:46:15 +01:00
Lioncash	a6e264c2dd	A64: Implement UABA Now that we have unsigned absolute difference capabilities, we can just use this to append onto the result via a vector add.	2020-04-22 20:46:15 +01:00
Lioncash	c2e7364d3e	A64: Implement UABD	2020-04-22 20:46:15 +01:00
Lioncash	ad5cf584ce	ir: Add opcodes for performing vector unsigned absolute differences	2020-04-22 20:46:15 +01:00
Lioncash	7780af56e3	ir_emitter: Make immediate member functions const qualified These don't modify class state	2020-04-22 20:46:15 +01:00
Lioncash	701f43d61e	IR: Add opcodes for interleaving upper-order bytes/halfwords/words/doublewords I should have added this when I introduced the functions for interleaving low-order equivalents for consistency in the interface.	2020-04-22 20:46:15 +01:00
Lioncash	94f0fba16b	A64: Implement SHA1H This is a fairly trivial instruction it's essentially: result = ROL(data, 30);	2020-04-22 20:46:15 +01:00
Lioncash	3985f7bf84	emit_x64_data_processing: Deduplicate some code in zero-extension functions EmitZeroExtendByteToLong() can be implemented in terms of EmitZeroExtendByteToWord() and EmitZeroExtendHalfToLong() can be implemented in terms of EmitZeroExtendHalfToWord().	2020-04-22 20:46:15 +01:00
Lioncash	40ec25356b	A64: NOP immediate variant of PRFM Makes behavior identical to the literal variant of PRFM. Given this is simply a hint instruction, this is valid behavior. The upside is that we don't fall back to Unicorn unnecessarily whenever the instruction is encountered.	2020-04-22 20:46:15 +01:00
MerryMage	e7b60189b3	abi: Missing includes'	2020-04-22 20:46:15 +01:00
MerryMage	cdc5c3ad95	emit_x64_floating_point: Near jump instead of short jump in FPMinNumberic{32,64}	2020-04-22 20:46:15 +01:00
Lioncash	73b9e4b276	A64: system: Use an enum class for MRS/MSR register encodings Reduces the need to manually write out the register bit encodings repeatedly.	2020-04-22 20:46:15 +01:00
MerryMage	df4ee0f51e	emit_X64_floating_point: Near jmp to end instead of short jmp Jump destination can be further than what can be reached in a short jump under some FPCR options.	2020-04-22 20:46:15 +01:00
Lioncash	b8d5765f9b	emit_x64_vector: Fix typo in VectorShuffleImpl This is supposed to be pshufd, not pshufw (which only allows a 64-bit operand)	2020-04-22 20:46:15 +01:00
Lioncash	586b00d11d	A64: Implement REV64	2020-04-22 20:46:15 +01:00
Lioncash	ade595e377	bit_util: Do nothing in RotateRight if the rotation amount is zero Without this sanitizing it's possible to perform a shift with a shift amount that's the same size as the type being shifted. This actually occurs when decoding ORR variants. We could get fancier here and make this branchless, but we don't really use RotateRight in any performance intensive areas.	2020-04-22 20:46:15 +01:00
Lioncash	9128988dc3	A64: Implement REV32 (vector)	2020-04-22 20:46:15 +01:00
Lioncash	6b0010c940	ir: Add IR opcodes for emitting vector shuffles This uses the ARM terminology for sizes (Halfword -> 2 bytes, Word -> 4 bytes) as opposed to the x86 terminology of (Word -> 2 bytes, Double word -> 4 bytes)	2020-04-22 20:46:15 +01:00
Lioncash	eb2d28d2b1	emit_x64_vector_floating_point: Fix out of bounds array access in EmitVectorOperation64	2020-04-22 20:46:15 +01:00
Lioncash	6ad1bce5e0	A64: Implement REV16 (vector)	2020-04-22 20:46:15 +01:00
Lioncash	6177c2c63d	CMakeLists: Add fp_util, macro_util and math_util headers Allows the headers to show up within IDEs	2020-04-22 20:46:15 +01:00
Lioncash	7a66224d9a	A64: Implement EOR3 and BCAX	2020-04-22 20:46:15 +01:00
MerryMage	be5047c7c2	impl: Update PC when raising exception	2020-04-22 20:46:15 +01:00
MerryMage	49cc6d7fad	A64: Implement FDIV (vector)	2020-04-22 20:46:15 +01:00
MerryMage	fd075d8d68	system: Raise exception for YIELD, WFE, WFI, SEV, SEVL	2020-04-22 20:46:15 +01:00
MerryMage	c832cec96d	Correct FPSR and FPCR	2020-04-22 20:46:15 +01:00
MerryMage	147284427b	A64: Implement USHL	2020-04-22 20:46:15 +01:00
MerryMage	fd8f4c1195	A64: Implement UCVTF (vector, integer), scalar variant	2020-04-22 20:46:15 +01:00
MerryMage	be57608353	A64: Partially implement FCVTZU (scalar, fixed-point) and FCVTZS (scalar, fixed-point)	2020-04-22 20:46:15 +01:00
MerryMage	e4697b1676	A64: Implement system register TPIDR_EL0	2020-04-22 20:46:15 +01:00
MerryMage	e3da92024e	A64: Implement system registers FPCR and FPSR	2020-04-22 20:46:15 +01:00
MerryMage	9e4e4e9c1d	A64: Implement system register CNTPCT_EL0	2020-04-22 20:46:15 +01:00
MerryMage	1e15283d00	A64: Implement system register CTR_EL0	2020-04-22 20:46:15 +01:00
MerryMage	58fbb3ff1b	A64: Implement NEG (vector)	2020-04-22 20:46:15 +01:00
MerryMage	710d09471b	IR: Add IR instruction ZeroVector	2020-04-22 20:46:15 +01:00
MerryMage	2721bb5ace	emit_x64_floating_point: Add maybe_unused to preprocess parameter	2020-04-22 20:46:15 +01:00
MerryMage	0575e7421b	A64: Implement FMINNM (scalar)	2020-04-22 20:46:15 +01:00
MerryMage	1c9804ea07	A64: Implement FMAXNM (scalar)	2020-04-22 20:46:15 +01:00
MerryMage	1dfce0894d	constant_pool: Add frame parameter	2020-04-22 20:46:14 +01:00
MerryMage	bd2b415850	A64: Implement ADDP (scalar)	2020-04-22 20:46:14 +01:00
MerryMage	84f1c9b7f4	reg_alloc: Only exchange GPRs	2020-04-22 20:46:14 +01:00
MerryMage	9df3793af0	A64: Implement DUP (element), scalar variant	2020-04-22 20:46:14 +01:00
MerryMage	6541ec064d	emit_x64_floating_point: Correct FP{Max,Min}{32,64} implementations for -0/+0	2020-04-22 20:46:14 +01:00
MerryMage	2080a51f41	A64: Implement FMAX (scalar), FMIN (scalar)	2020-04-22 20:46:14 +01:00
MerryMage	7c193485e1	a64/config: Allow NaN emulation accuracy to be set	2020-04-22 20:46:14 +01:00
MerryMage	a3df46a75a	a64_emit_x64: Add conf to A64EmitContext	2020-04-22 20:46:14 +01:00
MerryMage	0e157b0198	A64: Implement FSQRT (scalar)	2020-04-22 20:46:14 +01:00
MerryMage	07520f32c3	backend_x64: Accurately handle NaNs	2020-04-22 20:46:14 +01:00
MerryMage	e97581d063	fuzz_with_unicorn: Print AArch64 disassembly	2020-04-22 20:46:14 +01:00
MerryMage	01c1e9017e	T32: Add initial decoder list	2020-04-22 20:46:14 +01:00
MerryMage	ccf7df057b	simd_three_same: Add VectorZeroUpper to CMGE (vector) and CMHS (vector)	2020-04-22 20:46:14 +01:00
MerryMage	8cebb87d0d	A64: Implement CMGT (zero), CMEQ (zero), CMLT (zero)	2020-04-22 20:46:14 +01:00
MerryMage	7f68d556ab	decoder/a64: Rearrange SIMD two-register misc decoders	2020-04-22 20:46:14 +01:00
MerryMage	d5af052f06	A64: Implement CMGE (register)	2020-04-22 20:46:14 +01:00
MerryMage	9d85991906	A64: Implement CMHI, CMHS	2020-04-22 20:46:14 +01:00
MerryMage	e2b9b7c5b0	IR: Implement Vector{Less,Greater}{,Equal}{Signed,Unsigned}	2020-04-22 20:46:14 +01:00
MerryMage	0df6725f73	A64: Implement SMAX, SMIN, UMAX, UMIN	2020-04-22 20:46:14 +01:00
MerryMage	47c0ad0fc8	IR: Implement Vector{Max,Min}{Signed,Unsigned}	2020-04-22 20:46:14 +01:00
MerryMage	adb7f5f86f	A64: Implement CMGT (register)	2020-04-22 20:46:14 +01:00
MerryMage	f4775910f5	IR: Implement VectorGreaterSigned	2020-04-22 20:46:14 +01:00
MerryMage	1f5b3bca43	Exclusive fixups * Incorrect size of exclusive_address * Disable tests on exclusive memory instructions for now	2020-04-22 20:46:14 +01:00
MerryMage	f3fa4a042f	a64_emit_x64: EmitExclusiveWrite: Make MSVC happy (narrowing conversion warning)	2020-04-22 20:46:14 +01:00
MerryMage	8698f057d0	A64: Implement STXP, STLXP, LDXP, LDAXP	2020-04-22 20:46:14 +01:00
MerryMage	2a6619d59c	A64: Implement CLREX	2020-04-22 20:46:14 +01:00
MerryMage	b7a2c1a7df	A64: Implement STXRB, STXRH, STXR, STLXRB, STLXRH, STLXR, LDXRB, LDXRH, LDXR, LDAXRB, LDAXRH, LDAXR	2020-04-22 20:46:14 +01:00
MerryMage	a6cc667509	Direct Page Table Access: Handle address spaces less than the full 64-bit in size	2020-04-22 20:46:14 +01:00
MerryMage	f45a5e17c6	Implement direct page table access	2020-04-22 20:46:14 +01:00
MerryMage	bfd3e30c75	callbacks: Member functions should be const	2020-04-22 20:46:14 +01:00
MerryMage	9f2f08db8d	a64_emit_x64: Implement {Read,Write}Memory128 in terms of a function call	2020-04-22 20:46:14 +01:00
MerryMage	6c4773e85b	abi: Add RAX to ABI_ALL_CALLER_SAVE	2020-04-22 20:46:14 +01:00
MerryMage	8756487554	A64: Partially implement MRS	2020-04-22 20:46:14 +01:00
MerryMage	bfd65bedfe	A64: Implement DSB, DMB	2020-04-22 20:46:14 +01:00
MerryMage	5edd623b9d	Implement DC instructions	2020-04-22 20:46:14 +01:00
Lioncash	a9153218bd	A64: Implement NOT (vector)	2020-04-22 20:46:14 +01:00
MerryMage	2cb0a699ba	IR: Implement FPMax, FPMin	2020-04-22 20:46:14 +01:00
MerryMage	aed4fd3ec3	A64: Implement FADD (vector), vector variant	2020-04-22 20:46:14 +01:00
MerryMage	98c8e7d1af	IR: Implement FPVectorAdd	2020-04-22 20:46:14 +01:00
MerryMage	5f77ab28ee	A64: Implement SSHLL, SSHLL2	2020-04-22 20:46:14 +01:00
MerryMage	eae518a338	IR: Implement VectorSignExtend	2020-04-22 20:46:14 +01:00
MerryMage	3738043e58	A64: Implement DUP (element), vector variant	2020-04-22 20:46:14 +01:00
MerryMage	ce7628b6b5	load_store_multiple_structures: Improve IR codegen for selem == 1 case	2020-04-22 20:46:14 +01:00
MerryMage	f1cb5581c9	A64: Implement FSUB (vector)	2020-04-22 20:46:14 +01:00
MerryMage	b9cd345ddc	IR: Implement FPVectorSub	2020-04-22 20:46:14 +01:00
MerryMage	851fc83445	emit_x64_vector: EmitOneArgumentFallback	2020-04-22 20:46:14 +01:00
MerryMage	f378d2ef1b	Forward declare IR::Opcode and IR::Type where possible	2020-04-22 20:46:14 +01:00
MerryMage	6c9b4f0114	A64: Implement CNT	2020-04-22 20:46:14 +01:00
MerryMage	303088a51e	IR: Implement VectorPopulationCount	2020-04-22 20:46:14 +01:00
MerryMage	1dd2b33b87	A64: Implement MLS (vector)	2020-04-22 20:46:14 +01:00
MerryMage	5eac3abf52	A64: Implement MLA (vector)	2020-04-22 20:46:14 +01:00
MerryMage	bf2cd92da9	emit_x64_vector: Add SSE4.1 implementation for EmitVectorMultiply64	2020-04-22 20:46:14 +01:00
MerryMage	b062266b8e	emit_x64_vector: More explicit lambda decay	2020-04-22 20:46:14 +01:00
MerryMage	3afd2fcbad	A64: Implement MUL (vector)	2020-04-22 20:46:14 +01:00
MerryMage	b6de612e01	IR: Implement VectorMultiply	2020-04-22 20:46:14 +01:00
MerryMage	90a053a5e4	emit_x64_vector: Order alphabetically	2020-04-22 20:46:14 +01:00
MerryMage	e7041d7196	A64: Implement STR (register, SIMD&FP), LDR (register, SIMD&FP)	2020-04-22 20:46:14 +01:00
MerryMage	a455ff70c9	decoder/a64: Don't rearrange unrelated decoders	2020-04-22 20:46:14 +01:00
MerryMage	faeb77e8c4	A64: Implement SUB (vector)	2020-04-22 20:46:14 +01:00
MerryMage	bd106c3ae7	A64: Implement SIMD instruction SSRA, vector variant	2020-04-22 20:46:14 +01:00
MerryMage	f58aba9871	A64: Implement SIMD instruction SSHR, vector variant	2020-04-22 20:46:14 +01:00
MerryMage	715ae1c229	IR: Implement VectorArithmeticShiftRight	2020-04-22 20:46:14 +01:00
MerryMage	653c82d8f0	impl: Improve Vpart setter	2020-04-22 20:46:14 +01:00
MerryMage	e858ce0b35	A64: Implement SIMD instructions XTN, XTN2	2020-04-22 20:46:13 +01:00
MerryMage	132c783320	IR: Implement VectorNarrow	2020-04-22 20:46:13 +01:00
MerryMage	1423584f9f	constant_pool: Allow for 128-bit constants	2020-04-22 20:46:13 +01:00
MerryMage	69de50a878	emit_x64_vector: Add SSE4.1 implementations for VectorZeroExtend	2020-04-22 20:46:13 +01:00
MerryMage	cbc9f361b0	IR: Implement VectorSub	2020-04-22 20:46:13 +01:00
MerryMage	3f93c77ace	A64: Implement SIMD instruction USRA, vector variant	2020-04-22 20:46:13 +01:00
MerryMage	fb9d20f27f	A64: Implement SIMD instruction USHR, vector variant	2020-04-22 20:46:13 +01:00
MerryMage	b22c5961f9	IR: Implement VectorLogicalShiftRight	2020-04-22 20:46:13 +01:00
MerryMage	7ff280827b	A64: Implement SIMD instructions USHLL, USHLL2	2020-04-22 20:46:13 +01:00
MerryMage	59ace60b03	IR: Implement VectorZeroExtend	2020-04-22 20:46:13 +01:00
MerryMage	d3a4e1efe2	IR: Vector instructions now take esize argument in emitter	2020-04-22 20:46:13 +01:00
MerryMage	1d0cd95b23	A64: Implement SIMD instruction SHL	2020-04-22 20:46:13 +01:00
MerryMage	f6247125c0	IR: Implement VectorLogicalShiftLeft{8,16,32,64}	2020-04-22 20:46:13 +01:00
MerryMage	15e8231f24	opcodes: Sort vector IR opcodes alphabetically	2020-04-22 20:46:13 +01:00
MerryMage	d74f4e35f6	block_of_code: Increase constant pool size	2020-04-22 20:46:13 +01:00
MerryMage	e69288f803	devirtualize: MinGW uses Intanium MFP ABI	2020-04-22 20:46:13 +01:00
MerryMage	ad428cbd7a	callback: Properly handle calls with return pointers and simplify interface	2020-04-22 20:46:13 +01:00
FernandoS27	15871910af	Implemented BSL, BIC, BIT and BIF vector instructions	2020-04-22 20:46:13 +01:00
MerryMage	7a87e3fc55	devirtualize: Handle Windows ABI	2020-04-22 20:46:13 +01:00
MerryMage	ba4a779c62	A32/decoder/arm: bug: Correct bitstring for SRS	2020-04-22 20:46:13 +01:00
MerryMage	f808a0fbde	devirtualize: Devirtualize Itanium ABI MFPs at runtime	2020-04-22 20:46:13 +01:00
MerryMage	afe16fa0f3	cast_util: Add BitCast and BitCastPointee	2020-04-22 20:46:13 +01:00
Lioncash	4e33629b0e	A64: Move SDIV and UDIV out of data_processing_multiply.cpp	2020-04-22 20:46:13 +01:00
Lioncash	35a29a9665	A64: Implement ZIP1	2020-04-22 20:46:13 +01:00
FernandoS27	586854117b	Implemented UMULH and SMULH instructions	2020-04-22 20:46:13 +01:00
MerryMage	1a7b7b541a	A64: Implement MOVI, MVNI, ORR (vector, immediate), BIC (vector, immediate) There wasn't a clean way to seperate these instructions out.	2020-04-22 20:46:13 +01:00
MerryMage	8ab7d8175c	impl: Add AdvSIMDExpandImm	2020-04-22 20:46:13 +01:00
MerryMage	ea69cb4474	A64: Implement SUB (vector), scalar variant	2020-04-22 20:46:13 +01:00
MerryMage	4c5871d5d5	A64: Implement ADD (vector), scalar variant	2020-04-22 20:46:13 +01:00
MerryMage	2a0850c068	A64: Reorganize decoder tables (some vector entries were grouped with scalar entries)	2020-04-22 20:46:13 +01:00
MerryMage	7b33772ac6	A64: Implement BIC (vector, register)	2020-04-22 20:46:13 +01:00
MerryMage	eb5591859c	A64: Implement FMOV (general)	2020-04-22 20:46:13 +01:00
MerryMage	dd88cee15a	translate/impl: Add Vpart	2020-04-22 20:46:13 +01:00
MerryMage	cc9efd13c9	A64: Implement STLLRB, STLLRH, STLLR, LDLARB, LDLARH, LDLAR	2020-04-22 20:46:13 +01:00
MerryMage	81713c2b77	A64: Implement FCCMPE	2020-04-22 20:46:13 +01:00
MerryMage	ef906dbbfa	A64: Implement FCCMP	2020-04-22 20:46:13 +01:00
MerryMage	44c3c2312a	a64_jitstate: Remove unnecessary FPSCR_nzcv member	2020-04-22 20:46:13 +01:00
MerryMage	aac5af50e2	IR: FPCompare{32,64} now return NZCV flags instead of implicitly setting them	2020-04-22 20:46:13 +01:00
Lioncash	2ee39d6b36	A64: Implement FMOV (register)	2020-04-22 20:46:13 +01:00
MerryMage	b02b861242	A64: Implement STLRB, STLRH, STLR, LDARB, LDARH, LDAR	2020-04-22 20:46:13 +01:00
Lioncash	5a65313236	A64: Implement CCMP (immediate)	2020-04-22 20:46:13 +01:00
Lioncash	ab4664de61	A64: Implement CCMN (immediate)	2020-04-22 20:46:13 +01:00
Lioncash	a6c6539109	A64: Implement CCMP (register)	2020-04-22 20:46:13 +01:00
Lioncash	22632db337	microinstruction: Add ConditionalSelectNZCV opcode to ReadsFromCPSR()'s switch statement	2020-04-22 20:46:13 +01:00
MerryMage	c5033b5dda	A64: Implement CCMN (register)	2020-04-22 20:46:13 +01:00
MerryMage	dd2a6684fe	IR: Add ConditionalSelectNZCV instruction	2020-04-22 20:46:13 +01:00
MerryMage	4491746eae	A64: Implement FNEG	2020-04-22 20:46:13 +01:00
MerryMage	db958061a3	A64: Implement FABS	2020-04-22 20:46:13 +01:00
MerryMage	8765b421b7	A64: Implement FCSEL	2020-04-22 20:46:13 +01:00
MerryMage	7e82d8eede	A64: Implement SCVTF (scalar, integer), UCVTF (scalar, integer)	2020-04-22 20:46:13 +01:00
MerryMage	2409e5d082	A64: Implement FCVTZS (scalar, integer), FCVTZU (scalar, integer)	2020-04-22 20:46:13 +01:00
MerryMage	b173fcf34e	backend_x64: Simplify FPDoubleToU32 and FPSingleToU32 They're inaccurate in terms of FPSR at the moment anyway.	2020-04-22 20:46:13 +01:00
MerryMage	56bc7825ef	A64: Implement STR{,B,H} (register), LDR{,B,H,SB,SH,SW} (register), PFRM (register)	2020-04-22 20:46:13 +01:00
Lioncash	d040920727	Common: Put AES code within its own nested namespace Prevents the functions from potentially clashing with other stuff in Common in the future	2020-04-22 20:46:13 +01:00
Lioncash	40614202e7	A64: Implement AESD	2020-04-22 20:46:13 +01:00
Lioncash	ccef85dbb7	A64: Implement AESE	2020-04-22 20:46:13 +01:00
MerryMage	68f46c8334	backend_x64: Use a reference to BlockOfCode instead of a pointer	2020-04-22 20:46:13 +01:00
MerryMage	8931ee346b	IR: Add IR instruction NZCVFromPackedFlags This instruction expects NZCV to be in the high bits. i.e.: The positions they were in PSTATE.	2020-04-22 20:46:13 +01:00
MerryMage	0bb4474fb9	A64: Implement INS (general)	2020-04-22 20:46:13 +01:00
MerryMage	d13704fdef	A64: Implement INS (element)	2020-04-22 20:46:13 +01:00
MerryMage	0642d49919	A64: Implement SMOV	2020-04-22 20:46:13 +01:00
MerryMage	5297027ebe	A64: Implement UMOV	2020-04-22 20:46:13 +01:00
MerryMage	47661b746b	basic_block: Fix bogus GCC maybe-uninitialized warning	2020-04-22 20:46:13 +01:00
MerryMage	1fb0957aa3	A64: Implement FCVT	2020-04-22 20:46:13 +01:00
MerryMage	ca38225e08	fuzz_with_unicorn: Skip instructions that need to be interpreted	2020-04-22 20:46:13 +01:00
MerryMage	4be55b8b84	A64: Implement FMOV (scalar, immediate)	2020-04-22 20:46:13 +01:00
MerryMage	a07c05ea51	A64: Implement STUR (SIMD&FP), LDUR (SIMD&FP)	2020-04-22 20:46:13 +01:00
MerryMage	93fcbdf1e2	A64: Implement FCMP, FCMPE	2020-04-22 20:46:13 +01:00
MerryMage	75b8a76630	a64_jitstate: A64 does not have a seperate FPSCR.NZCV	2020-04-22 20:46:13 +01:00
MerryMage	99d8ebe4d5	A64: Implement FMUL (scalar), FDIV (scalar), FADD (scalar), FSUB (scalar), FNMUL (scalar)	2020-04-22 20:46:13 +01:00
MerryMage	429dc24587	IR: Merge U32 and U64 variants of FP instructions	2020-04-22 20:46:13 +01:00
MerryMage	ed2bedec43	A64: Implement {ST,LD}{1,2,3,4} (multiple structures)	2020-04-22 20:46:13 +01:00
MerryMage	6414736a8d	emit_x64_vector: bug: VectorGetElement8 returning incorrect values for non-SSE4.1 This bug wasn't discovered earlier because we previously only used index == 0.	2020-04-22 20:46:13 +01:00
MerryMage	ebfc51c609	IR: Implement VectorSetElement{8,16,32,64}	2020-04-22 20:46:13 +01:00
Lioncash	a5c4fbc783	A64: Implement AESIMC and AESMC	2020-04-22 20:46:13 +01:00
Lioncash	744495e23d	iterator_util: Make Reverse constexpr C++17 makes non-member rbegin(), rend(), crbegin(), and crend() constexpr, allowing this to also be constexpr.	2020-04-22 20:46:12 +01:00
Lioncash	ab9b5fb8aa	Common: Relocate common bits of CRC32 Allows the algorithm to be used in any other potential backend.	2020-04-22 20:46:12 +01:00
Lioncash	af1384d700	A64: Implement CRC32	2020-04-22 20:46:12 +01:00
MerryMage	64761dbc72	scope_exit: Add SCOPE_SUCCESS and SCOPE_EXIT	2020-04-22 20:46:12 +01:00
MerryMage	bafb39ebc5	A64: Add Disassemble method	2020-04-22 20:46:12 +01:00
MerryMage	cc0eb18a0b	A32: data_processing: Remove !S assertions	2020-04-22 20:46:12 +01:00
MerryMage	865a30eb0d	A32: Implement BKPT	2020-04-22 20:46:12 +01:00
MerryMage	f023bbb893	A32: Add ExceptionRaised IR instruction and use it	2020-04-22 20:46:12 +01:00
Lioncash	7ffbebf290	A64: Implement CRC32C	2020-04-22 20:46:12 +01:00
MerryMage	d7044bc751	assert: Use fmt in ASSERT_MSG	2020-04-22 20:46:12 +01:00
MerryMage	52268298a8	a64_emit_x64: Perform RSB predictions	2020-04-22 20:46:12 +01:00
MerryMage	98ec9c5f90	A32: Change UserCallbacks to be similar to A64's interface	2020-04-22 20:46:12 +01:00
Lioncash	b9ce660113	reg_alloc: std::move RegAlloc's function argument	2020-04-22 20:46:12 +01:00
Lioncash	ed561d6653	General: Add missing override specifiers	2020-04-22 20:46:12 +01:00
MerryMage	b2d99eddc6	EmitZeroExtendLongToQuad: Do not rely on register allocator to zero extend 64->128	2020-04-22 20:46:12 +01:00
MerryMage	f4f774f9f6	a64_get_set_elimination_pass: Simplify algorithm	2020-04-22 20:46:12 +01:00
MerryMage	54de64f5bf	a64_emit_x64: bug: x64 sign-extends 32-bit immediates	2020-04-22 20:46:12 +01:00
MerryMage	6fc228f7fd	ir_opt: Add A64 Get/Set Elimination Pass	2020-04-22 20:46:12 +01:00
MerryMage	e01b500aea	ir_emitter: Allow the insertion point for new instructions to be set	2020-04-22 20:46:12 +01:00
MerryMage	af793c2527	{a32,a64}_interface: Predict entrypoint	2020-04-22 20:46:12 +01:00
Lioncash	7734cf1050	A64: Implement EXTR	2020-04-22 20:46:12 +01:00
MerryMage	88ae7fce52	A64: Implement LDP (SIMD&FP) and STP (SIMD&FP)	2020-04-22 20:44:38 +01:00
MerryMage	d497464c9f	a64_jitstate: Have 128-bit wide spills	2020-04-22 20:44:38 +01:00
MerryMage	b513b2ef05	IR: Implement IR instructions A64{Get,Set}S	2020-04-22 20:44:38 +01:00
MerryMage	16fa2cd8f6	a64_emit_x64: Use xword from Xbyak::util	2020-04-22 20:44:38 +01:00
Lioncash	67443efb62	General: Convert multiple namespace specifiers to nested namespace specifiers where applicable Makes namespacing a little less noisy	2020-04-22 20:44:38 +01:00
Lioncash	7abd673a49	A64: Zero upper 64 bits in ORN if using the 64-bit variant Resolves a TODO	2020-04-22 20:44:38 +01:00
MerryMage	ba3d6da0c8	load_store_register_unprivileged: bug: LDTRSW	2020-04-22 20:44:38 +01:00
MerryMage	75756137c6	A64: Implement CMEQ (register, vector)	2020-04-22 20:44:38 +01:00
MerryMage	d5283e46e8	IR: Implement IR instructions VectorEqual{8,16,32,64,128}	2020-04-22 20:44:38 +01:00
MerryMage	4ce9c65cfb	reg_alloc: Use std::exchange	2020-04-22 20:44:38 +01:00
Fernando Sahmkow	e0c12ec2ad	A64: Implemented EOR (vector), ORR (vector, register) and ORN (vector) Instructions (#142 )	2020-04-22 20:44:38 +01:00
MerryMage	94383fd934	microinstruction: Missed A64{Read,Write}Memory128 from opcode information	2020-04-22 20:44:38 +01:00
MerryMage	d124a1d761	emit_x64_packed: EmitPackedSubU16 modified xmm_b wasn't writeable For CPUs that didn't support SSE4.1, this was a bug.	2020-04-22 20:44:38 +01:00
James Rowe	589ad7232f	Fixup: Xn\|SP are 64 bit addresses encoded in the Rn field	2020-04-22 20:44:38 +01:00
James Rowe	ae880d8391	A64: Fix bugs and address review comments	2020-04-22 20:44:38 +01:00
James Rowe	3aeb7ca50c	Add missing returns	2020-04-22 20:44:38 +01:00
James Rowe	41e6e659c5	A64: Implement Load/Store register (unprivileged)	2020-04-22 20:44:37 +01:00
MerryMage	01a26fa644	fixup: travis: Test with disabled CPU feature detection	2020-04-22 20:44:37 +01:00
Lioncash	5281d3c6d5	CMakeLists: Add opcodes.inc to the source file list Allows the file to show up nicely within IDEs	2020-04-22 20:44:37 +01:00
MerryMage	30936f5e94	travis: Test with disabled CPU feature detection Ensure that fallbacks are working correctly.	2020-04-22 20:44:37 +01:00
MerryMage	285fd22c30	IR: Add IR instruction VectorZeroUpper	2020-04-22 20:44:37 +01:00
MerryMage	da3e9a5704	a64_emit_x64: bug: EmitA64WriteMemory128 should write not read	2020-04-22 20:44:37 +01:00
FernandoS27	ab84524806	Implemented SDIV and UDIV instructions	2020-04-22 20:44:37 +01:00
MerryMage	6033b05ca6	A64: Implement LDR/STR (immediate, SIMD&FP)	2020-04-22 20:44:37 +01:00
MerryMage	f698848e26	IR: Add IR instructions A64Memory{Read,Write}128 Add the Windows ABI implementation	2020-04-22 20:44:37 +01:00
MerryMage	e1df7ae621	IR: Add IR instructions A64Memory{Read,Write}128 This implementation only works on macOS and Linux.	2020-04-22 20:44:37 +01:00
MerryMage	e00a522cba	IR: Add IR instruction VectorGetElement{8,16,32,64}	2020-04-22 20:44:37 +01:00
MerryMage	28ccd85e5c	IR: Add IR instruction ZeroExtendToQuad	2020-04-22 20:44:37 +01:00
MerryMage	af848c627d	block_of_code: Add ABI_RETURN2	2020-04-22 20:44:37 +01:00
MerryMage	1749780929	interface: Move Vector typedef to config.h	2020-04-22 20:44:37 +01:00
MerryMage	33bba6028c	bit_util: bug: Infinite loop in HighestSetBit	2020-04-22 20:44:37 +01:00
MerryMage	3caf192f60	A64: Implement DUP (general)	2020-04-22 20:44:37 +01:00
MerryMage	793753bf63	IR: Implement Vector{Lower,}Broadcast{8,16,32,64}	2020-04-22 20:44:37 +01:00
Lioncash	8ee854232c	General: Default constructors and destructors where applicable	2020-04-22 20:44:37 +01:00
Lioncash	d1e4526e1c	ir_emitter: Remove unused includes	2020-04-22 20:44:37 +01:00
Lioncash	6f9216d544	A64: Implement RBIT	2020-04-22 20:44:37 +01:00
MerryMage	9b0a21915f	ir_emitted: Remove unimplemented IR instruction Unimplemented	2020-04-22 20:44:37 +01:00
MerryMage	db30e02ac8	emit_x64: Extract BlockRangeInformation, remove template parameter	2020-04-22 20:44:36 +01:00
MerryMage	58c4a25527	emit_x64: Use JitStateInfo	2020-04-22 20:42:46 +01:00
MerryMage	d4b05b28cf	A64: Implement CLS This is not the cleanest implementation.	2020-04-22 20:42:46 +01:00
MerryMage	b8e26bfdc3	A64: Implement ADDP (vector)	2020-04-22 20:42:46 +01:00
MerryMage	eaf545877a	IR: Implement Vector{Lower,}PairedAdd{8,16,32,64}	2020-04-22 20:42:46 +01:00
MerryMage	a554e4a329	backend_x64: Split emit_x64	2020-04-22 20:42:46 +01:00
MerryMage	394bd57bb6	microinstruction: bug: Add missing opcodes	2020-04-22 20:42:46 +01:00
Lioncash	bb1c5bd3b2	A64: Implement SMADDL, SMSUBL, UMADDL, and UMSUBL	2020-04-22 20:42:46 +01:00
Lioncash	c1a25bfc2f	A64: Implement MADD and MSUB	2020-04-22 20:42:46 +01:00
Lioncash	b7c5055d42	A64: Implement CLZ	2020-04-22 20:42:46 +01:00
Lioncash	b612782445	opcodes: Add 64-bit CountLeadingZeroes opcode	2020-04-22 20:42:46 +01:00
MerryMage	4c4efb2213	data_processing_register: Clean-up	2020-04-22 20:42:46 +01:00
Lioncash	ae5dbcbed6	A64: Implement HINT, NOP, YIELD, WFE, WFI, SEV, and SEVL Truly the most difficult A64 instructions to implement.	2020-04-22 20:42:46 +01:00
Lioncash	4d8f4aa8af	A64: Implement ASRV, LSLV, LSRV, and RORV	2020-04-22 20:42:46 +01:00
Lioncash	a8a65beb2b	data_processsing_conditional_select: Implement CSINC, CSINV and CSNEG	2020-04-22 20:42:46 +01:00
Lioncash	b08be71775	a32/a64_emit_x64: Remove unused includes	2020-04-22 20:42:46 +01:00
MerryMage	f81d0a2536	A64: Implement AND (vector)	2020-04-22 20:42:46 +01:00
MerryMage	a63fc6c89b	A64: Implement ADD (vector, vector)	2020-04-22 20:42:46 +01:00
Thomas Guillemard	896cf44f96	A64: Implement REV, REV32, and REV16 (#126 )	2020-04-22 20:42:46 +01:00
MerryMage	5eb0bdecdf	IR: Simplify types. F32 -> U32, F64 -> U64, F128 -> U128 ARM's Architecture Specification Language doesn't distinguish between floats and integers as much as we do. This makes some things difficult to implement. Since our register allocator is now capable of allocating values to XMMs and GPRs as necessary, the Transfer IR instructions are no longer necessary as they used to be and they can be removed.	2020-04-22 20:42:46 +01:00
MerryMage	9a812b0c61	reg_alloc: GetBitWidth: Add UNREACHABLE	2020-04-22 20:42:46 +01:00
MerryMage	fff8e019dc	reg_alloc: Consider bitwidth of data and registers when emitting instructions	2020-04-22 20:42:46 +01:00
MerryMage	144b629d8a	A64: Implement CSEL	2020-04-22 20:42:45 +01:00
MerryMage	6395f09f94	IR: Implement Conditional Select	2020-04-22 20:42:45 +01:00
MerryMage	19da68568e	A64/translate/branch: bug: Read-after-write error in BLR	2020-04-22 20:42:45 +01:00
MerryMage	9f57283a30	A64: Implement SBFM, BFM, UBFM	2020-04-22 20:42:45 +01:00
MerryMage	cdbc8d07a5	A64: Implement MOVN, MOVZ, MOVK	2020-04-22 20:42:45 +01:00
MerryMage	ecebe14a01	ir/location_descriptor: Add missing <functional> header for std::hash	2020-04-22 20:42:45 +01:00
MerryMage	4e3675da7b	a64_merge_interpret_blocks: Remove debug output	2020-04-22 20:42:45 +01:00
MerryMage	c6a091d874	A64: Optimization: Merge interpret blocks	2020-04-22 20:42:45 +01:00
MerryMage	21fe61eac6	A64/data_processing_pcrel: bug: ADR{,P} instructions sign extend their immediate	2020-04-22 20:42:45 +01:00
MerryMage	7c4b70751c	A64/data_processing_addsub: bug: {ADD,SUB}S (extended register) instructions write to ZR when d = 31	2020-04-22 20:42:45 +01:00
MerryMage	996ffd5488	a64_emit_x64: bug: A64CallSupervisor trampled callee-save registers	2020-04-22 20:42:45 +01:00
MerryMage	e4615a4562	emit_x64: bug: OP m/r64, imm32 form instructions sign-extend their immediate on x64	2020-04-22 20:42:45 +01:00
MerryMage	0992987c98	A64: Add ExceptionRaised IR instruction The purpose of this instruction is to raise exceptions when certain decode-time issues happen, instead of asserting at translate time. This allows us to use the translator for code analysis without worrying about unnecessary asserts, but also provides flexibility for the library user to perform custom behaviour when one of these states are raised.	2020-04-22 20:42:45 +01:00
MerryMage	61125d6dd1	A64/translate: Add TranslateSingleInstruction function	2020-04-22 20:42:45 +01:00
MerryMage	aa74a8130b	Misc. fixups of MSVC build	2020-04-22 20:42:45 +01:00
MerryMage	a1dfa01515	imm: Suppress MSVC warning C4244: value will never be truncated	2020-04-22 20:42:45 +01:00
MerryMage	26da149639	imm: compiler bug: MSVC 19.12 with /permissive- flag doesn't support fold expressions	2020-04-22 20:42:45 +01:00
MerryMage	b34c6616d4	A64/decoder: Split decoder data from header	2020-04-22 20:42:45 +01:00
MerryMage	72a793f5b0	ir_opt: Split off A32 specific passes	2020-04-22 20:42:45 +01:00
MerryMage	595f157e5e	A64: Implement LDP, STP	2020-04-22 20:42:45 +01:00
MerryMage	511215342b	A64/location_descriptor: Fix -fpermissive warning on GCC	2020-04-22 20:42:45 +01:00
MerryMage	243f06c613	A64: Implement LDP, STP	2020-04-22 20:42:45 +01:00
MerryMage	25411da838	A32: Implement load stores (immediate)	2020-04-22 20:42:45 +01:00
MerryMage	2aadeec291	A64: Implement SVC	2020-04-22 20:42:45 +01:00
MerryMage	9e27e4d250	imm: bug: SignExtend wasn't working for T with bit size > 32	2020-04-22 20:42:45 +01:00
MerryMage	10c60dda97	a64_emit_x64: Don't use far code for now	2020-04-22 20:42:45 +01:00
MerryMage	593a569b53	EmitA64SetW: bug: should zero extend to entire 64-bit register	2020-04-22 20:42:45 +01:00
MerryMage	6bd9f02911	EmitA64SetNZCV: bug: to_store is scratch	2020-04-22 20:42:45 +01:00
MerryMage	f0276dd53b	emit_x86: Fix nzcv for EmitSub	2020-04-22 20:42:45 +01:00
MerryMage	68391b0a05	A64: Implement SVC	2020-04-22 20:42:45 +01:00
MerryMage	e5ace37560	a64_emit_x64: Call interpreter	2020-04-22 20:42:45 +01:00
MerryMage	b12dead76a	A64: Add batch register retrieval to interface	2020-04-22 20:42:45 +01:00
MerryMage	cb481a3a48	A64: Implement compare and branch	2020-04-22 20:42:45 +01:00
MerryMage	e8bcf72ee5	A64: PSTATE access and tests	2020-04-22 20:42:45 +01:00
MerryMage	23f3afe0b3	A64: Implement branch (register)	2020-04-22 20:42:45 +01:00
MerryMage	86d1095df7	A64: Implement branch	2020-04-22 20:42:45 +01:00
MerryMage	0641445e51	A64: Implement logical	2020-04-22 20:42:45 +01:00
MerryMage	5a1d88c5dc	A64: Implement pcrel	2020-04-22 20:42:45 +01:00
MerryMage	c09e69bb97	A64: Implement addsub instructions	2020-04-22 20:42:44 +01:00
MerryMage	d1cef6ffb0	A64: Implement ADD_shifted	2020-04-22 20:42:44 +01:00
MerryMage	d1eb757f93	A64: Backend framework	2020-04-22 20:42:44 +01:00
MerryMage	e161cf16f5	A64: Initial framework	2020-04-22 20:42:44 +01:00
MerryMage	f61da0b5a9	IR: Compile-time type-checking of IR	2020-04-22 20:39:27 +01:00
MerryMage	44f7f04b5c	IR/Value: Rename RegRef and ExtRegRef to A32Reg and A32ExtReg	2020-04-22 20:39:27 +01:00
MerryMage	83022322d1	Make IR->A32 LocationDescriptor conversion explicit	2020-04-22 20:39:27 +01:00
MerryMage	9d15e0a8e1	Final A32 refactor	2020-04-22 20:39:27 +01:00
MerryMage	455757d7b6	EmitX64: JitState type as template parameter	2020-04-22 20:39:26 +01:00
MerryMage	2d164d9345	Package up emit context	2020-04-22 20:38:31 +01:00
MerryMage	7bf421dd38	Rename JitState to A32JitState	2020-04-22 20:38:31 +01:00
MerryMage	63bd1ece23	backend_x64: Split A32 specific emission into separate class	2020-04-22 20:38:29 +01:00
MerryMage	8bef20c24d	IR: Split off A32 specific opcodes	2020-04-22 20:33:32 +01:00
MerryMage	b1f0cf9278	A32: Split off A32 specific IREmitter	2020-04-22 20:33:32 +01:00
MerryMage	b3c73e2622	Label A32 specific code appropriately	2020-04-22 20:33:30 +01:00
MerryMage	dc357c780d	EmitPackedHalvingSub{U,S}16: SSE2 implementation	2020-04-22 20:27:15 +01:00
MerryMage	a98821da41	Merge branch 'misc' These commits introduce context save and restore, and a small number of optimizations that depend on their use for performance.	2020-04-22 20:27:15 +01:00
MerryMage	fc885ac80f	EmitPackedHalvingAddU8: Add SSE2 implementation	2020-04-22 20:27:15 +01:00
MerryMage	4682211729	EmitPackedHalvingAdd{U,S}16: Add SSE2 implementation	2020-04-22 20:27:15 +01:00
MerryMage	9ac1c87a51	emit_x64: EmitSet{Register,ExtendedRegister32,ExtendedRegister64}: Store from current source	2020-04-22 20:27:15 +01:00
MerryMage	6e834de072	Add re-entry prediction to avoid std::unordered_map lookups	2020-04-22 20:26:40 +01:00
MerryMage	984ce22431	emit_x64: Arguments to MostSignificantBit and IsZero are 32-bit	2020-04-22 20:26:40 +01:00
MerryMage	5c6fcf378f	emit_x64: Optimize code emitted by EmitGetCpsr	2020-04-22 20:26:40 +01:00
MerryMage	f595f85039	block_of_code: Remove vzeroupper	2020-04-22 20:26:40 +01:00
MerryMage	4393473d06	interface: Allow saving and storing of contexts	2020-04-22 20:26:40 +01:00
MerryMage	05f3f07704	emit_x64: Reduce mxscr operations in EmitGetFpscr and EmitSetFpscr	2020-04-22 20:26:40 +01:00
MerryMage	19a7fb8992	jit_state: Split off CPSR.NZCV	2020-04-22 20:26:40 +01:00
MerryMage	0af1e7723d	CMakeLists: Fixup boost * boost is part of the public interface. * Consider boost a system library so warnings from boost do not cause a build failure. * If the parent project defines boost, use that.	2020-04-22 20:26:40 +01:00
MerryMage	a3432102b8	jit_state: Split off CPSR.Q	2020-04-22 20:26:40 +01:00
MerryMage	4f8675083c	interface_x64: Fix MSVC cast warning	2020-04-22 20:26:40 +01:00
MerryMage	311361b409	jit_state: Split off CPSR.{E,T} This allows us to improve code-emission for PopRSBHint. We also improve code emission other terminals at the same time.	2020-04-22 20:26:40 +01:00
MerryMage	cb119c2f72	emit_x64: Use boost::icl::interval_map to speed up ranged invalidation	2020-04-22 20:26:40 +01:00
MerryMage	3cca3bbd0b	jit_state: Split off CPSR.GE	2020-04-22 20:26:40 +01:00
MerryMage	6fde29f5d8	emit_x64: Remove unnecessary ABI overhead in ReadMemory, WriteMemory	2020-04-22 20:26:40 +01:00
MerryMage	6adc554b53	jit_state: Hide cpsr implementation	2020-04-22 20:26:40 +01:00
MerryMage	eb80aae9c0	block_of_code: Move MXCSR switching out of dispatch loop Also clarify MXCSR entry/exit terminology	2020-04-22 20:26:40 +01:00
MerryMage	a4e85ad565	emit_x64: Make RSB a stack	2020-04-22 20:26:40 +01:00
MerryMage	2a818f9d8e	Merge branch 'timing' We do this to improve timing information before entering a supervior function. We also do this to try and stay within JITted code as much as possible, by updating the cycles we have remaining.	2020-04-22 20:26:37 +01:00
MerryMage	ea4c3292d5	BlockOfCode: Detect space remaining We also clear the code cache when we run out of space. This closes #111.	2020-04-22 20:26:12 +01:00
MerryMage	256749910f	Add AddTicks and GetTicksRemaining callbacks	2020-04-22 20:26:12 +01:00
MerryMage	80c56aa89d	Remove unnecessary use of boost::make_optional Closes #119.	2020-04-22 20:26:12 +01:00
MerryMage	de6a93a160	decoder_detail: Lambda captures may be unused if iota is an empty sequence Closes #120	2020-04-22 20:26:12 +01:00
MerryMage	3141dadea9	Remove UNUSED macro	2020-04-22 20:26:12 +01:00
MerryMage	7cac9519b0	microinstruction: Remove DecrementRemainingUses	2020-04-22 20:26:12 +01:00
MerryMage	639f7cfd2d	reg_alloc: Add IsLastUse optimization for UseScratch	2020-04-22 20:26:12 +01:00
MerryMage	6b122751fe	reg_alloc: Remove reliance on IR::Inst::DecrementRemainingUses	2020-04-22 20:26:12 +01:00
MerryMage	30049ca928	emit_x86: Standardize time of DefineValue call	2020-04-22 20:26:12 +01:00
MerryMage	5d72f7048f	basic_block: Add inst address and use count to DumpBlock This additional output assists with debugging.	2020-04-22 20:26:12 +01:00
Mat M	c6d09adcb7	CMakeLists: Derive the source file listings from targets directly (#118 ) This gets rid of the need to store to individual variables before creating the target itself, cleaning up the variables in the surrounding scope a little bit.	2020-04-22 20:26:07 +01:00
MerryMage	12eaf496fd	emit_x64: Perform mask creation for packed instructions in SSE	2020-04-22 20:26:07 +01:00
MerryMage	305e4baa29	emit_x64: Eliminate conversion of GE flags * We do this so that we can simplify PackedSelect. * We also try to minimise xmm-gpr/gpr-xmm transfers in PackedSelect.	2020-04-22 20:26:07 +01:00
MerryMage	d1e0a29cd9	Implement IR instruction PackedSelect, reimplement SEL	2020-04-22 20:26:07 +01:00
MerryMage	18f11972c6	emit_x64: Remove SSSE3 implementation of PackedHalvingAddU8 It is much slower than the SSE2 implementation, so there's no point keeping it around.	2020-04-22 20:26:07 +01:00
MerryMage	c4b40909f7	emit_x64: Improve code emission of FPCompare{32,64} Replace if-chain with table lookup	2020-04-22 20:26:07 +01:00
MerryMage	814e378249	VCMP and VCMPE were the other way around - This was due to a misunderstanding of what the E in VCMPE means. - The E refers to an exception being raised when a QNaN is encountered. - Added unit tests for VCMP{E}	2020-04-22 20:26:07 +01:00
MerryMage	08f638d447	emit_x64: pmaxuw and pminuw require SSE 4.1 This commit is intended to close citra-emu/citra#3137. pmaxuw and pminuw were used to perform unsigned comparisons; we emulate these using a signed comparison by offsetting the inputs by 0x8000 for CPUs that do not support SSE 4.1.	2020-04-22 20:26:07 +01:00
Mat M	522992965a	Common: Delete Pool's copy constructor and copy/move assignment operators (#117 ) The language defines a copy constructor as: TypeName(const TypeName&) so this was just deleting a constructor variant that would catch most cases of attempted copies.	2020-04-22 20:22:01 +01:00
Mat M	77fe2aeeaa	emit_x64: Amend doxygen parameters for InvalidateCacheRange() (#116 )	2020-04-22 20:22:01 +01:00
MerryMage	19dcdde90b	block_of_code: Add vzeroupper instructions where AVX-SSE transitions may occur	2020-04-22 20:22:01 +01:00
MerryMage	60d9392b5c	block_of_code: BlockOfCode should provide cpu info	2020-04-22 20:22:01 +01:00
MerryMage	148c01e08f	interface_x64: Remove is_executing assert from HaltExecution In multithreaded code this can be triggered due to a race.	2017-10-14 23:35:01 +01:00
MerryMage	f6cf265bc5	block_of_code: BlockOfCode::ABI_* should be const	2017-09-29 01:35:24 +01:00
MerryMage	29471be317	Standardize location of storage-class specifiers: Place at beginning of declarations Justification: C99 specifies that doing otherwise is an obsolescent feature.	2017-09-29 01:23:45 +01:00
MerryMage	b992e5f8ec	Ranged cache invalidation * Fix clearing code block on a partial invalidation * Remove unnecessary use of boost::variant * Code cleanup	2017-09-11 00:11:05 +01:00
Lioncash	80477b5a67	externals: update fmt to 4.0	2017-08-27 21:43:21 +01:00
MerryMage	568b52d4ba	externals: Update Xbyak to v5.51 Xbyak now supports multi-byte nops	2017-08-17 21:34:54 +01:00
MerryMage	1613846ab0	reg_alloc: Handle XMM registers in LoadImmediate	2017-08-16 23:11:05 +01:00
MerryMage	993e142c6b	disassembler: Fix RegList	2017-08-05 01:57:29 +01:00
MerryMage	6197bde0fc	disassembler_arm: Fix disassembly of LDRH (reg)	2017-07-30 18:45:55 +01:00
Yuri Kunde Schlesner	38eb7e0314	emit_x64: Use alternative Xbyak names for and, or, xor Also enabled XBYAK_NO_OP_NAMES, allowing us to stop using -fno-operator-names.	2017-06-12 07:57:46 +01:00
James Rowe	82e8c99a47	Link against static fmtlib instead of header only When including fmtlib as a header only library in dynarmic, downstream projects cannot include fmtlib as a static library without getting linker errors.	2017-05-22 08:23:03 +01:00
MerryMage	599a613fea	Move SEL from status_register_access to misc	2017-04-25 13:57:27 +01:00
MerryMage	50bb317104	parallel: UQADD8 and UQADD16 are unpredictable when {d\|n\|m} == 15	2017-04-25 13:45:31 +01:00
MerryMage	7639dfea51	coprocessor: Use && instead of & with boolean arguments	2017-04-22 15:05:31 +01:00
MerryMage	2c9dcfa2db	backend_x64: Rename UnwindHandler to ExceptionHandler	2017-04-20 14:08:56 +01:00
MerryMage	0d47f50f57	block_of_code: Implement farcode	2017-04-19 18:58:36 +01:00
MerryMage	1c21ae6bcd	saturated: Implement QASX, QSAX, UQASX, UQSAX	2017-04-10 10:21:51 +01:00
MerryMage	9ac890c62d	reg_alloc: Fix for LLVM's interpretation of the System V ABI This aspect of the System V ABI is under-defined. LLVM choses a different interpretation from GCC and ICC. Most other compilers assume the callee is responsible zero-ing the upper bits of the register if necessary. LLVM assumes the caller has zero-extended the register. This is a quick fix for this problem until zext-tracking is implemented.	2017-04-08 22:12:37 +01:00
MerryMage	a5bb81a97c	backend_x64: Remove dispatch loop in Jit::Run	2017-04-08 10:04:53 +01:00
MerryMage	1b37420459	backend_x64: Simplify dispatcher	2017-04-08 09:35:45 +01:00
MerryMage	523ae542f4	microinstruction: Implement HasAssociatedPseudoOperation	2017-04-04 13:10:50 +01:00
MerryMage	4c5de3905b	emit_x64: Correct mutation of immutable in FPThreeOp{32,64} operand (args[1]) was erroneously declared as non-scratch. operand's value could be modified if FTZ was enabled.	2017-04-01 09:57:14 +01:00
MerryMage	05e97058c3	parallel: Add and Subtract with Exchange improvements * Remove asx argument from PackedHalvingSubAdd{U16,S16} IR instruction * Implement Packed{Halving,}{AddSub,SubAdd}{U16,S16} IR instructions * Implement SASX, SSAX, UASX, USAX	2017-03-24 15:56:24 +00:00
Lynn	fd068ed6b8	Ranged cache invalidation	2017-03-20 11:58:25 +00:00
MerryMage	d9c69ad997	constant_pool: Implement a constant pool	2017-03-19 13:08:04 +00:00
Lioncash	5a02da445a	CMakeLists: Only link LLVM libs against the library LLVM library code is only used within the main dynarmic library, not the test executable.	2017-03-11 13:25:14 +00:00
Lioncash	d85137ed65	interface_x64: Amend LLVM disassembly code This would previously attempt to perform pointer arithmetic on void pointers, which would cause compilation errors.	2017-03-07 18:32:04 +00:00
Lioncash	d0efbb9348	CMakeLists: Remove unnecessary linker language specifiers This is already inferred by the cmake project being declared a CXX project.	2017-03-07 18:30:58 +00:00
Lioncash	9906be746f	CMakeLists: Make boost an interface library target Gets rid of the use of a non-target include and makes libraries explicitly link against the identifier name in order to get includes.	2017-03-04 11:52:32 +00:00
MerryMage	6396bd02f0	Merge branch 'simplify-reg-alloc'	2017-02-27 00:11:52 +00:00
MerryMage	92a01b0cd8	Prefer ASSERT to DEBUG_ASSERT	2017-02-26 23:30:40 +00:00
MerryMage	135346eb2e	reg_alloc: Move implementations out of header	2017-02-26 23:30:39 +00:00
MerryMage	184db36caf	reg_alloc: Call DecrementRemainingUses in only one place	2017-02-26 23:30:38 +00:00
MerryMage	51fc9fec05	reg_alloc: Reorganize	2017-02-26 23:30:37 +00:00
MerryMage	cf93ab3d31	reg_alloc: Remove old register allocator interface	2017-02-26 23:12:26 +00:00
MerryMage	08a467bf9a	emit_x64: Port to new register allocator interface	2017-02-26 23:12:25 +00:00
Lioncash	662e07337f	CMakeLists: Don't explicitly signify dynarmic as a static lib This allows a user of the library to explicitly control which kind of library type should be built with the CMake BUILD_SHARED_LIBS flag. By default, libraries will build as static without this specifier.	2017-02-26 23:08:49 +00:00
MerryMage	f883bad2cc	reg_alloc: New register allocation interface	2017-02-26 21:37:35 +00:00
MerryMage	13ac0c234e	reg_alloc: Differentiate between ReadLock and WriteLock	2017-02-26 21:37:34 +00:00
MerryMage	6c3df057fa	reg_alloc: Remove unused functions	2017-02-26 21:37:33 +00:00
MerryMage	1ee4c07f14	reg_alloc: Reimplement ScratchHostLocReg	2017-02-26 21:37:32 +00:00
MerryMage	640faab8a7	reg_alloc: UseHostLoc is no longer necessary	2017-02-26 21:37:30 +00:00
MerryMage	9518bbe06e	reg_alloc: Reimplement UseScratchHostLocReg	2017-02-26 21:37:29 +00:00
MerryMage	e1d8238c50	reg_alloc: Stub UseOpArg	2017-02-26 21:37:27 +00:00
MerryMage	2b078152e7	reg_alloc: Reimplement UseHostLocReg	2017-02-26 21:37:26 +00:00
MerryMage	aefe550428	reg_alloc: Remove the Def concept from register allocator internals	2017-02-26 21:37:25 +00:00
MerryMage	65cccf070e	reg_alloc: Properly encapsulate HostLocInfo	2017-02-26 21:37:24 +00:00
MerryMage	469bb6253f	backend_x64: Factor EmitExclusiveWriteMemory64 into ExclusiveWrite	2017-02-26 15:34:26 +00:00
MerryMage	d7ab1f9c64	backend_x64: Fix ABI violation in ReadMemory and WriteMemory Caller-save registers were not saved before call instruction. Refer to issue #98.	2017-02-26 15:34:25 +00:00
MerryMage	3768174783	ir_opt: Constant propagation pass works better with a DCE just before it	2017-02-26 15:28:35 +00:00
MerryMage	157585887e	ir_opt: Simplify dead-code elimination pass	2017-02-26 15:28:34 +00:00
MerryMage	bbeea72eba	ir_opt: Remove redundant shift instructions	2017-02-26 15:28:14 +00:00
MerryMage	517fe0f18e	emit_x64: WriteMemory* microinstructions do not define a value	2017-02-25 11:54:47 +00:00
MerryMage	1ff60bc69f	reg_alloc: Move OpArg into own header	2017-02-21 23:38:36 +00:00
MerryMage	4ed8ee7489	microinstruction: Void arguments when invalidating instruction	2017-02-18 21:29:23 +00:00
MerryMage	7fa5845c1f	extension: Implement SXTAB16 and SXTB16	2017-02-18 20:14:44 +00:00
MerryMage	73d1cf36c3	extension: Simplify UXTB16	2017-02-18 20:14:39 +00:00
MerryMage	6edcfeba0b	extension: Simplify rotation code	2017-02-18 20:14:37 +00:00
MerryMage	cc9d2c4603	saturated: Implement SSAT16 and USAT16	2017-02-18 17:43:57 +00:00
MerryMage	358cf7c322	vfp: Implement vectorized VFP instructions	2017-02-18 01:13:25 +00:00
MerryMage	f2dd82967f	load_store: Simplify implementation * Remove dead code * Standardise code style with rest of code base	2017-02-16 22:28:56 +00:00
MerryMage	058f7b5de6	emit_x64: Make EmitTerminal type-safe Avoid the use of boost::variant::which, which tends to produce code which is not verifiable at compile-time.	2017-02-16 19:40:51 +00:00
MerryMage	e197b10b96	common: Introduce utility function VisitVariant VisitVariant allows one to use a generic lambda to visit a boost::variant. This is necessary because boost::visit_variant requires the visitor type to provide a return type.	2017-02-16 19:30:56 +00:00
MerryMage	5a20a37d3f	arm/fpscr: Correct Stride implementation	2017-02-11 12:13:57 +00:00
MerryMage	033e8b9b1e	vfp: Rename variables a, b, c to more sensible names	2017-02-06 21:14:36 +00:00
MerryMage	2af39dfaa8	emit_x64: Make reg_alloc a local variable reg_alloc contains state that is only valid on a per-block basis, so there is no reason for it to be a member variable.	2017-02-04 09:29:35 +00:00
MerryMage	a0e9417912	ir_opt: Initial constant propagation pass implementation	2017-01-30 21:49:46 +00:00
MerryMage	2447f2f360	callbacks: Factorize memory callbacks into inner structure	2017-01-30 21:42:51 +00:00
MerryMage	642ccb0f66	ir/value: Support U16 immediates	2017-01-29 22:58:11 +00:00
MerryMage	5f7ffe0d0b	microinstruction: Implement Inst::AreAllArgsImmediates	2017-01-29 22:56:59 +00:00
MerryMage	22804dc6a5	microinstruction: Arguments of Inst::Use and Inst::UndoUse should be const	2017-01-29 22:53:46 +00:00

... 17 18 19 20 21 ...

2267 commits