dynarmic

Author	SHA1	Message	Date
MerryMage	b74d5520f9	A64: Implement FRSQRTS (scalar), single/double variant	2020-04-22 20:46:22 +01:00
MerryMage	506e544bfe	IR: Implement FPRSqrtStepFused	2020-04-22 20:46:22 +01:00
MerryMage	6eb069e80d	fp: Implement FPRSqrtStepFused	2020-04-22 20:46:22 +01:00
MerryMage	b0ff35fcd1	fp: Implement FPNeg	2020-04-22 20:46:22 +01:00
MerryMage	ca6774ccce	process_nan: Add two operand variant	2020-04-22 20:46:22 +01:00
Lioncash	ace7d2ba50	A64: Implement FMAXP, FMINP, FMAXNMP and FMINNMP's scalar double/single-precision variant	2020-04-22 20:46:21 +01:00
MerryMage	66bb05fc0a	emit_x64_floating_point: Fixup special NaN case in FMA FPMulAdd implementation	2020-04-22 20:46:21 +01:00
Lioncash	070637e0f6	fp: Use a forward declaration in fused.h It's permissible to forward declare here, so we can do so and eliminate a direct header dependency	2020-04-22 20:46:21 +01:00
Lioncash	030820f649	u128: Implement comparison operators in terms of one another We can just implement the comparisons in terms of operator< and implement inequality with the negation of operator==.	2020-04-22 20:46:21 +01:00
MerryMage	76b07d6646	u128: StickyLogicalShiftRight requires special-casing for amount == 64 In this case (128 - amount) == 64, and this invokes undefined behaviour	2020-04-22 20:46:21 +01:00
Lioncash	49c7edf7c6	A64: Implement FMLA and FMLS (by element)'s double/single-precision scalar variant	2020-04-22 20:46:21 +01:00
Lioncash	c704acafe4	A64: Implement FMUL (by element)'s scalar double/single-precision variant	2020-04-22 20:46:21 +01:00
MerryMage	0ce11b7b15	emit_x64_floating_point: Implement accurate fallback for FPMulAdd{32,64}	2020-04-22 20:46:21 +01:00
MerryMage	e199887fbc	fp: Implement FPMulAdd	2020-04-22 20:46:21 +01:00
MerryMage	53a8c15d12	process_nan: Add FPProcessNaNs3	2020-04-22 20:46:21 +01:00
MerryMage	1c8e93e74d	block_of_code: Add SysV ABI fifth and sixth parameters	2020-04-22 20:46:21 +01:00
MerryMage	1fe8f51c54	u128: Add StickyLogicalShiftRight	2020-04-22 20:46:21 +01:00
MerryMage	b0afd53ea7	u128: Add Multiply64To128	2020-04-22 20:46:21 +01:00
MerryMage	5566fab29a	u128: Add u128::Bit	2020-04-22 20:46:21 +01:00
MerryMage	3e62fea003	u128: Add comparison operators	2020-04-22 20:46:21 +01:00
MerryMage	f17cd6f2c5	unpacked: Use ResidualErrorOnRightShift in FPRoundBase Fixes a bug relating to exponents that are severely out of range.	2020-04-22 20:46:21 +01:00
MerryMage	805428e35e	fp: Remove MantissaT	2020-04-22 20:46:21 +01:00
MerryMage	bda86fd167	FPRSqrtEstimate: Improve documentation of RecipSqrtEstimate	2020-04-22 20:46:21 +01:00
Lioncash	0a64a66b26	FPRSqrtEstimate: Deduplicate array bounds Dehardcodes a few constants in the loops.	2020-04-22 20:46:21 +01:00
Lioncash	b7bd70fd19	A64: Implement FMAXV, FMINV, FMAXNMV, and FMINNMV	2020-04-22 20:46:21 +01:00
Lioncash	664fb12e21	FPRSqrtEstimate: Use forward declarations where applicable	2020-04-22 20:46:21 +01:00
Lioncash	3447c82656	translate: Return by bool in helpers where applicable Gets rid of a bit of duplication regarding the early-out cases and makes all helpers functions consistent (previously some had a return type of bool, while others had a return type of void).	2020-04-22 20:46:21 +01:00
Lioncash	d65b056eba	Simplify fallback case for EmitVectorSetElement64()	2020-04-22 20:46:21 +01:00
MerryMage	6087c2af6f	emit_x64_floating_point: s/Esimate/Estimate/	2020-04-22 20:46:21 +01:00
MerryMage	f837ce8e78	simd_scalar_two_register_misc: Implement FRSQRTE, scalar variant	2020-04-22 20:46:21 +01:00
MerryMage	bde58b04d4	IR: Implement FPRSqrtEstimate	2020-04-22 20:46:21 +01:00
MerryMage	16061c28f3	simd_vector_x_indexed_element: Implement FMUL (by element), vector variant	2020-04-22 20:46:21 +01:00
MerryMage	55eaa16615	a64_emit_x64: Ensure host has updated ticks in EmitA64GetCNTPCT Discovered by @Subv. Fixes incomplete fix begun in 5a91c94dca47c9702dee20fbd5ae1f4c07eef9df. That fix fails to take into account that LinkBlock doesn't update ticks until there are no remaining ticks to be executed. Test added to confirm fix.	2020-04-22 20:46:21 +01:00
MerryMage	edd795e991	a64_emit_x64: Fix stack misalignment on Windows for 128-bit exclusive writes Discovered by @Subv. Includes a test to ensure this codepath is exercised on Windows.	2020-04-22 20:46:21 +01:00
Lioncash	04b4c8b0cf	emit_x64_aes: Eliminate extraneous usage of a scratch register in EmitAESInverseMixColumns() We can just use the same register the data is in as the result register, eliminating the need to use a completely separate register to store the result.	2020-04-22 20:46:21 +01:00
Lioncash	e5d80e998e	A64: Implement SADDLV	2020-04-22 20:46:21 +01:00
Lioncash	a1bc8ddb53	A64: Implement UADDLV	2020-04-22 20:46:21 +01:00
Lioncash	1dc1e3dcd8	fp: Use forward declarations where applicable Minimizes the amount of files that need to be rebuilt if the headers ever change.	2020-04-22 20:46:21 +01:00
Lioncash	46cb0d813b	emit_x64_vector: Append 'v' prefix onto movq in AVX path This is something I missed when adding in the AVX broadcast code.	2020-04-22 20:46:21 +01:00
Subv	4606a081c9	A64: The A64SetTPIDR IR instruction writes to a system register and should not be eliminated by the dead code elimination pass. Previously this instruction was alway eliminated, resulting in incorrect values for TPIDR_EL0.	2020-04-22 20:46:21 +01:00
MerryMage	b53127600b	fp: A64::FPCR -> FP::FPCR	2020-04-22 20:46:21 +01:00
MerryMage	084bf63a10	bit_util: Implement ClearBits and ModifyBits	2020-04-22 20:46:21 +01:00
MerryMage	699c5f36d5	system: Simplify static_cast	2020-04-22 20:46:21 +01:00
MerryMage	3f602129f4	system: Ensure value of CNTPCT_EL0 is accurate Since we currently only update the host's tick count at the end of a block, we force an end-of-block before executing a MRS %, CNTPCT_ELO instruction.	2020-04-22 20:46:21 +01:00
Lioncash	84affdb260	safe_ops: Avoid cases where shift bases are invalid with signed values For example, say the converted signed type is s64, shifting left by 63 bits would be undefined behavior. However, given an ASL is essentially the same behavior as an LSL we can just use an unsigned type instead of converting to a signed type.	2020-04-22 20:46:21 +01:00
Lioncash	d0274f412a	safe_ops: Avoid signed overflow in Negate() Negation of values such as -9223372036854775808 can't be represented in signed equivalents (such as long long), leading to signed overflow. Therefore, we can just invert bits and add 1 to perform this behavior with unsigned arithmetic.	2020-04-22 20:46:21 +01:00
Lioncash	af3e23b224	simd_scalar_shift_by_immediate: Implement FCVT{ZS, ZU} (vector, fixed-point)'s scalar double/single-precision variant	2020-04-22 20:46:21 +01:00
Lioncash	91abf87169	simd_scalar_two_register_misc: Implement FCVT{AS, AU, MS, MU, NS, NU, PS, PU, ZS, ZU} (vector)'s scalar double/single-precision variants We can simply implement this in terms of the fixed-point IR opcodes.	2020-04-22 20:46:21 +01:00
Lioncash	0ec8dac660	emit_x64: Remove FPSCR_RoundTowardsZero() virtual function from EmitContext struct This code was bugged in that we were comparing if the rounding mode was not equal to rounding towards zero. Fortunately, however, nothing uses this function anymore, and there's already the more general FPSCR_RMode() available, so this can be removed entirely.	2020-04-22 20:46:21 +01:00
Lioncash	fd92e2f186	emit_x64: Add missing <array> include Commit 755adef62e504a8d616de9dda8937d2428a9471b introduced a helper alias for std::array, eliminating the need to manually type out sizes for them, however I forgot to add the include for <array>	2020-04-22 20:46:21 +01:00
Lioncash	f939bd0228	emit_x64_vector{_floating_point}: Add helper alias for sizing arrays relative to vector width Avoids needing to remember to specify the proper size of the arrays, all that's needed is to specify the type of the array and the size will automatically be deduced from it. This helps prevent potential oversized or undersized arrays from being specified.	2020-04-22 20:46:21 +01:00
MerryMage	58f3399032	A64/PopRSBHint: Prevent RETing to a guest PC of ~0ull from crashing the jit	2020-04-22 20:46:21 +01:00
MerryMage	e18fca17dc	A64: Implement FABD in terms of existing IR instructions Fixes NaN issue. Closes #306.	2020-04-22 20:46:21 +01:00
MerryMage	1dbe9d95e6	FPRoundInt: Final FPRound based on new sign While this shouldn't change any of the results in theory, it's just logically more consistent	2020-04-22 20:46:21 +01:00
MerryMage	83be491875	emit_x64_floating_point: SSE4.1 implementation of EmitFPRound	2020-04-22 20:46:20 +01:00
MerryMage	a40127a054	A64: Implement FRINTX, FRINTI (scalar)	2020-04-22 20:46:20 +01:00
MerryMage	962fa3b65e	A64: Implement FRINTP, FRINTM, FRINTZ (scalar)	2020-04-22 20:46:20 +01:00
MerryMage	5200bf41cf	A64: Implement FRINTN (scalar)	2020-04-22 20:46:20 +01:00
MerryMage	8718dc1692	A64: Implement FRINTA (scalar)	2020-04-22 20:46:20 +01:00
MerryMage	b228694012	IR: Implement FPRoundInt	2020-04-22 20:46:20 +01:00
MerryMage	e24054f4d7	fp: Implement FPRoundInt	2020-04-22 20:46:20 +01:00
MerryMage	f876e4afa2	fp: Implement FPProcessNaN	2020-04-22 20:46:20 +01:00
MerryMage	591adee443	fp/info: Add DefaultNaN	2020-04-22 20:46:20 +01:00
MerryMage	797e18cd97	fp: Move FPToFixed to its own file	2020-04-22 20:46:20 +01:00
MerryMage	295deb4035	a64_jit_state: Add FPSR.QC flag	2020-04-22 20:46:20 +01:00
Lioncash	7797bc2fb2	emit_x64_vector: Use non-scratch Use* variants of registers within EmitVectorUnsignedAbsoluteDifference() In some cases, a register isn't modified, depending on the branch taken, so we can signify this by using the non-scratch variants in certain cases.	2020-04-22 20:46:20 +01:00
Lioncash	f7f83b76b7	simd_scalar_two_register_misc: Implement scalar double/single-precision variants of FCM{EQ, GE, GT, LE, LT} (zero)	2020-04-22 20:46:20 +01:00
Lioncash	9db6d1e98b	translate_arm: Remove unnecessary rotr() function We already have RotateRight() in our common code, so we can remove this function and replace it with it. We can also implement ArmExpandImm_C() in terms of ArmExpandImm().	2020-04-22 20:46:20 +01:00
Lioncash	9f8a44c982	cast_util: Remove unnecessary typename Given we use std::aligned_storage_t, we don't need to specify typename here. If we used std::aligned_storage, then we would need to.	2020-04-22 20:46:19 +01:00
MerryMage	89e43867c1	A64: Implement FADDP (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	33fa65de23	A64: Implement FADDP (vector)	2020-04-22 20:46:19 +01:00
MerryMage	9dba273a8c	A64: Implement SADDLP	2020-04-22 20:46:19 +01:00
MerryMage	70ff2d73b5	A64: Implement UADDLP	2020-04-22 20:46:19 +01:00
MerryMage	5563bbbd79	A64: Implement EXT	2020-04-22 20:46:19 +01:00
MerryMage	304cc7f61e	emit_x64_floating_point: SSE4.1 implementation for FP{Double,Single}ToFixed{S,U}{32,64}	2020-04-22 20:46:19 +01:00
MerryMage	3d9677d094	A64: Implement FCVTMU (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	79c9018d60	A64: Implement FCVTMS (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	49c4499a87	A64: Implement FCVTPU (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	af661ef5a6	A64: Implement FCVTPS (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	27319822bb	A64: Implement FCVTAU (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	c0c7a26314	A64: Implement FCVTAS (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	a1965a74a0	A64: Implement FCVTNU (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	7d36dbcdfd	A64: Implement FCVTNS (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	617ca0adf0	floating_point_conversion_integer: Refactor implementation of FCVTZS_float_int and FCVTZU_float_int	2020-04-22 20:46:19 +01:00
MerryMage	caaf36dfd6	IR: Initial implementation of FP{Double,Single}ToFixed{S,U}{32,64} This implementation just falls-back to the software floating point implementation.	2020-04-22 20:46:19 +01:00
MerryMage	760cc3ca89	EmitContext: Expose FPCR	2020-04-22 20:46:19 +01:00
MerryMage	9571269552	fp/op: Implement FPToFixed	2020-04-22 20:46:19 +01:00
MerryMage	8087e8df05	mantissa_util: Implement ResidualErrorOnRightShift Accurately calculate residual error that is shifted out	2020-04-22 20:46:19 +01:00
MerryMage	8668d61881	fp/unpacked: Implement FPRound	2020-04-22 20:46:19 +01:00
MerryMage	55d590c01f	FPCR: Add AHP setter and FZ16 getter	2020-04-22 20:46:19 +01:00
MerryMage	7360a2579b	mp: Implement metaprogramming library	2020-04-22 20:46:19 +01:00
MerryMage	4ab029c114	fp: Implement FPUnpack	2020-04-22 20:46:19 +01:00
MerryMage	4875658917	fp: Implement FPProcessException	2020-04-22 20:46:19 +01:00
MerryMage	3cb98e1560	fp: Move fp_util to fp/util	2020-04-22 20:46:19 +01:00
MerryMage	c41a38b13e	fp: Add FPSR	2020-04-22 20:46:19 +01:00
MerryMage	66381352f3	fp: Add FPInfo Provides information about floating-point format for various bit sizes	2020-04-22 20:46:19 +01:00
MerryMage	d21659152c	safe_ops: Implement safe shifting operations Implement shifiting operations that perform consistently across architectures without running into undefined or implemented-defined behaviour.	2020-04-22 20:46:19 +01:00
MerryMage	b00fe23b91	bit_util: Implement MostSignificantBit	2020-04-22 20:46:19 +01:00
MerryMage	95ad0d0a66	bit_util: Use Ones to implement Bits	2020-04-22 20:46:19 +01:00
MerryMage	62b640b2fa	bit_util: Add ClearBit and ModifyBit	2020-04-22 20:46:19 +01:00
MerryMage	8651c2d10e	u128: Implement u128 For when we need a 128-bit integer	2020-04-22 20:46:19 +01:00
Lioncash	e7409fdfe4	A64: Implement UCVTF (vector, integer)'s double/single-precision variant	2020-04-22 20:46:19 +01:00
Lioncash	4aa4885ba7	ir: Add opcodes for vector conversion of u32/u64 to floating-point	2020-04-22 20:46:19 +01:00
Lioncash	fcae4e2418	simd_three_different: Deduplicate common implementations Generally, the only difference between the signed variants and the unsigned variants is whether or not we use a sign-extension or zero-extension, so we can simply use common functions to implement both cases without totally duplicating code twice here.	2020-04-22 20:46:19 +01:00
Lioncash	9c0d5cf15c	floating_point_conversion_integer: Handle S64/U64 -> F32 conversions in SCVTF_float_int and UCVTF_float_int	2020-04-22 20:46:19 +01:00
Lioncash	7a84b6e8d8	ir: Add opcodes for converting S64 and U64 to single-precision floating-point values	2020-04-22 20:46:19 +01:00
Lioncash	066061fa50	constant_pool: Remove unnecessary std::memset from constructor AllocateFromCodeSpace() already zeroes out the allocated memory.	2020-04-22 20:46:19 +01:00
Lioncash	a1d6a86e8c	A64: Implement ADDV	2020-04-22 20:46:19 +01:00
Lioncash	35026a6ce3	emit_x64_vector: Vectorize fallback path for EmitVectorMaxU32()	2020-04-22 20:46:19 +01:00
Lioncash	245c903129	simd_three_same: Join FPAbsoluteComparison() into FPCompareRegister() These are part of the same comparison family, so there's no real point in keeping them separate.	2020-04-22 20:46:19 +01:00
Lioncash	9912836b59	A64: Implement scalar double/single-precision variants of FACGE, FACGT, FCMEQ, FCMGE, FCMGT	2020-04-22 20:46:18 +01:00
MerryMage	0b97e9bd8d	emit_x64_floating_point: Fix EmitFPU64ToDouble for TowardsMinusInfinity rounding mode	2020-04-22 20:46:18 +01:00
MerryMage	a2eb9a02e0	backend_x86: Add FPSCR_RMode to EmitContext	2020-04-22 20:46:18 +01:00
MerryMage	d875c08ebf	fp: Extract common RoundingMode enum	2020-04-22 20:46:18 +01:00
Lioncash	3714bc0ed4	floating_point_conversion_integer: Use FPS64ToDouble and FPU64ToDouble in SCVTF_float_int and UCVTF_float_int The opcodes introduced in 979b6f39f1621b80bd463645ec5b08661cb6b1bf can also be used here, avoiding more falling back to the interpreter.	2020-04-22 20:46:18 +01:00
Lioncash	b97358075e	simd_scalar_two_register_misc: Handle 64-bit case in SCVTF and UCVTF's scalar double/single-precision variant Avoids falling back to the interpreter in the 64-bit case.	2020-04-22 20:46:18 +01:00
Lioncash	7252293184	emit_x64_floating_point: Correct use of UseGpr() in EmitFPU32ToDouble() and EmitFPU32ToSingle() In the non-AVX512 path, the following code is present: code.mov(from.cvt32(), from.cvt32()); since this potentially modifies 'from', we should be using UseScratchGpr() instead.	2020-04-22 20:46:18 +01:00
Lioncash	fbd7623fe5	emit_x64_floating_point: Add AVX512F conversion operations to EmitFPU32ToSingle() and EmitFPU32ToDouble() AVX-512F provides convenient instructions for these kinds of conversions directly	2020-04-22 20:46:18 +01:00
Lioncash	3a41465eaf	ir: Add opcodes for converting S64 and U64 to double-precision values	2020-04-22 20:46:18 +01:00
MerryMage	436ca80bcd	Merge branch 'global_monitor'	2020-04-22 20:46:18 +01:00
Lioncash	0f4bf26e05	simd_two_register_misc: Utilize FPVectorAbs in FABS implementations Since we already have opcodes introduced to implement FACGE and FACGT, we can reutilize it for the FABS implementations.	2020-04-22 20:46:18 +01:00
MerryMage	821cff1227	A64: Add ClearExclusiveState method	2020-04-22 20:46:18 +01:00
Lioncash	81e572c78c	ir: Extend FPVectorAbs opcode to also handle 16-bit elements for FP16	2020-04-22 20:46:18 +01:00
MerryMage	2a8de5f733	a64_emit_x64: Clear exclusive state in EmitA64CallSupervisor The kernel would have to execute an ERET instruction to return to userland; this clears exclusive state.	2020-04-22 20:46:18 +01:00
Lioncash	53dbb6a92a	A64: Implement FACGE's vector single/double precision variants	2020-04-22 20:46:18 +01:00
MerryMage	57f7c7e1b0	Implement global exclusive monitor	2020-04-22 20:46:18 +01:00
Lioncash	6912a02d9b	A64: Implement FACGT's vector single/double precision variants	2020-04-22 20:46:18 +01:00
MerryMage	85234338d3	a64_emit_x64: Simplify EmitExclusiveWrite	2020-04-22 20:46:18 +01:00
Lioncash	fc731dddae	ir: Add opcodes for performing vector absolute floating-point values This will be usable for implementing FACGE and FACGT	2020-04-22 20:46:18 +01:00
MerryMage	2fc6b33829	CMakeLists: Add missing files	2020-04-22 20:46:18 +01:00
Lioncash	0bee648b4f	emit_x64_vector: Deduplicate a bit of code in EmitVectorSetElement{8, 32, 64} functions Given both branches are the same, we can hoist out the common code.	2020-04-22 20:46:18 +01:00
Lioncash	d86fea0d28	A64: Implement FCMEQ (zero)'s vector single and double precision variant	2020-04-22 20:46:18 +01:00
Lioncash	593eca7fb1	A64: Implement load/store single structure instructions Implements LD{1, 2, 3, 4}, LD{1, 2, 3, 4}R, and ST{1, 2, 3, 4} single structure variants.	2020-04-22 20:46:18 +01:00
Lioncash	9bec354791	A64: Implement FCMEQ (register)'s vector single and double precision variant	2020-04-22 20:46:18 +01:00
Lioncash	b6e223fc58	emit_x64_vector: Deduplicate a bit of code within EmitVectorGetElement8() Given both branches use the same destination register size, we can hoist the common code out.	2020-04-22 20:46:18 +01:00
Lioncash	5ce187a54e	ir: Add opcodes for floating-point vector equalities	2020-04-22 20:46:18 +01:00
MerryMage	be354dbfd0	ir/basic_block: Add missing U16 immediate type to DumpBlock	2020-04-22 20:46:18 +01:00
Lioncash	cf188448d4	emit_x64_vector: Vectorize fallback case in EmitVectorMultiply64() Gets rid of the need to perform a fallback.	2020-04-22 20:46:18 +01:00
MerryMage	5503ff28c3	llvm_disassemble: Allow disassembly of invalid AArch64 instructions	2020-04-22 20:46:18 +01:00
Lioncash	954deff2d4	emit_x64_vector: Add break to final case in EmitVectorRoundingHalvingAddUnsigned() This doesn't alter behavior but does make the code better if anything else is ever added to this function in the future.	2020-04-22 20:46:18 +01:00
Lioncash	11a92eaaef	A64: Implement SRHADD and URHADD	2020-04-22 20:46:18 +01:00
Lioncash	9e75d08860	A64: Implement FABD's scalar single/double precision variant	2020-04-22 20:46:18 +01:00
Lioncash	bc718c5b28	ir: Add opcodes for performing rounding halving adds	2020-04-22 20:46:18 +01:00
Lioncash	d898d1779d	A64: Implement FABD's vector single/double precision variant	2020-04-22 20:46:18 +01:00
Lioncash	054549da35	emit_x64_vector: Simplify AVX-512 codepath in EmitVectorMultiply64 I realized I introduced a helper for simple AVX operation emitting, so use that instead of writing it all out long-form.	2020-04-22 20:46:18 +01:00
Lioncash	8a4f8aed06	ir: Add opcode for performing FP vector absolute differences	2020-04-22 20:46:18 +01:00
Lioncash	cb456f914b	A64: Implement UMLAL{2}, UMLSL{2}, and UMULL{2} Now that we have the helper function set up for the signed variants, we can also modify it to be used with the unigned ones by performing a zero extension instead of a sign extension.	2020-04-22 20:46:18 +01:00
MerryMage	ba84e7a8de	A64: Implement FNMSUB	2020-04-22 20:46:18 +01:00
Lioncash	3576c02d91	A64: Implement SMLSL{2}	2020-04-22 20:46:18 +01:00
MerryMage	a1042cfcd8	A64: Implement FNMADD	2020-04-22 20:46:18 +01:00
Lioncash	ada5c0b2fa	A64: Implement SMLAL{2}	2020-04-22 20:46:18 +01:00
MerryMage	0d83032a6f	A64: Implement FMSUB	2020-04-22 20:46:18 +01:00
Lioncash	2d1aca25e6	A64: Implement SMULL{2}	2020-04-22 20:46:18 +01:00
MerryMage	69e00d225c	A64: Implement FMADD	2020-04-22 20:46:18 +01:00
MerryMage	8c90fcf58e	IR: Implement FPMulAdd	2020-04-22 20:46:18 +01:00
Lioncash	c5ae9107a9	A64: Implement SABAL/SABAL2 and SABDL/SABDL2 Now that we have a helper function for the unsigned variants, we can modify it to also be usable with the signed variants.	2020-04-22 20:46:18 +01:00
Lioncash	24e3299276	A64: Implement FCMGT, FCMGE (register) vector double and single precision variants	2020-04-22 20:46:18 +01:00
Lioncash	26d4473851	A64: Implement UABAL/UABAL2	2020-04-22 20:46:18 +01:00
Lioncash	350bc70be8	A64: Implement FCMGT, FCMGE, FCMLE, FCMLT (zero) vector double and single precision variants.	2020-04-22 20:46:18 +01:00
Lioncash	3397742c74	A64: Implement UABDL/UABDL2	2020-04-22 20:46:18 +01:00
Lioncash	c695da1cf3	ir: Add opcode for floating-point GE and GT comparisons The rest of the comparisons can be implemented in terms of these two	2020-04-22 20:46:18 +01:00
Lioncash	6de5ed96e5	emit_x64_vector: Emit VPMULLQ in EmitVectorMultiply64 on AVX-512{DQ, VL} capable CPUs Shortens code-gen down to a single instruction in the 64-bit path.	2020-04-22 20:46:18 +01:00
Lioncash	9054d1c20b	A64: Implement LDR (literal, SIMD&FP)	2020-04-22 20:46:18 +01:00
Lioncash	0da5e949a8	Correct typo in DataCacheOperation enum Fixes a typo for the InvalidateByVAToPoC enum entry. Given yuzu is the only known user of 64-bit mode and it doesn't use this value, we can get away with changing this.	2020-04-22 20:46:18 +01:00
Lioncash	9736e2cce2	A64: Implement FABS' half-precision variant	2020-04-22 20:46:18 +01:00
Lioncash	6e5750e4ec	A64: Implement FABS' single and double precision variant	2020-04-22 20:46:18 +01:00
Lioncash	7bce8d8757	A64: Implement URSHR (scalar) and URSRA (scalar) Now that the utility function is all set up from implementing SRSRA, the unsigned variants can now be trivially implemented by modifying the utility function to perform a logical shift right instead of an arithmetical shift right for the unsigned case.	2020-04-22 20:46:18 +01:00
Lioncash	1e70a589b0	A64: Implement SRSRA (scalar)	2020-04-22 20:46:18 +01:00
Lioncash	998aef07f6	A64: Implement SRSHR (scalar)	2020-04-22 20:46:17 +01:00
Lioncash	7c0250e9f8	A64: Implement SABA	2020-04-22 20:46:17 +01:00
Lioncash	f00789e6f7	A64: Implement SABD	2020-04-22 20:46:17 +01:00
Lioncash	1e10017f4b	ir: Add opcodes for signed absolute differences	2020-04-22 20:46:17 +01:00
Tillmann Karras	d3b44c1b5a	decoder_detail: use structured bindings	2020-04-22 20:46:17 +01:00
Lioncash	f745eb28bf	simd_two_register_misc: Handle 64-bit case for SCVTF_int_4	2020-04-22 20:46:17 +01:00
Lioncash	3f6c529da2	ir: Add opcode to perform the vector conversion S64->F64 Unfortunately x86 prior to AVX-512 doesn't really give us any convenient instruction to do the work for us	2020-04-22 20:46:17 +01:00
Lioncash	0e61ee6bf6	A64: Implement SHLL/SHLL2	2020-04-22 20:46:17 +01:00
Lioncash	43e6e98c3b	A64: Add missing decoding for PRFM (unscaled offset)	2020-04-22 20:46:17 +01:00
Lioncash	f2a85d5601	A64: Implement UHSUB	2020-04-22 20:46:17 +01:00
Lioncash	b33360a324	A64: Implement SHSUB	2020-04-22 20:46:17 +01:00
Lioncash	44a5f8095a	ir: Add opcodes for performing vector halving subtracts	2020-04-22 20:46:17 +01:00
Lioncash	4f37c0ec5a	A64: Implement SM4EKEY	2020-04-22 20:46:17 +01:00
Lioncash	3bde3347a5	A64: Implement SM4E	2020-04-22 20:46:17 +01:00
Lioncash	b312d28295	ir: Add an opcode for doing an SM4 lookup table query	2020-04-22 20:46:17 +01:00
Lioncash	27a6d5f6ce	emit_x64_vector: Use VPOPCNTB in EmitVectorPopulationCount() if AVX-512 BITALG is available	2020-04-22 20:46:17 +01:00
Lioncash	4dcc7724e0	A64: Implement UHADD	2020-04-22 20:46:17 +01:00
Lioncash	f8714f7250	A64: Implement SHADD	2020-04-22 20:46:17 +01:00
Lioncash	089096948a	ir: Add opcodes for performing halving adds	2020-04-22 20:46:17 +01:00
Lioncash	3d00dd63b4	emit_x64_vector: Emit VPMINSQ and VPMINUQ for 64-bit vector min operations if AVX-512VL is available	2020-04-22 20:46:17 +01:00
Lioncash	b97b71b8aa	emit_x64_vector: Emit VPMAXSQ and VPMAXUQ for 64-bit vector max operations if AVX-512VL is available	2020-04-22 20:46:17 +01:00
Lioncash	033e400df0	emit_x64_vector_floating_point: Deduplicate accurate NaN handling code Allows the code to both be used from the 32 bit and 64 bit operations without duplicating code.	2020-04-22 20:46:17 +01:00
Lioncash	0f067b7330	emit_x64_vector: Emit VPABSQ in EmitVectorAbs() for the 64-bit case if AVX-512VL is available	2020-04-22 20:46:17 +01:00
Lioncash	d4ee878cbd	emit_x64_vector: Use VPSRAQ in EmitVectorArithmeticShiftRight64() if AVX-512VL is available	2020-04-22 20:46:17 +01:00
Lioncash	b38dd191bd	disassembler_arm: Remove rotation helper function in favor of Common::RotateRight Mildly reduces the amount of duplicated behavior	2020-04-22 20:46:17 +01:00
Lioncash	51e4f1d9db	emit_x64_vector: Vectorize fallback path of EmitVectorMaxS32()	2020-04-22 20:46:17 +01:00
Lioncash	c692ccdd6d	emit_x64_vector: Vectorize fallback path of EmitVectorMaxS8()	2020-04-22 20:46:17 +01:00
Lioncash	b194313d8c	emit_x64_vector: Vectorize fallback path in EmitVectorMinU32()	2020-04-22 20:46:17 +01:00
Lioncash	7ceda6d919	emit_x64_vector: Vectorize fallback path in EmitVectorMinU16()	2020-04-22 20:46:17 +01:00
Lioncash	cda85a1da0	emit_x64_vector: Vectorize fallback path in EmitVectorMinS32()	2020-04-22 20:46:17 +01:00
Lioncash	6e08eed210	emit_x64_vector: Vectorize fallback path in EmitVectorMinS8()	2020-04-22 20:46:17 +01:00
Lioncash	0fb6dce689	emit_x64_vector: Remove unnecessary if constexpr expression in LogicalVShift This can simply be merged with the previous one.	2020-04-22 20:46:17 +01:00
Lioncash	5b71b1337b	emit_x64_vector: Avoid left shift of negative value in LogicalVShift Now that we handle the signed variants, we also have to be careful about left shifts with negative values, as this is considered undefined behavior.	2020-04-22 20:46:17 +01:00
Lioncash	9954d28868	a64_jitstate: Zero SP and PC on construction of A64JitState Given we zero out/reset everything else in the struct, do the same for these members to keep initialization consistent	2020-04-22 20:46:17 +01:00
Lioncash	4efbd40ea4	backend_x64/callback: Default virtual destructor in the cpp file Prevents the vtable being generated in each translation unit that includes the header (and silences -Wweak-vtables warnings)	2020-04-22 20:46:17 +01:00
Lioncash	edd0b5c8c7	a32_interface/a64_interface: Change reinterpret_casts to static_casts in GetCurrentBlock thunks It's well-defined to static_cast a void* to its proper type.	2020-04-22 20:46:17 +01:00
Lioncash	e71612d394	A64: Implement SSHL (scalar)	2020-04-22 20:46:17 +01:00
Lioncash	ef1e69a1e3	A64: Implement SSHL (vector)	2020-04-22 20:46:17 +01:00
Lioncash	21974ee57e	backend_x64/ir: Amend generic LogicalVShift() template to also handle signed variants Also adds IR opcodes to dispatch said variants	2020-04-22 20:46:17 +01:00
Lioncash	9fc89f0a0e	emit_x64_vector_floating_point: Use arrays for retrieving size instead of hardcoding the size Similar changes were done in emit_x64_vector, but these were missed.	2020-04-22 20:46:17 +01:00
Lioncash	af28e89a13	emit_x64_vector: Vectorize fallback path in EmitVectorMaxU16()	2020-04-22 20:46:17 +01:00
Lioncash	cda75e2079	A64: Implement CMTST's scalar variant	2020-04-22 20:46:17 +01:00
Lioncash	0d20423ad5	emit_x64_vector: Vectorize non-SSE4.1 fallback path for VectorMultiply32()	2020-04-22 20:46:17 +01:00
Lioncash	d70ee7c0d1	emit_x64_vector: Use VBPROADCAST where applicable and available Uses the instruction that does what it says in its name if available. Allows avoiding the use of a scratch register in EmitVectorBroadcast8() and EmitVectorBroadcastLower8()'s SSSE3 path.	2020-04-22 20:46:17 +01:00
Lioncash	bebe7235ae	A64: Implement UZP1 and UZP2	2020-04-22 20:46:17 +01:00
Lioncash	26d77c6f09	ir: Add opcodes for performing vector deinterleaving	2020-04-22 20:46:17 +01:00
Lioncash	d6f9ed47d9	A64: Implement FNEG (half-precision)	2020-04-22 20:46:17 +01:00
Lioncash	7efbd73bac	A64: Implement USHL (scalar)	2020-04-22 20:46:17 +01:00
Lioncash	41f4717f2b	A64: Implement FNEG (vector)	2020-04-22 20:46:17 +01:00
Lioncash	ba1cc6366d	A64: Implement RSUBHN/RSUBHN2	2020-04-22 20:46:17 +01:00
Lioncash	e41640fe33	A64: Implement RADDHN/RADDHN2	2020-04-22 20:46:17 +01:00
Lioncash	b719a6b3f7	A64: Implement XAR	2020-04-22 20:46:17 +01:00
Lioncash	0b1b131ec2	simd_two_register_misc: Factor out common comparison code Gets rid of a tiny bit of duplicated code.	2020-04-22 20:46:17 +01:00
Lioncash	ed0b84da70	A64: Implement CMLE (zero)'s vector variant	2020-04-22 20:46:17 +01:00
Lioncash	b595a68ffa	A64: Implement CMTST (vector)	2020-04-22 20:46:17 +01:00
Lioncash	48c7f8630c	A64: Implement ADDHN{2} and SUBHN{2}	2020-04-22 20:46:17 +01:00
Lioncash	3acd9c9200	translate: zero extend result in Vpart when storing to lower part of vector	2020-04-22 20:46:17 +01:00
Lioncash	87ca63699f	emit_x64_vector: Emit PMAXUD in EmitVectorMaxU32 on SSE4.1-capable CPUs	2020-04-22 20:46:17 +01:00
Lioncash	f17702f608	emit_x64_vector: Emit PMINUD in EmitVectorMinU32 on SSE4.1-capable CPUs	2020-04-22 20:46:17 +01:00
Lioncash	596a8dd1dd	emit_x64_vector: Emit PMINSD in EmitVectorMinS32 on SSE4.1-capable CPUs Provides a better alternative to a fallback operation.	2020-04-22 20:46:17 +01:00
Lioncash	75fd4eaaaa	emit_x64_vector: Get rid of some magic numbers in loop bounds	2020-04-22 20:46:17 +01:00
Lioncash	7b80ac25eb	emit_x64_vector: Generify variable shift functions	2020-04-22 20:46:17 +01:00
Lioncash	4ec735f707	A64: Implement CMLE (zero)'s scalar variant	2020-04-22 20:46:17 +01:00
Lioncash	6534184df2	A64: Implement CMLT (zero)'s scalar single/double-precision variant	2020-04-22 20:46:17 +01:00
Lioncash	8863c9bb4b	A64: Implement SHA512H2	2020-04-22 20:46:17 +01:00
Lioncash	033b890e25	A64: Implement SHA512H	2020-04-22 20:46:17 +01:00
Lioncash	d1f5b084b4	A64: Handle S32->F32 case for SCVTF (vector)	2020-04-22 20:46:17 +01:00
Lioncash	38fa984b53	IR: Add opcode for packed word->f32 conversions	2020-04-22 20:46:16 +01:00
Lioncash	b8587d8e34	A64: Implement SHA512SU1	2020-04-22 20:46:16 +01:00
Lioncash	44d846045a	A64: Implement SHA512SU0	2020-04-22 20:46:16 +01:00
Lioncash	ca903c1585	A64: Implement SHA256H and SHA256H2	2020-04-22 20:46:16 +01:00
MerryMage	e4237c44eb	A64: Implement SCVTF (vector, integer), scalar varaint	2020-04-22 20:46:16 +01:00
MerryMage	bfba38d0b6	impl: Reorganize scalar two-register misc instructions	2020-04-22 20:46:16 +01:00
Lioncash	ea582b17cc	A64: Implement SHA256SU1	2020-04-22 20:46:16 +01:00
Lioncash	06c5dcaf5e	simd_two_register_misc: Add missing zeroing of the vector for CMGT and CMLT	2020-04-22 20:46:16 +01:00
Lioncash	0d50d7314b	A64: Implement CMGE (zero)'s vector variant	2020-04-22 20:46:16 +01:00
Lioncash	ab35dc0e78	A64: Implement MLS (by element)	2020-04-22 20:46:16 +01:00
Lioncash	1651e60462	A64: Implement MUL (by element)	2020-04-22 20:46:16 +01:00
MerryMage	a86d4093cd	A64: Implement MLA (by element)	2020-04-22 20:46:16 +01:00
Lioncash	7f47402609	A64: Implement ABS (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	c8eb4528be	A64: Implement SHA256SU0	2020-04-22 20:46:16 +01:00
Lioncash	181c3b0790	A64: Implement SHA1M	2020-04-22 20:46:16 +01:00
Lioncash	47bc97a71b	A64: Implement SHA1P	2020-04-22 20:46:16 +01:00
Lioncash	718f3e9bb4	A64: Implement scalar variants of CMEQ, CMGT, and CMGE zero comparison instructions These can trivially use the ScalarCompare helper function.	2020-04-22 20:46:16 +01:00
Lioncash	3ad4e547e4	A64: Implement scalar variant of NEG	2020-04-22 20:46:16 +01:00
Lioncash	b4f3051e4b	simd: Relocate REV16, REV32 and REV64 vector variants to the proper file These aren't scalar instruction variants.	2020-04-22 20:46:16 +01:00
Lioncash	19e276d10f	A64: Implement CMEQ (register, scalar)	2020-04-22 20:46:16 +01:00
Lioncash	5b8c9e5146	A64: Implement CMHS (register, scalar)	2020-04-22 20:46:16 +01:00
Lioncash	78bb12276a	A64: Implement CMHI (register, scalar)	2020-04-22 20:46:16 +01:00
Lioncash	c18b20b8d1	A64: Implement CMGE (register, scalar)	2020-04-22 20:46:16 +01:00
Lioncash	755981d0da	A64: Implement CMGT (register, scalar)	2020-04-22 20:46:16 +01:00
Lioncash	da6627124b	A64: Implement SHA1C	2020-04-22 20:46:16 +01:00
Lioncash	3c013bd9f8	A64: Implement SLI (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	154cac594a	A64: Implement SRI (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	6bcfdba1ad	general: Remove unused lambda captures Resolves warnings that occur in Xcode 9.3	2020-04-22 20:46:16 +01:00
Lioncash	205ca6b4cb	A64: Implement SHA1SU1	2020-04-22 20:46:16 +01:00
Lioncash	16a001b9ff	A64: Implement SHA1SU0	2020-04-22 20:46:16 +01:00
Lioncash	3b6db59850	A64: Implement TRN2	2020-04-22 20:46:16 +01:00
Lioncash	30e158f8d0	A64: Implement TRN1	2020-04-22 20:46:16 +01:00
Lioncash	52cad2d9d0	A64: Implement SSRA (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	255a33936d	A64: Implement SSHR (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	6723b00497	A64: Implement USRA (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	d56fa8f735	A64: Implement USHR (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	870e418b0b	A64: Implement SHL (scalar)	2020-04-22 20:46:16 +01:00
Lioncash	97f2bea4f2	A64: Implement SM3PARTW1	2020-04-22 20:46:16 +01:00
Lioncash	e268b110f0	simd_sha512: Simplify RAX1 Now that the vector rotation helpers are in, replace the explicit shifting with the relevant helper function that does the same thing. Simply tidies up code; no behavioral changes are made.	2020-04-22 20:46:16 +01:00
Lioncash	20d2491267	A64: Implement SM3PARTW2	2020-04-22 20:46:16 +01:00
Lioncash	e1b662e90c	ir: Add helper functions for vector rotation	2020-04-22 20:46:16 +01:00
Lioncash	8a60a63a8b	A64: Implement SM3TT2B	2020-04-22 20:46:16 +01:00
Lioncash	b3d4c02098	A64: Implement SM3TT2A	2020-04-22 20:46:16 +01:00
Lioncash	7fbccabd81	A64: Implement SM3TT1B	2020-04-22 20:46:16 +01:00
Lioncash	769373b3ed	A64: Implement SM3TT1A	2020-04-22 20:46:16 +01:00
Lioncash	2d269fdcc7	simd_shift_by_immediate: Merge signed/unsigned helper functions Gets rid of a little more code duplication.	2020-04-22 20:46:16 +01:00
Lioncash	d5461be6b4	A64: Implement SM3SS1	2020-04-22 20:46:16 +01:00
Lioncash	2db032ac83	A64: Implement SRI (vector)	2020-04-22 20:46:16 +01:00
Lioncash	11005cfe26	A64: Implement SLI (vector)	2020-04-22 20:46:16 +01:00
Lioncash	e3d9bf55e7	A64: Implement SRSRA (vector)	2020-04-22 20:46:16 +01:00
Lioncash	bc6016cad7	A64: Implement SRSHR (vector)	2020-04-22 20:46:16 +01:00
MerryMage	6c9c829a08	imm: Add additional bit position checks to Imm::Bits	2020-04-22 20:46:16 +01:00
MerryMage	be907a61f7	math_util: rvalue references for std::forward	2020-04-22 20:46:16 +01:00
Lioncash	a2f8cdf0a3	A64: Implement SSUBL/SSUBL2	2020-04-22 20:46:16 +01:00
Lioncash	d456fb85c8	A64: Implement SADDL/SADDL2	2020-04-22 20:46:16 +01:00
Lioncash	5c9e7f328d	A64: Implement USUBL/USUBL2	2020-04-22 20:46:16 +01:00
Lioncash	88d70e3b8a	A64: Implement UADDL/UADDL2	2020-04-22 20:46:16 +01:00
Lioncash	4b3d70de5f	simd_shift_by_immediate: Factor out common code in shift instructions Gets rid of partial duplication of the same code for instructions that only have a small behavior difference to them. e.g. The only difference between SSHR and SSRA is that SSRA adds an accumulator before storing the result.	2020-04-22 20:46:16 +01:00
Lioncash	56803f5203	A64: Implement URSRA (vector)	2020-04-22 20:46:16 +01:00
Lioncash	8afdf4b23d	A64: Implement URSHR (vector)	2020-04-22 20:46:16 +01:00
Lioncash	16613ee066	A64: Implement RSHRN/RSHRN2	2020-04-22 20:46:15 +01:00
Lioncash	937990fd2a	A64: Implement SHRN/SHRN2	2020-04-22 20:46:15 +01:00
Lioncash	80e005e5b5	A64/translate: Amend I() to also handle u8 and u16 immediates This is necessary for instructions like SRSHR, and other related instructions.	2020-04-22 20:46:15 +01:00
MerryMage	7969871aa3	A64: Implement FMOV (vector, immediate) and mark other SIMD modified immediate instructions as unallocated	2020-04-22 20:46:15 +01:00
MerryMage	5c95e28ed0	A64: Implement ZIP2	2020-04-22 20:46:15 +01:00
MerryMage	871aefb9a0	decoder/a64: Tweak ordering algorithm Ensuring only instruction families are sorted with each other in the fashion previously devised does not admit a total ordering.	2020-04-22 20:46:15 +01:00
MerryMage	575590d18d	ir_emitter: Remove overloads Having overloads made explicit casting necesssary for these functions when using types like UAny.	2020-04-22 20:46:15 +01:00
Lioncash	83ff7a43d1	A64: Implement RBIT (vector)	2020-04-22 20:46:15 +01:00
Lioncash	64b1f2d468	ir: Add opcode for reversing bits in a vector	2020-04-22 20:46:15 +01:00
Lioncash	9de60b60bb	A64/translate: Amend instruction prototypes erroneously marked as taking Reg Makes the prototypes consistent	2020-04-22 20:46:15 +01:00
Lioncash	cf81f04ed3	A64: Implement RAX1	2020-04-22 20:46:15 +01:00
Lioncash	7371e63a7b	a64_get_set_elimination_pass: Make TrackingType enum an enum class Prevents placing single letter enum members into the surrounding scope.	2020-04-22 20:46:15 +01:00
Lioncash	7bcb1c115a	A64: Implement ABS (vector)	2020-04-22 20:46:15 +01:00
Lioncash	e33dcce14a	ir: Add opcodes for performing vector absolute values	2020-04-22 20:46:15 +01:00
Lioncash	84d49309b9	A64: Implement USUBW/USUBW2	2020-04-22 20:46:15 +01:00
Lioncash	e20fce6b5a	A64: Implement SSUBW/SSUBW2	2020-04-22 20:46:15 +01:00
Lioncash	00af6eeab9	A64: Implement SADDW/SADDW2	2020-04-22 20:46:15 +01:00
MerryMage	78a047f0f9	A64: Implement EXT	2020-04-22 20:46:15 +01:00
MerryMage	3472f371df	IR: Implement VectorExtract, VectorExtractLower IR instructions	2020-04-22 20:46:15 +01:00
MerryMage	8bba37089e	A64: Implement UADDW	2020-04-22 20:46:15 +01:00
MerryMage	5c47f03888	A64: Implement FMUL (vector)	2020-04-22 20:46:15 +01:00
Lioncash	a6e264c2dd	A64: Implement UABA Now that we have unsigned absolute difference capabilities, we can just use this to append onto the result via a vector add.	2020-04-22 20:46:15 +01:00
Lioncash	c2e7364d3e	A64: Implement UABD	2020-04-22 20:46:15 +01:00
Lioncash	ad5cf584ce	ir: Add opcodes for performing vector unsigned absolute differences	2020-04-22 20:46:15 +01:00
Lioncash	7780af56e3	ir_emitter: Make immediate member functions const qualified These don't modify class state	2020-04-22 20:46:15 +01:00
Lioncash	701f43d61e	IR: Add opcodes for interleaving upper-order bytes/halfwords/words/doublewords I should have added this when I introduced the functions for interleaving low-order equivalents for consistency in the interface.	2020-04-22 20:46:15 +01:00
Lioncash	94f0fba16b	A64: Implement SHA1H This is a fairly trivial instruction it's essentially: result = ROL(data, 30);	2020-04-22 20:46:15 +01:00
Lioncash	3985f7bf84	emit_x64_data_processing: Deduplicate some code in zero-extension functions EmitZeroExtendByteToLong() can be implemented in terms of EmitZeroExtendByteToWord() and EmitZeroExtendHalfToLong() can be implemented in terms of EmitZeroExtendHalfToWord().	2020-04-22 20:46:15 +01:00
Lioncash	40ec25356b	A64: NOP immediate variant of PRFM Makes behavior identical to the literal variant of PRFM. Given this is simply a hint instruction, this is valid behavior. The upside is that we don't fall back to Unicorn unnecessarily whenever the instruction is encountered.	2020-04-22 20:46:15 +01:00
MerryMage	e7b60189b3	abi: Missing includes'	2020-04-22 20:46:15 +01:00
MerryMage	cdc5c3ad95	emit_x64_floating_point: Near jump instead of short jump in FPMinNumberic{32,64}	2020-04-22 20:46:15 +01:00
Lioncash	73b9e4b276	A64: system: Use an enum class for MRS/MSR register encodings Reduces the need to manually write out the register bit encodings repeatedly.	2020-04-22 20:46:15 +01:00
MerryMage	df4ee0f51e	emit_X64_floating_point: Near jmp to end instead of short jmp Jump destination can be further than what can be reached in a short jump under some FPCR options.	2020-04-22 20:46:15 +01:00
Lioncash	b8d5765f9b	emit_x64_vector: Fix typo in VectorShuffleImpl This is supposed to be pshufd, not pshufw (which only allows a 64-bit operand)	2020-04-22 20:46:15 +01:00
Lioncash	586b00d11d	A64: Implement REV64	2020-04-22 20:46:15 +01:00
Lioncash	ade595e377	bit_util: Do nothing in RotateRight if the rotation amount is zero Without this sanitizing it's possible to perform a shift with a shift amount that's the same size as the type being shifted. This actually occurs when decoding ORR variants. We could get fancier here and make this branchless, but we don't really use RotateRight in any performance intensive areas.	2020-04-22 20:46:15 +01:00
Lioncash	9128988dc3	A64: Implement REV32 (vector)	2020-04-22 20:46:15 +01:00
Lioncash	6b0010c940	ir: Add IR opcodes for emitting vector shuffles This uses the ARM terminology for sizes (Halfword -> 2 bytes, Word -> 4 bytes) as opposed to the x86 terminology of (Word -> 2 bytes, Double word -> 4 bytes)	2020-04-22 20:46:15 +01:00
Lioncash	eb2d28d2b1	emit_x64_vector_floating_point: Fix out of bounds array access in EmitVectorOperation64	2020-04-22 20:46:15 +01:00
Lioncash	6ad1bce5e0	A64: Implement REV16 (vector)	2020-04-22 20:46:15 +01:00
Lioncash	6177c2c63d	CMakeLists: Add fp_util, macro_util and math_util headers Allows the headers to show up within IDEs	2020-04-22 20:46:15 +01:00
Lioncash	7a66224d9a	A64: Implement EOR3 and BCAX	2020-04-22 20:46:15 +01:00
MerryMage	be5047c7c2	impl: Update PC when raising exception	2020-04-22 20:46:15 +01:00
MerryMage	49cc6d7fad	A64: Implement FDIV (vector)	2020-04-22 20:46:15 +01:00
MerryMage	fd075d8d68	system: Raise exception for YIELD, WFE, WFI, SEV, SEVL	2020-04-22 20:46:15 +01:00
MerryMage	c832cec96d	Correct FPSR and FPCR	2020-04-22 20:46:15 +01:00
MerryMage	147284427b	A64: Implement USHL	2020-04-22 20:46:15 +01:00
MerryMage	fd8f4c1195	A64: Implement UCVTF (vector, integer), scalar variant	2020-04-22 20:46:15 +01:00
MerryMage	be57608353	A64: Partially implement FCVTZU (scalar, fixed-point) and FCVTZS (scalar, fixed-point)	2020-04-22 20:46:15 +01:00
MerryMage	e4697b1676	A64: Implement system register TPIDR_EL0	2020-04-22 20:46:15 +01:00
MerryMage	e3da92024e	A64: Implement system registers FPCR and FPSR	2020-04-22 20:46:15 +01:00
MerryMage	9e4e4e9c1d	A64: Implement system register CNTPCT_EL0	2020-04-22 20:46:15 +01:00
MerryMage	1e15283d00	A64: Implement system register CTR_EL0	2020-04-22 20:46:15 +01:00
MerryMage	58fbb3ff1b	A64: Implement NEG (vector)	2020-04-22 20:46:15 +01:00
MerryMage	710d09471b	IR: Add IR instruction ZeroVector	2020-04-22 20:46:15 +01:00
MerryMage	2721bb5ace	emit_x64_floating_point: Add maybe_unused to preprocess parameter	2020-04-22 20:46:15 +01:00
MerryMage	0575e7421b	A64: Implement FMINNM (scalar)	2020-04-22 20:46:15 +01:00
MerryMage	1c9804ea07	A64: Implement FMAXNM (scalar)	2020-04-22 20:46:15 +01:00
MerryMage	1dfce0894d	constant_pool: Add frame parameter	2020-04-22 20:46:14 +01:00
MerryMage	bd2b415850	A64: Implement ADDP (scalar)	2020-04-22 20:46:14 +01:00
MerryMage	84f1c9b7f4	reg_alloc: Only exchange GPRs	2020-04-22 20:46:14 +01:00
MerryMage	9df3793af0	A64: Implement DUP (element), scalar variant	2020-04-22 20:46:14 +01:00
MerryMage	6541ec064d	emit_x64_floating_point: Correct FP{Max,Min}{32,64} implementations for -0/+0	2020-04-22 20:46:14 +01:00
MerryMage	2080a51f41	A64: Implement FMAX (scalar), FMIN (scalar)	2020-04-22 20:46:14 +01:00
MerryMage	7c193485e1	a64/config: Allow NaN emulation accuracy to be set	2020-04-22 20:46:14 +01:00
MerryMage	a3df46a75a	a64_emit_x64: Add conf to A64EmitContext	2020-04-22 20:46:14 +01:00
MerryMage	0e157b0198	A64: Implement FSQRT (scalar)	2020-04-22 20:46:14 +01:00
MerryMage	07520f32c3	backend_x64: Accurately handle NaNs	2020-04-22 20:46:14 +01:00
MerryMage	e97581d063	fuzz_with_unicorn: Print AArch64 disassembly	2020-04-22 20:46:14 +01:00
MerryMage	01c1e9017e	T32: Add initial decoder list	2020-04-22 20:46:14 +01:00
MerryMage	ccf7df057b	simd_three_same: Add VectorZeroUpper to CMGE (vector) and CMHS (vector)	2020-04-22 20:46:14 +01:00
MerryMage	8cebb87d0d	A64: Implement CMGT (zero), CMEQ (zero), CMLT (zero)	2020-04-22 20:46:14 +01:00
MerryMage	7f68d556ab	decoder/a64: Rearrange SIMD two-register misc decoders	2020-04-22 20:46:14 +01:00
MerryMage	d5af052f06	A64: Implement CMGE (register)	2020-04-22 20:46:14 +01:00
MerryMage	9d85991906	A64: Implement CMHI, CMHS	2020-04-22 20:46:14 +01:00
MerryMage	e2b9b7c5b0	IR: Implement Vector{Less,Greater}{,Equal}{Signed,Unsigned}	2020-04-22 20:46:14 +01:00
MerryMage	0df6725f73	A64: Implement SMAX, SMIN, UMAX, UMIN	2020-04-22 20:46:14 +01:00
MerryMage	47c0ad0fc8	IR: Implement Vector{Max,Min}{Signed,Unsigned}	2020-04-22 20:46:14 +01:00
MerryMage	adb7f5f86f	A64: Implement CMGT (register)	2020-04-22 20:46:14 +01:00
MerryMage	f4775910f5	IR: Implement VectorGreaterSigned	2020-04-22 20:46:14 +01:00
MerryMage	1f5b3bca43	Exclusive fixups * Incorrect size of exclusive_address * Disable tests on exclusive memory instructions for now	2020-04-22 20:46:14 +01:00
MerryMage	f3fa4a042f	a64_emit_x64: EmitExclusiveWrite: Make MSVC happy (narrowing conversion warning)	2020-04-22 20:46:14 +01:00
MerryMage	8698f057d0	A64: Implement STXP, STLXP, LDXP, LDAXP	2020-04-22 20:46:14 +01:00
MerryMage	2a6619d59c	A64: Implement CLREX	2020-04-22 20:46:14 +01:00
MerryMage	b7a2c1a7df	A64: Implement STXRB, STXRH, STXR, STLXRB, STLXRH, STLXR, LDXRB, LDXRH, LDXR, LDAXRB, LDAXRH, LDAXR	2020-04-22 20:46:14 +01:00
MerryMage	a6cc667509	Direct Page Table Access: Handle address spaces less than the full 64-bit in size	2020-04-22 20:46:14 +01:00
MerryMage	f45a5e17c6	Implement direct page table access	2020-04-22 20:46:14 +01:00
MerryMage	bfd3e30c75	callbacks: Member functions should be const	2020-04-22 20:46:14 +01:00
MerryMage	9f2f08db8d	a64_emit_x64: Implement {Read,Write}Memory128 in terms of a function call	2020-04-22 20:46:14 +01:00
MerryMage	6c4773e85b	abi: Add RAX to ABI_ALL_CALLER_SAVE	2020-04-22 20:46:14 +01:00
MerryMage	8756487554	A64: Partially implement MRS	2020-04-22 20:46:14 +01:00
MerryMage	bfd65bedfe	A64: Implement DSB, DMB	2020-04-22 20:46:14 +01:00
MerryMage	5edd623b9d	Implement DC instructions	2020-04-22 20:46:14 +01:00
Lioncash	a9153218bd	A64: Implement NOT (vector)	2020-04-22 20:46:14 +01:00
MerryMage	2cb0a699ba	IR: Implement FPMax, FPMin	2020-04-22 20:46:14 +01:00
MerryMage	aed4fd3ec3	A64: Implement FADD (vector), vector variant	2020-04-22 20:46:14 +01:00
MerryMage	98c8e7d1af	IR: Implement FPVectorAdd	2020-04-22 20:46:14 +01:00
MerryMage	5f77ab28ee	A64: Implement SSHLL, SSHLL2	2020-04-22 20:46:14 +01:00
MerryMage	eae518a338	IR: Implement VectorSignExtend	2020-04-22 20:46:14 +01:00
MerryMage	3738043e58	A64: Implement DUP (element), vector variant	2020-04-22 20:46:14 +01:00
MerryMage	ce7628b6b5	load_store_multiple_structures: Improve IR codegen for selem == 1 case	2020-04-22 20:46:14 +01:00
MerryMage	f1cb5581c9	A64: Implement FSUB (vector)	2020-04-22 20:46:14 +01:00
MerryMage	b9cd345ddc	IR: Implement FPVectorSub	2020-04-22 20:46:14 +01:00
MerryMage	851fc83445	emit_x64_vector: EmitOneArgumentFallback	2020-04-22 20:46:14 +01:00
MerryMage	f378d2ef1b	Forward declare IR::Opcode and IR::Type where possible	2020-04-22 20:46:14 +01:00
MerryMage	6c9b4f0114	A64: Implement CNT	2020-04-22 20:46:14 +01:00
MerryMage	303088a51e	IR: Implement VectorPopulationCount	2020-04-22 20:46:14 +01:00
MerryMage	1dd2b33b87	A64: Implement MLS (vector)	2020-04-22 20:46:14 +01:00
MerryMage	5eac3abf52	A64: Implement MLA (vector)	2020-04-22 20:46:14 +01:00
MerryMage	bf2cd92da9	emit_x64_vector: Add SSE4.1 implementation for EmitVectorMultiply64	2020-04-22 20:46:14 +01:00
MerryMage	b062266b8e	emit_x64_vector: More explicit lambda decay	2020-04-22 20:46:14 +01:00
MerryMage	3afd2fcbad	A64: Implement MUL (vector)	2020-04-22 20:46:14 +01:00
MerryMage	b6de612e01	IR: Implement VectorMultiply	2020-04-22 20:46:14 +01:00
MerryMage	90a053a5e4	emit_x64_vector: Order alphabetically	2020-04-22 20:46:14 +01:00
MerryMage	e7041d7196	A64: Implement STR (register, SIMD&FP), LDR (register, SIMD&FP)	2020-04-22 20:46:14 +01:00
MerryMage	a455ff70c9	decoder/a64: Don't rearrange unrelated decoders	2020-04-22 20:46:14 +01:00
MerryMage	faeb77e8c4	A64: Implement SUB (vector)	2020-04-22 20:46:14 +01:00
MerryMage	bd106c3ae7	A64: Implement SIMD instruction SSRA, vector variant	2020-04-22 20:46:14 +01:00
MerryMage	f58aba9871	A64: Implement SIMD instruction SSHR, vector variant	2020-04-22 20:46:14 +01:00
MerryMage	715ae1c229	IR: Implement VectorArithmeticShiftRight	2020-04-22 20:46:14 +01:00
MerryMage	653c82d8f0	impl: Improve Vpart setter	2020-04-22 20:46:14 +01:00
MerryMage	e858ce0b35	A64: Implement SIMD instructions XTN, XTN2	2020-04-22 20:46:13 +01:00
MerryMage	132c783320	IR: Implement VectorNarrow	2020-04-22 20:46:13 +01:00
MerryMage	1423584f9f	constant_pool: Allow for 128-bit constants	2020-04-22 20:46:13 +01:00
MerryMage	69de50a878	emit_x64_vector: Add SSE4.1 implementations for VectorZeroExtend	2020-04-22 20:46:13 +01:00
MerryMage	cbc9f361b0	IR: Implement VectorSub	2020-04-22 20:46:13 +01:00
MerryMage	3f93c77ace	A64: Implement SIMD instruction USRA, vector variant	2020-04-22 20:46:13 +01:00
MerryMage	fb9d20f27f	A64: Implement SIMD instruction USHR, vector variant	2020-04-22 20:46:13 +01:00
MerryMage	b22c5961f9	IR: Implement VectorLogicalShiftRight	2020-04-22 20:46:13 +01:00
MerryMage	7ff280827b	A64: Implement SIMD instructions USHLL, USHLL2	2020-04-22 20:46:13 +01:00
MerryMage	59ace60b03	IR: Implement VectorZeroExtend	2020-04-22 20:46:13 +01:00
MerryMage	d3a4e1efe2	IR: Vector instructions now take esize argument in emitter	2020-04-22 20:46:13 +01:00
MerryMage	1d0cd95b23	A64: Implement SIMD instruction SHL	2020-04-22 20:46:13 +01:00
MerryMage	f6247125c0	IR: Implement VectorLogicalShiftLeft{8,16,32,64}	2020-04-22 20:46:13 +01:00
MerryMage	15e8231f24	opcodes: Sort vector IR opcodes alphabetically	2020-04-22 20:46:13 +01:00
MerryMage	d74f4e35f6	block_of_code: Increase constant pool size	2020-04-22 20:46:13 +01:00
MerryMage	e69288f803	devirtualize: MinGW uses Intanium MFP ABI	2020-04-22 20:46:13 +01:00
MerryMage	ad428cbd7a	callback: Properly handle calls with return pointers and simplify interface	2020-04-22 20:46:13 +01:00
FernandoS27	15871910af	Implemented BSL, BIC, BIT and BIF vector instructions	2020-04-22 20:46:13 +01:00
MerryMage	7a87e3fc55	devirtualize: Handle Windows ABI	2020-04-22 20:46:13 +01:00
MerryMage	ba4a779c62	A32/decoder/arm: bug: Correct bitstring for SRS	2020-04-22 20:46:13 +01:00
MerryMage	f808a0fbde	devirtualize: Devirtualize Itanium ABI MFPs at runtime	2020-04-22 20:46:13 +01:00
MerryMage	afe16fa0f3	cast_util: Add BitCast and BitCastPointee	2020-04-22 20:46:13 +01:00
Lioncash	4e33629b0e	A64: Move SDIV and UDIV out of data_processing_multiply.cpp	2020-04-22 20:46:13 +01:00
Lioncash	35a29a9665	A64: Implement ZIP1	2020-04-22 20:46:13 +01:00
FernandoS27	586854117b	Implemented UMULH and SMULH instructions	2020-04-22 20:46:13 +01:00
MerryMage	1a7b7b541a	A64: Implement MOVI, MVNI, ORR (vector, immediate), BIC (vector, immediate) There wasn't a clean way to seperate these instructions out.	2020-04-22 20:46:13 +01:00
MerryMage	8ab7d8175c	impl: Add AdvSIMDExpandImm	2020-04-22 20:46:13 +01:00
MerryMage	ea69cb4474	A64: Implement SUB (vector), scalar variant	2020-04-22 20:46:13 +01:00
MerryMage	4c5871d5d5	A64: Implement ADD (vector), scalar variant	2020-04-22 20:46:13 +01:00
MerryMage	2a0850c068	A64: Reorganize decoder tables (some vector entries were grouped with scalar entries)	2020-04-22 20:46:13 +01:00
MerryMage	7b33772ac6	A64: Implement BIC (vector, register)	2020-04-22 20:46:13 +01:00
MerryMage	eb5591859c	A64: Implement FMOV (general)	2020-04-22 20:46:13 +01:00
MerryMage	dd88cee15a	translate/impl: Add Vpart	2020-04-22 20:46:13 +01:00
MerryMage	cc9efd13c9	A64: Implement STLLRB, STLLRH, STLLR, LDLARB, LDLARH, LDLAR	2020-04-22 20:46:13 +01:00
MerryMage	81713c2b77	A64: Implement FCCMPE	2020-04-22 20:46:13 +01:00
MerryMage	ef906dbbfa	A64: Implement FCCMP	2020-04-22 20:46:13 +01:00
MerryMage	44c3c2312a	a64_jitstate: Remove unnecessary FPSCR_nzcv member	2020-04-22 20:46:13 +01:00
MerryMage	aac5af50e2	IR: FPCompare{32,64} now return NZCV flags instead of implicitly setting them	2020-04-22 20:46:13 +01:00
Lioncash	2ee39d6b36	A64: Implement FMOV (register)	2020-04-22 20:46:13 +01:00
MerryMage	b02b861242	A64: Implement STLRB, STLRH, STLR, LDARB, LDARH, LDAR	2020-04-22 20:46:13 +01:00
Lioncash	5a65313236	A64: Implement CCMP (immediate)	2020-04-22 20:46:13 +01:00
Lioncash	ab4664de61	A64: Implement CCMN (immediate)	2020-04-22 20:46:13 +01:00
Lioncash	a6c6539109	A64: Implement CCMP (register)	2020-04-22 20:46:13 +01:00
Lioncash	22632db337	microinstruction: Add ConditionalSelectNZCV opcode to ReadsFromCPSR()'s switch statement	2020-04-22 20:46:13 +01:00
MerryMage	c5033b5dda	A64: Implement CCMN (register)	2020-04-22 20:46:13 +01:00
MerryMage	dd2a6684fe	IR: Add ConditionalSelectNZCV instruction	2020-04-22 20:46:13 +01:00
MerryMage	4491746eae	A64: Implement FNEG	2020-04-22 20:46:13 +01:00
MerryMage	db958061a3	A64: Implement FABS	2020-04-22 20:46:13 +01:00
MerryMage	8765b421b7	A64: Implement FCSEL	2020-04-22 20:46:13 +01:00
MerryMage	7e82d8eede	A64: Implement SCVTF (scalar, integer), UCVTF (scalar, integer)	2020-04-22 20:46:13 +01:00
MerryMage	2409e5d082	A64: Implement FCVTZS (scalar, integer), FCVTZU (scalar, integer)	2020-04-22 20:46:13 +01:00
MerryMage	b173fcf34e	backend_x64: Simplify FPDoubleToU32 and FPSingleToU32 They're inaccurate in terms of FPSR at the moment anyway.	2020-04-22 20:46:13 +01:00
MerryMage	56bc7825ef	A64: Implement STR{,B,H} (register), LDR{,B,H,SB,SH,SW} (register), PFRM (register)	2020-04-22 20:46:13 +01:00
Lioncash	d040920727	Common: Put AES code within its own nested namespace Prevents the functions from potentially clashing with other stuff in Common in the future	2020-04-22 20:46:13 +01:00
Lioncash	40614202e7	A64: Implement AESD	2020-04-22 20:46:13 +01:00
Lioncash	ccef85dbb7	A64: Implement AESE	2020-04-22 20:46:13 +01:00
MerryMage	68f46c8334	backend_x64: Use a reference to BlockOfCode instead of a pointer	2020-04-22 20:46:13 +01:00
MerryMage	8931ee346b	IR: Add IR instruction NZCVFromPackedFlags This instruction expects NZCV to be in the high bits. i.e.: The positions they were in PSTATE.	2020-04-22 20:46:13 +01:00
MerryMage	0bb4474fb9	A64: Implement INS (general)	2020-04-22 20:46:13 +01:00
MerryMage	d13704fdef	A64: Implement INS (element)	2020-04-22 20:46:13 +01:00
MerryMage	0642d49919	A64: Implement SMOV	2020-04-22 20:46:13 +01:00
MerryMage	5297027ebe	A64: Implement UMOV	2020-04-22 20:46:13 +01:00
MerryMage	47661b746b	basic_block: Fix bogus GCC maybe-uninitialized warning	2020-04-22 20:46:13 +01:00
MerryMage	1fb0957aa3	A64: Implement FCVT	2020-04-22 20:46:13 +01:00
MerryMage	ca38225e08	fuzz_with_unicorn: Skip instructions that need to be interpreted	2020-04-22 20:46:13 +01:00
MerryMage	4be55b8b84	A64: Implement FMOV (scalar, immediate)	2020-04-22 20:46:13 +01:00
MerryMage	a07c05ea51	A64: Implement STUR (SIMD&FP), LDUR (SIMD&FP)	2020-04-22 20:46:13 +01:00
MerryMage	93fcbdf1e2	A64: Implement FCMP, FCMPE	2020-04-22 20:46:13 +01:00
MerryMage	75b8a76630	a64_jitstate: A64 does not have a seperate FPSCR.NZCV	2020-04-22 20:46:13 +01:00
MerryMage	99d8ebe4d5	A64: Implement FMUL (scalar), FDIV (scalar), FADD (scalar), FSUB (scalar), FNMUL (scalar)	2020-04-22 20:46:13 +01:00
MerryMage	429dc24587	IR: Merge U32 and U64 variants of FP instructions	2020-04-22 20:46:13 +01:00
MerryMage	ed2bedec43	A64: Implement {ST,LD}{1,2,3,4} (multiple structures)	2020-04-22 20:46:13 +01:00
MerryMage	6414736a8d	emit_x64_vector: bug: VectorGetElement8 returning incorrect values for non-SSE4.1 This bug wasn't discovered earlier because we previously only used index == 0.	2020-04-22 20:46:13 +01:00
MerryMage	ebfc51c609	IR: Implement VectorSetElement{8,16,32,64}	2020-04-22 20:46:13 +01:00
Lioncash	a5c4fbc783	A64: Implement AESIMC and AESMC	2020-04-22 20:46:13 +01:00
Lioncash	744495e23d	iterator_util: Make Reverse constexpr C++17 makes non-member rbegin(), rend(), crbegin(), and crend() constexpr, allowing this to also be constexpr.	2020-04-22 20:46:12 +01:00
Lioncash	ab9b5fb8aa	Common: Relocate common bits of CRC32 Allows the algorithm to be used in any other potential backend.	2020-04-22 20:46:12 +01:00
Lioncash	af1384d700	A64: Implement CRC32	2020-04-22 20:46:12 +01:00
MerryMage	64761dbc72	scope_exit: Add SCOPE_SUCCESS and SCOPE_EXIT	2020-04-22 20:46:12 +01:00
MerryMage	bafb39ebc5	A64: Add Disassemble method	2020-04-22 20:46:12 +01:00
MerryMage	cc0eb18a0b	A32: data_processing: Remove !S assertions	2020-04-22 20:46:12 +01:00
MerryMage	865a30eb0d	A32: Implement BKPT	2020-04-22 20:46:12 +01:00
MerryMage	f023bbb893	A32: Add ExceptionRaised IR instruction and use it	2020-04-22 20:46:12 +01:00
Lioncash	7ffbebf290	A64: Implement CRC32C	2020-04-22 20:46:12 +01:00
MerryMage	d7044bc751	assert: Use fmt in ASSERT_MSG	2020-04-22 20:46:12 +01:00
MerryMage	52268298a8	a64_emit_x64: Perform RSB predictions	2020-04-22 20:46:12 +01:00
MerryMage	98ec9c5f90	A32: Change UserCallbacks to be similar to A64's interface	2020-04-22 20:46:12 +01:00
Lioncash	b9ce660113	reg_alloc: std::move RegAlloc's function argument	2020-04-22 20:46:12 +01:00
Lioncash	ed561d6653	General: Add missing override specifiers	2020-04-22 20:46:12 +01:00
MerryMage	b2d99eddc6	EmitZeroExtendLongToQuad: Do not rely on register allocator to zero extend 64->128	2020-04-22 20:46:12 +01:00
MerryMage	f4f774f9f6	a64_get_set_elimination_pass: Simplify algorithm	2020-04-22 20:46:12 +01:00
MerryMage	54de64f5bf	a64_emit_x64: bug: x64 sign-extends 32-bit immediates	2020-04-22 20:46:12 +01:00
MerryMage	6fc228f7fd	ir_opt: Add A64 Get/Set Elimination Pass	2020-04-22 20:46:12 +01:00
MerryMage	e01b500aea	ir_emitter: Allow the insertion point for new instructions to be set	2020-04-22 20:46:12 +01:00
MerryMage	af793c2527	{a32,a64}_interface: Predict entrypoint	2020-04-22 20:46:12 +01:00
Lioncash	7734cf1050	A64: Implement EXTR	2020-04-22 20:46:12 +01:00
MerryMage	88ae7fce52	A64: Implement LDP (SIMD&FP) and STP (SIMD&FP)	2020-04-22 20:44:38 +01:00
MerryMage	d497464c9f	a64_jitstate: Have 128-bit wide spills	2020-04-22 20:44:38 +01:00
MerryMage	b513b2ef05	IR: Implement IR instructions A64{Get,Set}S	2020-04-22 20:44:38 +01:00
MerryMage	16fa2cd8f6	a64_emit_x64: Use xword from Xbyak::util	2020-04-22 20:44:38 +01:00
Lioncash	67443efb62	General: Convert multiple namespace specifiers to nested namespace specifiers where applicable Makes namespacing a little less noisy	2020-04-22 20:44:38 +01:00
Lioncash	7abd673a49	A64: Zero upper 64 bits in ORN if using the 64-bit variant Resolves a TODO	2020-04-22 20:44:38 +01:00
MerryMage	ba3d6da0c8	load_store_register_unprivileged: bug: LDTRSW	2020-04-22 20:44:38 +01:00
MerryMage	75756137c6	A64: Implement CMEQ (register, vector)	2020-04-22 20:44:38 +01:00
MerryMage	d5283e46e8	IR: Implement IR instructions VectorEqual{8,16,32,64,128}	2020-04-22 20:44:38 +01:00
MerryMage	4ce9c65cfb	reg_alloc: Use std::exchange	2020-04-22 20:44:38 +01:00
Fernando Sahmkow	e0c12ec2ad	A64: Implemented EOR (vector), ORR (vector, register) and ORN (vector) Instructions (#142 )	2020-04-22 20:44:38 +01:00
MerryMage	94383fd934	microinstruction: Missed A64{Read,Write}Memory128 from opcode information	2020-04-22 20:44:38 +01:00
MerryMage	d124a1d761	emit_x64_packed: EmitPackedSubU16 modified xmm_b wasn't writeable For CPUs that didn't support SSE4.1, this was a bug.	2020-04-22 20:44:38 +01:00
James Rowe	589ad7232f	Fixup: Xn\|SP are 64 bit addresses encoded in the Rn field	2020-04-22 20:44:38 +01:00
James Rowe	ae880d8391	A64: Fix bugs and address review comments	2020-04-22 20:44:38 +01:00
James Rowe	3aeb7ca50c	Add missing returns	2020-04-22 20:44:38 +01:00
James Rowe	41e6e659c5	A64: Implement Load/Store register (unprivileged)	2020-04-22 20:44:37 +01:00
MerryMage	01a26fa644	fixup: travis: Test with disabled CPU feature detection	2020-04-22 20:44:37 +01:00
Lioncash	5281d3c6d5	CMakeLists: Add opcodes.inc to the source file list Allows the file to show up nicely within IDEs	2020-04-22 20:44:37 +01:00
MerryMage	30936f5e94	travis: Test with disabled CPU feature detection Ensure that fallbacks are working correctly.	2020-04-22 20:44:37 +01:00
MerryMage	285fd22c30	IR: Add IR instruction VectorZeroUpper	2020-04-22 20:44:37 +01:00
MerryMage	da3e9a5704	a64_emit_x64: bug: EmitA64WriteMemory128 should write not read	2020-04-22 20:44:37 +01:00
FernandoS27	ab84524806	Implemented SDIV and UDIV instructions	2020-04-22 20:44:37 +01:00
MerryMage	6033b05ca6	A64: Implement LDR/STR (immediate, SIMD&FP)	2020-04-22 20:44:37 +01:00
MerryMage	f698848e26	IR: Add IR instructions A64Memory{Read,Write}128 Add the Windows ABI implementation	2020-04-22 20:44:37 +01:00
MerryMage	e1df7ae621	IR: Add IR instructions A64Memory{Read,Write}128 This implementation only works on macOS and Linux.	2020-04-22 20:44:37 +01:00
MerryMage	e00a522cba	IR: Add IR instruction VectorGetElement{8,16,32,64}	2020-04-22 20:44:37 +01:00
MerryMage	28ccd85e5c	IR: Add IR instruction ZeroExtendToQuad	2020-04-22 20:44:37 +01:00
MerryMage	af848c627d	block_of_code: Add ABI_RETURN2	2020-04-22 20:44:37 +01:00
MerryMage	1749780929	interface: Move Vector typedef to config.h	2020-04-22 20:44:37 +01:00
MerryMage	33bba6028c	bit_util: bug: Infinite loop in HighestSetBit	2020-04-22 20:44:37 +01:00
MerryMage	3caf192f60	A64: Implement DUP (general)	2020-04-22 20:44:37 +01:00
MerryMage	793753bf63	IR: Implement Vector{Lower,}Broadcast{8,16,32,64}	2020-04-22 20:44:37 +01:00
Lioncash	8ee854232c	General: Default constructors and destructors where applicable	2020-04-22 20:44:37 +01:00
Lioncash	d1e4526e1c	ir_emitter: Remove unused includes	2020-04-22 20:44:37 +01:00
Lioncash	6f9216d544	A64: Implement RBIT	2020-04-22 20:44:37 +01:00
MerryMage	9b0a21915f	ir_emitted: Remove unimplemented IR instruction Unimplemented	2020-04-22 20:44:37 +01:00
MerryMage	db30e02ac8	emit_x64: Extract BlockRangeInformation, remove template parameter	2020-04-22 20:44:36 +01:00
MerryMage	58c4a25527	emit_x64: Use JitStateInfo	2020-04-22 20:42:46 +01:00
MerryMage	d4b05b28cf	A64: Implement CLS This is not the cleanest implementation.	2020-04-22 20:42:46 +01:00
MerryMage	b8e26bfdc3	A64: Implement ADDP (vector)	2020-04-22 20:42:46 +01:00
MerryMage	eaf545877a	IR: Implement Vector{Lower,}PairedAdd{8,16,32,64}	2020-04-22 20:42:46 +01:00
MerryMage	a554e4a329	backend_x64: Split emit_x64	2020-04-22 20:42:46 +01:00
MerryMage	394bd57bb6	microinstruction: bug: Add missing opcodes	2020-04-22 20:42:46 +01:00
Lioncash	bb1c5bd3b2	A64: Implement SMADDL, SMSUBL, UMADDL, and UMSUBL	2020-04-22 20:42:46 +01:00
Lioncash	c1a25bfc2f	A64: Implement MADD and MSUB	2020-04-22 20:42:46 +01:00
Lioncash	b7c5055d42	A64: Implement CLZ	2020-04-22 20:42:46 +01:00
Lioncash	b612782445	opcodes: Add 64-bit CountLeadingZeroes opcode	2020-04-22 20:42:46 +01:00
MerryMage	4c4efb2213	data_processing_register: Clean-up	2020-04-22 20:42:46 +01:00
Lioncash	ae5dbcbed6	A64: Implement HINT, NOP, YIELD, WFE, WFI, SEV, and SEVL Truly the most difficult A64 instructions to implement.	2020-04-22 20:42:46 +01:00
Lioncash	4d8f4aa8af	A64: Implement ASRV, LSLV, LSRV, and RORV	2020-04-22 20:42:46 +01:00
Lioncash	a8a65beb2b	data_processsing_conditional_select: Implement CSINC, CSINV and CSNEG	2020-04-22 20:42:46 +01:00
Lioncash	b08be71775	a32/a64_emit_x64: Remove unused includes	2020-04-22 20:42:46 +01:00
MerryMage	f81d0a2536	A64: Implement AND (vector)	2020-04-22 20:42:46 +01:00
MerryMage	a63fc6c89b	A64: Implement ADD (vector, vector)	2020-04-22 20:42:46 +01:00
Thomas Guillemard	896cf44f96	A64: Implement REV, REV32, and REV16 (#126 )	2020-04-22 20:42:46 +01:00
MerryMage	5eb0bdecdf	IR: Simplify types. F32 -> U32, F64 -> U64, F128 -> U128 ARM's Architecture Specification Language doesn't distinguish between floats and integers as much as we do. This makes some things difficult to implement. Since our register allocator is now capable of allocating values to XMMs and GPRs as necessary, the Transfer IR instructions are no longer necessary as they used to be and they can be removed.	2020-04-22 20:42:46 +01:00
MerryMage	9a812b0c61	reg_alloc: GetBitWidth: Add UNREACHABLE	2020-04-22 20:42:46 +01:00
MerryMage	fff8e019dc	reg_alloc: Consider bitwidth of data and registers when emitting instructions	2020-04-22 20:42:46 +01:00
MerryMage	144b629d8a	A64: Implement CSEL	2020-04-22 20:42:45 +01:00
MerryMage	6395f09f94	IR: Implement Conditional Select	2020-04-22 20:42:45 +01:00
MerryMage	19da68568e	A64/translate/branch: bug: Read-after-write error in BLR	2020-04-22 20:42:45 +01:00
MerryMage	9f57283a30	A64: Implement SBFM, BFM, UBFM	2020-04-22 20:42:45 +01:00
MerryMage	cdbc8d07a5	A64: Implement MOVN, MOVZ, MOVK	2020-04-22 20:42:45 +01:00
MerryMage	ecebe14a01	ir/location_descriptor: Add missing <functional> header for std::hash	2020-04-22 20:42:45 +01:00
MerryMage	4e3675da7b	a64_merge_interpret_blocks: Remove debug output	2020-04-22 20:42:45 +01:00
MerryMage	c6a091d874	A64: Optimization: Merge interpret blocks	2020-04-22 20:42:45 +01:00
MerryMage	21fe61eac6	A64/data_processing_pcrel: bug: ADR{,P} instructions sign extend their immediate	2020-04-22 20:42:45 +01:00
MerryMage	7c4b70751c	A64/data_processing_addsub: bug: {ADD,SUB}S (extended register) instructions write to ZR when d = 31	2020-04-22 20:42:45 +01:00
MerryMage	996ffd5488	a64_emit_x64: bug: A64CallSupervisor trampled callee-save registers	2020-04-22 20:42:45 +01:00
MerryMage	e4615a4562	emit_x64: bug: OP m/r64, imm32 form instructions sign-extend their immediate on x64	2020-04-22 20:42:45 +01:00
MerryMage	0992987c98	A64: Add ExceptionRaised IR instruction The purpose of this instruction is to raise exceptions when certain decode-time issues happen, instead of asserting at translate time. This allows us to use the translator for code analysis without worrying about unnecessary asserts, but also provides flexibility for the library user to perform custom behaviour when one of these states are raised.	2020-04-22 20:42:45 +01:00
MerryMage	61125d6dd1	A64/translate: Add TranslateSingleInstruction function	2020-04-22 20:42:45 +01:00
MerryMage	aa74a8130b	Misc. fixups of MSVC build	2020-04-22 20:42:45 +01:00
MerryMage	a1dfa01515	imm: Suppress MSVC warning C4244: value will never be truncated	2020-04-22 20:42:45 +01:00
MerryMage	26da149639	imm: compiler bug: MSVC 19.12 with /permissive- flag doesn't support fold expressions	2020-04-22 20:42:45 +01:00
MerryMage	b34c6616d4	A64/decoder: Split decoder data from header	2020-04-22 20:42:45 +01:00
MerryMage	72a793f5b0	ir_opt: Split off A32 specific passes	2020-04-22 20:42:45 +01:00
MerryMage	595f157e5e	A64: Implement LDP, STP	2020-04-22 20:42:45 +01:00
MerryMage	511215342b	A64/location_descriptor: Fix -fpermissive warning on GCC	2020-04-22 20:42:45 +01:00
MerryMage	243f06c613	A64: Implement LDP, STP	2020-04-22 20:42:45 +01:00
MerryMage	25411da838	A32: Implement load stores (immediate)	2020-04-22 20:42:45 +01:00
MerryMage	2aadeec291	A64: Implement SVC	2020-04-22 20:42:45 +01:00
MerryMage	9e27e4d250	imm: bug: SignExtend wasn't working for T with bit size > 32	2020-04-22 20:42:45 +01:00
MerryMage	10c60dda97	a64_emit_x64: Don't use far code for now	2020-04-22 20:42:45 +01:00
MerryMage	593a569b53	EmitA64SetW: bug: should zero extend to entire 64-bit register	2020-04-22 20:42:45 +01:00
MerryMage	6bd9f02911	EmitA64SetNZCV: bug: to_store is scratch	2020-04-22 20:42:45 +01:00
MerryMage	f0276dd53b	emit_x86: Fix nzcv for EmitSub	2020-04-22 20:42:45 +01:00
MerryMage	68391b0a05	A64: Implement SVC	2020-04-22 20:42:45 +01:00
MerryMage	e5ace37560	a64_emit_x64: Call interpreter	2020-04-22 20:42:45 +01:00
MerryMage	b12dead76a	A64: Add batch register retrieval to interface	2020-04-22 20:42:45 +01:00
MerryMage	cb481a3a48	A64: Implement compare and branch	2020-04-22 20:42:45 +01:00
MerryMage	e8bcf72ee5	A64: PSTATE access and tests	2020-04-22 20:42:45 +01:00
MerryMage	23f3afe0b3	A64: Implement branch (register)	2020-04-22 20:42:45 +01:00
MerryMage	86d1095df7	A64: Implement branch	2020-04-22 20:42:45 +01:00
MerryMage	0641445e51	A64: Implement logical	2020-04-22 20:42:45 +01:00
MerryMage	5a1d88c5dc	A64: Implement pcrel	2020-04-22 20:42:45 +01:00
MerryMage	c09e69bb97	A64: Implement addsub instructions	2020-04-22 20:42:44 +01:00
MerryMage	d1cef6ffb0	A64: Implement ADD_shifted	2020-04-22 20:42:44 +01:00
MerryMage	d1eb757f93	A64: Backend framework	2020-04-22 20:42:44 +01:00
MerryMage	e161cf16f5	A64: Initial framework	2020-04-22 20:42:44 +01:00
MerryMage	f61da0b5a9	IR: Compile-time type-checking of IR	2020-04-22 20:39:27 +01:00
MerryMage	44f7f04b5c	IR/Value: Rename RegRef and ExtRegRef to A32Reg and A32ExtReg	2020-04-22 20:39:27 +01:00
MerryMage	83022322d1	Make IR->A32 LocationDescriptor conversion explicit	2020-04-22 20:39:27 +01:00
MerryMage	9d15e0a8e1	Final A32 refactor	2020-04-22 20:39:27 +01:00
MerryMage	455757d7b6	EmitX64: JitState type as template parameter	2020-04-22 20:39:26 +01:00
MerryMage	2d164d9345	Package up emit context	2020-04-22 20:38:31 +01:00
MerryMage	7bf421dd38	Rename JitState to A32JitState	2020-04-22 20:38:31 +01:00
MerryMage	63bd1ece23	backend_x64: Split A32 specific emission into separate class	2020-04-22 20:38:29 +01:00
MerryMage	8bef20c24d	IR: Split off A32 specific opcodes	2020-04-22 20:33:32 +01:00
MerryMage	b1f0cf9278	A32: Split off A32 specific IREmitter	2020-04-22 20:33:32 +01:00
MerryMage	b3c73e2622	Label A32 specific code appropriately	2020-04-22 20:33:30 +01:00
MerryMage	dc357c780d	EmitPackedHalvingSub{U,S}16: SSE2 implementation	2020-04-22 20:27:15 +01:00
MerryMage	a98821da41	Merge branch 'misc' These commits introduce context save and restore, and a small number of optimizations that depend on their use for performance.	2020-04-22 20:27:15 +01:00
MerryMage	fc885ac80f	EmitPackedHalvingAddU8: Add SSE2 implementation	2020-04-22 20:27:15 +01:00
MerryMage	4682211729	EmitPackedHalvingAdd{U,S}16: Add SSE2 implementation	2020-04-22 20:27:15 +01:00
MerryMage	9ac1c87a51	emit_x64: EmitSet{Register,ExtendedRegister32,ExtendedRegister64}: Store from current source	2020-04-22 20:27:15 +01:00
MerryMage	6e834de072	Add re-entry prediction to avoid std::unordered_map lookups	2020-04-22 20:26:40 +01:00
MerryMage	984ce22431	emit_x64: Arguments to MostSignificantBit and IsZero are 32-bit	2020-04-22 20:26:40 +01:00
MerryMage	5c6fcf378f	emit_x64: Optimize code emitted by EmitGetCpsr	2020-04-22 20:26:40 +01:00
MerryMage	f595f85039	block_of_code: Remove vzeroupper	2020-04-22 20:26:40 +01:00
MerryMage	4393473d06	interface: Allow saving and storing of contexts	2020-04-22 20:26:40 +01:00
MerryMage	05f3f07704	emit_x64: Reduce mxscr operations in EmitGetFpscr and EmitSetFpscr	2020-04-22 20:26:40 +01:00
MerryMage	19a7fb8992	jit_state: Split off CPSR.NZCV	2020-04-22 20:26:40 +01:00
MerryMage	0af1e7723d	CMakeLists: Fixup boost * boost is part of the public interface. * Consider boost a system library so warnings from boost do not cause a build failure. * If the parent project defines boost, use that.	2020-04-22 20:26:40 +01:00
MerryMage	a3432102b8	jit_state: Split off CPSR.Q	2020-04-22 20:26:40 +01:00
MerryMage	4f8675083c	interface_x64: Fix MSVC cast warning	2020-04-22 20:26:40 +01:00
MerryMage	311361b409	jit_state: Split off CPSR.{E,T} This allows us to improve code-emission for PopRSBHint. We also improve code emission other terminals at the same time.	2020-04-22 20:26:40 +01:00
MerryMage	cb119c2f72	emit_x64: Use boost::icl::interval_map to speed up ranged invalidation	2020-04-22 20:26:40 +01:00
MerryMage	3cca3bbd0b	jit_state: Split off CPSR.GE	2020-04-22 20:26:40 +01:00
MerryMage	6fde29f5d8	emit_x64: Remove unnecessary ABI overhead in ReadMemory, WriteMemory	2020-04-22 20:26:40 +01:00
MerryMage	6adc554b53	jit_state: Hide cpsr implementation	2020-04-22 20:26:40 +01:00
MerryMage	eb80aae9c0	block_of_code: Move MXCSR switching out of dispatch loop Also clarify MXCSR entry/exit terminology	2020-04-22 20:26:40 +01:00
MerryMage	a4e85ad565	emit_x64: Make RSB a stack	2020-04-22 20:26:40 +01:00
MerryMage	2a818f9d8e	Merge branch 'timing' We do this to improve timing information before entering a supervior function. We also do this to try and stay within JITted code as much as possible, by updating the cycles we have remaining.	2020-04-22 20:26:37 +01:00
MerryMage	ea4c3292d5	BlockOfCode: Detect space remaining We also clear the code cache when we run out of space. This closes #111.	2020-04-22 20:26:12 +01:00
MerryMage	256749910f	Add AddTicks and GetTicksRemaining callbacks	2020-04-22 20:26:12 +01:00
MerryMage	80c56aa89d	Remove unnecessary use of boost::make_optional Closes #119.	2020-04-22 20:26:12 +01:00
MerryMage	de6a93a160	decoder_detail: Lambda captures may be unused if iota is an empty sequence Closes #120	2020-04-22 20:26:12 +01:00
MerryMage	3141dadea9	Remove UNUSED macro	2020-04-22 20:26:12 +01:00
MerryMage	7cac9519b0	microinstruction: Remove DecrementRemainingUses	2020-04-22 20:26:12 +01:00
MerryMage	639f7cfd2d	reg_alloc: Add IsLastUse optimization for UseScratch	2020-04-22 20:26:12 +01:00
MerryMage	6b122751fe	reg_alloc: Remove reliance on IR::Inst::DecrementRemainingUses	2020-04-22 20:26:12 +01:00
MerryMage	30049ca928	emit_x86: Standardize time of DefineValue call	2020-04-22 20:26:12 +01:00
MerryMage	5d72f7048f	basic_block: Add inst address and use count to DumpBlock This additional output assists with debugging.	2020-04-22 20:26:12 +01:00
Mat M	c6d09adcb7	CMakeLists: Derive the source file listings from targets directly (#118 ) This gets rid of the need to store to individual variables before creating the target itself, cleaning up the variables in the surrounding scope a little bit.	2020-04-22 20:26:07 +01:00
MerryMage	12eaf496fd	emit_x64: Perform mask creation for packed instructions in SSE	2020-04-22 20:26:07 +01:00
MerryMage	305e4baa29	emit_x64: Eliminate conversion of GE flags * We do this so that we can simplify PackedSelect. * We also try to minimise xmm-gpr/gpr-xmm transfers in PackedSelect.	2020-04-22 20:26:07 +01:00
MerryMage	d1e0a29cd9	Implement IR instruction PackedSelect, reimplement SEL	2020-04-22 20:26:07 +01:00
MerryMage	18f11972c6	emit_x64: Remove SSSE3 implementation of PackedHalvingAddU8 It is much slower than the SSE2 implementation, so there's no point keeping it around.	2020-04-22 20:26:07 +01:00
MerryMage	c4b40909f7	emit_x64: Improve code emission of FPCompare{32,64} Replace if-chain with table lookup	2020-04-22 20:26:07 +01:00
MerryMage	814e378249	VCMP and VCMPE were the other way around - This was due to a misunderstanding of what the E in VCMPE means. - The E refers to an exception being raised when a QNaN is encountered. - Added unit tests for VCMP{E}	2020-04-22 20:26:07 +01:00
MerryMage	08f638d447	emit_x64: pmaxuw and pminuw require SSE 4.1 This commit is intended to close citra-emu/citra#3137. pmaxuw and pminuw were used to perform unsigned comparisons; we emulate these using a signed comparison by offsetting the inputs by 0x8000 for CPUs that do not support SSE 4.1.	2020-04-22 20:26:07 +01:00
Mat M	522992965a	Common: Delete Pool's copy constructor and copy/move assignment operators (#117 ) The language defines a copy constructor as: TypeName(const TypeName&) so this was just deleting a constructor variant that would catch most cases of attempted copies.	2020-04-22 20:22:01 +01:00
Mat M	77fe2aeeaa	emit_x64: Amend doxygen parameters for InvalidateCacheRange() (#116 )	2020-04-22 20:22:01 +01:00
MerryMage	19dcdde90b	block_of_code: Add vzeroupper instructions where AVX-SSE transitions may occur	2020-04-22 20:22:01 +01:00
MerryMage	60d9392b5c	block_of_code: BlockOfCode should provide cpu info	2020-04-22 20:22:01 +01:00
MerryMage	148c01e08f	interface_x64: Remove is_executing assert from HaltExecution In multithreaded code this can be triggered due to a race.	2017-10-14 23:35:01 +01:00
MerryMage	f6cf265bc5	block_of_code: BlockOfCode::ABI_* should be const	2017-09-29 01:35:24 +01:00
MerryMage	29471be317	Standardize location of storage-class specifiers: Place at beginning of declarations Justification: C99 specifies that doing otherwise is an obsolescent feature.	2017-09-29 01:23:45 +01:00
MerryMage	b992e5f8ec	Ranged cache invalidation * Fix clearing code block on a partial invalidation * Remove unnecessary use of boost::variant * Code cleanup	2017-09-11 00:11:05 +01:00
Lioncash	80477b5a67	externals: update fmt to 4.0	2017-08-27 21:43:21 +01:00
MerryMage	568b52d4ba	externals: Update Xbyak to v5.51 Xbyak now supports multi-byte nops	2017-08-17 21:34:54 +01:00
MerryMage	1613846ab0	reg_alloc: Handle XMM registers in LoadImmediate	2017-08-16 23:11:05 +01:00
MerryMage	993e142c6b	disassembler: Fix RegList	2017-08-05 01:57:29 +01:00
MerryMage	6197bde0fc	disassembler_arm: Fix disassembly of LDRH (reg)	2017-07-30 18:45:55 +01:00
Yuri Kunde Schlesner	38eb7e0314	emit_x64: Use alternative Xbyak names for and, or, xor Also enabled XBYAK_NO_OP_NAMES, allowing us to stop using -fno-operator-names.	2017-06-12 07:57:46 +01:00
James Rowe	82e8c99a47	Link against static fmtlib instead of header only When including fmtlib as a header only library in dynarmic, downstream projects cannot include fmtlib as a static library without getting linker errors.	2017-05-22 08:23:03 +01:00
MerryMage	599a613fea	Move SEL from status_register_access to misc	2017-04-25 13:57:27 +01:00
MerryMage	50bb317104	parallel: UQADD8 and UQADD16 are unpredictable when {d\|n\|m} == 15	2017-04-25 13:45:31 +01:00
MerryMage	7639dfea51	coprocessor: Use && instead of & with boolean arguments	2017-04-22 15:05:31 +01:00
MerryMage	2c9dcfa2db	backend_x64: Rename UnwindHandler to ExceptionHandler	2017-04-20 14:08:56 +01:00
MerryMage	0d47f50f57	block_of_code: Implement farcode	2017-04-19 18:58:36 +01:00
MerryMage	1c21ae6bcd	saturated: Implement QASX, QSAX, UQASX, UQSAX	2017-04-10 10:21:51 +01:00
MerryMage	9ac890c62d	reg_alloc: Fix for LLVM's interpretation of the System V ABI This aspect of the System V ABI is under-defined. LLVM choses a different interpretation from GCC and ICC. Most other compilers assume the callee is responsible zero-ing the upper bits of the register if necessary. LLVM assumes the caller has zero-extended the register. This is a quick fix for this problem until zext-tracking is implemented.	2017-04-08 22:12:37 +01:00
MerryMage	a5bb81a97c	backend_x64: Remove dispatch loop in Jit::Run	2017-04-08 10:04:53 +01:00
MerryMage	1b37420459	backend_x64: Simplify dispatcher	2017-04-08 09:35:45 +01:00
MerryMage	523ae542f4	microinstruction: Implement HasAssociatedPseudoOperation	2017-04-04 13:10:50 +01:00
MerryMage	4c5de3905b	emit_x64: Correct mutation of immutable in FPThreeOp{32,64} operand (args[1]) was erroneously declared as non-scratch. operand's value could be modified if FTZ was enabled.	2017-04-01 09:57:14 +01:00
MerryMage	05e97058c3	parallel: Add and Subtract with Exchange improvements * Remove asx argument from PackedHalvingSubAdd{U16,S16} IR instruction * Implement Packed{Halving,}{AddSub,SubAdd}{U16,S16} IR instructions * Implement SASX, SSAX, UASX, USAX	2017-03-24 15:56:24 +00:00
Lynn	fd068ed6b8	Ranged cache invalidation	2017-03-20 11:58:25 +00:00
MerryMage	d9c69ad997	constant_pool: Implement a constant pool	2017-03-19 13:08:04 +00:00
Lioncash	5a02da445a	CMakeLists: Only link LLVM libs against the library LLVM library code is only used within the main dynarmic library, not the test executable.	2017-03-11 13:25:14 +00:00
Lioncash	d85137ed65	interface_x64: Amend LLVM disassembly code This would previously attempt to perform pointer arithmetic on void pointers, which would cause compilation errors.	2017-03-07 18:32:04 +00:00
Lioncash	d0efbb9348	CMakeLists: Remove unnecessary linker language specifiers This is already inferred by the cmake project being declared a CXX project.	2017-03-07 18:30:58 +00:00
Lioncash	9906be746f	CMakeLists: Make boost an interface library target Gets rid of the use of a non-target include and makes libraries explicitly link against the identifier name in order to get includes.	2017-03-04 11:52:32 +00:00
MerryMage	6396bd02f0	Merge branch 'simplify-reg-alloc'	2017-02-27 00:11:52 +00:00
MerryMage	92a01b0cd8	Prefer ASSERT to DEBUG_ASSERT	2017-02-26 23:30:40 +00:00
MerryMage	135346eb2e	reg_alloc: Move implementations out of header	2017-02-26 23:30:39 +00:00
MerryMage	184db36caf	reg_alloc: Call DecrementRemainingUses in only one place	2017-02-26 23:30:38 +00:00
MerryMage	51fc9fec05	reg_alloc: Reorganize	2017-02-26 23:30:37 +00:00
MerryMage	cf93ab3d31	reg_alloc: Remove old register allocator interface	2017-02-26 23:12:26 +00:00
MerryMage	08a467bf9a	emit_x64: Port to new register allocator interface	2017-02-26 23:12:25 +00:00
Lioncash	662e07337f	CMakeLists: Don't explicitly signify dynarmic as a static lib This allows a user of the library to explicitly control which kind of library type should be built with the CMake BUILD_SHARED_LIBS flag. By default, libraries will build as static without this specifier.	2017-02-26 23:08:49 +00:00
MerryMage	f883bad2cc	reg_alloc: New register allocation interface	2017-02-26 21:37:35 +00:00
MerryMage	13ac0c234e	reg_alloc: Differentiate between ReadLock and WriteLock	2017-02-26 21:37:34 +00:00
MerryMage	6c3df057fa	reg_alloc: Remove unused functions	2017-02-26 21:37:33 +00:00
MerryMage	1ee4c07f14	reg_alloc: Reimplement ScratchHostLocReg	2017-02-26 21:37:32 +00:00
MerryMage	640faab8a7	reg_alloc: UseHostLoc is no longer necessary	2017-02-26 21:37:30 +00:00
MerryMage	9518bbe06e	reg_alloc: Reimplement UseScratchHostLocReg	2017-02-26 21:37:29 +00:00
MerryMage	e1d8238c50	reg_alloc: Stub UseOpArg	2017-02-26 21:37:27 +00:00
MerryMage	2b078152e7	reg_alloc: Reimplement UseHostLocReg	2017-02-26 21:37:26 +00:00
MerryMage	aefe550428	reg_alloc: Remove the Def concept from register allocator internals	2017-02-26 21:37:25 +00:00
MerryMage	65cccf070e	reg_alloc: Properly encapsulate HostLocInfo	2017-02-26 21:37:24 +00:00
MerryMage	469bb6253f	backend_x64: Factor EmitExclusiveWriteMemory64 into ExclusiveWrite	2017-02-26 15:34:26 +00:00
MerryMage	d7ab1f9c64	backend_x64: Fix ABI violation in ReadMemory and WriteMemory Caller-save registers were not saved before call instruction. Refer to issue #98.	2017-02-26 15:34:25 +00:00
MerryMage	3768174783	ir_opt: Constant propagation pass works better with a DCE just before it	2017-02-26 15:28:35 +00:00
MerryMage	157585887e	ir_opt: Simplify dead-code elimination pass	2017-02-26 15:28:34 +00:00
MerryMage	bbeea72eba	ir_opt: Remove redundant shift instructions	2017-02-26 15:28:14 +00:00
MerryMage	517fe0f18e	emit_x64: WriteMemory* microinstructions do not define a value	2017-02-25 11:54:47 +00:00
MerryMage	1ff60bc69f	reg_alloc: Move OpArg into own header	2017-02-21 23:38:36 +00:00
MerryMage	4ed8ee7489	microinstruction: Void arguments when invalidating instruction	2017-02-18 21:29:23 +00:00
MerryMage	7fa5845c1f	extension: Implement SXTAB16 and SXTB16	2017-02-18 20:14:44 +00:00
MerryMage	73d1cf36c3	extension: Simplify UXTB16	2017-02-18 20:14:39 +00:00
MerryMage	6edcfeba0b	extension: Simplify rotation code	2017-02-18 20:14:37 +00:00
MerryMage	cc9d2c4603	saturated: Implement SSAT16 and USAT16	2017-02-18 17:43:57 +00:00
MerryMage	358cf7c322	vfp: Implement vectorized VFP instructions	2017-02-18 01:13:25 +00:00
MerryMage	f2dd82967f	load_store: Simplify implementation * Remove dead code * Standardise code style with rest of code base	2017-02-16 22:28:56 +00:00
MerryMage	058f7b5de6	emit_x64: Make EmitTerminal type-safe Avoid the use of boost::variant::which, which tends to produce code which is not verifiable at compile-time.	2017-02-16 19:40:51 +00:00
MerryMage	e197b10b96	common: Introduce utility function VisitVariant VisitVariant allows one to use a generic lambda to visit a boost::variant. This is necessary because boost::visit_variant requires the visitor type to provide a return type.	2017-02-16 19:30:56 +00:00
MerryMage	5a20a37d3f	arm/fpscr: Correct Stride implementation	2017-02-11 12:13:57 +00:00
MerryMage	033e8b9b1e	vfp: Rename variables a, b, c to more sensible names	2017-02-06 21:14:36 +00:00
MerryMage	2af39dfaa8	emit_x64: Make reg_alloc a local variable reg_alloc contains state that is only valid on a per-block basis, so there is no reason for it to be a member variable.	2017-02-04 09:29:35 +00:00
MerryMage	a0e9417912	ir_opt: Initial constant propagation pass implementation	2017-01-30 21:49:46 +00:00
MerryMage	2447f2f360	callbacks: Factorize memory callbacks into inner structure	2017-01-30 21:42:51 +00:00
MerryMage	642ccb0f66	ir/value: Support U16 immediates	2017-01-29 22:58:11 +00:00
MerryMage	5f7ffe0d0b	microinstruction: Implement Inst::AreAllArgsImmediates	2017-01-29 22:56:59 +00:00
MerryMage	22804dc6a5	microinstruction: Arguments of Inst::Use and Inst::UndoUse should be const	2017-01-29 22:53:46 +00:00
MerryMage	1d4446cad5	microinstruction: Removed unnecessary reference from argument of Inst::ReplaceUsesWith	2017-01-29 22:52:33 +00:00
MerryMage	3e0e339d98	bit_util: Remove unnecessary include	2017-01-09 22:19:51 +00:00
MerryMage	9ecdd32b84	coprocessor: Implement fast-path for Coproc{Send,Get}{OneWord,TwoWords} Allow coprocessor interface to provide pointers instead of a callback. This allows for a fastpath when all that is required is to read or write a value and no other action needs to be taken.	2017-01-08 14:56:06 +00:00
MerryMage	e3bc7d039f	Implement CDP, LDC, MCR, MCRR, MRC, MRRC, STC	2017-01-08 14:56:06 +00:00
MerryMage	48693eb6ff	Implement coprocessor-related microinstructions * CoprocInternalOperation * CoprocSendOneWord * CoprocSendTwoWords * CoprocGetOneWord * CoprocGetTwoWords * CoprocLoadWords * CoprocStoreWords	2017-01-08 14:56:06 +00:00
MerryMage	b3ae57619d	types: Formatting for CoprogReg	2017-01-08 14:56:06 +00:00
MerryMage	d8a37e287c	IR: Add IR type CoprocInfo	2017-01-08 14:56:06 +00:00
MerryMage	890b2f75ad	callbacks: Add coprocessor interface	2017-01-08 14:56:06 +00:00
MerryMage	1efd3a764d	IR: Remove unused microinstructions NegateLowWord and NegateHighWord	2017-01-05 20:16:39 +00:00
Fernando Sahmkow	70f4235ee9	Implement UXTAB16 (#78 )	2016-12-29 12:15:18 +00:00
MerryMage	0d1fa85402	bit_util: Bit<T>(size_t, const T) cannot be constexpr Compound statements are not permitted in constexpr functions in C++14	2016-12-29 10:08:35 +00:00
FernandoS27	d5610eb26c	Implement UHASX, UHSAX, SHASX and SHSAX (#75 )	2016-12-28 21:32:22 +00:00
MerryMage	e9df248d56	decoder_detail: Support const member functions	2016-12-23 11:33:40 +00:00
MerryMage	163b67bf1f	mp: Add support for const member function pointers to FunctionInfo	2016-12-23 11:32:12 +00:00
MerryMage	b1bad4b5cc	decoder_detail: static_assert member function is from visitor class Improves readability of compiler errors.	2016-12-23 11:10:02 +00:00
MerryMage	c7e5216473	emit_x64: EraseInstruction now also invalidates the instruction There is now no longer a need to call DecrementRemainingUses on the parent instruction.	2016-12-22 18:43:11 +00:00
MerryMage	c78f153ddb	decoder/arm: Sort decoders according to number of bits in mask	2016-12-22 15:25:38 +00:00
MerryMage	cb38c94b58	decoder/arm: Fix decoding of RFE	2016-12-22 15:25:07 +00:00
MerryMage	7e77ee7fd6	decoder/arm: Fix decoding of MCR2	2016-12-22 15:11:47 +00:00
Fernando Sahmkow	677f62dd6f	Implement SHSUB8 and SHSUB16 (#74 ) * Implement IR operations PackedHalvingSubS8 and PackedHalvingSubS16	2016-12-22 12:02:24 +00:00
MerryMage	967f3cf7e1	Implement CPS (Thumb) * Since currently only User mode is emulated, CPS is a NOP.	2016-12-21 22:44:27 +00:00
MerryMage	c764a2b889	Implement MUL (T1)	2016-12-21 22:44:14 +00:00
MerryMage	36082087de	callbacks: Read code using MemoryReadCode callback	2016-12-21 21:39:14 +00:00
MerryMage	56ea2386d3	saturated: Implement SSAT and USAT	2016-12-21 19:51:25 +00:00
MerryMage	6a269a6ebd	IR: Add microinstructions UnsignedSaturation and SignedSaturation	2016-12-21 19:51:25 +00:00
MerryMage	b23b524b03	bit_util: Add SignExtend implementation with runtime bit_count argument	2016-12-21 19:51:25 +00:00
MerryMage	02b2ab7581	emit_x64: Pass tmp to ExtractMostSignificantBitFromPackedBytes in EmitPackedAddU8	2016-12-20 22:07:51 +00:00
MerryMage	097f6a83da	emit_x64: Document ExtractAndDuplicateMostSignificantBitFromPackedWords	2016-12-20 22:06:14 +00:00
MerryMage	03f168094d	emit_x64: Document ExtractMostSignificantBitFromPackedBytes	2016-12-20 22:05:51 +00:00
FernandoS27	8919265d2c	Implement SADD8, SADD16, SSUB8, SSUB16, USUB16	2016-12-20 21:52:38 +00:00
FernandoS27	3f6ecfe245	Implemented USAD8 and USADA8	2016-12-20 21:52:38 +00:00
MerryMage	b1d3e7aae9	emit_x64: Refactor patching code * Only have a single std::unordered_map for patching information * Factor patch emitters into own functions * Implement EmitX64::Unpatch	2016-12-20 14:06:55 +00:00
MerryMage	cc58666c06	CMakeLists: Use target_compile_options intead of add_compile_options	2016-12-19 00:48:25 +00:00
MerryMage	74a95ea51e	block_of_code: Rename alloc to AllocateFromCodeSpace	2016-12-18 23:43:48 +00:00
MerryMage	96e46ba6b5	Implement QADD, QSUB, QDADD, QDSUB	2016-12-15 22:34:29 +00:00
MerryMage	b178ab3bec	Replace (void)(...); idiom with UNUSED macro	2016-12-15 21:36:05 +00:00
MerryMage	276873bf70	Wrap #pragma warning with #ifdef _MSC_VER .. #endif	2016-12-15 21:36:02 +00:00
MerryMage	0e8b626d87	CMakeLists: Globally disable MSVC warning C4592 C4592: Symbol will be dynamically initialized (implementation limitation)	2016-12-15 21:09:55 +00:00
MerryMage	91e851a991	CMakeLists: Enable /W4 on MSVC	2016-12-15 20:52:23 +00:00
MerryMage	63caed7b09	emit_x64: Remove argument names of unused arguments	2016-12-15 20:52:22 +00:00
MerryMage	df197ff6b1	arm/types: Use smallest possible standard type that has sufficient bits for Imm{} types	2016-12-15 20:52:21 +00:00
MerryMage	546198d603	translate_arm: Mark arguments as unused	2016-12-15 20:52:20 +00:00
MerryMage	8d5522f4a0	dissassembler_arm: Support BKPT, QASX, QSAX, UQASX, UQSAX	2016-12-15 20:16:08 +00:00
Yuri Kunde Schlesner	34e19f135c	CMake: Re-use external xbyak target if present (#62 )	2016-12-12 14:23:42 +00:00
MerryMage	5bea2e1680	block_of_code: Support stack unwinding on Windows	2016-12-12 07:49:18 +00:00
MerryMage	4962d92b79	block_of_code: Do not regenerate prelude when clearing cache	2016-12-12 07:49:18 +00:00
MerryMage	2a1cf94b1c	CMakeLists: Include backend_x64 only if we're targeting x86_64	2016-12-12 07:49:18 +00:00
MerryMage	dcc880a002	assert: _a_ expression string shouldn't be part of the format string The expression may contain the % operator.	2016-12-12 07:49:18 +00:00
MerryMage	179a3388f9	block_of_code: Provide an alloc function to allocate space in the code block	2016-12-12 07:49:18 +00:00
Lioncash	f467589346	emit_x64: Remove unnecessary casts	2016-12-05 20:30:19 +00:00
Lioncash	a37631c010	emit_x64: Use reinterpret_cast for pointer casts	2016-12-05 20:30:19 +00:00
Lioncash	fafa845f64	emit_x64: Make GetBasicBlock() const qualified	2016-12-05 12:46:22 +00:00
Lioncash	6a16edc0fb	emit_x64: Move implementations into the cpp file Prevents needing to rebuild everything including the emitter if any details ever change.	2016-12-05 12:46:22 +00:00
Lioncash	282029f60a	emit_x64: Forward declare BlockOfCode	2016-12-05 12:46:22 +00:00
Lioncash	6898b74c78	emit_x64: Get rid of indirect includes	2016-12-05 12:46:22 +00:00
MerryMage	54d051977f	emit_x64: Use movdqa instead of movaps in EmitPackedSubU8 While movaps and movdqa are behaviourly equivalent, using movaps may incur a domain crossing penalty on some microarchitectures. This is because movaps is an instruction in the floating-point domain while the following instructions are in the integer domain.	2016-12-05 01:00:51 +00:00
MerryMage	52e1445f43	Implement USUB8	2016-12-05 00:29:15 +00:00
MerryMage	5c1aab1666	Implement CLZ Includes tests	2016-12-04 22:56:33 +00:00
MerryMage	1a1646d962	Implement UADD8	2016-12-04 20:52:33 +00:00
MerryMage	7cad6949e7	IR: Implement new pseudo-operation GetGEFromOp	2016-12-04 20:52:06 +00:00
MerryMage	25f21b5371	emit_x64: Inline nzcv computation into EmitFPCompare32 and EmitFPCompare64	2016-12-04 11:43:31 +00:00
MerryMage	cede5e442a	emit_x64: Use xorps/xorpd when argument to TransferToFP32/TransferToFP64 is an immediate zero	2016-12-03 11:41:10 +00:00
MerryMage	e166965f3e	Implement VCMP	2016-12-03 11:41:09 +00:00
MerryMage	f2fe376fc6	Support 64-bit immediates	2016-12-03 11:29:50 +00:00
Mat M	95f34c683c	reg_alloc: Remove unnecessary breaks after returns (#54 )	2016-12-02 19:14:44 +00:00
Mat M	de1f831d79	microinstruction: Make use_count private (#53 ) Makes the operation a part of the direct interface.	2016-11-30 21:51:06 +00:00
MerryMage	3621a925b2	reg_alloc: Register allocator related constraints belong with the rest of the register allocator HostLocToReg64 contained two DEBUG_ASSERTs invloving constraints that really belonged to the register allocator. The register allocator prevents allocation of RSP and R15 because those are reserved for the stack pointer and the state pointer respectively.	2016-11-30 19:42:41 +00:00
MerryMage	5f11b4f50e	HostLoc: R15 is a GPR	2016-11-30 18:38:03 +00:00
Sebastian Valle	14eb70d7e4	VFP: Fixed the VCVT behavior when converting from unsigned 32-bit values. (#51 ) Use a 64-bit register to hold the values so that we don't end up interpreting them as signed values.	2016-11-27 23:25:50 +00:00
Merry	0ff8c375af	Implement UHSUB8 and UHSUB16 (#48 )	2016-11-26 18:27:21 +00:00
Merry	cb17f9a3ed	Implement SHADD8 and SHADD16 (#47 )	2016-11-26 18:12:29 +00:00
Sebastian Valle	11ae8d1ffa	Added disassembler support for the ARM parallel add/subtract (modulo arithmetic) instructions. (#50 )	2016-11-26 17:58:09 +00:00
Sebastian Valle	ed71e31cea	Added disassembler support for the ARM parallel and saturated instructions (#44 )	2016-11-26 17:49:46 +00:00
MerryMage	c0c1bb1094	Implemented UHADD16	2016-11-26 11:28:20 +00:00
Mat M	4f7dc81492	mp: Fix static_assert condition (#46 ) Not an issue currently, but this would have prevented type inspection on the last function parameter.	2016-11-25 22:09:45 +00:00
Yuri Kunde Schlesner	9ec51f74bd	libfmt: Update version to current master	2016-11-25 20:47:04 +00:00
Sebastian Valle	4d44474ad4	Implemented the ARM UHADD8 instruction. (#45 ) The x64 implementation uses the SSSE3 instruction PSHUFB. A non-SSE fallback is provided in case the CPU doesn't support it.	2016-11-25 20:32:22 +00:00
Sebastian Valle	f32921d493	ARM: Implemented UXTB16. (#42 ) It passes tests.	2016-11-24 08:21:12 +00:00
Sebastian Valle	32615d0eff	Implemented the PKHTB and PKHBT instructions with tests. (#40 )	2016-11-23 21:45:18 +00:00
MerryMage	780ff8e00e	status_register_access: SEL: Use GetGEFlags	2016-11-23 19:47:35 +00:00
MerryMage	b6f7b8babd	ir: Implement GetGEFlags, SetGEFlags	2016-11-23 19:44:27 +00:00
MerryMage	e7d02a5439	get_set_elimination_pass: Refactor CPSR related eliminations	2016-11-23 18:42:13 +00:00
Sebastian Valle	d589c63107	Implemented the ARM SEL instruction, with tests. (#39 ) The test for this instruction is very peculiar. As the instruction's behavior depends on the value of the CPSR, we generate a MSR instruction after each SEL instruction to change the CPSR.	2016-11-23 18:14:07 +00:00
Mat M	65dcf45ca6	FPSCR: Mask away reserved bits (#34 )	2016-09-21 17:51:13 +01:00
MerryMage	792f2bfd94	translate_arm: Remove unused method ArmTranslatorVisitor::LinkToNextInstruction	2016-09-21 14:07:53 +01:00
Mat M	f75acd6cfb	decoder: Generify the matcher interface (#33 ) Gets rid of a bit of duplication while remaining compatible with the current interfaces in place.	2016-09-17 09:48:18 +01:00
Mat M	943487ecee	disassembler: Provide includes to function declarations (#32 )	2016-09-14 23:03:09 +01:00
Mat M	72897b5def	types: Provide ostream operator<< overloads where applicable (#30 )	2016-09-07 14:21:17 +01:00
Mat M	b41de890fb	memory_pool: Deduplicate slab allocation code (#28 )	2016-09-07 13:20:42 +01:00
Merry	d646c3119d	Merge pull request #29 from lioncash/list intrusive_list: Minor changes	2016-09-07 12:10:05 +01:00
Mat M	6a2174ebfa	Add missing explicit specifiers (#27 )	2016-09-07 12:08:48 +01:00
Mat M	6e0f27a500	types: Add helpers for determining single and doubleword extension registers (#26 )	2016-09-07 12:08:35 +01:00
Lioncash	c052f9f84c	intrusive_list: Amend doxygen parameter documentation	2016-09-06 22:54:33 -04:00
Lioncash	1c4868ccce	intrusive_list: Correct unused variable	2016-09-06 22:54:25 -04:00
Lioncash	8fb857f9da	intrusive_list: Specify noexcept on swap implementations Necessary to fully satisfy the Swappable concept.	2016-09-06 22:47:55 -04:00
Mat M	5bc9ce544f	arm_types: Move into arm folder (#25 )	2016-09-06 00:52:33 +01:00
Mat M	b40d19c3b7	location_descriptor: Provide operator<< string overload (#24 )	2016-09-05 21:31:25 +01:00
MerryMage	1f61a3d7bc	jitstate: Rename fields s/guest_FPSCR/FPSCR/	2016-09-05 14:42:21 +01:00
Mat M	6d53bb6d7e	arm_types: Split out LocationDescriptor (#20 ) This isn't really an ARM-specific type, since it's used to indicate a Block location.	2016-09-05 11:54:09 +01:00
Mat M	84336cf29d	value: Change Value into a class (#19 ) 'struct' is a little bit of a misnomer, considering it has invariants	2016-09-05 11:53:56 +01:00
Mat M	858796a029	Eliminate variable shadowing warnings with MSVC (#17 )	2016-09-04 11:30:57 +01:00
Mat M	7f9a0c3c38	Remove unnecessary explicit includes (#16 )	2016-09-03 21:48:03 +01:00
Mat M	26db11cd71	reg_alloc: Use a strongly-typed enum for representing OpArg type (#15 )	2016-09-03 18:30:03 +01:00
Mat M	05b189bc26	arm_types: Specialize std::hash for LocationDescriptor (#14 ) Same thing, but with the benefit of working with anything that uses std::hash by default.	2016-09-03 12:48:47 +01:00
Mat M	8c4df46580	FPSCR: Make value constructor explicit (#13 ) Maintains consistency between the PSR helper.	2016-09-03 12:48:31 +01:00
Mat M	3e03524658	assert: Use attribute specifier syntax with non MSVC compilers (#12 )	2016-09-03 12:48:07 +01:00
MerryMage	cc3e7e71aa	bit_util: std::bitset-based BitCount implementation Suggestion by @lioncash.	2016-09-02 22:00:48 +01:00
Mat M	5aa4f753b6	load_store: Add checks for unpredictability to other singular store instructions (#11 )	2016-09-02 21:10:28 +01:00
MerryMage	e8764c129f	bit_util: Implement BitCount portably	2016-09-02 19:05:49 +01:00
Mat M	6ec651498d	arm: Add PSR helper type (#3 )	2016-09-02 17:34:33 +01:00
Mat M	00d0f4d5ff	load_store: Add correctness checks for STRD variants (#7 ) STRD doesn't allow the use of the PC in either Rt or Rt2	2016-09-02 17:32:02 +01:00
Mat M	d16badbc04	get_set_elimination_pass: Replace decltype with direct type retrieval (#9 )	2016-09-02 17:30:21 +01:00
Mat M	1e781d911a	reg_alloc: const correctness (#8 )	2016-09-02 17:30:01 +01:00
MerryMage	ba04be5071	travis: Build on OS X	2016-09-02 17:08:09 +01:00
MerryMage	b3743e9453	Revert "arm_types: Don't use std::hash<u64>() for LocationDescriptorHash" This reverts commit `519c714dbc`.	2016-09-02 14:33:56 +01:00
MerryMage	519c714dbc	arm_types: Don't use std::hash<u64>() for LocationDescriptorHash Apple Clang (clang-600.0.54 on x86_64-apple-darwin13.4.0) complains with: implicit instantiation of undefined template 'std::__1::hash<unsigned long long>'	2016-09-02 12:45:09 +01:00
Mat M	a465b2ddbc	ir_emitter: Fix typo. ClearExlcusive -> ClearExclusive (#5 )	2016-09-02 12:17:22 +01:00
Mat M	ea157dfd52	translate_arm: const-correctness (#6 )	2016-09-02 12:17:02 +01:00
MerryMage	711b3e29d3	interface: Allow ClearCache to be called at any time	2016-09-02 10:59:33 +01:00
Mat M	fb6d838bd9	dynarmic: Remove poison_memory ClearCache parameter (#1 ) Unused since the switch to Xbyak	2016-09-01 09:47:09 +01:00
Mat M	7e3c981974	translate: Forward declare LocationDescriptor (#2 )	2016-09-01 09:46:35 +01:00
MerryMage	4321e829d1	callbacks: Add user_arg argument to InterpreterFallback	2016-09-01 02:00:08 +01:00
MerryMage	3b5c43b427	Optimization: Read page-table directly for memory access	2016-09-01 00:58:02 +01:00
MerryMage	57169ec093	abi: Implement ABI_PushCallerSaveRegistersAndAdjustStack and ABI_PopCallerSaveRegistersAndAdjustStack	2016-09-01 00:57:22 +01:00
MerryMage	702e181b35	backend_x64/abi: Reversing XMM list leads to incorrect ordering	2016-08-31 23:06:49 +01:00
MerryMage	b10c438e8e	jitstate: Remove code argument from ResetRSB	2016-08-31 21:57:33 +01:00
Lioncash	ea6a4e82b5	block_of_code: Make CallFunction accept function pointers only	2016-08-31 21:51:44 +01:00
Lioncash	37d64f0c86	hostloc: Simplify static_assert	2016-08-28 22:10:23 +01:00
Lioncash	f2bf795876	intrusive_list: Interface changes - Remove the root pointer from iterators. This is unnecessary, since the only way to get a valid iterator is either from a node itself (it transiently becomes an iterator via the underlying interface), or through the iterator interface for the list. This should also result in better code generation, as each increment or decrement of an iterator is now branchless. - Remove iterator_to This is actually a pretty dangerous function, since it would immediately create an iterator into the list using the given item, even if it's not actually part of the list. This was only left around due to lack of type handling around constructors. - Add other overloads for erase() and remove() Now handles iterators, pointers, and references.	2016-08-28 20:56:40 +01:00
MerryMage	7912a79fa5	emit_x64: align before emitting blocks	2016-08-27 11:04:43 +01:00
MerryMage	41c8dabf0b	block_of_code: nop should probably default to a size of 1	2016-08-27 10:57:48 +01:00
MerryMage	dca3b2f079	Implement VMRS and VMSR	2016-08-26 22:47:54 +01:00
MerryMage	814348371e	emit_x64: EmitX64::Emit: block.Location() returns by value	2016-08-26 19:43:29 +01:00
Lioncash	79545661b3	intrusive_list: De-duplicate some iterator code These increment/decrement variants can just leverage the other overloads.	2016-08-26 19:15:11 +01:00
MerryMage	4f6ea715b2	emit_x64: EmitX64::Emit doesn't need descriptor argument	2016-08-26 19:14:25 +01:00
Lioncash	32c24d2cb3	Use 'false' instead of '0' in asserts	2016-08-26 18:52:08 +01:00
MerryMage	ba31f43672	reg_alloc: UseDefOpArgXmm: default value for argument desired_location should be any_xmm, not any_gpr	2016-08-26 18:50:08 +01:00
MerryMage	7fedf04e79	reg_alloc: Deduplicate constants in RegAlloc::HostCall that were already defined by abi.h	2016-08-26 18:43:50 +01:00
MerryMage	59a8e14d1c	reg_alloc: Correct OpArg::setBit for Reg	2016-08-26 15:23:38 +01:00
MerryMage	065c53ebfc	emit_x64: Make ZeroIfNaN64 branchless	2016-08-26 15:23:08 +01:00
MerryMage	9901ed0f51	block_of_code: Optimize nops	2016-08-26 13:46:19 +01:00
Lioncash	0102951bdd	Convert formatting over to fmtlib	2016-08-26 13:13:19 +01:00
Lioncash	ee4b30eee4	externals: Add fmt as a submodule	2016-08-26 13:13:19 +01:00
MerryMage	ed3a686d1d	Implement public header files	2016-08-26 00:44:50 +01:00
MerryMage	656d4f7252	emit_x64: inhibit_emission is obsolete Not used anymore; unused ever since intrusive lists were introduced.	2016-08-25 23:24:16 +01:00
MerryMage	4322c0907c	microinstruction: Rename FindUseWithOpcode to GetAssociatedPseudoOperation, encapsulate associated variables	2016-08-25 21:08:47 +01:00
MerryMage	30df51c2dc	ir_emitter: Should be in the IR namespace, not the Arm namespace	2016-08-25 17:36:42 +01:00
MerryMage	922d1fd198	Merge branch 'xbyak'	2016-08-25 16:54:48 +01:00
MerryMage	d04b9eaa81	backend_x64/block_of_code: Reset labels when ClearCache() is called	2016-08-25 16:18:18 +01:00
MerryMage	e32812cd00	Port x64 backend to xbyak	2016-08-25 16:18:17 +01:00
Lioncash	0e12fb6a56	basic_block: Move all variables behind a public interface	2016-08-25 16:14:37 +01:00
Lioncash	1d8432487d	arm_types: Provide the not-equals operator overload for LocationDescriptor Generally if == has an overload, != should be provided for symmetry.	2016-08-25 14:08:16 +01:00
MerryMage	13908c5a58	reg_alloc: Insert braces around DEBUG_ASSERT DEBUG_ASSERT becomes an empty statement in release-mode; an if statement with an empty statement produces a compiler warning.	2016-08-25 13:09:18 +01:00
MerryMage	dc26afbd7e	translate_arm: Translate more than one conditional instruction in a block	2016-08-25 13:05:33 +01:00
MerryMage	aa9b63bac4	basic_block: DumpBlock now dumps terminal details	2016-08-25 13:01:32 +01:00
Lioncash	1395baefa9	interface: Return register files by const reference Prevents unnecessary copies where they aren't particularly required.	2016-08-25 12:51:41 +01:00
Lioncash	37755cbfec	translate: Simplify function pointer calls They can just be called like regular functions	2016-08-24 23:19:50 +01:00
Lioncash	9b874c2e23	CMakeLists: Add FPSCR.h to the list of headers Whoops, that one's on me	2016-08-24 23:19:49 +01:00
MerryMage	22cca5ff72	emit_x64: Actually advance RSB pointer	2016-08-24 23:19:47 +01:00
Lioncash	eba3a06d80	frontend: Introduce FPSCR register helper class Encapsulates all of the FPSCR state.	2016-08-24 20:51:14 +01:00
MerryMage	b5a86889cd	Implement VCVT	2016-08-23 22:20:04 +01:00
MerryMage	445aad0639	x64/emitter: Add opBits argument to CVTSI2SS and CVTSI2SD	2016-08-23 21:58:34 +01:00
MerryMage	78464a8f01	translate_arm/vfp2: Implement VSTM (A1, A2)	2016-08-23 20:54:38 +01:00
MerryMage	a96704eb0f	arm_types: new_reg >= 0 is always true since new_reg is unsigned	2016-08-23 20:11:41 +01:00
MerryMage	7a01dba3c4	arm_types: Change type signature of operator+ to size_t instead of int	2016-08-23 20:07:53 +01:00
MerryMage	af9a68f0d1	translate_arm/vfp2: Implement VLDM (A1, A2)	2016-08-23 20:07:06 +01:00
Lioncash	d5805cc6eb	intrusive_list: Add size querying Since we store pointers and have an interface for iterators set up, the count is just the distance from the beginning to the end of the list. Nice thing is that because of this, basic blocks also get the ability to have a size count without needing to do anything directly.	2016-08-23 19:52:09 +01:00
Lioncash	2180a4be7a	basic_block: Use a range-based for loop for iteration	2016-08-23 19:51:01 +01:00
Lioncash	897b776250	string_util: Use C++ attribute specifier for format strings This is also compatible with both clang and GCC	2016-08-23 19:38:48 +01:00
Lioncash	867d345fdc	disassembler: Deduplicate SignStr Also just makes it return a character, rather than a pointer to a string.	2016-08-23 16:40:33 +01:00
Lioncash	8bed891011	x64 emitter: Fix swapped parameter names	2016-08-23 16:39:38 +01:00
MerryMage	c8b2f63c93	get_set_elimination_pass: Eliminate unnecessary gets/sets of extended registers	2016-08-23 15:57:20 +01:00
MerryMage	e0f9dead5d	microinstruction: Identity's type depends on the type of its argument	2016-08-23 15:48:30 +01:00
Lioncash	67706c208b	assert: Use false in asserts rather than 0 Quiets extended warnings.	2016-08-23 14:31:54 +01:00
MerryMage	8c7a81a308	VPOP and VPUSH are floating-point load-store instructions	2016-08-23 14:26:50 +01:00
MerryMage	34cffa86a4	dead_code_elimination_pass: Update to use IR::Inst::MayHaveSideEffects	2016-08-23 13:12:14 +01:00
Lioncash	46573eb538	intrusive_list: Add insert_before() and insert_after() helper functions Small helpers for inserting nodes before and after an existing one. insert() is the same as insert_before(), so insert() is just made to be an alias of this.	2016-08-23 12:38:57 +01:00
MerryMage	8d1b9f32ca	Standardize indentation of switch statments	2016-08-23 12:19:27 +01:00
MerryMage	2471be317e	arm_types: Implement LocationDescription::FPSCR_RMode	2016-08-23 02:22:04 +01:00
Lioncash	47f285249b	microinstruction: Introduce convenience informational functions Whenever more rigorous optimizations are attempted (or even basic ones), it's usually helpful to know what overall kind of instruction is being dealt with, in the event certain classes of instructions may be eligible for optimization.	2016-08-22 21:36:48 +01:00
Lioncash	06ec4b5977	microinstruction: Make constructor explicit	2016-08-22 16:01:18 +01:00
Lioncash	1bedd3bd7f	CMakeLists: Clean up Moves functions out of the main CMakeLists file into module files that can just be included whenever necessary. This also uses the CMake provided variables for enforcing compiler requirements.	2016-08-22 15:55:39 +01:00
MerryMage	72250b119f	backend_x64/block_of_code: Add more floating point constants * MFloatPositiveZero32 * MFloatPositiveZero64 * MFloatMinS32 * MFloatMaxS32 * MFloatMinU32 * MFloatMaxU32	2016-08-22 15:54:19 +01:00
MerryMage	a32689c832	x64/emitter: Implement CMPxxSD instructions	2016-08-22 15:54:18 +01:00
MerryMage	843d29b5a9	translate_arm/branch: Read-after-write in arm_BLX_reg When BLX LR is translated, BXWritePC(GetRegister(Reg::LR)) was executed after the SetRegister(Reg::LR, _) update was done.	2016-08-22 15:53:56 +01:00
MerryMage	d8bee60947	translate_thumb: Read-after-write in thumb16_BLX_reg When the instruction BLX LR is translated, BXWritePC(GetRegister(Reg::LR)) was executed after the SetRegister(Reg::LR, _) update was performed.	2016-08-22 14:28:51 +01:00
Lioncash	1abe881921	basic_block: Add proxy member functions for the instruction list Currently basic block kind of acts like a 'dumb struct' which makes things a little more verbose to write (as opposed to keeping it all in one place, I guess). It's also a little wonky conceptually, considering a block is composed of instructions (i.e. 'contains' them). So providing accessors that make it act more like a container can make working with algorithms a little nicer. It also makes the API a little more defined. Ideally, the list would be only available through a function, but currently, the pool allocator is exposed, which seems somewhat odd, considering the block itself should manage its overall allocations (with placement new, and regular new), rather than putting that sanitizing directly on the IR emitter (it should just care about emission, not block state). However, recontaining that can be followed up with, as it's very trivial to do.	2016-08-22 13:44:56 +01:00
Lioncash	226d66dd5b	intrusive_list: satisfy the Swappable concept	2016-08-22 12:38:16 +01:00
Lioncash	2a9fdacc60	intrusive_list: move iterator implementation above list Will make keeping non-member list functions easier to keep together with the class.	2016-08-22 12:38:16 +01:00
Lioncash	669ffb5f3a	intrusive_list: Add pop_back(), pop_front(), front(), and back() member functions	2016-08-20 21:26:16 +01:00
Lioncash	86f803da04	reg_alloc: Use Inst's HasUses() function where applicable	2016-08-20 21:26:09 +01:00
Lioncash	a8ba15f0d5	intrusive_list: Make Remove and IsEmpty stdlib compatible Makes the name match the standard library equivalents. C++17 introduces non-member empty() which allows for nicer handling in generic contexts. May as well make the data structure compatible with it.	2016-08-19 20:25:18 +01:00
Lioncash	23d190f7b0	intrusive_list: Support inserters Allows std::inserter, std::back_inserter, and std::front_inserter to work with intrusive lists.	2016-08-19 20:25:17 +01:00
Lioncash	36a0ad5bc2	reg_alloc: const correctness for ValueLocation()	2016-08-19 19:33:57 +01:00
MerryMage	2d6a86e43c	Remove <cassert>	2016-08-19 01:53:24 +01:00
MerryMage	192a0029be	ir/opcodes: Implement IR::AreTypesCompatible Type-checking is now occuring in more than one place.	2016-08-19 01:34:14 +01:00
Tillmann Karras	9782e7da3f	verification_pass: show type errors	2016-08-19 01:17:30 +01:00
Tillmann Karras	dad7724b86	TranlateArm: implement remaining multiplies SMLALxy, SMLAxy, SMULxy SMLAWy, SMULWy, SMLAD, SMLALD, SMLSD, SMLSLD, SMUAD, SMUSD	2016-08-19 01:08:38 +01:00
MerryMage	fe15cbd50e	translate_arm/parallel: Detect UNPREDICTABLE instructions	2016-08-19 00:59:07 +01:00
MerryMage	2119dfc926	translate_arm/multiply: MLA is UNPREDICTABLE when Ra == R15	2016-08-19 00:59:05 +01:00
MerryMage	0d0f4b1b4f	translate_arm/load_store: Correct implementation for LDM*	2016-08-19 00:59:04 +01:00
MerryMage	4acc481463	translate_arm/load_store: Handle unpredictable instructions This necessated handling literal versions of the instructions separately as they had different requirements. The rationale for detecting unpredictable instructions is because: a. they are unlikely to be outputted by a well-behaved compiler b. their behaviour may change between different processors I would rather unpredictable instructions fail loudly than silently do approximately the right thing.	2016-08-19 00:59:02 +01:00
MerryMage	5869e79b9c	translate_arm: Simplify EmitImmShift and EmitRegShift	2016-08-19 00:21:31 +01:00
Lioncash	fe9329ef3e	intrusive_list: Add list class type definitions; extend iterator interface Adds type definitions, and extends the list interface to support all standard library forms of iterator creation.	2016-08-18 23:47:26 +01:00
Lioncash	95a83543f2	intrusive_list: Get rid of unnecessary static_casts The only valid objects to add to the list are those that inherit from IntrusiveListNode. Therefore anything being added to the list that isn't inheriting from it will cause compilation to fail.	2016-08-18 23:47:26 +01:00
Lioncash	67509935f6	intrusive_list: Eliminate need for separate const iterator construct This generalizes the regular iterator to be compatible with both use cases. Passing in the list instance directly isn't needed, because the only way you'd ever get a valid instantiation of an iterator is from a list instance itself.	2016-08-18 23:47:26 +01:00
MerryMage	b8cf43c43e	translate_arm/data_processing: Rd == R15 is unpredictable for rsr instructions	2016-08-18 18:23:05 +01:00
MerryMage	efc8d2f772	arm_translator: NV conditional is obsolete	2016-08-18 18:21:48 +01:00
MerryMage	5f7d940fde	disassemble_arm: Partially implement coprocessor and hint instructions	2016-08-18 18:21:16 +01:00
MerryMage	36a916a766	decoder/arm: Correct NOP decoder	2016-08-18 18:20:29 +01:00

... 17 18 19 20 21 ...

2057 commits