dynarmic

Author	SHA1	Message	Date
Lioncash	7797bc2fb2	emit_x64_vector: Use non-scratch Use* variants of registers within EmitVectorUnsignedAbsoluteDifference() In some cases, a register isn't modified, depending on the branch taken, so we can signify this by using the non-scratch variants in certain cases.	2020-04-22 20:46:20 +01:00
Lioncash	f7f83b76b7	simd_scalar_two_register_misc: Implement scalar double/single-precision variants of FCM{EQ, GE, GT, LE, LT} (zero)	2020-04-22 20:46:20 +01:00
Lioncash	9db6d1e98b	translate_arm: Remove unnecessary rotr() function We already have RotateRight() in our common code, so we can remove this function and replace it with it. We can also implement ArmExpandImm_C() in terms of ArmExpandImm().	2020-04-22 20:46:20 +01:00
Lioncash	9f8a44c982	cast_util: Remove unnecessary typename Given we use std::aligned_storage_t, we don't need to specify typename here. If we used std::aligned_storage, then we would need to.	2020-04-22 20:46:19 +01:00
MerryMage	89e43867c1	A64: Implement FADDP (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	33fa65de23	A64: Implement FADDP (vector)	2020-04-22 20:46:19 +01:00
MerryMage	9dba273a8c	A64: Implement SADDLP	2020-04-22 20:46:19 +01:00
MerryMage	70ff2d73b5	A64: Implement UADDLP	2020-04-22 20:46:19 +01:00
MerryMage	5563bbbd79	A64: Implement EXT	2020-04-22 20:46:19 +01:00
MerryMage	304cc7f61e	emit_x64_floating_point: SSE4.1 implementation for FP{Double,Single}ToFixed{S,U}{32,64}	2020-04-22 20:46:19 +01:00
MerryMage	3d9677d094	A64: Implement FCVTMU (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	79c9018d60	A64: Implement FCVTMS (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	49c4499a87	A64: Implement FCVTPU (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	af661ef5a6	A64: Implement FCVTPS (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	27319822bb	A64: Implement FCVTAU (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	c0c7a26314	A64: Implement FCVTAS (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	a1965a74a0	A64: Implement FCVTNU (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	7d36dbcdfd	A64: Implement FCVTNS (scalar)	2020-04-22 20:46:19 +01:00
MerryMage	617ca0adf0	floating_point_conversion_integer: Refactor implementation of FCVTZS_float_int and FCVTZU_float_int	2020-04-22 20:46:19 +01:00
MerryMage	caaf36dfd6	IR: Initial implementation of FP{Double,Single}ToFixed{S,U}{32,64} This implementation just falls-back to the software floating point implementation.	2020-04-22 20:46:19 +01:00
MerryMage	760cc3ca89	EmitContext: Expose FPCR	2020-04-22 20:46:19 +01:00
MerryMage	9571269552	fp/op: Implement FPToFixed	2020-04-22 20:46:19 +01:00
MerryMage	8087e8df05	mantissa_util: Implement ResidualErrorOnRightShift Accurately calculate residual error that is shifted out	2020-04-22 20:46:19 +01:00
MerryMage	8668d61881	fp/unpacked: Implement FPRound	2020-04-22 20:46:19 +01:00
MerryMage	55d590c01f	FPCR: Add AHP setter and FZ16 getter	2020-04-22 20:46:19 +01:00
MerryMage	7360a2579b	mp: Implement metaprogramming library	2020-04-22 20:46:19 +01:00
MerryMage	4ab029c114	fp: Implement FPUnpack	2020-04-22 20:46:19 +01:00
MerryMage	4875658917	fp: Implement FPProcessException	2020-04-22 20:46:19 +01:00
MerryMage	3cb98e1560	fp: Move fp_util to fp/util	2020-04-22 20:46:19 +01:00
MerryMage	c41a38b13e	fp: Add FPSR	2020-04-22 20:46:19 +01:00
MerryMage	66381352f3	fp: Add FPInfo Provides information about floating-point format for various bit sizes	2020-04-22 20:46:19 +01:00
MerryMage	d21659152c	safe_ops: Implement safe shifting operations Implement shifiting operations that perform consistently across architectures without running into undefined or implemented-defined behaviour.	2020-04-22 20:46:19 +01:00
MerryMage	b00fe23b91	bit_util: Implement MostSignificantBit	2020-04-22 20:46:19 +01:00
MerryMage	95ad0d0a66	bit_util: Use Ones to implement Bits	2020-04-22 20:46:19 +01:00
MerryMage	62b640b2fa	bit_util: Add ClearBit and ModifyBit	2020-04-22 20:46:19 +01:00
MerryMage	8651c2d10e	u128: Implement u128 For when we need a 128-bit integer	2020-04-22 20:46:19 +01:00
Lioncash	e7409fdfe4	A64: Implement UCVTF (vector, integer)'s double/single-precision variant	2020-04-22 20:46:19 +01:00
Lioncash	4aa4885ba7	ir: Add opcodes for vector conversion of u32/u64 to floating-point	2020-04-22 20:46:19 +01:00
Lioncash	fcae4e2418	simd_three_different: Deduplicate common implementations Generally, the only difference between the signed variants and the unsigned variants is whether or not we use a sign-extension or zero-extension, so we can simply use common functions to implement both cases without totally duplicating code twice here.	2020-04-22 20:46:19 +01:00
Lioncash	9c0d5cf15c	floating_point_conversion_integer: Handle S64/U64 -> F32 conversions in SCVTF_float_int and UCVTF_float_int	2020-04-22 20:46:19 +01:00
Lioncash	7a84b6e8d8	ir: Add opcodes for converting S64 and U64 to single-precision floating-point values	2020-04-22 20:46:19 +01:00
Lioncash	066061fa50	constant_pool: Remove unnecessary std::memset from constructor AllocateFromCodeSpace() already zeroes out the allocated memory.	2020-04-22 20:46:19 +01:00
Lioncash	a1d6a86e8c	A64: Implement ADDV	2020-04-22 20:46:19 +01:00
Lioncash	35026a6ce3	emit_x64_vector: Vectorize fallback path for EmitVectorMaxU32()	2020-04-22 20:46:19 +01:00
Lioncash	245c903129	simd_three_same: Join FPAbsoluteComparison() into FPCompareRegister() These are part of the same comparison family, so there's no real point in keeping them separate.	2020-04-22 20:46:19 +01:00
Lioncash	9912836b59	A64: Implement scalar double/single-precision variants of FACGE, FACGT, FCMEQ, FCMGE, FCMGT	2020-04-22 20:46:18 +01:00
MerryMage	0b97e9bd8d	emit_x64_floating_point: Fix EmitFPU64ToDouble for TowardsMinusInfinity rounding mode	2020-04-22 20:46:18 +01:00
MerryMage	a2eb9a02e0	backend_x86: Add FPSCR_RMode to EmitContext	2020-04-22 20:46:18 +01:00
MerryMage	d875c08ebf	fp: Extract common RoundingMode enum	2020-04-22 20:46:18 +01:00
Lioncash	3714bc0ed4	floating_point_conversion_integer: Use FPS64ToDouble and FPU64ToDouble in SCVTF_float_int and UCVTF_float_int The opcodes introduced in 979b6f39f1621b80bd463645ec5b08661cb6b1bf can also be used here, avoiding more falling back to the interpreter.	2020-04-22 20:46:18 +01:00
Lioncash	b97358075e	simd_scalar_two_register_misc: Handle 64-bit case in SCVTF and UCVTF's scalar double/single-precision variant Avoids falling back to the interpreter in the 64-bit case.	2020-04-22 20:46:18 +01:00
Lioncash	7252293184	emit_x64_floating_point: Correct use of UseGpr() in EmitFPU32ToDouble() and EmitFPU32ToSingle() In the non-AVX512 path, the following code is present: code.mov(from.cvt32(), from.cvt32()); since this potentially modifies 'from', we should be using UseScratchGpr() instead.	2020-04-22 20:46:18 +01:00
Lioncash	fbd7623fe5	emit_x64_floating_point: Add AVX512F conversion operations to EmitFPU32ToSingle() and EmitFPU32ToDouble() AVX-512F provides convenient instructions for these kinds of conversions directly	2020-04-22 20:46:18 +01:00
Lioncash	3a41465eaf	ir: Add opcodes for converting S64 and U64 to double-precision values	2020-04-22 20:46:18 +01:00
MerryMage	436ca80bcd	Merge branch 'global_monitor'	2020-04-22 20:46:18 +01:00
Lioncash	0f4bf26e05	simd_two_register_misc: Utilize FPVectorAbs in FABS implementations Since we already have opcodes introduced to implement FACGE and FACGT, we can reutilize it for the FABS implementations.	2020-04-22 20:46:18 +01:00
MerryMage	821cff1227	A64: Add ClearExclusiveState method	2020-04-22 20:46:18 +01:00
Lioncash	81e572c78c	ir: Extend FPVectorAbs opcode to also handle 16-bit elements for FP16	2020-04-22 20:46:18 +01:00
MerryMage	2a8de5f733	a64_emit_x64: Clear exclusive state in EmitA64CallSupervisor The kernel would have to execute an ERET instruction to return to userland; this clears exclusive state.	2020-04-22 20:46:18 +01:00
Lioncash	53dbb6a92a	A64: Implement FACGE's vector single/double precision variants	2020-04-22 20:46:18 +01:00
MerryMage	57f7c7e1b0	Implement global exclusive monitor	2020-04-22 20:46:18 +01:00
Lioncash	6912a02d9b	A64: Implement FACGT's vector single/double precision variants	2020-04-22 20:46:18 +01:00
MerryMage	85234338d3	a64_emit_x64: Simplify EmitExclusiveWrite	2020-04-22 20:46:18 +01:00
Lioncash	fc731dddae	ir: Add opcodes for performing vector absolute floating-point values This will be usable for implementing FACGE and FACGT	2020-04-22 20:46:18 +01:00
MerryMage	2fc6b33829	CMakeLists: Add missing files	2020-04-22 20:46:18 +01:00
Lioncash	0bee648b4f	emit_x64_vector: Deduplicate a bit of code in EmitVectorSetElement{8, 32, 64} functions Given both branches are the same, we can hoist out the common code.	2020-04-22 20:46:18 +01:00
Lioncash	d86fea0d28	A64: Implement FCMEQ (zero)'s vector single and double precision variant	2020-04-22 20:46:18 +01:00
Lioncash	593eca7fb1	A64: Implement load/store single structure instructions Implements LD{1, 2, 3, 4}, LD{1, 2, 3, 4}R, and ST{1, 2, 3, 4} single structure variants.	2020-04-22 20:46:18 +01:00
Lioncash	9bec354791	A64: Implement FCMEQ (register)'s vector single and double precision variant	2020-04-22 20:46:18 +01:00
Lioncash	b6e223fc58	emit_x64_vector: Deduplicate a bit of code within EmitVectorGetElement8() Given both branches use the same destination register size, we can hoist the common code out.	2020-04-22 20:46:18 +01:00
Lioncash	5ce187a54e	ir: Add opcodes for floating-point vector equalities	2020-04-22 20:46:18 +01:00
MerryMage	be354dbfd0	ir/basic_block: Add missing U16 immediate type to DumpBlock	2020-04-22 20:46:18 +01:00
Lioncash	cf188448d4	emit_x64_vector: Vectorize fallback case in EmitVectorMultiply64() Gets rid of the need to perform a fallback.	2020-04-22 20:46:18 +01:00
MerryMage	5503ff28c3	llvm_disassemble: Allow disassembly of invalid AArch64 instructions	2020-04-22 20:46:18 +01:00
Lioncash	954deff2d4	emit_x64_vector: Add break to final case in EmitVectorRoundingHalvingAddUnsigned() This doesn't alter behavior but does make the code better if anything else is ever added to this function in the future.	2020-04-22 20:46:18 +01:00
Lioncash	11a92eaaef	A64: Implement SRHADD and URHADD	2020-04-22 20:46:18 +01:00
Lioncash	9e75d08860	A64: Implement FABD's scalar single/double precision variant	2020-04-22 20:46:18 +01:00
Lioncash	bc718c5b28	ir: Add opcodes for performing rounding halving adds	2020-04-22 20:46:18 +01:00
Lioncash	d898d1779d	A64: Implement FABD's vector single/double precision variant	2020-04-22 20:46:18 +01:00
Lioncash	054549da35	emit_x64_vector: Simplify AVX-512 codepath in EmitVectorMultiply64 I realized I introduced a helper for simple AVX operation emitting, so use that instead of writing it all out long-form.	2020-04-22 20:46:18 +01:00
Lioncash	8a4f8aed06	ir: Add opcode for performing FP vector absolute differences	2020-04-22 20:46:18 +01:00
Lioncash	cb456f914b	A64: Implement UMLAL{2}, UMLSL{2}, and UMULL{2} Now that we have the helper function set up for the signed variants, we can also modify it to be used with the unigned ones by performing a zero extension instead of a sign extension.	2020-04-22 20:46:18 +01:00
MerryMage	ba84e7a8de	A64: Implement FNMSUB	2020-04-22 20:46:18 +01:00
Lioncash	3576c02d91	A64: Implement SMLSL{2}	2020-04-22 20:46:18 +01:00
MerryMage	a1042cfcd8	A64: Implement FNMADD	2020-04-22 20:46:18 +01:00
Lioncash	ada5c0b2fa	A64: Implement SMLAL{2}	2020-04-22 20:46:18 +01:00
MerryMage	0d83032a6f	A64: Implement FMSUB	2020-04-22 20:46:18 +01:00
Lioncash	2d1aca25e6	A64: Implement SMULL{2}	2020-04-22 20:46:18 +01:00
MerryMage	69e00d225c	A64: Implement FMADD	2020-04-22 20:46:18 +01:00
MerryMage	8c90fcf58e	IR: Implement FPMulAdd	2020-04-22 20:46:18 +01:00
Lioncash	c5ae9107a9	A64: Implement SABAL/SABAL2 and SABDL/SABDL2 Now that we have a helper function for the unsigned variants, we can modify it to also be usable with the signed variants.	2020-04-22 20:46:18 +01:00
Lioncash	24e3299276	A64: Implement FCMGT, FCMGE (register) vector double and single precision variants	2020-04-22 20:46:18 +01:00
Lioncash	26d4473851	A64: Implement UABAL/UABAL2	2020-04-22 20:46:18 +01:00
Lioncash	350bc70be8	A64: Implement FCMGT, FCMGE, FCMLE, FCMLT (zero) vector double and single precision variants.	2020-04-22 20:46:18 +01:00
Lioncash	3397742c74	A64: Implement UABDL/UABDL2	2020-04-22 20:46:18 +01:00
Lioncash	c695da1cf3	ir: Add opcode for floating-point GE and GT comparisons The rest of the comparisons can be implemented in terms of these two	2020-04-22 20:46:18 +01:00
Lioncash	6de5ed96e5	emit_x64_vector: Emit VPMULLQ in EmitVectorMultiply64 on AVX-512{DQ, VL} capable CPUs Shortens code-gen down to a single instruction in the 64-bit path.	2020-04-22 20:46:18 +01:00
Lioncash	9054d1c20b	A64: Implement LDR (literal, SIMD&FP)	2020-04-22 20:46:18 +01:00
Lioncash	0da5e949a8	Correct typo in DataCacheOperation enum Fixes a typo for the InvalidateByVAToPoC enum entry. Given yuzu is the only known user of 64-bit mode and it doesn't use this value, we can get away with changing this.	2020-04-22 20:46:18 +01:00
Lioncash	9736e2cce2	A64: Implement FABS' half-precision variant	2020-04-22 20:46:18 +01:00

1 2 3 4 5 ...

1142 commits