dynarmic

Author	SHA1	Message	Date
Lioncash	bfaeb08d3c	A64: Implement LDNP/STNP LDNP and STNP indicate that a memory access is non-temporal/streaming (i.e. unlikely to be repeated), allowing data caching to not be performed. However, given this is only a hint, we can treat these two instructions as regular LDP and STP instructions for the time being.	2020-04-22 20:58:10 +01:00
Lioncash	deb9dd4acc	block_of_code: Replace cast with [[maybe_unused]] in DoesCpuSupport()	2020-04-22 20:58:09 +01:00
Lioncash	3290a9fdc2	common: Remove address_range.h The AddressRange structure isn't used anywhere within the codebase, so this can be removed. Particularly because there's no real appeal/heavy potential use of it in the future that isn't trivial to add back if needed.	2020-04-22 20:57:38 +01:00
Lioncash	ee973f13c7	frontend/A32/ir_emitter: Mark PC() and AlignPC() as const-qualified member functions These don't modify instance state, so they can be const-qualified member functions.	2020-04-22 20:57:38 +01:00
Lioncash	3a2dd09122	frontend/A64/ir_emitter: Mark PC() and AlignPC() as const qualified member functions These don't actually alter any instance state.	2020-04-22 20:57:38 +01:00
Lioncash	575ae852a9	constant_propagation_pass: Fold byte reversal opcodes where applicable These are reasonably trivial to fold away when applicable. We just perform the swap and replace the instruction with the constant value.	2020-04-22 20:57:37 +01:00
Merry	2c53f354ab	Merge pull request #418 from lioncash/fold-op constant_propagation_pass: Handle folding for Least/MostSignificant{Bit, Byte, Half, Word} opcodes	2020-04-22 20:57:37 +01:00
Merry	ad14a33672	Merge pull request #417 from lioncash/swap common: Move byte swapping functions to bit_utils.h	2020-04-22 20:57:37 +01:00
Lioncash	d302d9bd0c	constant_propagation_pass: Handle folding for Least/MostSignificant{Bit, Byte, Half, Word} opcodes These are quite trivial to fold.	2020-04-22 20:57:37 +01:00
Lioncash	7139942976	common: Move byte swapping functions to bit_utils.h These are quite general functions, so they can just be moved into common instead of recreating a namespace here.	2020-04-22 20:57:37 +01:00
MerryMage	7c8fcaef26	emit_x64_vector_floating_point: AVX && DN implementation of EmitFPVectorMulX	2020-04-22 20:57:37 +01:00
MerryMage	e3898e628e	A64: Implement FMULX (by element), single and double precision variants	2020-04-22 20:57:37 +01:00
Lioncash	93351c7efb	a64_emit_x64: Make constness of loop elements explicit within GenFastmemFallbacks()	2020-04-22 20:57:37 +01:00
MerryMage	c106d8cedf	A64: Implement FMULX, vector single-precision and double-precision variant	2020-04-22 20:57:37 +01:00
Lioncash	7752ffc50c	a64_emit_x64: Convert std::vector instances in GenFastmemFallbacks() to std::array Given these are quite small, we can avoid the need to heap allocate here.	2020-04-22 20:57:37 +01:00
MerryMage	fa8925c4df	IR: Implement FPVectorMulX	2020-04-22 20:57:37 +01:00
Michał Janiszewski	bbd8abaa25	Provide justification for always-true condition (#412 )	2020-04-22 20:57:37 +01:00
Michał Janiszewski	7d0e918b51	Add missing include guards	2020-04-22 20:57:37 +01:00
V.Kalyuzhny	764a93bf5a	Switch boost::optional to std::optional	2020-04-22 20:57:37 +01:00
Lioncash	07c197e8d0	constant_propagation_pass: Add 64-bit variants of shifts to the pass These optimizations can also apply to the 64-bit variants of the shift opcodes; we just need to check if the instruction has an associated pseudo-op before performing the 32-bit variant's specifics. While we're at it, we can also relocate the code to its own function like the rest of the cases to keep organization consistent.	2020-04-22 20:57:37 +01:00
Lioncash	8248999c5d	constant_propagation_pass: Fold division operations where applicable We can fold division operations if: 1. The divisor is zero, then we can replace the result with zero (as this is how ARM platforms expect it). 2. Both values are known, in which case we can just do the operation and store the result 3. The divisor is 1, in which case just return the other operand.	2020-04-22 20:57:37 +01:00
Merry	d83eae2004	Merge pull request #406 from lioncash/mul constant_propagation_pass: Fold Mul32 and Mul64 cases where applicable	2020-04-22 20:57:37 +01:00
Merry	73d9393300	Merge pull request #405 from lioncash/inst a64: Add ARMv8.4+ instructions encodings to the encoding table	2020-04-22 20:57:37 +01:00
Lioncash	7ad6981437	constant_propagation_pass: deduplicate common 32/64 bit checking for results in folding functions It's common for an folding operation to apply to both the 32-bit and 64-bit variant of the same opcode, which leads to checking which kind of result we need to store the value as. This moves it to its own function, so that we don't need to duplicate it in various functions.	2020-04-22 20:57:37 +01:00
Lioncash	f1a66c37ba	a64: Add ARMv8.4+ instructions encodings to the encoding table Keeps the table up to date with the ARM specification.	2020-04-22 20:57:37 +01:00
Lioncash	72daf37208	constant_propagation_pass: Fold Mul32 and Mul64 cases where applicable Multiplication operations can currently be folded if: 1. Both arguments are known constant values 2. Either operand is zero (in which case the result is also zero) 3. Either operand is one (in which case the result is the non-one operand).	2020-04-22 20:57:37 +01:00
Lioncash	43b2eb4688	constant_propagation_pass: Fold SignExtend{Type}ToLong opcodes if possible	2020-04-22 20:57:37 +01:00
Lioncash	2da2cf9058	constant_propagation_pass: Fold SignExtend{Type}ToWord opcodes if possible	2020-04-22 20:57:37 +01:00
Lioncash	0583d401e3	ir/value: Add IsSignedImmediate() and IsUnsignedImmediate() functions to Value's interface This allows testing against arbitrary values while also simultaneously eliminating the need to check IsImmediate() all the time in expressions.	2020-04-22 20:57:37 +01:00
Lioncash	c42f6ea184	constant_propagation_pass: Fold ZeroExtend{Type}ToLong opcodes if possible These are equivalent to the ZeroExtendXToWord variants, so we can trivially do this as well.	2020-04-22 20:57:37 +01:00
Lioncash	e3258e8525	ir/value: Add a GetImmediateAsS64() function Provides a signed analogue to GetImmediateAsU64() for consistency with both integral classes when it comes to signed/unsigned..	2020-04-22 20:57:37 +01:00
Lioncash	2274214ff0	constant_propagation_pass: Combine zero-extension folding code into its own function Separates the behavior from the actual switch statement and gets rid of duplication, now that we can use the general GetImmediateAsU64() function.	2020-04-22 20:57:37 +01:00
Lioncash	4a3c064b15	ir/value: Add an IsZero() member function to Value's interface By far, one of the most common things to check for is whether or not a value is zero, as it typically allows folding away unnecesary operations (other close contenders that can help with eliding operations are 1 and -1). So instead of requiring a check for an immediate and then actually retrieving the integral value and checking it, we can wrap it within a function to make it more convenient.	2020-04-22 20:57:37 +01:00
Merry	c649f11c0a	Merge pull request #401 from lioncash/folding constant_propagation_pass: Fold &, \|, ^, and ~ operations where applicable	2020-04-22 20:56:01 +01:00
MerryMage	2524d536b0	A32/ir_emitter: Bugfix: ExceptionRaised was producing incorrect PC Use actual PC and not pipelined PC.	2020-04-22 20:56:01 +01:00
Lioncash	c09f4cf28e	constant_propagation_pass: Fold NOT operations	2020-04-22 20:55:50 +01:00
Lioncash	d69fceec55	value: Move ImmediateToU64() to be a part of Value's interface This'll make it slightly nicer to do basic constant folding for 32-bit and 64-bit variants of the same IR opcode type. By that, I mean it's possible to inspect immediate values without a bunch of conditional checks beforehand to verify that it's possible to call GetU32() or GetU64, etc.	2020-04-22 20:55:50 +01:00
Lioncash	8013548bbb	constant_propagation_pass: Fold OR operations	2020-04-22 20:55:50 +01:00
MerryMage	ca603c1215	reg_alloc: Emit AVX instructions where able Smaller codesize.	2020-04-22 20:55:50 +01:00
Lioncash	898d096e39	constant_propagation_pass: Fold AND operations	2020-04-22 20:55:50 +01:00
MerryMage	e2358af5ef	abi: Emit AVX instructions where able Smaller codesize.	2020-04-22 20:55:50 +01:00
Lioncash	f40fcda1f6	ir/value: Add member function to check whether or not all bits of a contained value are set This is useful when we wish to know if a contained value is something like 0xFFFFFFFF, as this helps perform constant folding. For example the operation: x & 0xFFFFFFFF can be folded to just x in the 32-bit case.	2020-04-22 20:55:50 +01:00
MerryMage	7c0378f56d	a64_exclusive_monitor: Loosen memory ordering requirements It is not necessary to be as strict as it was.	2020-04-22 20:55:50 +01:00
Lioncash	0ea99b7d59	constant_propagation_pass: Fold EOR operations It's possible to fold cases of exclusive OR operations if they can be known to be an identity operation, or if both operands happen to be known immediates, in which case we can just store the result of the exclusive-OR directly.	2020-04-22 20:55:50 +01:00
MerryMage	f0920c0ded	Fix VShift terminology An arithmetic shift is by definition a signed shift, and a logical shift is by definition an unsigned shift. - Rename VectorLogicalVShiftS* -> VectorArithmeticVShift* - Rename VectorLogicalVShiftU* -> VectorLogicalVShift*	2020-04-22 20:55:50 +01:00
MerryMage	b51dae790d	emit_x64_vector: AVX512 implementation of EmitVectorLogicalVShiftS16	2020-04-22 20:55:50 +01:00
MerryMage	bd47f2ca8f	emit_x64_vector: AVX512 implementation of EmitVectorLogicalVShiftS64	2020-04-22 20:55:50 +01:00
MerryMage	3bf183d7e8	emit_x64_vector: AVX2 implementation of EmitVectorLogicalVShiftS32	2020-04-22 20:55:50 +01:00
MerryMage	94f9d402eb	emit_x64_vector: AVX512 implementation of EmitVectorLogicalVShiftU16()	2020-04-22 20:55:50 +01:00
MerryMage	6d9639e3b0	emit_x64_vector: AVX2 implementation of EmitVectorLogicalVShiftU64()	2020-04-22 20:55:50 +01:00

1 2 3 4 5 ...

1457 commits