dynarmic

Author	SHA1	Message	Date
Lioncash	25b4e463d3	ir_opt/a64_get_set_elimination_pass: Remove redundant return This lambda function has a void return type, so we don't need to explicitly return at the end of it.	2020-04-22 21:04:22 +01:00
Lioncash	182ceb2807	General: Make parameter names from declarations and implementations consistent Most of the time when this occurs, it's a bug. Thankfully this isn't the case. However, we can resolve these cases to make the codebase more consistent.	2020-04-22 21:04:22 +01:00
MerryMage	3513ed1c60	CMakeLists: Define FMT_USE_USER_DEFINED_LITERALS=0 This disable a fmtlib feature that depends on a non-standard feature for its implementation.	2020-04-22 21:04:22 +01:00
Lioncash	b301fcd520	A32/translate/translate: Add missing doxygen parameter string	2020-04-22 21:04:22 +01:00
Lioncash	44b61212e5	Revert "CMakeLists: Handle DYNARMIC_NO_BUNDLED_FMT in relation to export()" I was being silly. This isn't required. This reverts commit 00b79cbb72c61744470e0aa1a96b673702b33931.	2020-04-22 21:04:22 +01:00
Lioncash	6b9bf7868a	General: Correct typos is code comments	2020-04-22 21:04:22 +01:00
Lioncash	acd7ac5ed3	CMakeLists: Handle DYNARMIC_NO_BUNDLED_FMT in relation to export() This is pretty gross, but until DYNARMIC_NO_BUNDLED_FMT is eliminated, this fixes the use of it in existing libraries or applications making use of dynarmic.	2020-04-22 21:04:22 +01:00
Lioncash	6187de7ca7	a32_interface: std::move UserConfig where applicable UserConfig instances contain up to 16 std::shared_ptr<Coprocessor> instances. We can std::move here to avoid performing 16 redundant atomic reference increment and decrement operations. Mostly inconsequential on x64, but we may as well signify intent.	2020-04-22 21:04:22 +01:00
MerryMage	7d20f3b861	A32/translate_thumb: Split off implementation into thumb16 and thumb32	2020-04-22 21:04:22 +01:00
Lioncash	b79ce71b0f	ir/basic_block: std::move Terminal within SetTerminal and ReplaceTerminal A terminal isn't a trivial type (and boost::variant is allowed to heap allocate), so we can std::move it here to avoid a redundant copy.	2020-04-22 21:04:22 +01:00
MerryMage	e639aa1583	A32/translate: Rename translate_arm directory to impl Mirror what the A64 frontend does.	2020-04-22 21:04:22 +01:00
Lioncash	63eff4e7cc	ir/terminal: std::move constructor parameters where applicable Allows the compiler to choose the most suitable code in this scenario, given a Terminal isn't a trivial type.	2020-04-22 21:04:22 +01:00
Lioncash	b13b6610b5	a32_interface: Default destructor in the cpp file Makes it more consistent with code throughout the codebase.	2020-04-22 21:04:22 +01:00
MerryMage	5f8eb7c51c	A32/location_descriptor: Add CPSR.IT to A32::LocationDescriptor	2020-04-22 21:04:22 +01:00
MerryMage	13f65f55eb	PSR: Use Common::ModifyBit{,s}	2020-04-22 21:04:22 +01:00
MerryMage	74633301c1	A32: Add ITState	2020-04-22 21:04:22 +01:00
MerryMage	6e2cd35e4f	a32_jitstate: Optimize runtime location descriptor calculation Calculation is now one unaligned 64-bit load.	2020-04-22 21:04:22 +01:00
MerryMage	0de3993373	a32_jitstate: Remove fpsr_idc We do not really have accurate FPSR state in any case.	2020-04-22 21:04:22 +01:00
MerryMage	6f49c0ef8e	{a32,a64}_jitstate: Rename CPSR_* to cpsr_*	2020-04-22 21:04:22 +01:00
MerryMage	8cd7837839	a32_jitstate: Remove old_FPSCR	2020-04-22 21:04:22 +01:00
MerryMage	b3bb544bca	a32_jitstate: Rename FPSCR_nzcv to fpsr_nzcv	2020-04-22 21:04:22 +01:00
MerryMage	76f986979d	a32_jitstate: Rename FPSCR_mode to fpcr_mode	2020-04-22 21:04:22 +01:00
MerryMage	49fca15f90	{a32,a64}_jitstate: Rename FPSCR_IDC to fpsr_idc	2020-04-22 21:04:21 +01:00
MerryMage	622c02f537	{a32,a64}_jitstate: Remove FPSCR_UFC	2020-04-22 21:04:21 +01:00
MerryMage	366d63f4b4	a32_jitstate: Enable SSE FTZ and DAZ	2020-04-22 21:04:21 +01:00
MerryMage	f178562ee7	a32_jitstate: Remove exception trap enables from FPSCR_MODE_MASK We don't currently use this for anything (we do not currently trap floating point exceptions). This frees these bits up for other purposes.	2020-04-22 21:04:21 +01:00
Merry	fd6222f0a1	Merge pull request #500 from lioncash/cbz A32: Implement Thumb-1's CBZ/CBNZ instructions	2020-04-22 21:04:21 +01:00
Merry	bab4e29075	Merge pull request #498 from lioncash/ahp A32/location_descriptor: Add AHP bit to the FPSCR mask	2020-04-22 21:04:21 +01:00
Lioncash	2d695e3c7c	common/fp/info: Make formatting of FPInfo struct member functions consistent Orgranizes the functions to all be consistent with the half-precision specialization.	2020-04-22 21:04:21 +01:00
Lioncash	05b330906e	common/fp/util: Make ProcessNaN utility functions constexpr Nothing in particular prevents these from being constexpr. Do so to make them consistent with the bulk of other functions in this header that are constexpr.	2020-04-22 21:04:21 +01:00
Lioncash	6b9a40bdc4	common/fp/op/FPNeg: Make FPNeg constexpr Negation in (standard IEEE) floating-point is simply flipping the sign-bit, so this operation will never be more complex than what is presented here, making constexpr a reasonable allowance.	2020-04-22 21:04:21 +01:00
Lioncash	ef95e0fa7d	CMakeLists: Add FPNeg.h to the library target sources Ensures that the header shows up in IDE generated projects.	2020-04-22 21:04:21 +01:00
Lioncash	87083af733	general: Remove trailing spaces General code-related cleanup. Gets rid of trailing spaces in the codebase.	2020-04-22 21:04:21 +01:00
Lioncash	fdbafbc1ae	x64/reg_alloc: Remove reference qualifier to variable in GetArgumentInfo() The result of GetArg() is returned by value, so this is essentially still a copy. While the previous code is valid, this communicates what is actually happening a little more explicitly.	2020-04-22 21:04:20 +01:00
Lioncash	1f6878fb46	ir_opt/verification_pass: Add include for std::puts Ensures that the header dependency is always satisfied directly, and not through other project headers. While we're at it, we can qualify the call with the std:: namespace.	2020-04-22 21:03:38 +01:00
Lioncash	fc9c59d056	ir_opt/verification_pass: Eliminate redundant GetArg() Given the same argument is used inside the condition's body if it's true, we can just utilize the local to cut out a GetArg() operation. Avoids redundant internal assertion checking.	2020-04-22 21:03:35 +01:00
Lioncash	03e6899fd7	A32: Implement Thumb-1's CBZ/CBNZ instructions Introduced in ARMv6T2, this allows for short forward branches.	2020-04-22 21:02:47 +01:00
Lioncash	d02a4e6fc9	A32/location_descriptor: Add AHP bit to the FPSCR mask Ensures the alternate half-precision state is preserved within the location descriptors, which will be necessary when implementing the half-precision extensions for VFP and NEON.	2020-04-22 21:02:47 +01:00
Merry	f4990a5f6b	Merge pull request #499 from lioncash/movw A32: Implement ARM-mode MOVW	2020-04-22 21:02:47 +01:00
Lioncash	bd755ae494	frontend/ir/ir_emitter: Add A32 equivalent to A64's SetCheckBit This will be used in a subsequent change to implement ARMv6T2's CBZ/CBNZ Thumb-1 instructions.	2020-04-22 21:02:47 +01:00
Lioncash	c6e1fd1416	a64_emit_x64: Use const on locals where applicable Normalizes the use of const in the source file.	2020-04-22 21:02:47 +01:00
Merry	f6f0b6da65	Merge pull request #497 from lioncash/boost A32/coprocessor: Remove boost from public interface	2020-04-22 21:02:47 +01:00
Lioncash	106c8c2473	A32: Implement ARM-mode MOVW Introduced to the ISA in ARMv6T2	2020-04-22 21:02:47 +01:00
Lioncash	fb437080be	a32_emit_x64: Use const on locals where applicable Normalizes the use of const in the source file.	2020-04-22 21:02:47 +01:00
Lioncash	9935f3aa28	A32: Implement Thumb-1 variant of SEVL While we're at it, also add the Thumb-2 encoding to the encoding table to make sure it isn't forgotten about in the future.	2020-04-22 21:02:47 +01:00
Lioncash	5f9ba970b9	emit_x64: Use const on locals where applicable	2020-04-22 21:02:47 +01:00
Lioncash	92daae9513	A32/coprocessor: Remove boost from public interface Removes a boost header from the public includes in favor of using the standard-provided std::variant. The use of boost in public interfaces is often a dealbreaker for some people. Given we use std::optional in the header already, we can transition over to std::variant from boost::variant. With this removal, this makes all of our dependencies internal to the library itself.	2020-04-22 21:02:47 +01:00
Lioncash	9a097e307f	A32: Implement the ARM-mode variant of SEVL	2020-04-22 21:02:47 +01:00
Lioncash	a40b921cb5	emit_x64: Remove unnecessary typename in GetBasicBlock() This can be deduced from the name alone.	2020-04-22 21:02:47 +01:00
Lioncash	e89ca42048	A32: Implement Thumb-1 variant of YIELD	2020-04-22 21:02:47 +01:00
Lioncash	675f67e41d	emit_x64_vector: Use const on locals where applicable Normalizes the use of const in the source file.	2020-04-22 21:02:47 +01:00
Lioncash	ebab7ede55	A32: Implement Thumb-1 variant of WFI	2020-04-22 21:02:47 +01:00
Lioncash	cccbc7fd0e	emit_x64_saturation: Use const on locals where applicable Normalizes the use of const in the source file.	2020-04-22 21:02:47 +01:00
Lioncash	b4110af22a	A32: Implement Thumb-1 variant of WFE	2020-04-22 21:02:47 +01:00
Lioncash	7316fa47b3	emit_x64_packed: Use const on locals where applicable Normalizes the use of const across the source file.	2020-04-22 21:02:47 +01:00
Lioncash	3a9c2f81d0	block_of_code: Use variable template variants of type traits Now all type traits are using the variable template variants where applicable.	2020-04-22 21:02:47 +01:00
Lioncash	57675fe592	A32: Implement Thumb-1 variant of SEV	2020-04-22 21:02:47 +01:00
Lioncash	9b783a5527	emit_x64_data_processing: Use const on locals where applicable Normalizes the use of const across the source file.	2020-04-22 21:02:47 +01:00
Lioncash	07699b47ba	A32/translate_thumb: Add helper function for raising exceptions Similar to the variant within the ARM-mode translator visitor. This will be used in subsequent changes to implement the hint instructions introduced in ARMv7.	2020-04-22 21:02:47 +01:00
Lioncash	f74762ae4e	frontend/decoder/decoder_detail: Replace std::is_same, with std::is_same_v Same thing, same readability, less characters.	2020-04-22 21:02:47 +01:00
Lioncash	64879396f6	A32: Implement Thumb-1 variant of NOP	2020-04-22 21:02:47 +01:00
Merry	6a67da1225	Merge pull request #493 from lioncash/ir frontend/ir/ir_emitter: Remove unnecessary logical shift overloads	2020-04-22 21:02:47 +01:00
Merry	81b908b077	Merge pull request #495 from lioncash/bkpt A32: Implement Thumb-16's variant of BKPT	2020-04-22 21:02:47 +01:00
Merry	30d28029a8	Merge pull request #492 from lioncash/vfp A32: Rename vfp2-related files to vfp	2020-04-22 21:02:47 +01:00
Merry	c4fb7cf540	Merge pull request #494 from lioncash/pldw A32: Handle PLDW	2020-04-22 21:02:47 +01:00
Lioncash	b17a5d3365	A32: Implement Thumb-16's variant of BKPT	2020-04-22 21:02:47 +01:00
Lioncash	b902f72001	A32/disassembler_arm: Remove <unimplemented> from hint instruction output Given we now support hooking these hint instructions, we can consider them implemented.	2020-04-22 21:02:47 +01:00
Lioncash	0fa0bca22a	A32: Handle different variants of PLD	2020-04-22 21:02:47 +01:00
Lioncash	c6f99235e1	frontend/ir/ir_emitter: Remove unnecessary logical shift overloads These aren't necessary anymore, now that the U32U64 overload already exists.	2020-04-22 21:02:46 +01:00
Merry	9ba503e394	Merge pull request #491 from lioncash/hint A32: Allow hooking of hint instructions in ARM mode.	2020-04-22 21:02:46 +01:00
Lioncash	97277c598b	A32: Rename vfp2-related files to vfp Now that we fuzz against Unicorn, we aren't just restricted to VFPv2. VFPv3 and VFPv4 facilities can now be implemented. This renames constructs mentioning VFPv2 to just refer to VFP.	2020-04-22 21:02:46 +01:00
Lioncash	8c3122ff46	A64/translate/impl/impl: Mark locals const where applicable in DecodeBitMasks() Follows the convention of making immutable state explicit.	2020-04-22 21:02:46 +01:00
Merry	a132b56d57	Merge pull request #490 from lioncash/crc32 A32: Implement ARM-mode CRC32 instructions	2020-04-22 21:02:46 +01:00
Lioncash	966e04d03d	A32: Allow hooking of hint instructions in ARM mode. Mirrors the hooking functionality from the AArch64 frontend to make the behavior of both consistent.	2020-04-22 21:02:46 +01:00
Lioncash	134b586c5c	frontend/ir/ir_emitter: Amend arguments to conversion opcodes Accidentally caused within 967d1fcc8d6f60749a162a96b997439450fed687. That one's on me. My bad.	2020-04-22 21:02:46 +01:00
MerryMage	5debb411cc	block_of_code: Explicitly delete copy constructor	2020-04-22 21:02:46 +01:00
Lioncash	e37689315d	A32: Implement ARM-mode CRC32 instructions Implements the ARM-mode variants of the CRC32 instructions introduced within ARMv8. This is also one of the instruction cases where there is UNPREDICTABLE behavior that is constrained (we must do one of the options indicated by the reference manual). In both documented cases of constrained unpredictable behavior, we treat the instructions as unpredictable in order to allow library users to hook the unpredictable exception to provide the intended behavior they desire.	2020-04-22 21:02:46 +01:00
Lioncash	95d9baea67	{A32, A64}/types: Use std::array deduction guides where applicable We also make the arrays static here, as MSVC tends to load the whole array every time the function is called, instead of storing the data within rodata. This also line breaks the elements a little earlier for readability.	2020-04-22 21:02:46 +01:00
MerryMage	605a43d23e	Suppress MSVC warning C4702: unreachable code	2020-04-22 21:02:46 +01:00
Lioncash	bac945f2d8	A32: Resolve parameter discrepancies discovered via use of the Imm template	2020-04-22 21:02:46 +01:00
Lioncash	e4c65721fe	frontend/ir/type: Generify std::array declaration With deduction guides, we can eliminate the need to explicitly size the array. Also newlines the elements based off their relation, making it slightly nicer to read.	2020-04-22 21:02:46 +01:00
Lioncash	4ba2318b2e	A32: Replace immediate type aliases with the Imm template Replaces type aliases of raw integral types with the more type-safe Imm template, like how the AArch64 frontend has been using it. This makes the two frontends more consistent with one another.	2020-04-22 21:02:46 +01:00
Lioncash	f01dc9192a	CMakeLists: Add a namespace to the export Avoids potentially dumping boost, fmt, and xbyak targets into a top-level namespace without any qualification, which can lead to build errors in projects that already make use of them.	2020-04-22 21:02:46 +01:00
Lioncash	f96036b3f1	A32/barrier: Correct PC assignment within ISB The SetRegister() IR function doesn't allow specifying the PC as a register. This is a discrepancy that slipped through (my bad). Instead, we can use BranchWritePC(), like how the other similar PC modifying locations do it.	2020-04-22 21:02:46 +01:00
Lioncash	196e7b5e35	frontend/A32/ir_emitter: Mark locals as const where applicable Makes const usage consistent within the source file.	2020-04-22 21:02:46 +01:00
Lioncash	511613c736	frontend/A32/types: Use helper function in operator+ overload Allows deduplicating an assert and a cast.	2020-04-22 21:02:46 +01:00
Lioncash	8103652a91	frontend: Move imm.h to the top-level directory of the frontends Preparation to utilize the immediate type within the A32 backend as well, which will allow eliminating numerous type aliases like Imm4, Imm5, etc.	2020-04-22 21:02:46 +01:00
Lioncash	796bb8a7f7	frontend/A64/types: Make RegNumber() and VecNumber() constexpr Given they simply perform casting, they can be safely made constexpr.	2020-04-22 21:02:46 +01:00
Lioncash	64e51a6d4d	A32/disassembler_arm: Mark utility functions as static where applicable These don't depend on class state and can be marked static to make that explicit.	2020-04-22 21:02:46 +01:00
Lioncash	0c43228ad5	frontend/A64/types: Use helper functions in operator+ overloads Allows us to get rid of another explicit cast.	2020-04-22 21:02:46 +01:00
Lioncash	a1cace21a9	frontend/ir/ir_emitter: Apply const to locals where applicable Makes const usage consistent with all other functions in the source file.	2020-04-22 21:02:46 +01:00
Lioncash	0a35836998	frontend/ir/ir_emitter: Use switch constructs in floating point opcodes where applicable This'll reduce the amount of noise necessary in changes implementing half-precision instructions, as the type can just be prepended to the switch cases, instead of rewriting the whole if/else branch.	2020-04-22 21:02:46 +01:00
Lioncash	8316d231e9	A32: Implement barrier instructions introduced in ARMv7 Provides basic implementations of the barrier instruction introduced within ARMv7. Currently these simply mirror the behavior of the AArch64 equivalents.	2020-04-22 21:02:46 +01:00
Lioncash	7fc3bd689d	A32: Implement ARM-mode MLS	2020-04-22 21:02:46 +01:00
Lioncash	8b338b7def	A32: Implement ARM-mode MOVT	2020-04-22 21:02:46 +01:00
Lioncash	877fa0f8c3	A32: Implement ARM-mode SBFX	2020-04-22 21:02:46 +01:00
Lioncash	47218ee65d	A32: Implement ARM-mode UBFX	2020-04-22 21:02:46 +01:00
Lioncash	2970b34e3c	A32: Implement ARM-mode BFI	2020-04-22 21:02:46 +01:00
Lioncash	fab3a59e05	A32: Implement ARM-mode BFC	2020-04-22 21:02:46 +01:00
Lioncash	7305d13221	A32: Implement ARM-mode RBIT	2020-04-22 21:02:46 +01:00
Lioncash	b2f7a0e7ba	A32: Implement ARM-mode SDIV/UDIV Now that we have Unicorn in place, we can freely implement instructions introduced in newer versions of the ARM architecture.	2020-04-22 21:02:46 +01:00
Lioncash	c0ae23bbb7	A32/translate_thumb: Clean up formatting Performs a similar tidying up of the Thumb translator, like what was done with the regular ARM translator to make it consistent with the rest of the codebase. The A32 backend (both Thumb and ARM), will likely see more changes to it in the near future, so this just acts as a "dusting off".	2020-04-22 21:02:46 +01:00
Merry	837c23a8ec	Merge pull request #483 from lioncash/invert frontend/ir/cond: Remove unused invert() function	2020-04-22 21:02:46 +01:00
Lioncash	d12e375481	common/fp/op/FPConvert: Remove unnecessary casts in FPConvert() These were made unnecessary in 2c2fdb435cf8e358a0c5b907ce8131e434df3f22, but were missed during the initial removal.	2020-04-22 21:02:46 +01:00
Merry	09ee64ea98	Merge pull request #482 from lioncash/fixedfp A64: Handle half-precision variants of FP->Fixed instructions	2020-04-22 21:02:45 +01:00
MerryMage	1e1e9c17c7	emit_x64_data_processing: Remove INVALID_REG INVALID_REG.cvt8() now throws	2020-04-22 21:02:45 +01:00
Lioncash	06ec6ab0da	frontend/ir/cond: Remove unused invert() function This is no longer used by anything in the codebase, so it can be removed.	2020-04-22 21:01:46 +01:00
Merry	d71f51b0da	Merge pull request #481 from lioncash/alloc ir/basic_block: Forward declare headers where applicable	2020-04-22 21:01:46 +01:00
Lioncash	64e3d233f4	A64: Handle half-precision variants of FP->Fixed-point instructions	2020-04-22 21:01:45 +01:00
Lioncash	4fc531f71b	ir/basic_block: Forward declare headers where applicable Now that the constructor and destructors have been placed within the cpp file, we can forward declare the memory pool data structures. Now, a change to the memory pool code won't ripple across the entirety of the IR emitter.	2020-04-22 21:01:45 +01:00
Lioncash	427b7afd66	frontend/ir/microinstruction: Add missing fixed-point opcodes to ReadsFromAndWritesToFPSRCumulativeExceptionBits()	2020-04-22 21:01:45 +01:00
Lioncash	c9777ef997	common/fp/info: Make half-precision info struct functions return correctly sized types While initially done to potentially prevent creating bugs due to C++ having a silly type-promotion mechanism involving types < sizeof(int) and unsignedness, given that the bulk of these functions' usages are on exit paths, these can return the correct type to avoid the need to cast at every usage point.	2020-04-22 21:01:45 +01:00
Lioncash	9309d95b17	ir/block: Default ctor and dtor in the cpp file Prevents potentially inlining allocation code everywhere. While we're at it, also explicitly delete/default the copy/move constructor/assignment operators to be explicit about them.	2020-04-22 21:01:45 +01:00
Lioncash	604f39f00a	frontend/ir_emitter: Add half-precision->fixed-point opcodes	2020-04-22 21:01:45 +01:00
Lioncash	4ecfbc14de	common/fp/op/FPToFixed: Add half-precision specialization of FPToFixed	2020-04-22 21:01:45 +01:00
Lioncash	471eb77bc9	A64: Implement FRSQRTS' half-precision vector variant	2020-04-22 21:01:45 +01:00
Lioncash	f9b2862217	A64: Implement FRSQRTS' half-precision scalar variant With the necessary machinery in place, we can now handle the half-precision variant.	2020-04-22 21:01:45 +01:00
Lioncash	96356fac93	frontend/ir_emitter: Add half-precision opcode variant of FPVectorRSqrtStepFused	2020-04-22 21:01:45 +01:00
Merry	45864133f5	Merge pull request #478 from lioncash/stepfused A64: Handle half-precision variants of FRECPE and FRECPS	2020-04-22 21:01:44 +01:00
Lioncash	824c551ba2	frontend/ir_emitter: Add half-precision opcode variant of FPRSqrtStepFused	2020-04-22 21:01:44 +01:00
Lioncash	3739d92097	A64: Implement half-precision vector variant of FRECPE	2020-04-22 21:01:44 +01:00
Lioncash	e3b2eb57b5	common/fp/op/FPRSqrtStepFused: Add half-precision specialization for FPRSqrtStepFused	2020-04-22 21:01:44 +01:00
Lioncash	7b212ec8ae	A64: Implement half-precision variant of FRSQRTE's vector variant	2020-04-22 21:01:44 +01:00
Lioncash	0945a491bd	A64: Implement half-precision scalar variant of FRECPE	2020-04-22 21:01:44 +01:00
Lioncash	77c84bcf9b	A64: Implement half-precision variant of FRSQRTE's scalar variant	2020-04-22 21:01:44 +01:00
Lioncash	86b7626a2f	A64: Implement half-precision vector variant of FRECPS	2020-04-22 21:01:44 +01:00
Lioncash	037acb17b9	frontend/ir_emitter: Add half-precision opcode variant for FPVectorRSqrtEstimate	2020-04-22 21:01:44 +01:00
Lioncash	de43f011a7	A64: Implement half-precision scalar variant of FRECPS	2020-04-22 21:01:44 +01:00
Lioncash	5dba99b4f4	frontend/ir_emitter: Add half-precision opcode variant for FPRSqrtEstimate	2020-04-22 21:01:44 +01:00
Lioncash	825a3ea16f	frontend/ir_emitter: Add half-precision opcode for FPVectorRecipEstimate	2020-04-22 21:01:44 +01:00
Lioncash	726b9914c5	common/fp/op/FPRSqrtEstimate: Add half-precision specialization for FPRSqrtEstimate	2020-04-22 21:01:44 +01:00
Lioncash	2184d24e8f	frontend/ir_emitter: Add half-precision opcode for FPRecipEstimate	2020-04-22 21:01:44 +01:00
Lioncash	af2e5afed6	common/fp/op: Add half-precision specialization for FPRecipEstimate	2020-04-22 21:01:44 +01:00
Lioncash	d7f394fc1a	A64: Enable half-precision vector FRINT* variants	2020-04-22 21:01:44 +01:00
Lioncash	5d5c9f149f	frontend/ir_emitter: Add half-precision opcode for FPVectorRecipStepFused	2020-04-22 21:01:44 +01:00
Lioncash	24f583c498	A64: Enable half-precision variants of floating-point FRINT* variants With all the backing machinery in place, we can remove the fallback check for half-precision.	2020-04-22 21:01:44 +01:00
Lioncash	6da0411111	frontend/ir_emitter: Add half-precision opcode for FPRecipStepFused	2020-04-22 21:01:44 +01:00
Lioncash	fb829b9525	frontend/microinstruction: Add FPVectorRoundInt types to ReadsFromAndWritesToFPSRCumulativeExceptionBits() All variants were previously missing from this.	2020-04-22 21:01:44 +01:00
Lioncash	68d8cd2b13	common/fp/op: Add half-precision specialization for FPRecipStepFused	2020-04-22 21:01:44 +01:00
Lioncash	5b4673da4b	frontend/ir_emitter: Add half-precision variant of FPVectorRoundInt	2020-04-22 21:01:44 +01:00
Lioncash	ad0c698f89	frontend/ir_emitter: Add half-precision variant of FPRoundInt	2020-04-22 21:01:44 +01:00
Lioncash	61cec94a19	fp/op/FPRoundInt: Add half-precision specialization of FPRoundInt	2020-04-22 21:01:44 +01:00
Merry	cb9a1b18b6	Merge pull request #475 from lioncash/muladd A64: Enable half-precision variants of floating-point multiply-add instructions	2020-04-22 21:01:44 +01:00
Merry	d6db7ad46c	Merge pull request #474 from lioncash/bracing load_store_*: Make bracing consistent and variables const where applicable	2020-04-22 21:01:44 +01:00
Merry	1b6520f5dd	A64/location_descriptor: Ensure FZ16 is included in the FPCR mask	2020-04-22 21:01:44 +01:00
Merry	13f421c27d	Merge pull request #473 from lioncash/sqshlu A64: Implement SQSHLU	2020-04-22 21:01:44 +01:00
Lioncash	b5bf890584	load_store_*: Make bracing consistent and variables const where applicable Makes bracing consistent, and variables const where applicable to be consistent with the rest of the codebase. In most bracing cases, they'd need to be added to conditionals that would involve checking stack pointer alignment in the future anyways.	2020-04-22 21:01:44 +01:00
Lioncash	9a58c3f1c7	A64: Implement FMLA/FMLS' half-precision vector indexed variants	2020-04-22 21:01:44 +01:00
Merry	d7da53a74b	Merge pull request #472 from lioncash/exception general: Mark hash functions as noexcept	2020-04-22 21:01:44 +01:00
Lioncash	9dcc04e106	A64: Implement SQSHLU's scalar variant	2020-04-22 21:01:44 +01:00
Merry	b91c6c8bae	Merge pull request #471 from lioncash/sqrdmulh A64: Implement SQRDMULH's scalar vector variant	2020-04-22 21:01:44 +01:00
Lioncash	1fdd3ef8a0	A64: Implement FMLA/FMLS' half-precision scalar indexed variants	2020-04-22 21:01:44 +01:00
Lioncash	2d59d10ac8	A64: Implement SQSHLU's vector variant The vector shift by immediate category is now fully implemented.	2020-04-22 21:01:44 +01:00
Merry	b5e25959d9	Merge pull request #470 from lioncash/assert general: Replace unreachable-imitating assertions with UNREACHABLE()	2020-04-22 21:01:44 +01:00
Lioncash	d6606deda2	A64: Implement half-precision vector variants of FMLA/FMLS	2020-04-22 21:01:44 +01:00
Lioncash	a4cadf1cd9	frontend/ir_emitter: Add opcodes for signed saturated left shifts with unsigned saturation	2020-04-22 21:01:44 +01:00
Lioncash	ec6b3ae084	ir/frontend: Add half-precision opcode for FPVectorMulAdd	2020-04-22 21:01:44 +01:00
Lioncash	5f74d25bf7	A64: Enable half-precision floating point variants of FP data-processing three register instructions This handles half-precision floating point for: - FMADD - FMSUB - FNMADD - FNMSUB	2020-04-22 21:01:44 +01:00
Lioncash	bd82513199	frontend/ir_emitter: Add half-precision opcode for FPMulAdd	2020-04-22 21:01:44 +01:00
Lioncash	79a892d23c	fp/op/FPMulAdd: Add half-precision floating-point specialization	2020-04-22 21:01:44 +01:00
Lioncash	7bb5440507	general: Mark hash functions as noexcept Generally hash functions shouldn't throw exceptions. It's also a requirement for the standard library-provided hash functions to not throw exceptions. An exception to this rule is made for user-defined specializations, however we can just be consistent with the standard library on this to allow it to play nicer with it. While we're at it, we can also make the std::less specializations noexcpet as well, since they also can't throw.	2020-04-22 21:01:43 +01:00
Lioncash	3b46b4a37d	A64: Implement SQRDMULH's scalar vector variant Implements the scalar variant in terms of the vector variant for the time being.	2020-04-22 21:01:43 +01:00
Lioncash	fe95575b95	general: Replace unreachable-imitating assertions with UNREACHABLE() We can just use the self-documenting assertion for indicating unreachable paths, instead of manually passing false and providing a message.	2020-04-22 21:01:43 +01:00
Merry	4a3d808354	Merge pull request #468 from lioncash/const ir_opt: Mark locals as const where applicable	2020-04-22 21:01:43 +01:00
Lioncash	64de80839e	A64/impl: Reorganize peculiar void use in V_scalar To a reader this might look particularly strange, given the function itself has a void return value, but this is actually valid, given the function in the return statement also has a void return value. This instead alters it to be a little easier to parse and potentially be a little less confusing at a glance.	2020-04-22 21:01:43 +01:00
Merry	9a4e3b24e4	Merge pull request #467 from lioncash/reserved A64: Handle reserved instruction cases more specifically where applicable	2020-04-22 21:01:43 +01:00
Merry	0b794cbcea	Merge pull request #466 from lioncash/fcmla A64: Implement FCMLA's indexed element variant	2020-04-22 21:01:43 +01:00
Merry	994349d154	Merge pull request #465 from neobrain/master CMakeLists: Allow importing dynarmic build trees into other CMake projects	2020-04-22 21:01:43 +01:00
Lioncash	cfd7513a7d	ir_opt/verification_pass: Mark locals as const where applicable Makes our immutable state a little more explicit.	2020-04-22 21:01:40 +01:00
Lioncash	8309d49588	A64: Handle reserved instruction cases more specifically where applicable These are cases that are defined as reserved within the ARMv8 reference manual, so we can handle them as such instead of as unallocated encodings. While this doesn't actually change emulated behavior, it does at least allow the JIT to generate the more appropriate exception.	2020-04-22 21:00:47 +01:00
Lioncash	6c2c68bce6	A64: Implement FCMLA's indexed element variant With this, all of the instructions introduced with ARMv8.3-CompNum have an implementation.	2020-04-22 21:00:47 +01:00
Tony Wasserka	7d99a6c00f	CMakeLists: Allow importing dynarmic build trees into other CMake projects	2020-04-22 21:00:47 +01:00
Lioncash	1a45f35b9c	ir_opt/a64_callback_config_pass: Mark locals as const where applicable Makes our immutable state a little more explicit.	2020-04-22 21:00:47 +01:00
Lioncash	7bc7042104	simd_scalar_shift_by_immediate: Change UnallocatedEncoding() path in SaturatingShiftLeft to ReservedValue() Strictly speaking, immh being zero is defined as reserved in the ARMv8 reference manual. This was just an error on my part when introducing the SQSHL immediate scalar variant.	2020-04-22 21:00:47 +01:00
Lioncash	dc97977576	ir_opt/a32_get_set_elimination_pass: Mark local variables as const where applicable Makes our intended immutable state slightly more explicit.	2020-04-22 21:00:47 +01:00
Lioncash	b1b4487e4d	A64: Implement UQSHL (immediate)'s scalar variant Like SQSHL's immediate scalar variant, we can also implement UQSHL's immediate scalar variant in terms of the vector variant for the time being.	2020-04-22 21:00:47 +01:00
Lioncash	3649dc6d9a	A64: Implement scalar variant of SQSHL (immediate) This can be handled in terms of the vector variant for the time being.	2020-04-22 21:00:47 +01:00
Lioncash	7d535eaba6	ir_opt/a32_constant_memory_reads_pass: Apply const where applicable to locals Makes immutable state just slightly more explicit.	2020-04-22 21:00:47 +01:00
Lioncash	e1b4ff1068	simd_scalar_shift_by_immediate: Migrate SQSHL implementation to file-scope function This will allow it to be reused for the implementation of UQSHL.	2020-04-22 21:00:47 +01:00
Lioncash	b37279f65c	backend/x64/emit_x64_vector: Prevent undefined behavior within VectorSignedSaturatedShiftLeft Avoids undefined behavior by potentially left-shifting a signed negative value.	2020-04-22 21:00:47 +01:00
Lioncash	46eae8cf2f	common/fp/op/FPRecipExponent: Prevent undefined behavior from shifting a negative value Due to promotion rules (types < int, even if unsigned, get promoted to int when arithmetic is performed on them), this is a potential spot for undefined behavior.	2020-04-22 21:00:47 +01:00
MerryMage	13e8b7b516	emit_x64_floating_point: F16C implementation of FPSingleToHalf	2020-04-22 20:58:17 +01:00
MerryMage	d32d6fe598	emit_x64_floating_point: F16C implementation of FPHalfToSingle and FPHalfToDouble	2020-04-22 20:58:12 +01:00
MerryMage	a53ba12be2	emit_x64_floating_point: Factor out ConvertRoundingModeToX64Immediate	2020-04-22 20:58:12 +01:00
MerryMage	5a2adc6629	backend/x64: Expose FPCR in EmitContext instead of its subcomponents	2020-04-22 20:58:12 +01:00
Merry	01bb1cdd88	Merge pull request #458 from lioncash/float-op A64: Handle half-precision floating point in FABS, FNEG, and scalar FMOV	2020-04-22 20:58:12 +01:00
Lioncash	28a8b4d210	A64: Handle half-precision floating point in scalar FMOV This is simply performing a scalar value transfer between registers without conversions, so this is trivial to handle as-is.	2020-04-22 20:58:12 +01:00
Lioncash	d7ac5a664f	A64: Handle half-precision floating point in FCVTL Like FCVTN, now that we have half-precision floating point conversion functions available, we can go ahead and use those to eliminate the interpreter fallback.	2020-04-22 20:58:12 +01:00
Lioncash	fe84ecb780	A64: Handle half-precision floating point in scalar FABS Now that we have the half-precision variant of the opcode added, we can simply handle the instruction instead of treating it as undefined.	2020-04-22 20:58:12 +01:00
Lioncash	fac9224d5e	A64: Handle half-precision floating point in FCVTN Now that we have IR instructions for performing conversions with half-precision floating point, we can also handle half-precision values within FCVTN.	2020-04-22 20:58:12 +01:00
Lioncash	8309ec7a9f	frontend/ir_emitter: Add half-precision variant of FPAbs	2020-04-22 20:58:12 +01:00
Lioncash	16de99d3e3	A64: Enable FCVT floating-point conversions for half-precision With this, we no longer have to fall back to the interpreter in any of the FCVT floating-point conversion instructions.	2020-04-22 20:58:12 +01:00
Lioncash	10abc77fad	A64: Handle half-precision floating point in scalar FNEG With the half-precision variant of the FPNeg opcode added, we can utilize it here to emulate the half-precision variant of FNEG.	2020-04-22 20:58:12 +01:00
Lioncash	e4c259d69f	frontend/ir_emitter: Add half->{single, double} and {double, single}->half conversion opcodes	2020-04-22 20:58:12 +01:00
Lioncash	c97efcb978	frontend/ir_emitter: Add half-precision variant of FPNeg	2020-04-22 20:58:12 +01:00
Lioncash	dff5da1063	common/fp/unpacked: Amend behavior of FPUnpackCV This is supposed to call FPUnpackBase instead of FPUnpack. This would result in alternate half-precision representations being misinterpreted when it comes to dealing with NaNs.	2020-04-22 20:58:12 +01:00
Merry	f01afc5ae6	Merge pull request #456 from lioncash/mov A64: Enable FMOV (general) for half-precision floating point	2020-04-22 20:58:12 +01:00
Lioncash	03bc2334fe	common/fp/op/FPConvert: Amend off-by one in double NaN case in FPConvertNaN Avoids potentially clobbering the intended sign bit value during conversions to double-precision values. The other conversion types are already properly handled, so those don't need to be addressed.	2020-04-22 20:58:12 +01:00
Lioncash	c57b146fb2	common/fp/op/FPConvert: Add half-precision instantiations to FPConvert	2020-04-22 20:58:12 +01:00
Merry	c1ce94872d	Merge pull request #455 from lioncash/sqrdmulh-scalar A64: Implement SQRDMULH and SQDMULL's scalar indexed variants	2020-04-22 20:58:11 +01:00
Lioncash	25a7256ee1	A64: Enable FMOV (general) for half-precision floating point This just transfers values between vector registers and general-purpose registers with no conversions performed, so this is trivial to add support for half-precision to.	2020-04-22 20:58:11 +01:00
Lioncash	97dd3d0596	A64: Implement SQRDMULH's scalar indexed element variant	2020-04-22 20:58:11 +01:00
Lioncash	49b51e34f1	simd_vector_x_indexed_element: Deduplicate index and Vm operand construction	2020-04-22 20:58:11 +01:00
Lioncash	692aba91b6	A64: Implement SQDMULL{2}'s scalar indexed element variant	2020-04-22 20:58:11 +01:00
Lioncash	c043b831d5	A64: Implement SQDMULL{2}'s by-element variant	2020-04-22 20:58:11 +01:00
Lioncash	72af5a3dff	simd_scalar_x_indexed_element: Factor out index and Vm argument construction This will be useful in the implementations of SQRDMULH and SQDMULL{2} as well.	2020-04-22 20:58:11 +01:00
Lioncash	224ff0afaa	A64: Implement SQRDMULH's by-index vector variant	2020-04-22 20:58:11 +01:00
Lioncash	3a3542414b	A64: Implement FRECPX's half-precision floating point variant	2020-04-22 20:58:11 +01:00
Lioncash	bd892ec4ef	frontend/ir/ir_emitter: Amend FPRecipExponent to handle half-precision floating point	2020-04-22 20:58:11 +01:00
Lioncash	974fbf0677	frontend/ir/value: Add U16U32U64 type to represent floating point types	2020-04-22 20:58:11 +01:00
Lioncash	eb3e0d5908	common/fp/op/FPRecipExponent: Add half-precision floating point specialization	2020-04-22 20:58:11 +01:00
Lioncash	a829c93406	common/fp/unpacked: Correct edge-cases within FPUnpack for half-precision floating point This corrects one case where floating-point exceptions could be set when they're not supposed to be. This also corrects a case where values were being treated as NaNs when they weren't supposed to be.	2020-04-22 20:58:11 +01:00
Lioncash	7030b9af95	common/fp/process_nan: Add half-precision instantiations for NaN processing functions	2020-04-22 20:58:11 +01:00
Lioncash	14f55d7476	common/fp/unpacked: Add half-precision instantiation of FPRoundBase	2020-04-22 20:58:11 +01:00
Lioncash	7e814de445	common/fp/unpacked: Handle half-precision unpacking in FPUnpackBase	2020-04-22 20:58:11 +01:00
Lioncash	8f9fe8690a	common/fp/unpacked: Adjust FPUnpack to operate like ARM pseudocode This function is defined as always disabling the AHP bit in the fpcr before performing any operations. At the same time, rename the original FPUnpack function to FPUnpackBase to match the pseudocode in the ARM reference manual.	2020-04-22 20:58:11 +01:00
Merry	37c4c39d62	Merge pull request #448 from lioncash/saturate A64: Implement SQSHRN, SQSHRUN, and UQSHRN's scalar variants	2020-04-22 20:58:11 +01:00
Merry	f5d774bdbd	Merge pull request #449 from lioncash/hp common/fp/info: Add specialization of FPInfo for half-precision floating point	2020-04-22 20:58:11 +01:00
Lioncash	126c29a9e9	A64: Implement SQSHRN, SQSHRUN, and UQSHRN's scalar variants These can just be implemented in terms of the vector variants for the time being.	2020-04-22 20:58:11 +01:00
Lioncash	0b67b94b6c	common/fp/info: Add specialization of FPInfo for half-precision floating point Puts the necessary info struct in place for further use.	2020-04-22 20:58:11 +01:00
Lioncash	dd7433f9d3	A64: Amend prototypes of some SIMD scalar shift by immediate opcodes These take a vector for a destination.	2020-04-22 20:58:11 +01:00
Lioncash	99c494bae9	common/fp/unpacked: Add FPRoundCV Corresponds to the equivalent pseudocode within the ARMv8 reference manual. This will be necessary for supporting half-precision floating-point. This also makes use of it within FPConvert	2020-04-22 20:58:11 +01:00
Merry	bbd5330ad2	Merge pull request #447 from lioncash/flag A64: Implement CFINV, RMIF, AXFlag and XAFlag	2020-04-22 20:58:11 +01:00
Lioncash	490bebbd9a	common/fp/unpacked: Add FPUnpackCV Adds a template function that performs the same behavior as in the ARM pseudocode, and utilizes it in FPConvert, which will be necessary for half-float support.	2020-04-22 20:58:11 +01:00
Merry	fb039e232c	Merge pull request #442 from lioncash/fcvtxn A64: Implement scalar and vector variants of FCVTXN	2020-04-22 20:58:11 +01:00
Lioncash	6aed4036ef	ir_opt/a64_get_set_elimination_pass: Add handling for NZCV raw get and set operations	2020-04-22 20:58:11 +01:00
Merry	4f937c1ee1	Merge pull request #446 from lioncash/sqshl A64: Implement scalar variants of SQSHL (register) and UQSHL (register)	2020-04-22 20:58:11 +01:00
Lioncash	aa22db534b	A64: Implement AXFlag and XAFlag	2020-04-22 20:58:11 +01:00
Merry	d74cccbc84	Merge pull request #445 from lioncash/sqrt A64: Implement single and double-precision vector variant of FSQRT	2020-04-22 20:58:11 +01:00
Lioncash	20ffe568d0	A64: Implement RMIF	2020-04-22 20:58:11 +01:00
Merry	6d7e7c3269	Merge pull request #443 from lioncash/flag A64: Rearrange flag format/manipulation instructions	2020-04-22 20:58:11 +01:00
Lioncash	51b526e453	A64: Implement CFINV	2020-04-22 20:58:11 +01:00
Merry	5d01f1b462	Merge pull request #441 from lioncash/constexpr common/bit_util: Mark a few functions as constexpr	2020-04-22 20:58:11 +01:00
Lioncash	597a8be5d5	ir: Add A64-specific opcodes for getting and setting raw NZCV values This will be necessary to implement the flag manipulation and flag format instructions.	2020-04-22 20:58:11 +01:00
Merry	743c52fdc5	Merge pull request #440 from lioncash/include common/fp: Remove unnecessary includes	2020-04-22 20:58:11 +01:00
Lioncash	d3515279df	A64: Implement the vector version of FCVTXN	2020-04-22 20:58:10 +01:00
Lioncash	17aea0b997	A64: Implement UQSHL (register)'s scalar variant This can be implemented in terms of the vector variant.	2020-04-22 20:58:10 +01:00
Lioncash	c99d4b762e	A64: Implement single and double-precision vector variant of FSQRT	2020-04-22 20:58:10 +01:00
Lioncash	54e0b487f3	A64: Rearrange flag format/manipulation instructions Gives these instructions better categorical labeling.	2020-04-22 20:58:10 +01:00
Lioncash	88d1977cb9	common/bit_util: Make a few functions as constexpr These four functions can be made constexpr with no issue.	2020-04-22 20:58:10 +01:00
Lioncash	f33e5939b7	common/fp: Remove unnecessary includes	2020-04-22 20:58:10 +01:00
Lioncash	302f56b36a	A64: Fall back to interpreting for FCADD and FCMLA half-precision variants Rather than straight-up treating them as undefined, we can fall back to an interpreter in this case.	2020-04-22 20:58:10 +01:00
Lioncash	4339a8fff6	A64: Implement the scalar version of FCVTXN	2020-04-22 20:58:10 +01:00
Lioncash	35ddf68ad5	A64: Implement SQSHL (register)'s scalar variant We can implement this in terms of the vector variant.	2020-04-22 20:58:10 +01:00
Lioncash	5cf1478620	frontend/ir: Add opcodes for vector square roots	2020-04-22 20:58:10 +01:00
Lioncash	36027ebef5	frontend/ir/microinstruction: Add missing cases for FPRecipExponent{32,64} for ReadsFromAndWritesToFPSRCumulativeExceptionBits() This was intended to be added within #437, but was missed	2020-04-22 20:58:10 +01:00
Merry	40b081438a	Merge pull request #439 from lioncash/fcmla A64: Implement FCADD and FCMLA	2020-04-22 20:58:10 +01:00
Lioncash	7c81a58ed3	frontend/ir/ir_emitter: Alter parameters of FPDoubleToSingle() and FPSingleToDouble() to pass along desired rounding mode This will be necessary to special-case the non-IEEE Von Neumann rounding to odd rounding mode.	2020-04-22 20:58:10 +01:00
Merry	d91192681a	Merge pull request #438 from lioncash/fmulx A64: Implement scalar double/single precision FMULX (by element)	2020-04-22 20:58:10 +01:00
Lioncash	ed29ef8cca	A64: Implement FCMLA	2020-04-22 20:58:10 +01:00

... 3 4 5 6 7 ...

1932 commits