dynarmic/src
Lioncash e739624296 ir: Add opcodes for vector CLZ operations
We can optimize these cases further for with the use of a fair bit of
shuffling via pshufb and the use of masks, but given the uncommon use of
this instruction, I wouldn't consider it to be beneficial in terms of
amount of code to be worth it over a simple manageable naive solution
like this.

If we ever do hit a case where vectorized CLZ happens to be a
bottleneck, then we can revisit this. At least with AVX-512CD, this can
be done with a single instruction for the 32-bit word case.
2020-04-22 20:55:05 +01:00
..
backend/x64 ir: Add opcodes for vector CLZ operations 2020-04-22 20:55:05 +01:00
common ir: Add opcodes for performing unsigned reciprocal square root estimates 2020-04-22 20:55:05 +01:00
frontend ir: Add opcodes for vector CLZ operations 2020-04-22 20:55:05 +01:00
ir_opt ir/value: Replace includes with forward declarations 2020-04-22 20:55:05 +01:00
CMakeLists.txt CMakeLists: Add missing cond.h header to file listing 2020-04-22 20:55:05 +01:00