Commit graph

2934 commits

Author SHA1 Message Date
Merry
6633089a44 Squashed 'externals/fmt/' changes from 9e8b86fd2..b6f4ceaed
b6f4ceaed Update version
15f812dae Update changelog
6884aab49 Update changelog
88ec4e706 Bump version
dd3d2490e Update changelog
739055ae7 Fix apidocs
dbbd711f4 Suppress a warning
98cbb6a43 Fix ABI compatiblity issue
214cf13f1 Fix endianness bug in write_digit2_separated (#2699)
17a5c808d Restore FMT_API on error_handler::on_error() (#2696)
fc1783fcc Avoid undefined symbols with mingw-w64 (#2692)
1b193e7b3 Deprecate more
8e59744b8 Switch to new github auth mechanism
7081a6aa3 Update version
64dc8fbad Bump version
fc8e3de7d Fix manage.py script
57bee9fcd Fix formating
dce52e491 Update changelog
9405a4724 Update changelog
495b8bf12 Update changelog
e221166fa Update changelog
035cab8da Update changelog
89c6ed12b Clarify in comments (for now) deprecated map functions
e462da828 Add some noexcept (#2684)
79c66d66b Update changelog
5d37f705f Update changelog
6bb370cec Update changelog
bb6920157 Fix tuple join
4fac7daae Cleanup bit_cast
3617c2795 Update changelog
9c0c1bcdb Simplify tuple formatting
187e8db1b Update changelog
c7f88180f add tests for format string compile-time checks
8a2c3fb88 add reverse tests to compile-error-test
1164eda5a disable compile-error-test on Windows
4482f6f1f rewrite compile-error-test to use non-header-only library
796662a61 Escape range items convertible to std::string_view
33ee4cc51 Improve noexception test
3bbf2c673 Fix throw with exceptions disabled
074c9c52e Update changelog
3110ec5a2 Update changelog
3014b3d77 Clarify that C strings must be null-terminated
eab2ea9fc Replace an assert with an exception
21ed92a6e Update changelog
04111dd1e Fix issue #2670 (#2671)
817788fbf remove incorrect C++20 check from test/CMakeLists.txt (#2663)
4511030af Minor code style tweaks for consistency
7812813a3 Don't explicitly delete copy ctor of dynamic_format_arg_store (#2664)
664cd6067 Remove std-format-test
784e2a7b4 Fix an overflow when formatting very large durations
fc2a376d8 Remove two expressions which had no effect (reported by LGTM)
c5aafd8f9 expose headers as SYSTEM depending on special configuration option
eaddd1e3c Fix handling of byte
2d4457758 Try fixing byte regression
e46392ea2 deprecate _format UDL in code using FMT_DEPRECATED
c882790a2 Add a set formatter
121002d70 Add a map formatter
be51ee1ce Disable broken copy ctor of dynamic_format_arg_store
659de779e Fix a UB in parse_format_specs when begin is null
51b14b6c0 remove commented out lines
223a0fa55 move gtest-specific check into gtest/CMakeLists.txt
ef72b471f enable named arguments check in compile-time checks (#2649)
82246b876 fix throw with exceptions disabled (#2647)
35f60377a Update ChangeLog.rst
3a951a66c Avoid qualifying by inline namespace. Fixes #2642. (#2643)
e0136fc8b Qualify calls to make_wformat_args. Fixes #2639. (#2641)
ac1b5f3da Refactor problematic trailing returns in arg_mapper
fd62fba98 Don't convert scoped enums to integers
c652f8243 Make header guard consistent with header name
a9c7b9b8f Clarify that _format is deprecated
e4f0564aa Disable is_streamable for string[_view]
91533d3c3 Minor tweaks to chrono subsecond formatting
0bbc9708f Implement c++20 std::chrono::duration subsecond formatting (#2623)
9d5b9defd Enable tzset only on Windows desktop app (#2633)
215f21a03 Detect overflow on large precision
c240d98ff Optimize tm formatting (Non C-locales and %Z) (#2617)
6ab73113f Mark grow as FMT_CONSTEXPR20 (#2630)
713c7c7c6 Cleanup os.cc
9b1807a8a fix int -> uint warning (#2611)
ec3b097cb [doc] FMT_STRING supports C++14 and no-op in C++11 (#2620)
c472a2781 Fix handling of very large precision in fixed format
201971e29 Make MSVC use [[nodiscard]] (#2615)
acad8cfab Reformat all source code; no functional changes
491ba2dda Annotate fmt::format and fmt::formatted_size as [[nodiscard]]
5abe9e826 Add platform-specific 'z' formatter
be3a3a5ae Use predefined formats for C-locale
a3ab36c80 Formatting of function pointers, member function pointers, member object pointers... (#2610)
19cac63fe Broken link in README.rst
43419a4ad Workaround a bug in gcc
c089f7d49 Simplify std::tm formatter
aa5517f6b Reuse tm_writer in chrono_formatter
50140be7a Reuse tm_writer in weekday formatter
8b8945499 Improve consistency
5380ff4d8 Detect types convertible to unformattable pointers
094b66e81 changed locale retrieval way to a fancy one
b69ae4854 Reorder classes (#2591)
0b843af56 sped up chrono.h formatting for cases without providing locale (#2576)
12b1d8b14 Fix precision 0 with std::chrono::duration and added additional tests. (#2588)
e67f92c55 Cleanup warnings with nvhpc/21.9. (#2582)
812733cc9 const qualify format function for systen_clock
028f22775 Handle implicit conversions in write
5b0aa638c Minor grammar fix
6eaceb5f7 Fix incompatible between docutils 1.18.0 and sphinx 3.3.0 (#2575)
0697c5edb FMT_USE_FCNTL can be predefined (#2573)
1031eedf2 Replacing strftime with std::time_put (#2550)
90034e4c4 Add FMT_ASSERT and validation of values of struct tm members (#2564)
df40e9467 Upgrade `module-test` to msvc 16.11.5 and 17.0-pre5 (#2558)
e6d5059cb Simplify js tag in basic-bootstrap theme (#2562)
3b6e409cd Enable `consteval` for msvc 17.0-pre5 (#2559)
249f03bbb do not detect LLVM based IBMXL compiler (on ppc) as clang (#2555)
7463c8320 Fix overflow for very bigger years (>2*10^9) (#2551)
1266c2b60 Fix handling of exotic character types
684e2fdc9 Minor cleanup
a1d586302 Minor cleanup
7a604cdd9 Cleanup
aeb54b0dd Fix bug on '%Y' and '%C' formats with negative years Requested changes
f88c020fc Generalization of strftime/wcsftime function calls in tests
2eeddba75 Renaming, splitting of functions
275454608 Fix errors in ISO week-base-year formatter
218cecb6d Fix error in test
e9f4453b0 Fix Microsoft Visual Studio 14.0 build
27c3674ce Improve performance
5dc3dd3d4 New tests
f8542cd98 Unified formatters for std::chrono::time_point<std::chrono::system_clock, Duration> and std::tm
4707373d3 Fix year formatter
79c00ad8f Improve ISO week-base-year formatter
fbaaa5906 Improve week of the year formatter
cde44ddb7 Improve year formatter
b04601b91 Switch from std::strftime/std::wcsftime to internal implementation for locale independent formats
d3d30a46f New tests
7911d8d3f Add format spec checker
fbbfc3b03 Reorder formatters
509eac957 Workarounds for implementation-defined std::strftime behavior
85b38190d New tests for all C++11 std::strftime format specifiers
7aca36bca Extending fmt::join to support C++20-only ranges. (#2549)
f5371a75f locale.h -> format.h
febdef43f fix: add workaround for intel parameter pack bug
f56756986 fix: check to make sure both 'if constexpr' and return type deduction are available
dcd282bb2 Namespace qualify calls to get
9c14474d3 Include `<bit>` when using `std::bit_cast`
1e96e0176 Fix compiler flag check (#2540)
7e4bc9451 Speeding up write_significand() (#2499)
26c1ca4c3 Replaced default spec with equivalent one, which is potentially more optimizable (#2537)
1e865b353 Fix docs
4a85db1ce Change default open mode to -rw-r--r-- (#2530)
0a985fd4c Move size_ initialization to initializer list (#2529)
012cc709d Workaround gcc _Pragma bug 59884
d6590e3bd Fix compiler check
134aec40f Fix search in docs
48a476ae0 Update example (#2522)
023c2018f Don't use strlen in constexpr
800d4c8ac Refactor Windows workarounds
32865aeaa changed detection of Intel Compiler Classic to distinguish MS-Windows (#2510)
7b339795a Describe a better approach of how to use {fmt} as a depency in a Bazel project (#2516)
ae9bbe116 Suppress warning C4127 in chrono.h (conditional expression is constant) (#2518)
927dbd134 Misplaced comma in README.rst (#2515)
2a9a77dd8 Remove misplaced comment
1aee4bc90 Refactor FP formatting
e1bd6cc91 Refactor FP formatting
027fcaf05 Replace use_grisu with fallback since Grisu is only one of multiple implemented algorithms
716d69f27 Refactor FP formatting
ff7e73af6 Always run grisu_gen_digits before fallback_format
2976e31ac Refactor format_float
807ee5ec3 Disable consteval in Apple clang
d9a731d48 Add basic support for Bazel (#2505)
9c57357e0 Add `static` to a table (#2509)
2742611ca Fix formatting
5092b198b Document group_digits
b4d9d82e1 make FP formatting available to be used at compile-time (#2426)
d9fd695ac Fix wchar_t tm formatting
92614ecbf Optimize %T in tm formatting
aaeca12d8 Move FMT_MAYBE_UNUSED to format.h where it is used
3d0c7ae38 Move data to format.cc
04e3a79f7 Use memcpy in more cases in copy2
e47e99bb0 Simplify format_decimal (#2498)
9b6b0e403 Remove data
4d1c6034e Deprecate basic_data
a3348eccd Deprecate most of basic_data
3a0448148 Remove data::hex_digits
ad77331c0 Move log10_2_significand to format-inl.h
d9ebc4e82 Add a function to get sign char
c00eb4f4c Add missing inline
25af02f21 positive -> nonnegative (#2493)
67cb2dad3 Optimize %F in tm formatting
1aa98f8b9 Eliminate double copying in vformat_to_n (#2489)
a58c13382 Improve code_point_length codegen on older gcc
aeee70a81 Remove unnecessary cast
c771ba361 Fix build for the clang-10 / libstdc++-9 couple (#2491)
ab6e2272c Clarify shifts encoding
e4728409e Use throw_format_error in more places to reduce bloat
e3ebf366a Inline padding shifts
894faf3fe Refactor presentation types
4eb97fa4e Reduce code bloat
6b55c8325 is_const_formattable -> has_const_formatter
2fe94ad7e Make specifiers support in tuple_join an opt-in
3940de595 thousands -> group_digits
c4d0f96a6 Implement format specs in fmt::thousands
3b9c44268 Implement thousands separators without locales
08f98c7fa Simplify get_arg_index_by_name
a151f955a Remove FMT_OVERRIDE
42a225cbd Remove redundand final
bf20d1990 Simplify the core API
fc0884037 Move FMT_GCC_VISIBILITY_HIDDEN to format.h
1aeed2dbc Require inline namespaces
799bea473 Remove FMT_HAS_GXX_CXX11
60cd5ea3f Add support for more formattable types in ranges
4fd9a00f3 Simplify ostream interface
568156389 Cleanup ostream interface
20931baf1 Disable fallback_formatter for arrays
d58d19ba3 Fix an odr violation in ranges.h (#2483)
ee0659f8b Fix formatting of abstract classes via ostream
8029bf955 Fix copy_str performance (#2477)
2520f410c Workaround for #2478 (#2482)
ee63f5f04 Workaround to MSVC bug (#2474) (#2476)
1aaf72fb6 Add an example to fmt::runtime
c1313c205 Clarify that format_to[_n] do not append a terminating null
cb0f177c3 Improve docs
71677e520 Improve docs
4db572352 add fuzzers for chrono timepoint and localtime,gmtime (#2469)
dc7f3ef2b Fix header name
419ba86a9 Improve docs
6a5b4d5fa Document format_string
2599163b8 Document format_string
8ef22f774 Update docs
c0c4d1ada Update docs
729a44e67 Depreate strtod and remove problematic tests
74c111896 Apply force inline
596508a92 Cleanup
043e3b342 Remove static_assert from arg_mapper
8b0cb944d Fix error reporting when mixing character types
117fc6707 CI: replace g++ C++20 build to test FP formatting at compile-time
c79a3841e make detail::fp and detail::bigit constexpr
5888de9f3 make detail::make_checked() constexpr
04b4b69b1 make detail::bit_cast() constexpr with C++20
fd34a3d24 make detail::basic_memory_buffer constexpr with C++20
6d597e39c Fix overload ambiguity in arg_mapper
b9ce56d93 Improve comments
f889e52a1 Improve error reporting
34caecd6b Use consistent initialization style
a44c8f651 reimplement `formatter<tuple_join_view>` (#2457)
4b8bda25c Fix 2462
6b5e6119e set clang in one place
7af1dc1d2 fix UB in fuzzer common (memcpy on nullptr)
e77686f7a clang format
2207ea0b3 More escaping
a212ff757 Escape invalid code points
a76031e11 check -> is_printable
a7f280765 Improve naming
07d033ecb Fix is_printable
cdb4299ac Add Unicode support to is_printable
7df2c82a8 Rewrite printable.py codegen to emit C++
6cf90d7ce Add script license and fix python version
2f1ad8ed3 Add printable codegen from Rust
371d8e2ee Escape Unicode
6397095ca More escaping
f69a57253 Don't overescape wide strings
11b07a56b We should escape
b559cfd4c Implement basic escaping
11d49491c Handle global locale
6ea6cf946 Add decimal separator support to float
9730a2af0 Update ChangeLog.rst
c2ed5f686 Update ChangeLog.rst
7b66e72e2 Use builtin intrinsics on intel (#2450)
d57b2a652 Suppress a warning
bdfbd794e Cleanup begin/end usage
111de881f Don't copy non-const-iterable ranges
d6e882ed8 Undo the move because the doc is not a GH template
f488eed10 Resolve default constructor error in Xcode 7.2.1 and 8.2.1
652c3653b Move CONTRIBUTING.md to .github
fb19faa31 Improves README with svg badge (#2446)
07211701f Disable the -Wstringop-overflow warning from GCC 7 (#2442)
bba0a9d96 Make flush public
f1794a885 Switch to threadsafe death test style
0544a2279 Exclude fallback functions when FMT_BUILTIN_CLZ(LL) is not defined (#2434)
5c222f056 Add support for nonconst formattable types
3def950b8 Set FMT_CAN_MODULE=OFF for MSVC 19.29.30035+
63fe2d5bd Add copy constructor for dynamic_format_arg_store, and test
561834650 Improve digit count
f20f50368 Replace `throw` with `FMT_THROW` (#2427)
00235d8a9 fix module test odr violations (#2414)
2038bf618 Update format_to usage
e41ac1f87 Don't use deprecated API in docs
8465869d7 Move ignore_unused to detail
3d53d1539 Warning removals in test code (#2399)
20e4ef8b4 Pass significand_size by value
c4a3c2342 Refactor locale handling
7a0d30175 Update README.rst
f2b03facd Include test sources to pick up functions and classes from the module rather than from the non-modular library which is baked into the `test-main` library. (#2356)
02ad5e11d Add faint, blink, reverse and conceal to the emphases (#2394)
d141cdbeb Update version
cfc05e05f Bump version
8ea312633 Update changelog
e461f3dbb Minor consitency and comment tweaks
54014e42e silence warning C4100 on MSVC 2019 when exceptions are disabled (#2397)
3e7a29cc9 Workaround clang/gcc incompatibility
00a57a9f8 Update changelog
1d7384530 Add missing presentation type checks for std::string (#2402)
889bbf27a Fix missing std::get overload in MSVC (#2407)
5f8473914 Remove outdated apidoc
785908ee3 Fix warnings
fbb70eec5 suppress unused variable warnings (#2381)
002bb759f Remove unneeded `num_result_bigits` decrement
a3f762c5a [doc] Minor: fix ``code``.
c3c27e5ab Fix MSVC warning C4819
c6b1f181a Fix docs
94564b058 Fix docs
0fc73a2a8 Merge branch 'master' of github.com:fmtlib/fmt
3156fcf5f Switch to older breathe version
f85fb9fdf Adjust definition for FMT_HAS_INCLUDE
0bc3d664e Fix docs
e5c46e13e Fix docs: breathe 18 and earlier corrupts trailing return type
49a3b58c8 Specify size for static data arrays
d0c8d45a2 apt update before install
c9a10631c format: do not use udl_{arg,formatter} return types when UDL is not in use
3bd806f12 Eliminate intel compiler warning
fd16bcb20 Fix bug in cmake join function
5221242f6 Instruct msvc to report the _true_ value in `__cplusplus` and force _full_ C++ conformance
31a5f0d39 Bump version in inline namespace
102a4d492 Bump version in inline namespace
f68508b6c Update ChangeLog.rst

git-subtree-dir: externals/fmt
git-subtree-split: b6f4ceaed0a0a24ccf575fab6c56dd50ccf6f1a9
2022-02-15 11:15:34 +00:00
Merry
8b3bef13dc externals: Update fmt to 8.1.1
Merge commit '6633089a44b13022ddb37a44229c9d9a88a9096f'
2022-02-15 11:15:34 +00:00
merry
76ec1afdad fuzz_arm: Ensure that FPSCR.QC is tested 2022-02-12 22:07:26 +00:00
merry
b8dd1c7510 emit_x64_floating_point: Correct dead-code warning in MSVC 2019 2022-02-12 22:07:26 +00:00
merry
95a1ebfb97 backend/x64: Bugfix: A32 frontent also uses FPSCR.QC 2022-02-12 21:46:45 +00:00
merry
473bbd422e test_arm_instructions: Add vmsr/vcmp/vmrs test 2022-02-12 21:43:05 +00:00
Fernando Sahmkow
a8cbfd9af4 X86_Backend: set fences correctly for memory barriers and synchronization. 2022-02-01 14:27:54 +00:00
Alexandre Bouvier
0cafcc1af9 cmake: Always build static externals 2022-01-08 14:23:34 +00:00
Mai M
1635958d06
Merge pull request #658 from liushuyu/master
disassembler_thumb: fix formatting issues with fmt 8.1.x
2022-01-06 00:17:16 -05:00
liushuyu
40afbe1927
disassembler_thumb: fix formatting issues with fmt 8.1.x ...
... fmt 8.1.0 added more formatting checks and Cond can't be formatted
directly now
2022-01-05 21:49:51 -07:00
Wunkolo
ad5465d6ce constant_pool: Use tsl::robin_map rather than unordered_map
Finding a much more drastic improvement with `robin_map`.

`map`:
```
[master] % hyperfine -r 100 "./dynarmic_tests --durations yes"
Benchmark 1: ./dynarmic_tests --durations yes
  Time (mean ± σ):     567.0 ms ±   6.9 ms    [User: 513.1 ms, System: 53.2 ms]
  Range (min … max):   554.4 ms … 588.1 ms    100 runs
```

`unordered_map`:
```
[opt_const_pool] % hyperfine -r 100 "./dynarmic_tests --durations yes"
Benchmark 1: ./dynarmic_tests --durations yes
  Time (mean ± σ):     561.1 ms ±   4.5 ms    [User: 508.1 ms, System: 52.3 ms]
  Range (min … max):   552.6 ms … 574.2 ms    100 runs
```

`tsl::robin_map`:
```
[opt_const_pool] % hyperfine -r 100 "./dynarmic_tests --durations yes"
Benchmark 1: ./dynarmic_tests --durations yes
  Time (mean ± σ):     553.5 ms ±   5.6 ms    [User: 500.7 ms, System: 52.1 ms]
  Range (min … max):   545.7 ms … 569.3 ms    100 runs
```
2022-01-01 12:13:13 +00:00
Wunkolo
e57bb0569a constant_pool: Convert hashtype from tuple to pair 2022-01-01 12:13:13 +00:00
Wunkolo
befc22a61e constant_pool: Use unordered_map rather than map
`map` is an ordinal structure with log(n) time searches.
`unordered_map` uses O(1) average-time searches and O(n) in the worst
case where a bucket has a to a colliding hash and has to start chaining.
The unordered version should speed up our general-case when looking up
constants.

I've added a trivial order-dependent(_(0,1) and (1,0) will return a
different hash_) hash to combine a 128-bit constant into a
64-bit hash that generally will not collide, using a bit-rotate to
preserve entropy.
2022-01-01 12:13:13 +00:00
Morph
28714ee75a general: Rename files with duplicate names
In MSVC, having files with identical filenames will result into massive slowdowns when compiling.
The approach I have taken to resolve this is renaming the identically named files in frontend/(A32, A64) to (a32, a64)_filename.cpp/h
2021-12-23 11:38:58 +00:00
Andrea Pappacoda
4dcebc1822 build(cmake): add install target
This makes dynarmic installable, and also adds a CMake package config
file, that allows projects to use `find_package(dynarmic)` to import the
library.

I know #636 adds the same thing, but while experimenting with the
different install options in
https://github.com/merryhime/dynarmic/pull/636#discussion_r725656034
I ended up with a working patch, so I'm proposing this as well. This
implements solution 2.
2021-10-30 19:03:23 +01:00
Mai M
cce7e4ee5d
Merge pull request #651 from ameerj/fmt-cmake
externals/cmake: Fix fmt target check
2021-10-12 14:33:36 -04:00
ameerj
4cfbbe3df2 externals/cmake: Fix fmt target check 2021-10-11 13:44:19 -04:00
Andrea Pappacoda
b87a889d98 build(cmake): add version and soversion to the library
This adds versioning information to the built library.

When building the shared library on Linux systems, a new object will
be created: libdynarmic.so.5

This is really useful when talking about ABI compatibility.

The variables dynarmic_VERSION and dynarmic_VERSION_MAJOR
are implicitly created when calling project(dynarmic VERSION x.y.z)
2021-10-11 06:53:05 +01:00
ameerj
55bede81f8 CMake: Fix fmt target check 2021-10-11 06:52:52 +01:00
Fernando S
e4146ec3a1
x64 Interface: Allow for asynchronous invalidation (#647)
* x64 Interface: Make Invalidation asynchronous.

* Apply suggestions from code review
2021-10-05 15:06:41 +01:00
Wunkolo
5e7d2afe0f IR: Introduce VectorReduceAdd{8,16,32,64} opcode
Adds all elements of vector and puts the result into the lowest element.
Accelerates the `addv` instruction into a vectorized implementation
rather than a serial one.
2021-09-27 19:54:11 +01:00
Wunkolo
69b831d7d2 tests: Add {S,V}ADD{V,P} tests
These are the instructions emitted for each variant of the `vaddv{q}_{s}{8,16,32,64}` intrinsic.
2021-09-27 19:54:11 +01:00
Marshall Mohror
0b8fd755d8 Fix signal_stack_size for glibc 2.34
`SIGSTKSZ` is now defined as `sysconf(_SC_SIGSTKSZ)` which is not constexpr, and returns a long which throws off the `std::max` template deduction.
2021-09-22 20:38:11 +01:00
Ben
6ce8bfaf32
Add API function to retrieve dissassembly as vector of strings (#644)
Co-authored-by: ben <Avuxo@users.noreply.github.com>
2021-09-16 16:45:20 -04:00
Macchiarch
f88aa570a3
cpu_info: remove tSSE4a and tSSE5 (#643)
tSSE4a and tSSE5 have been removed from xbyak
2021-09-06 20:49:10 +01:00
merry
1697902948
Merge pull request #641 from abouvier/unbundle
CMakeLists: Add options to unbundle most external libraries
2021-08-25 07:56:12 +01:00
Alexandre Bouvier
352898e88b cmake: Add options to unbundle Zydis 2021-08-24 12:28:44 +02:00
Merry
517e35f845 decoder_detail: Avoid MSVC ICE
MSVC has an internal compiler error when assume is present in this constexpr function
2021-08-15 19:32:05 +01:00
Merry
2e4f99ae3d CMakeLists: Expose DYNARMIC_IGNORE_ASSERTS option 2021-08-15 16:09:37 +01:00
Merry
3b4459d112 CMakeLists: Enable C++20 support 2021-08-15 15:17:01 +01:00
Merry
4988d9fab3 disassembler_arm: Fix format strings for vfp_VMOV_from_i{8,16} 2021-08-15 15:16:53 +01:00
Merry
615ce8c7c5 IR: Remove A32 IR instructions Get{N,Z,V}Flag 2021-08-12 13:06:15 +01:00
Alexandre Bouvier
04b1c78166 cmake: Add checks for projects using dynarmic as subproject 2021-08-10 16:16:02 +02:00
Alexandre Bouvier
33b89cca08 cmake: Add options to unbundle some externals 2021-08-10 16:05:38 +02:00
Merry
72f8abe11d externals: Update mp to latest
Merge commit '163b59390c32745f95838b121be3ef5e2cf08e8c'
2021-08-10 12:30:46 +01:00
Merry
163b59390c Squashed 'externals/mp/' changes from 649fde1e..b50053ce
b50053ce function_info: Implement equivalent_function_type_with_class

git-subtree-dir: externals/mp
git-subtree-split: b50053cef50385419c59fb3aebb78974547318bc
2021-08-10 12:30:46 +01:00
Merry
2bc86209bd catch: Correct include directory 2021-08-08 12:52:55 +01:00
Wunkolo
1e94acff66 ir: Add VectorBroadcastElement{Lower} IR instruction
The lane-splatting variant of `FMUL` and `FMLA` is very
common in instruction streams when implementing things like
matrix multiplication. When used, they are used very densely.

https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/coding-for-neon---part-3-matrix-multiplication

The way this is currently implemented is by grabbing the particular lane
into a general purpose register and then broadcasting it into a simd
register through `VectorGetElement` and `VectorBroadcast`.

```cpp
    const IR::U128 operand2 = v.ir.VectorBroadcast(esize, v.ir.VectorGetElement(esize, v.V(idxdsize, Vm), index));
```

What could be done instead is to keep it within
the vector-register and use a permute/shuffle to "splat" the particular
lane across all other lanes, removing the GPR-round-trip.

This is implemented as the new IR instruction `VectorBroadcastElement`:

```cpp
    const IR::U128 operand2 = v.ir.VectorBroadcastElement(esize, v.V(idxdsize, Vm), index);
```
2021-08-07 23:03:57 +01:00
Wunkolo
46b8cfabc0 bit_util: Protect Replicate from automatic up-casting
Recursive calls to `Replicate` beyond the first call might
cause an unintentional up-casting to an `int` type due
to `|` and `<<` operations on types such as `uint8_t` and `uint16_t`

This makes sure calls such as `Recursive<u8>` stay as the `u8` type
through-out.
2021-08-07 23:03:57 +01:00
Wunkolo
f171ce7859 tests: Add FMLA(lane) test
Math operations such as Matrix multiplication utilize these particular
instructions enough that there should be some unit tests for thesein particular.
The lane-splatting form of FMUL and FMLA instructions are of particular
interest and I've found them to be very common in retail game binaries
such as Pokemon Sword.

https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/coding-for-neon---part-3-matrix-multiplication

I'm primarily adding this unit test so that I can ensure compatibility
while I tune and optimize them.
2021-08-07 23:03:57 +01:00
Merry
d41bc492fe {a32,a64}_jitstate: Remove unnecessary headers 2021-08-07 19:35:33 +01:00
Merry
07b5734fb0 xbyak: Correct xbyak include directory
xbyak is intended to be installed in /usr/local/include/xbyak.
Since we desire not to install xbyak before using it, we copy the headers
to the appropriate directory structure and use that instead
2021-08-07 15:13:49 +01:00
Merry
31cefb22a0 fuzz_with_unicorn: Correct printing of vectors 2021-08-06 15:29:43 +01:00
Merry
59fb568b27 tests: Use Zydis for disassembly 2021-08-06 15:29:43 +01:00
Wunkolo
f33bd69ec2 emit_x64_vector_floating_point: AVX512 implementation of EmitFPVectorToFixed
AVX512 introduces the _unsigned_ variant of float-to-integer conversion
functions via `vcvttp{sd}2u{dq}q`. In the case that a value is not
representable as an unsigned integer, it will result in `0xFFFFF...`
which can be utilized to get "free" saturation when the floating point
value exceeds the unsigned range, after masking away negative values.

https://www.felixcloutier.com/x86/vcvttps2udq
https://www.felixcloutier.com/x86/vcvttpd2uqq

This PR also speeds up the _signed_ conversion function for fp64->int64
https://www.felixcloutier.com/x86/vcvttpd2qq
2021-07-17 22:13:11 +01:00
SachinVin
048da372e9 block_of_code.cpp: remove redundant align() 2021-07-17 22:12:31 +01:00
Kappamalone
6ca6461450
docs/Design: Fix links (#633) 2021-07-11 19:22:46 +01:00
Merry
65309eb6bc gitignore: Update mig path 2021-07-11 11:38:43 +01:00
Wunkolo
5971361160 IR: Add AndNot{32,64} IR instruction
Also includes BMI1-acceleration for x64, when available
2021-07-02 22:27:29 +01:00
Wunkolo
49d00634f9 IR: Add VectorAndNot IR instruction
And(a, Not(b)) is a common enough operation that this can
be fused into a single `AndNot` operation. On x64 this is also
a single `pandn` instruction rather than two.
2021-07-02 22:27:29 +01:00