Calculate with error-free, faithful, and compensated transforms and extended significands.
This release is the first to be compatible with Julia v1.6
Changes:
Closed issues:
Merged pull requests:
sum_mixed
& dot_mixed
: mixed-precision implementations of summation and dot product for 32-bit numbers (#20)four_sum
(#18, #21)Merged pull requests:
four_sum
(#21) (@JeffreySarnoff)This release introduces a @fusible
counterpart to SIMDops.@explicit
, which allows using SIMD instructions from SIMDPirates.jl
in all cases, whether exact instructions are wanted, or fused operations are allowed to happen. EFTs now use this when possible. The API for @explicit
changed: it now affects the following expressions (instead of the whole scope inside which it is placed).
This release also adds support for cache prefetching, which significantly improves performance for large vectors. This was actually the explanation for the performance discrepancies observed for large vectors and mentioned in #7. Default values for the cache prefetching mechanism are probably not optimal for all architectures. Users interested in the last 10% performance outside the cache are invited to customize this parameter.
Closed issues:
Merged pull requests:
This version adjusts the SIMDPirates compatibility bounds. Hopefully it fixes issue gh-11 once and for all.
Closed issues:
Merged pull requests:
This version fixes the broken compatibility bounds affecting v0.3.1 (which is currently uninstallable, see gh-11).
Closed issues:
Merged pull requests:
Closed issues:
Merged pull requests: