Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/auburn/fastsimd
Low level generic SIMD wrapper for x86, ARM, WASM with dynamic dispatch
https://github.com/auburn/fastsimd
arm64 avx2 avx512 simd sse2 wasm x86
Last synced: about 1 month ago
JSON representation
Low level generic SIMD wrapper for x86, ARM, WASM with dynamic dispatch
- Host: GitHub
- URL: https://github.com/auburn/fastsimd
- Owner: Auburn
- License: mit
- Created: 2017-10-11T23:20:04.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2024-08-27T22:28:47.000Z (4 months ago)
- Last Synced: 2024-10-31T22:52:00.102Z (about 2 months ago)
- Topics: arm64, avx2, avx512, simd, sse2, wasm, x86
- Language: C++
- Homepage:
- Size: 194 KB
- Stars: 28
- Watchers: 7
- Forks: 3
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE
Awesome Lists containing this project
README
# FastSIMD
FastSIMD is an SIMD abstraction layer that allows easy development of SIMD code. Using the generic SIMD register types included with FastSIMD provides access to standard C++ operators and various mathematic functions. Using template parameters or a define the generic types can be compiled to various SIMD feature sets (SSE, AVX, NEON...)
FastSIMD also provides an easy to use class template for runtime SIMD feature set detection and appropriate function dispatch. The generic register types mentioned above make it easy to write code that always makes use of the highest SIMD feature set available.
The original version of FastSIMD was developed along side FastNoise2. This new "1.0" version of FastSIMD is being developed as a working standalone library, although the development is still primarily driven to support FastNoise2. Compared to the "old" version of FastSIMD supplied with FastNoise2 this "1.0" version of FastSIMD has these improvements:
- Removed all uses of C++ macros in favour of templated types/functions
- Variable sized generic register types. For example when using operators on a register of 8xInt32 when targeting SSE, the intrinsics output will get doubled up transparently.
- Moved from SIMD levels to FeatureFlags to allow more verbose specialisation of templated types and more readable code.