Projects in Awesome Lists tagged with simd-instructions
A curated list of projects in awesome lists tagged with simd-instructions .
https://github.com/google/highway
Performance-portable, length-agnostic SIMD with runtime dispatch
avx avx-512 avx-instructions avx2 avx512 intrinsics neon simd simd-instructions simd-intrinsics simd-library simd-parallelism simd-programming sse42 wasm
Last synced: 14 May 2025
https://github.com/xtensor-stack/xsimd
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
avx avx512 c-plus-plus-11 cpp mathematical-functions neon simd simd-instructions simd-intrinsics sse sve vectorization
Last synced: 03 Oct 2025
https://github.com/lakshayg/tensorflow-build-archived
TensorFlow binaries supporting AVX, FMA, SSE
machine-learning simd-instructions tensorflow
Last synced: 28 Sep 2025
https://github.com/vcdevel/vc
SIMD Vector Classes for C++
avx avx2 avx512 c-plus-plus cpp cpp11 cpp14 cpp17 data-parallel neon parallel parallel-computing portable simd simd-instructions simd-programming simd-vector sse vectorization
Last synced: 14 Apr 2025
https://github.com/VcDevel/Vc
SIMD Vector Classes for C++
avx avx2 avx512 c-plus-plus cpp cpp11 cpp14 cpp17 data-parallel neon parallel parallel-computing portable simd simd-instructions simd-programming simd-vector sse vectorization
Last synced: 15 Mar 2025
https://github.com/ashvardanian/simsimd
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
arm-neon arm-sve assembly avx2 avx512 bfloat16 blas blas-libraries distance-calculation float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search
Last synced: 13 May 2025
https://github.com/ashvardanian/SimSIMD
Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
arm-neon arm-sve assembly avx2 avx512 bfloat16 blas blas-libraries distance-calculation float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search
Last synced: 23 Mar 2025
https://github.com/gnuradio/volk
The Vector Optimized Library of Kernels
sdr simd simd-instructions simd-programming
Last synced: 14 May 2025
https://github.com/fast-pack/simdcomp
A simple C library for compressing lists of integers using binary packing
c compression simd simd-instructions
Last synced: 18 Dec 2025
https://github.com/lemire/SIMDCompressionAndIntersection
A C++ library to compress and intersect sorted lists of integers using SIMD instructions
algorithms compression integer-compression intersection simd simd-instructions
Last synced: 20 Apr 2025
https://github.com/fast-pack/simdcompressionandintersection
A C++ library to compress and intersect sorted lists of integers using SIMD instructions
algorithms compression integer-compression intersection simd simd-instructions
Last synced: 05 Apr 2025
https://github.com/fast-pack/SIMDCompressionAndIntersection
A C++ library to compress and intersect sorted lists of integers using SIMD instructions
algorithms compression integer-compression intersection simd simd-instructions
Last synced: 15 Mar 2025
https://github.com/agenium-scale/nsimd
Agenium Scale vectorization library for CPUs and GPUs
aarch64 avx avx2 avx512 cpp20 cpp20-library cuda hpc neon neon128 rocm simd simd-instructions simd-library simd-programming sse2 sse42 sve vectorization-library
Last synced: 09 Apr 2025
https://github.com/fast-pack/MaskedVByte
Fast decoder for VByte-compressed integers
compression integer-compression simd simd-instructions vbyte vbyte-compressed-integers
Last synced: 10 May 2025
https://github.com/fast-pack/maskedvbyte
Fast decoder for VByte-compressed integers
compression integer-compression simd simd-instructions vbyte vbyte-compressed-integers
Last synced: 21 Aug 2025
https://github.com/lemire/simdxorshift
Fast random number generators: Vectorized (SIMD) version of xorshift128+
prng simd simd-instructions xorshift
Last synced: 16 Mar 2025
https://github.com/fast-pack/dictionary
High-performance dictionary coding
integer-compression simd simd-instructions
Last synced: 11 Jul 2025
https://github.com/cloudflare/sliceslice-rs
A fast implementation of single-pattern substring search using SIMD acceleration.
avx2 search-in-text simd simd-instructions simd-programming substring-search text-processing
Last synced: 09 Apr 2025
https://github.com/edanor/umesimd
UME::SIMD A library for explicit simd vectorization.
altivec avx avx2 avx512 benchmark code-generation cpp cpp11 cpp14 cpp17 instruction-set-architecture neon performance-tuning scalar-types simd simd-instructions simd-programming ume vector vectorization
Last synced: 15 Apr 2025
https://github.com/lsp-plugins/lsp-dsp-lib
DSP library for signal processing
aarch64 algorithms architectures armv7 assembly convolution-algorithms dsp dsp-library fft fma3 lsp-dsp-lib processing-algorithms simd simd-instructions simd-library x86-32 x86-64
Last synced: 14 May 2025
https://github.com/bgin/radar-electrooptical-simulation
(REOS) Radar and Electro-Optical Simulation Framework written in C++.
amd-gpu atmosphere-model avx avx2 avx512 control-theory cuda-kernels fortran90 gpu-acceleration high-performance-computing infrared-sensors modelling radar radar-signal-processing radiative-transfer simd-instructions simulation vectorization
Last synced: 10 Apr 2025
https://github.com/realtimechris/jsonifier
A few classes for parsing and serializing objects from/into JSON, in C++ - very rapidly.
cpp jasonparser json json-parsing json-parsing-library json-simd jsonifier parsing serialization simd-instructions simd-json
Last synced: 20 Aug 2025
https://github.com/badamczewski/simpleintrinsics
This project aims to rename all C# intrinsic names to their more compact C/C++ counterparts that the industry uses.
dotnet dotnet-core intrinsics simd simd-instructions
Last synced: 01 May 2025
https://github.com/lemire/fastdifferentialcoding
Fast differential coding functions (using SIMD instructions)
compressed integer-compression prefix-sum simd simd-instructions
Last synced: 21 Mar 2025
https://github.com/nulidangxueshen/ALBUS
A Method for efficiently processing SpMV using SIMD and load balancing
albus csr load-balancing simd simd-instructions sparse-matrix-vector-multiplication spmv
Last synced: 21 Apr 2025
https://github.com/lemire/vectorclass
Random number generator for large applications using vector instructions
performance prng simd simd-instructions
Last synced: 18 Jun 2025
https://github.com/aminya/minijson
Minify JSON files fast! Supports Comments. Uses D, C, and AVX2 and SSE4_1 SIMD.
avx avx2 benchmark build-tool dlang dlanguage javascript json minify minify-javascript minify-json minifying nodejs simd simd-instructions sse sse41 sse42
Last synced: 01 Aug 2025
https://github.com/m3y54m/sobel-simd-opencv
Using SIMD instructions in image processing using OpenCV
cpp intel-intrinsics opencv simd-instructions sobel-edge-detector
Last synced: 14 Mar 2025
https://github.com/z1skgr/simd-instruction-mpi-pthreads-parallism
Parallelism standards for accelerating performance on calculations for detection of positive DNA selection
accelerated-computing intel intel-intrinsics linux memory-layout mpi parallel-programming pthreads simd-instructions sse
Last synced: 16 Mar 2025
https://github.com/12acorns/portfolio-simdextensions
A, Source-Generated, library to add easier processing of SIMD instructions whilst maintaing a performance expected for each platform.
csharp csharp-lib csharp-libarary csharp-library simd simd-instructions simd-intrinsics simd-library simd-vector source-gen source-generated source-generation source-generator
Last synced: 23 Jul 2025
https://github.com/rsusik/cf2
Approximate pattern matching with Counting Filter on q-grams using SSE instructions (CF2)
algorithm approximate approximate-pattern-matching counting-filter dna dna-inversion dna-sequences dna-translocation edit-distance filtration hamming levenshtein matching pattern research simd simd-instructions sse text-matching text-search
Last synced: 15 Jun 2025
https://github.com/randomhashtags/swift-intrinsics
Unlock SIMD intrinsics for Swift.
intrinsics simd simd-instructions simd-intrinsics swift
Last synced: 28 Dec 2025
https://github.com/jacek13/findprimes
A program with a graphical interface designed to search for prime numbers. The application uses vector instructions (SIMD) from the x64 assembler level.
assembly cpp dear-imgui sdl2 simd simd-instructions threads visual-studio x64-assembly
Last synced: 15 May 2025
https://github.com/hunyadi/simdparse
High-speed parser with vector instructions
avx2-instructions datetime-parser parser-library simd-instructions uuid-parser
Last synced: 05 Jul 2025
https://github.com/korbolkoinc/uuids
High performance C++ uuid generator
aes clang clang-format clang-tidy cmake cpp cpp20 hardware-acceleration random-generation rdseed simd simd-instructions simd-intrinsics uuid uuid-generator uuid-v4 uuids uuidv4
Last synced: 12 Apr 2025
https://github.com/ndoll1998/fairpt
A fairly optimized cpu-only path tracer
bvh bvh-tree pathtracer pathtracing raytracer raytracing realist rendering simd simd-instructions
Last synced: 26 Feb 2025
https://github.com/bsgbryan/roc
A thoroughly-modern real-time simulation engine
assemblyscript bun entity-component-system game-dev game-development game-engine gamedev simd simd-instructions simd-intrinsics simd-programming simulation typescript webassembly webgpu
Last synced: 08 Oct 2025
https://github.com/mtumilowicz/java17-mesi-false-sharing-processor-optimisations-workshop
Introduction to cache coherence: false sharing, MESI protocol and vectorization
cache-coherence cache-coherency cache-invalidation cache-line cache-line-padding false-sharing jmh jmh-benchmarks mesi mesi-protocol multi-core-architectures processor-architecture simd simd-instructions simd-programming vectorization workshop workshop-materials writeback writethrough
Last synced: 23 Feb 2025