Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with simd
A curated list of projects in awesome lists tagged with simd .
https://github.com/Tencent/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
android arm-neon artificial-intelligence caffe darknet deep-learning high-preformance inference ios keras mlir mxnet ncnn neural-network onnx pytorch riscv simd tensorflow vulkan
Last synced: 25 Oct 2024
https://github.com/tencent/ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
android arm-neon artificial-intelligence caffe darknet deep-learning high-preformance inference ios keras mlir mxnet ncnn neural-network onnx pytorch riscv simd tensorflow vulkan
Last synced: 16 Dec 2024
https://github.com/simdjson/simdjson
Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks
aarch64 arm arm64 avx2 avx512 c-plus-plus clang clang-cl cpp11 gcc-compiler json json-parser json-pointer loongarch neon simd sse42 vs2019 x64
Last synced: 16 Dec 2024
https://github.com/questdb/questdb
QuestDB is an open source time-series database for fast ingest and SQL queries
analytics big-data cpp database financial-analysis grafana hacktoberfest iot java low-latency postgres postgresql questdb simd sql time-series time-series-database tsdb
Last synced: 16 Dec 2024
https://github.com/openwall/john
John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs
assembler c cracker crypt fpga gpgpu gpu hash john jtr mpi opencl openmp password ripper simd
Last synced: 16 Dec 2024
https://github.com/g-truc/glm
OpenGL Mathematics (GLM)
cpp cpp-library glm header-only mathematics matrix opengl quaternion simd sycl vector vulkan
Last synced: 29 Sep 2024
https://github.com/unity-technologies/entitycomponentsystemsamples
auto-vectorisation auto-vectorization burst component containers csharp documentation ecs entity high jobs multicore multicore-processors multicore-programming native performance simd system tutorials unity3d
Last synced: 17 Dec 2024
https://github.com/Unity-Technologies/EntityComponentSystemSamples
auto-vectorisation auto-vectorization burst component containers csharp documentation ecs entity high jobs multicore multicore-processors multicore-programming native performance simd system tutorials unity3d
Last synced: 14 Nov 2024
https://github.com/bytedance/sonic
A blazingly fast JSON serializing & deserializing library
high-performance jit json simd
Last synced: 16 Dec 2024
https://github.com/google/highway
Performance-portable, length-agnostic SIMD with runtime dispatch
avx avx-512 avx-instructions avx2 avx512 intrinsics neon simd simd-instructions simd-intrinsics simd-library simd-parallelism simd-programming sse42 wasm
Last synced: 16 Dec 2024
https://github.com/ARM-software/ComputeLibrary
The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.
aarch64 android arm armv7 armv8 computer-vision cpp linux machine-learning neon neural-network opencl simd sve
Last synced: 26 Oct 2024
https://github.com/turbo/js
turbo.js - perform massive parallel computations in your browser with GPGPU.
calculations glsl gpgpu gpu parallel shaders simd vector
Last synced: 26 Sep 2024
https://github.com/hora-search/hora
๐ efficient approximate nearest neighbor search algorithm collections library written in Rust ๐ฆ .
algorithm approximate-nearest-neighbor-search artificial-intelligence data-structures high-performance hnsw image-search k-nearest-neighbors machine-learning neural-network numeric recommender-system rust rust-sci search-engine simd similarity-search vector-search
Last synced: 19 Dec 2024
https://github.com/ispc/ispc
Intelยฎ Implicit SPMD Program Compiler
compiler intel ispc programming-language simd spmd
Last synced: 17 Dec 2024
https://github.com/guillaumeblanc/ozz-animation
Open source c++ skeletal animation library and toolset
animation collada data-oriented fbx game mit-license simd soa sse
Last synced: 19 Dec 2024
https://github.com/recp/cglm
๐ฝ Highly Optimized 2D / 3D Graphics Math (glm) for C
3d 3d-math affine-transform-matrices avx bezier bounding-boxes c euler frustum marix-inverse math matrix matrix-decompositions neon opengl opengl-math simd sse vector wasm
Last synced: 17 Dec 2024
https://github.com/ashvardanian/stringzilla
Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging NEON, AVX2, AVX-512, and SWAR to accelerate search, sort, edit distances, alignment scores, etc ๐ฆ
beautifulsoup common-crawl csv dataset html information-retrieval json laion ndjson parser pattern-recognition simd sorting-algorithms string string-manipulation string-matching string-parsing string-search substring
Last synced: 19 Dec 2024
https://github.com/unum-cloud/usearch
Fast Open-Source Search & Clustering engine ร for Vectors & ๐ Strings ร in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram ๐
approximate-nearest-neighbor-search clustering database faiss full-text-search fuzzy-search image-search kann nearest-neighbor-search recommender-system search search-engine semantic-search simd similarity-search text-search vector-search webassembly
Last synced: 01 Nov 2024
https://github.com/xtensor-stack/xsimd
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
avx avx512 c-plus-plus-11 cpp mathematical-functions neon simd simd-instructions simd-intrinsics sse sve vectorization
Last synced: 17 Dec 2024
https://github.com/tairov/llama2.mojo
Inference Llama 2 in one file of pure ๐ฅ
inference llama llama2 modular mojo parallelize performance simd tensor transformer-architecture vectorization
Last synced: 22 Dec 2024
https://github.com/ermig1979/simd
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, NEON for ARM.
amx arm avx avx512 c-plus-plus haar-cascade image-processing lbp machine-learning neon neural-network simd simd-library sse
Last synced: 19 Dec 2024
https://github.com/ermig1979/Simd
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM.
altivec amx arm avx avx512 c-plus-plus haar-cascade image-processing lbp machine-learning neon neural-network powerpc simd simd-library sse vsx
Last synced: 26 Oct 2024
https://github.com/google/xnnpack
High-efficiency floating-point neural network inference operators for mobile, server, and Web
convolutional-neural-network convolutional-neural-networks cpu inference inference-optimization matrix-multiplication mobile-inference multithreading neural-network neural-networks simd
Last synced: 13 Nov 2024
https://github.com/google/XNNPACK
High-efficiency floating-point neural network inference operators for mobile, server, and Web
convolutional-neural-network convolutional-neural-networks cpu inference inference-optimization matrix-multiplication mobile-inference multithreading neural-network neural-networks simd
Last synced: 24 Oct 2024
https://github.com/ashvardanian/StringZilla
Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores, etc ๐ฆ
beautifulsoup common-crawl csv dataset html information-retrieval json laion ndjson parser pattern-recognition simd sorting-algorithms string string-manipulation string-matching string-parsing string-search substring
Last synced: 28 Oct 2024
https://github.com/maratyszcza/nnpack
Acceleration package for neural networks on multi-core CPUs
convolutional-layers cpu fast-fourier-transform high-performance high-performance-computing inference matrix-multiplication multithreading neural-network neural-networks simd winograd-transform
Last synced: 19 Dec 2024
https://github.com/kfrlib/kfr
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
audio audio-processing avx avx512 clang cplusplus cplusplus-14 cplusplus-17 cpp14 cpp17 cxx dft digital-signal-processing discrete-fourier-transform dsp fast-fourier-transform fft header-only simd
Last synced: 19 Dec 2024
https://github.com/Maratyszcza/NNPACK
Acceleration package for neural networks on multi-core CPUs
convolutional-layers cpu fast-fourier-transform high-performance high-performance-computing inference matrix-multiplication multithreading neural-network neural-networks simd winograd-transform
Last synced: 27 Oct 2024
https://github.com/fastfloat/fast_float
Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, Redis and WebKit/Safari
cpp-library cpp11 cpp17 freebsd high-performance linux macos neon simd sse2 visual-studio
Last synced: 17 Dec 2024
https://github.com/adamniederer/faster
SIMD for humans
cross-platform intrinsics optimization simd
Last synced: 19 Dec 2024
https://github.com/AdamNiederer/faster
SIMD for humans
cross-platform intrinsics optimization simd
Last synced: 28 Oct 2024
https://github.com/microsoft/directxmath
DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
avx avx2 clang cpp-library desktop directx directxmath microsoft msvc neon simd sse uwp xbox
Last synced: 18 Dec 2024
https://github.com/timeplus-io/proton
A streaming SQL engine, a fast and lightweight alternative to ksqlDB and Apache Flink, ๐ powered by ClickHouse.
analytics clickhouse confluent cpp flink-alternative high-performance kakfa ksqldb-alternative redpanda simd single-binary sql stream-processing streaming-sql udf
Last synced: 18 Dec 2024
https://github.com/bitshifter/glam-rs
A simple and fast linear algebra library for games and graphics
3d-math-libraries rust simd sse2
Last synced: 16 Dec 2024
https://github.com/microsoft/DirectXMath
DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
avx avx2 clang cpp-library desktop directx directxmath microsoft msvc neon simd sse uwp xbox
Last synced: 25 Oct 2024
https://github.com/Microsoft/DirectXMath
DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps
avx avx2 clang cpp-library desktop directx directxmath microsoft msvc neon simd sse uwp xbox
Last synced: 24 Oct 2024
https://github.com/vcdevel/vc
SIMD Vector Classes for C++
avx avx2 avx512 c-plus-plus cpp cpp11 cpp14 cpp17 data-parallel neon parallel parallel-computing portable simd simd-instructions simd-programming simd-vector sse vectorization
Last synced: 19 Dec 2024
https://github.com/VcDevel/Vc
SIMD Vector Classes for C++
avx avx2 avx512 c-plus-plus cpp cpp11 cpp14 cpp17 data-parallel neon parallel parallel-computing portable simd simd-instructions simd-programming simd-vector sse vectorization
Last synced: 26 Oct 2024
https://github.com/ada-url/ada
WHATWG-compliant and fast URL parser written in modern C++, part of Node.js, Clickhouse, Redpanda, Kong, Telegram and Cloudflare Workers.
cpp neon parser performance simd sse2 url whatwg-url
Last synced: 19 Dec 2024
https://github.com/daniel-liu-c0deb0t/uwu
fastest text uwuifier in the west
Last synced: 19 Dec 2024
https://github.com/Daniel-Liu-c0deb0t/uwu
fastest text uwuifier in the west
Last synced: 06 Nov 2024
https://github.com/satdump/satdump
A generic satellite data processing software.
baseband ccsds digital-signal-processing satellite sdr simd volk
Last synced: 19 Dec 2024
https://github.com/SatDump/SatDump
A generic satellite data processing software.
baseband ccsds digital-signal-processing satellite sdr simd volk
Last synced: 05 Nov 2024
https://github.com/dltcollab/sse2neon
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
aarch64 apple-silicon arm arm64 armv7l armv8 armv8-a biilabs intel-intrinsics intel-sse-intrinsics neon neon-intrinsics simd sse sse-intrinsics sse2neon x86
Last synced: 19 Dec 2024
https://github.com/DLTcollab/sse2neon
A translator from Intel SSE intrinsics to Arm/Aarch64 NEON implementation
aarch64 apple-silicon arm arm64 armv7l armv8 armv8-a biilabs intel-intrinsics intel-sse-intrinsics neon neon-intrinsics simd sse sse-intrinsics sse2neon x86
Last synced: 27 Oct 2024
https://github.com/simdutf/simdutf
Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension, LoongArch64. Part of Node.js, WebKit/Safari, Ladybird, Chromium, Cloudflare Workers and Bun.
avx-512 avx2 base64 neon risc-v simd sse2 transcoding unicode utf16 utf8
Last synced: 19 Dec 2024
https://github.com/apache/incubator-gluten
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
arrow clickhouse simd spark-sql vectorization velox
Last synced: 19 Dec 2024
https://github.com/simd-lite/simd-json
Rust port of simdjson
hacktoberfest json rust rust-crate simd
Last synced: 17 Dec 2024
https://github.com/unum-cloud/ucall
Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_uring โ๏ธ
backend cpython dpdk epoll fast-api flask http http-server io-uring json json-rpc liburing linux-kernel python rest-api rpc rpc-framework simd tcp tcp-ip
Last synced: 18 Dec 2024
https://github.com/rustgd/cgmath
A linear algebra and mathematics library for computer graphics.
computer-graphics linear-algebra mathematics-library matrix rust simd simd-vector vector
Last synced: 17 Dec 2024
https://github.com/ashvardanian/simsimd
Up to 200x Faster Dot Products & Similarity Metrics โ for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 ๐
arm-neon arm-sve assembly avx2 avx512 bfloat16 blas blas-libraries distance-calculation float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search
Last synced: 17 Dec 2024
https://github.com/auburn/fastnoise2
Modular node graph based noise generation library using SIMD, C++17 and templates
cross-platform fastnoise magnum node-graph noise noise-algorithms noise-generator perlin-noise procedural-generation simd simplex terrain-generation texture-generation
Last synced: 20 Dec 2024
https://github.com/SnellerInc/sneller
World's fastest log analysis: ฮป + SQL + JSON + S3
avx512 go high-performance indexless json log query-engine s3 schemaless serverless simd sql vectorized
Last synced: 28 Oct 2024
https://github.com/burntsushi/memchr
Optimized string search routines for Rust.
bytes memchr rabin-karp rust simd string string-searching twoway
Last synced: 18 Dec 2024
https://github.com/BurntSushi/memchr
Optimized string search routines for Rust.
bytes memchr rabin-karp rust simd string string-searching twoway
Last synced: 06 Nov 2024
https://github.com/netfabric/netfabric.hyperlinq
High performance LINQ implementation with minimal heap allocations. Supports enumerables, async enumerables, arrays and Span<T>.
array async-enumerable buffer-pools csharp csharp-library dotnet dotnet-core dotnet-standard enumeration heap-allocations linq nuget-package performance reduced-heap-allocations simd span
Last synced: 16 Dec 2024
https://github.com/NetFabric/NetFabric.Hyperlinq
High performance LINQ implementation with minimal heap allocations. Supports enumerables, async enumerables, arrays and Span<T>.
array async-enumerable buffer-pools csharp csharp-library dotnet dotnet-core dotnet-standard enumeration heap-allocations linq nuget-package performance reduced-heap-allocations simd span
Last synced: 11 Nov 2024
https://github.com/segmentio/asm
Go library providing algorithms optimized to leverage the characteristics of modern CPUs
arm assembler assembly avo branch-prediction go golang simd x86
Last synced: 18 Dec 2024
https://github.com/jfalcou/eve
Expressive Vector Engine - SIMD in C++ Goes Brrrr
aarch64 altivec avx avx2 cpp cpp-library hpc neon simd simd-library simd-parallelism simd-programming sse2 ssse3
Last synced: 26 Oct 2024
https://github.com/libxsmm/libxsmm
Library for specialized dense and sparse matrix operations, and deep learning primitives.
amx avx avx2 avx512 bfloat16 blas convolution fortran intel jit machine-learning matrix matrix-multiplication simd sparse sse tensor transpose vector
Last synced: 26 Oct 2024
https://github.com/ogxd/gxhash
The fastest hashing algorithm ๐
cryptography hash hashing hashmap ilp no-std performance simd
Last synced: 19 Nov 2024
https://github.com/jackmott/LinqFaster
Linq-like extension functions for Arrays, Span<T>, and List<T> that are faster and allocate less.
allocation csharp-library gamedev-library linq perfromance simd
Last synced: 13 Nov 2024
https://github.com/jackmott/linqfaster
Linq-like extension functions for Arrays, Span<T>, and List<T> that are faster and allocate less.
allocation csharp-library gamedev-library linq perfromance simd
Last synced: 21 Dec 2024
https://github.com/ashvardanian/SimSIMD
Up to 200x Faster Inner Products and Vector Similarity โ for Python, JavaScript, Rust, and C, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE ๐
arm-neon arm-sve assembly avx2 avx512 blas blas-libraries distance-calculation distance-measures float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search
Last synced: 28 Oct 2024
https://github.com/jeremyong/klein
P(R*_{3, 0, 1}) specialized SIMD Geometric Algebra Library
3d 3d-graphics algebra animation dual-quaternions geometric-algebra inverse-kinematics pga projective-geometry quaternion-algebra quaternions simd sse
Last synced: 20 Nov 2024
https://github.com/powturbo/TurboPFor-Integer-Compression
Fastest Integer Compression
avx2 compression compressor encoding floating-point integer-compression intersection inverted-index library simd sse2 time-series
Last synced: 26 Oct 2024
https://github.com/JuliaSIMD/LoopVectorization.jl
Macro(s) for vectorizing loops.
Last synced: 06 Nov 2024
https://github.com/nfrechette/rtm
Realtime Math
c-plus-plus cpp game-development game-engine math simd
Last synced: 20 Dec 2024
https://github.com/romeric/Fastor
A lightweight high performance tensor algebra framework for modern C++
fpga hpc multidimensional-arrays simd small-blas tensor-contraction tensors
Last synced: 11 Nov 2024
https://github.com/redorav/hlslpp
Math library using HLSL syntax with multiplatform SIMD support
arm arm64 avx c-plus-plus-11 cpp game-development hlsl math math-library matrix neon quaternion shaders simd sse sse41 vector wasm
Last synced: 21 Dec 2024
https://github.com/shibatch/sleef
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
aarch64 android arm avx avx512 cuda elementary-functions fft ios math-library neon powerpc quadruple-precision s390x simd sse2 sve vector-math vectorization vsx
Last synced: 19 Dec 2024
https://github.com/tjake/jlama
Jlama is a modern LLM inference engine for Java
ai genai gpt huggingface java llama llm openai simd transformers
Last synced: 20 Dec 2024
https://github.com/piotte13/SIMD-Visualiser
A tool to graphically visualize SIMD code
compilers intrinsics simd vectorized-computation visualisation
Last synced: 28 Oct 2024
https://github.com/egorbo/simdjsonsharp
C# bindings for lemire/simdjson (and full C# port)
Last synced: 21 Dec 2024
https://github.com/tktech/pysimdjson
Python bindings for the simdjson project.
json pysimdjson python python-bindings simd simdjson
Last synced: 18 Dec 2024
https://github.com/EgorBo/SimdJsonSharp
C# bindings for lemire/simdjson (and full C# port)
Last synced: 13 Nov 2024
https://github.com/TkTech/pysimdjson
Python bindings for the simdjson project.
json pysimdjson python python-bindings simd simdjson
Last synced: 31 Oct 2024
https://github.com/pikkr/pikkr
JSON parser which picks up values directly without performing tokenization in Rust
json json-parser pikkr rust simd
Last synced: 27 Oct 2024
https://github.com/rust-lang/stdarch
Rust's standard library vendor-specific APIs and run-time feature detection
Last synced: 19 Dec 2024
https://github.com/Auburn/FastNoiseSIMD
C++ SIMD Noise Library
avx2 cellular fastnoise fastnoise-simd fractal neon noise noise-3d noise-library perlin perlin-noise simd simplex simplex-noise sse white-noise
Last synced: 26 Oct 2024
https://github.com/Auburns/FastNoiseSIMD
C++ SIMD Noise Library
avx2 cellular fastnoise fastnoise-simd fractal neon noise noise-3d noise-library perlin perlin-noise simd simplex simplex-noise sse white-noise
Last synced: 14 Dec 2024
https://github.com/mukel/llama3.java
Practical Llama 3 inference in Java
chatgpt genai gguf huggingface java llama llama3 llamacpp llm llm-inference llms openai simd transformers
Last synced: 21 Dec 2024
https://github.com/gnuradio/volk
The Vector Optimized Library of Kernels
sdr simd simd-instructions simd-programming
Last synced: 15 Dec 2024
https://github.com/soedinglab/hh-suite
Remote protein homology detection suite.
alignment bioinformatics cpp hh-suite hhblits hhpred hhsearch opensource profile-profile-search profile-search protein-structure sequence-search simd viterbi
Last synced: 20 Dec 2024
https://github.com/rusticstuff/simdutf8
SIMD-accelerated UTF-8 validation for Rust.
aarch64 arm64 avx2 neon rust rust-crate simd simd-extensions sse41 unicode utf-8 wasm
Last synced: 07 Nov 2024
https://github.com/cloudwego/sonic-rs
A fast Rust JSON library based on SIMD.
Last synced: 15 Dec 2024
https://github.com/fast-pack/simdcomp
A simple C library for compressing lists of integers using binary packing
c compression simd simd-instructions
Last synced: 21 Dec 2024
https://github.com/mosra/corrade
C++11 multiplatform utility library
android c-plus-plus c-plus-plus-11 cmake corrade emscripten ios linux macos magnum simd windows
Last synced: 20 Dec 2024
https://github.com/lemire/simdcomp
A simple C library for compressing lists of integers using binary packing
c compression simd simd-instructions
Last synced: 14 Dec 2024
https://github.com/lemire/SIMDcomp
A simple C library for compressing lists of integers using binary packing
c compression simd simd-instructions
Last synced: 28 Oct 2024
https://github.com/seqan/seqan
SeqAn's official repository.
alignment bioinfomatics blast bwt cpp14 fasta fastq-format high-performance htslib indexing sam-bam seqan sequence-alignments simd suffixarray
Last synced: 20 Dec 2024
https://github.com/Volcomix/virtual-background
Demo on adding virtual background to a live video stream in the browser
background bodypix demo mediapipe mlkit react segmentation selfie shaders simd stream tensorflow tfjs tflite typescript video wasm webgl
Last synced: 29 Oct 2024
https://github.com/fast-pack/simdcompressionandintersection
A C++ library to compress and intersect sorted lists of integers using SIMD instructions
algorithms compression integer-compression intersection simd simd-instructions
Last synced: 21 Dec 2024