Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with simd

A curated list of projects in awesome lists tagged with simd .

https://github.com/tencent/ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

android arm-neon artificial-intelligence caffe darknet deep-learning high-preformance inference ios keras mlir mxnet ncnn neural-network onnx pytorch riscv simd tensorflow vulkan

Last synced: 29 Sep 2024

https://github.com/Tencent/ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

android arm-neon artificial-intelligence caffe darknet deep-learning high-preformance inference ios keras mlir mxnet ncnn neural-network onnx pytorch riscv simd tensorflow vulkan

Last synced: 30 Jul 2024

https://github.com/simdjson/simdjson

Parsing gigabytes of JSON per second : used by Facebook/Meta Velox, the Node.js runtime, ClickHouse, WatermelonDB, Apache Doris, Milvus, StarRocks

aarch64 arm arm64 avx2 avx512 c-plus-plus clang clang-cl cpp11 gcc-compiler json json-parser json-pointer loongarch neon simd sse42 vs2019 x64

Last synced: 29 Sep 2024

https://github.com/openwall/john

John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs

assembler c cracker crypt fpga gpgpu gpu hash john jtr mpi opencl openmp password ripper simd

Last synced: 01 Oct 2024

https://github.com/magnumripper/JohnTheRipper

John the Ripper jumbo - advanced offline password cracker, which supports hundreds of hash and cipher types, and runs on many operating systems, CPUs, GPUs, and even some FPGAs

assembler c cracker crypt fpga gpgpu gpu hash john jtr mpi opencl openmp password ripper simd

Last synced: 14 Aug 2024

https://github.com/bytedance/sonic

A blazingly fast JSON serializing & deserializing library

high-performance jit json simd

Last synced: 29 Sep 2024

https://github.com/ARM-software/ComputeLibrary

The Compute Library is a set of computer vision and machine learning functions optimised for both Arm CPUs and GPUs using SIMD technologies.

aarch64 android arm armv7 armv8 computer-vision cpp linux machine-learning neon neural-network opencl simd sve

Last synced: 30 Jul 2024

https://github.com/turbo/js

turbo.js - perform massive parallel computations in your browser with GPGPU.

calculations glsl gpgpu gpu parallel shaders simd vector

Last synced: 26 Sep 2024

https://github.com/guillaumeblanc/ozz-animation

Open source c++ skeletal animation library and toolset

animation collada data-oriented fbx game mit-license simd soa sse

Last synced: 30 Sep 2024

https://github.com/ispc/ispc

Intelยฎ Implicit SPMD Program Compiler

compiler intel ispc programming-language simd spmd

Last synced: 26 Sep 2024

https://github.com/simd-everywhere/simde

Implementations of SIMD instruction sets for systems which don't natively support them.

altivec arm arm64 avx avx2 avx512 fma gfni mmx neon powerpc simd simd-intrinsics sse sse2 sse3 sse41 sse42 ssse3 vectorization

Last synced: 01 Aug 2024

https://github.com/xtensor-stack/xsimd

C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))

avx avx512 c-plus-plus-11 cpp mathematical-functions neon simd simd-instructions simd-intrinsics sse sve vectorization

Last synced: 30 Sep 2024

https://github.com/ermig1979/Simd

C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM.

altivec amx arm avx avx512 c-plus-plus haar-cascade image-processing lbp machine-learning neon neural-network powerpc simd simd-library sse vsx

Last synced: 30 Jul 2024

https://github.com/ermig1979/simd

C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512, AMX for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM.

altivec amx arm avx avx512 c-plus-plus haar-cascade image-processing lbp machine-learning neon neural-network powerpc simd simd-library sse vsx

Last synced: 25 Sep 2024

https://github.com/unum-cloud/usearch

Fast Open-Source Search & Clustering engine ร— for Vectors & ๐Ÿ”œ Strings ร— in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram ๐Ÿ”

approximate-nearest-neighbor-search clustering database faiss full-text-search fuzzy-search image-search kann nearest-neighbor-search recommender-system search search-engine semantic-search simd similarity-search text-search vector-search webassembly

Last synced: 31 Jul 2024

https://github.com/ashvardanian/StringZilla

Up to 10x faster strings for C, C++, Python, Rust, and Swift, leveraging SWAR and SIMD on Arm Neon and x86 AVX2 & AVX-512-capable chips to accelerate search, sort, edit distances, alignment scores, etc ๐Ÿฆ–

beautifulsoup common-crawl csv dataset html information-retrieval json laion ndjson parser pattern-recognition simd sorting-algorithms string string-manipulation string-matching string-parsing string-search substring

Last synced: 31 Jul 2024

https://github.com/agavrel/42_cheatsheet

A comprehensive guide to 50 years of evolution of strict C programming, a tribute to Dennis Ritchie's language

42 42born2code 42fremont 42madrid 42paris 42school 42seoul 42tokyo bitwise learning school sdl2 simd

Last synced: 30 Sep 2024

https://github.com/kfrlib/kfr

Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)

audio audio-processing avx avx512 clang cplusplus cplusplus-14 cplusplus-17 cpp14 cpp17 cxx dft digital-signal-processing discrete-fourier-transform dsp fast-fourier-transform fft header-only simd

Last synced: 01 Oct 2024

https://github.com/agavrel/42_CheatSheet

A comprehensive guide to 50 years of evolution of strict C programming, a tribute to Dennis Ritchie's language

42 42born2code 42fremont 42madrid 42paris 42school 42seoul 42tokyo bitwise learning school sdl2 simd

Last synced: 31 Jul 2024

https://github.com/microsoft/DirectXMath

DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps

avx avx2 clang cpp-library desktop directx directxmath microsoft msvc neon simd sse uwp xbox

Last synced: 30 Jul 2024

https://github.com/microsoft/directxmath

DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps

avx avx2 clang cpp-library desktop directx directxmath microsoft msvc neon simd sse uwp xbox

Last synced: 30 Sep 2024

https://github.com/Microsoft/DirectXMath

DirectXMath is an all inline SIMD C++ linear algebra library for use in games and graphics apps

avx avx2 clang cpp-library desktop directx directxmath microsoft msvc neon simd sse uwp xbox

Last synced: 30 Jul 2024

https://github.com/fastfloat/fast_float

Fast and exact implementation of the C++ from_chars functions for number types: 4x to 10x faster than strtod, part of GCC 12, Chromium, Redis and WebKit/Safari

cpp-library cpp11 cpp17 freebsd high-performance linux macos neon simd sse2 visual-studio

Last synced: 01 Oct 2024

https://github.com/timeplus-io/proton

A streaming SQL engine, a fast and lightweight alternative to ksqlDB and Apache Flink, ๐Ÿš€ powered by ClickHouse.

analytics clickhouse confluent cpp flink-alternative high-performance kakfa ksqldb-alternative redpanda simd single-binary sql stream-processing streaming-sql udf

Last synced: 30 Sep 2024

https://github.com/bitshifter/glam-rs

A simple and fast linear algebra library for games and graphics

3d-math-libraries rust simd sse2

Last synced: 30 Sep 2024

https://github.com/daniel-liu-c0deb0t/uwu

fastest text uwuifier in the west

owo simd uwu

Last synced: 30 Sep 2024

https://github.com/ada-url/ada

WHATWG-compliant and fast URL parser written in modern C++, part of Node.js, Redpanda, Kong, Telegram and Cloudflare Workers.

cpp neon parser performance simd sse2 url whatwg-url

Last synced: 30 Sep 2024

https://github.com/Daniel-Liu-c0deb0t/uwu

fastest text uwuifier in the west

owo simd uwu

Last synced: 01 Aug 2024

https://github.com/p12tic/libsimdpp

Portable header-only C++ low level SIMD library

altivec avx2 avx512 msa neon simd sse vsx

Last synced: 01 Oct 2024

https://github.com/SatDump/SatDump

A generic satellite data processing software.

baseband ccsds digital-signal-processing satellite sdr simd volk

Last synced: 01 Aug 2024

https://github.com/satdump/satdump

A generic satellite data processing software.

baseband ccsds digital-signal-processing satellite sdr simd volk

Last synced: 30 Sep 2024

https://github.com/rustgd/cgmath

A linear algebra and mathematics library for computer graphics.

computer-graphics linear-algebra mathematics-library matrix rust simd simd-vector vector

Last synced: 30 Sep 2024

https://github.com/simdutf/simdutf

Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension. Part of Node.js, WebKit/Safari and Bun.

avx-512 avx2 base64 neon risc-v simd sse2 transcoding unicode utf16 utf8

Last synced: 30 Sep 2024

https://github.com/simd-lite/simd-json

Rust port of simdjson

hacktoberfest json rust rust-crate simd

Last synced: 30 Sep 2024

https://github.com/apache/incubator-gluten

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

arrow clickhouse simd spark-sql vectorization velox

Last synced: 30 Sep 2024

https://github.com/SnellerInc/sneller

World's fastest log analysis: ฮป + SQL + JSON + S3

avx512 go high-performance indexless json log query-engine s3 schemaless serverless simd sql vectorized

Last synced: 31 Jul 2024

https://github.com/unum-cloud/ucall

Remote Procedure Calls - 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & ๐Ÿ”œ REST over io_uring and SIMDJSON โ˜Ž๏ธ

backend cpython dpdk epoll fast-api flask http http-server io-uring json json-rpc liburing linux-kernel python rpc rpc-framework simd simdjson tcp tcp-ip

Last synced: 30 Sep 2024

https://github.com/netfabric/netfabric.hyperlinq

High performance LINQ implementation with minimal heap allocations. Supports enumerables, async enumerables, arrays and Span<T>.

array async-enumerable buffer-pools csharp csharp-library dotnet dotnet-core dotnet-standard enumeration heap-allocations linq nuget-package performance reduced-heap-allocations simd span

Last synced: 30 Sep 2024

https://github.com/NetFabric/NetFabric.Hyperlinq

High performance LINQ implementation with minimal heap allocations. Supports enumerables, async enumerables, arrays and Span<T>.

array async-enumerable buffer-pools csharp csharp-library dotnet dotnet-core dotnet-standard enumeration heap-allocations linq nuget-package performance reduced-heap-allocations simd span

Last synced: 02 Aug 2024

https://github.com/segmentio/asm

Go library providing algorithms optimized to leverage the characteristics of modern CPUs

arm assembler assembly avo branch-prediction go golang simd x86

Last synced: 29 Sep 2024

https://github.com/jfalcou/eve

Expressive Vector Engine - SIMD in C++ Goes Brrrr

aarch64 altivec avx avx2 cpp cpp-library hpc neon simd simd-library simd-parallelism simd-programming sse2 ssse3

Last synced: 31 Jul 2024

https://github.com/libxsmm/libxsmm

Library for specialized dense and sparse matrix operations, and deep learning primitives.

amx avx avx2 avx512 bfloat16 blas convolution fortran intel jit machine-learning matrix matrix-multiplication simd sparse sse tensor transpose vector

Last synced: 30 Jul 2024

https://github.com/jackmott/linqfaster

Linq-like extension functions for Arrays, Span<T>, and List<T> that are faster and allocate less.

allocation csharp-library gamedev-library linq perfromance simd

Last synced: 28 Sep 2024

https://github.com/ashvardanian/SimSIMD

Up to 200x Faster Inner Products and Vector Similarity โ€” for Python, JavaScript, Rust, and C, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE ๐Ÿ“

arm-neon arm-sve assembly avx2 avx512 blas blas-libraries distance-calculation distance-measures float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search

Last synced: 31 Jul 2024

https://github.com/ashvardanian/simsimd

Up to 200x Faster Inner Products and Vector Similarity โ€” for Python, JavaScript, Rust, and C, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE ๐Ÿ“

arm-neon arm-sve assembly avx2 avx512 blas blas-libraries distance-calculation distance-measures float16 information-retrieval metrics neon numpy scipy simd simd-instructions similarity-measures similarity-search vector-search

Last synced: 30 Sep 2024

https://github.com/JuliaSIMD/LoopVectorization.jl

Macro(s) for vectorizing loops.

loops simd vectorizing-loops

Last synced: 01 Aug 2024

https://github.com/ogxd/gxhash

The fastest hashing algorithm ๐Ÿ“ˆ

cryptography hash hashing hashmap ilp no-std performance simd

Last synced: 04 Aug 2024

https://github.com/romeric/Fastor

A lightweight high performance tensor algebra framework for modern C++

fpga hpc multidimensional-arrays simd small-blas tensor-contraction tensors

Last synced: 02 Aug 2024

https://github.com/BurntSushi/memchr

Optimized string search routines for Rust.

bytes memchr rabin-karp rust simd string string-searching twoway

Last synced: 01 Aug 2024

https://github.com/piotte13/SIMD-Visualiser

A tool to graphically visualize SIMD code

compilers intrinsics simd vectorized-computation visualisation

Last synced: 31 Jul 2024

https://github.com/egorbo/simdjsonsharp

C# bindings for lemire/simdjson (and full C# port)

avx2 json netcore3 simd

Last synced: 01 Oct 2024

https://github.com/TkTech/pysimdjson

Python bindings for the simdjson project.

json pysimdjson python python-bindings simd simdjson

Last synced: 31 Jul 2024

https://github.com/pikkr/pikkr

JSON parser which picks up values directly without performing tokenization in Rust

json json-parser pikkr rust simd

Last synced: 31 Jul 2024

https://github.com/rust-lang/stdarch

Rust's standard library vendor-specific APIs and run-time feature detection

rust simd

Last synced: 28 Sep 2024

https://github.com/gnuradio/volk

The Vector Optimized Library of Kernels

sdr simd simd-instructions simd-programming

Last synced: 04 Aug 2024

https://github.com/rusticstuff/simdutf8

SIMD-accelerated UTF-8 validation for Rust.

aarch64 arm64 avx2 neon rust rust-crate simd simd-extensions sse41 unicode utf-8 wasm

Last synced: 01 Aug 2024

https://github.com/tjake/jlama

Jlama is a modern LLM inference engine for Java

ai gpt huggingface java llama llama2 llm openai simd transformers

Last synced: 27 Sep 2024

https://github.com/lemire/SIMDcomp

A simple C library for compressing lists of integers using binary packing

c compression simd simd-instructions

Last synced: 31 Jul 2024

https://github.com/Volcomix/virtual-background

Demo on adding virtual background to a live video stream in the browser

background bodypix demo mediapipe mlkit react segmentation selfie shaders simd stream tensorflow tfjs tflite typescript video wasm webgl

Last synced: 31 Jul 2024

https://github.com/aff3ct/MIPP

MIPP is a portable wrapper for SIMD instructions written in C++11. It supports NEON, SSE, AVX, AVX-512 and SVE (length specific).

avx avx-512 neon portable simd sse sve vector wrapper

Last synced: 31 Jul 2024

https://github.com/lemire/fastbase64

SIMD-accelerated base64 codecs

avx2 simd

Last synced: 31 Jul 2024

https://github.com/lemire/SIMDCompressionAndIntersection

A C++ library to compress and intersect sorted lists of integers using SIMD instructions

algorithms compression integer-compression intersection simd simd-instructions

Last synced: 30 Jul 2024

https://github.com/lemire/streamvbyte

Fast integer compression in C using the StreamVByte codec

arm compression integer-compression neon simd ssse3 x64

Last synced: 02 Aug 2024

https://github.com/p-ranav/fccf

fccf: A command-line tool that quickly searches through C/C++ source code in a directory based on a search string and prints relevant code snippets that match the query.

abstract-syntax-tree c-language c-programming clang code-search-engine command-line-tool cpp cpp11 cpp17 fast find libclang needle search simd sse2

Last synced: 29 Sep 2024

https://github.com/cloudwego/sonic-rs

A fast Rust JSON library based on SIMD.

json rust serde simd

Last synced: 01 Aug 2024

https://github.com/kangkaisen/olap-performance

OLAP Database Performance Tuning Guide

book cpp database olap performance query simd

Last synced: 31 Jul 2024

https://github.com/mratsim/laser

The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats and integers

assembler blas compiler-optimization convolution deep-learning gemm high-performance-computing jit matrix-multiplication openmp parallel runtime-cpu-detection simd tensor

Last synced: 29 Sep 2024

https://github.com/turnerj/quickenshtein

Making the quickest and most memory efficient implementation of Levenshtein Distance with SIMD and Threading support

edit-distance hardware-intrinsics levenshtein levenshtein-distance simd string-distance threading

Last synced: 03 Oct 2024

https://github.com/quickwit-oss/bitpacking

SIMD algorithms for integer compression via bitpacking. This crate is a port of a C library called simdcomp.

compression rust simd

Last synced: 01 Aug 2024

https://github.com/powturbo/Turbo-Base64

Turbo Base64 - Fastest Base64 SIMD:SSE/AVX2/AVX512/Neon/Altivec - Faster than memcpy!

arm avx avx2 avx512 base64 base64-decoding base64-encoding benchmark encoding encoding-library library neon simd sse

Last synced: 04 Aug 2024

https://github.com/epi5131/patch.aul

AviUtlใฎใƒใ‚ฐใ‚’็›ดใ™/้ซ˜้€ŸๅŒ–ใ™ใ‚‹/ๆฉŸ่ƒฝ่ฟฝๅŠ 

aviutl aviutl-plugin boost cpp cpp20 monkey-patching opencl simd x86-assembly

Last synced: 29 Sep 2024