Projects in Awesome Lists tagged with fast-inference
A curated list of projects in awesome lists tagged with fast-inference .
https://github.com/foolwood/pytorch-slimming
Learning Efficient Convolutional Networks through Network Slimming, In ICCV 2017.
deep-learning fast-inference l1-regularization pytorch weight-pruning
Last synced: 05 Apr 2025
https://github.com/aredden/flux-fp8-api
Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.
diffusion fast-inference flux fp8 pytorch quantization
Last synced: 12 Jan 2025
https://github.com/kssteven418/biglittledecoder
[NeurIPS'23] Speculative Decoding with Big Little Decoder
decoding efficient-inference fast-inference llm speculative-decoding speculative-execution
Last synced: 05 Dec 2024
https://github.com/meopbk/fast_inference_classifiers
Multilable fast inference classifiers (Ridge Regression and MLP) for NLPs with Sentence Embedder, K-Fold, Bootstrap and Boosting. NOTE: since the MLP (fully connected NN) Classifier was too heavy to be loaded, you can just compile it with the script.
boosting bootstrap classification classifiers classifiers-comparison embedders fast-inference learning-curve mlp-classifier mlp-networks nlp nn python python3 ridge-classifier ridge-regression ridge-regression-model transformers
Last synced: 07 May 2025