An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with fast-inference

A curated list of projects in awesome lists tagged with fast-inference .

https://github.com/foolwood/pytorch-slimming

Learning Efficient Convolutional Networks through Network Slimming, In ICCV 2017.

deep-learning fast-inference l1-regularization pytorch weight-pruning

Last synced: 05 Apr 2025

https://github.com/aredden/flux-fp8-api

Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.

diffusion fast-inference flux fp8 pytorch quantization

Last synced: 12 Jan 2025

https://github.com/kssteven418/biglittledecoder

[NeurIPS'23] Speculative Decoding with Big Little Decoder

decoding efficient-inference fast-inference llm speculative-decoding speculative-execution

Last synced: 05 Dec 2024

https://github.com/meopbk/fast_inference_classifiers

Multilable fast inference classifiers (Ridge Regression and MLP) for NLPs with Sentence Embedder, K-Fold, Bootstrap and Boosting. NOTE: since the MLP (fully connected NN) Classifier was too heavy to be loaded, you can just compile it with the script.

boosting bootstrap classification classifiers classifiers-comparison embedders fast-inference learning-curve mlp-classifier mlp-networks nlp nn python python3 ridge-classifier ridge-regression ridge-regression-model transformers

Last synced: 07 May 2025