Projects in Awesome Lists by aredden
A curated list of projects in awesome lists by aredden .
https://github.com/aredden/flux-fp8-api
Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.
diffusion fast-inference flux fp8 pytorch quantization
Last synced: 12 Jan 2025