https://github.com/flaykky/llm-benchmarks
largest llm coding benchmark
https://github.com/flaykky/llm-benchmarks
ai ai-code-generation aicoding benchmark claude-code deepseek-r1 deepseek-v3 llm llms llms-benchmarking openai qwen3
Last synced: about 1 month ago
JSON representation
largest llm coding benchmark
- Host: GitHub
- URL: https://github.com/flaykky/llm-benchmarks
- Owner: Flaykky
- License: mit
- Created: 2025-08-30T12:33:35.000Z (10 months ago)
- Default Branch: main
- Last Pushed: 2026-01-29T19:09:21.000Z (4 months ago)
- Last Synced: 2026-01-30T07:48:38.695Z (4 months ago)
- Topics: ai, ai-code-generation, aicoding, benchmark, claude-code, deepseek-r1, deepseek-v3, llm, llms, llms-benchmarking, openai, qwen3
- Language: C
- Homepage:
- Size: 233 KB
- Stars: 1
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# LLM Benchmarks
**The Coding, IT activities, censorship, jailbreakability Benchmark for Large Language Models [LLM]**
## Basic info about project
benchmarks of many models from 13 llm providers in coding and other, coding promots including all popular themes and languages like frontend, backend and etc
## tests
- IT activities (14+ topics of coding provided at [docs/prompts.md](docs/prompts.md))
- censorship and jailbrekability
## Advanced information of project
all info about project, which llms were used in benchmarks, prompts and etc is in [docs/prompts.md](docs/prompts.md)
## License
Distributed under the MIT License. See [LICENSE](LICENSE) file for details.