Projects in Awesome Lists by devo8604
A curated list of projects in awesome lists by devo8604 .
https://github.com/devo8604/cicd_llm_data_scraper
Automated pipeline for generating high-quality Q&A training data from Git repositories. Processes source code with LLMs to create fine-tuning datasets. Features smart caching, resume support, MLX (Apple Silicon) & llama.cpp backends, multiple export formats (Alpaca, ChatML, etc).
alpaca code-analysis data-pipeline dataset-generation fine-tuning instruction-tuning llamacpp llm machine-learning mlx python question-answering sqlite synthetic-data training-data
Last synced: 11 Apr 2026