{"id":25556437,"url":"https://github.com/ramcovasu/monolithic","last_synced_at":"2026-02-13T06:41:42.549Z","repository":{"id":277600789,"uuid":"932944389","full_name":"ramcovasu/Monolithic","owner":"ramcovasu","description":"Leverage LLM (SLM like phi4) to generate documentation for very large SQL file which has 100's of procedures..","archived":false,"fork":false,"pushed_at":"2025-02-14T20:22:32.000Z","size":59,"stargazers_count":1,"open_issues_count":0,"forks_count":1,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-05-23T23:37:14.611Z","etag":null,"topics":["ai","llm","phi4","slm"],"latest_commit_sha":null,"homepage":"","language":"HTML","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"apache-2.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/ramcovasu.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2025-02-14T20:13:28.000Z","updated_at":"2025-02-14T20:22:35.000Z","dependencies_parsed_at":"2025-02-14T21:25:36.752Z","dependency_job_id":"878386d5-ffc4-4830-ac4a-084d8339d3ff","html_url":"https://github.com/ramcovasu/Monolithic","commit_stats":null,"previous_names":["ramcovasu/monolithic"],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/ramcovasu/Monolithic","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ramcovasu%2FMonolithic","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ramcovasu%2FMonolithic/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ramcovasu%2FMonolithic/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ramcovasu%2FMonolithic/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/ramcovasu","download_url":"https://codeload.github.com/ramcovasu/Monolithic/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/ramcovasu%2FMonolithic/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29398144,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-13T06:24:03.484Z","status":"ssl_error","status_checked_at":"2026-02-13T06:23:12.830Z","response_time":78,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["ai","llm","phi4","slm"],"created_at":"2025-02-20T14:26:38.736Z","updated_at":"2026-02-13T06:41:42.533Z","avatar_url":"https://github.com/ramcovasu.png","language":"HTML","funding_links":[],"categories":[],"sub_categories":[],"readme":"# SQL Code Analysis and Documentation Generator\n\nA powerful tool for analyzing, documenting, and visualizing SQL codebase structure and dependencies. This project combines modern language models with vector storage to provide comprehensive insights into SQL code architecture.\n\n## Features\n\n- **Intelligent SQL Parsing**: Automatically breaks down SQL files into logical chunks (packages, procedures, functions)\n- **Dependency Analysis**: Identifies and visualizes relationships between different SQL objects\n- **Vector-Based Storage**: Uses ChromaDB for efficient storage and retrieval of code chunks\n- **LLM-Powered Analysis**: Leverages language models to provide detailed code analysis and insights\n- **Interactive Documentation**: Generates comprehensive HTML documentation with interactive components\n- **Streamlit Interface**: User-friendly web interface for uploading and analyzing SQL files\n- **Monolithic helps to generate a SQL file which can then be used for this project\n- **Output is created as a HTML , sample shown sql_documentation.html\n\n## Architecture\n\nThe project consists of several key components:\n\n- `sqldataeng.py`: SQL parsing and chunk extraction\n- `vectorstore.py`: Vector storage implementation using ChromaDB\n- `llmprocessor.py`: Language model integration for code analysis\n- `docgenerator.py`: Documentation generation and formatting\n- `main.py`: Streamlit web interface\n\n## Prerequisites\n\n- Python 3.8+\n- CUDA-capable GPU (optional, for faster processing)\n\n## Installation\n\n1. Clone the repository:\n```bash\ngit clone https://github.com/ramcovasu/monolithic.git\ncd monolithic\n```\n\n2. Create a virtual environment:\n```bash\npython -m venv venv\nsource venv/bin/activate  # On Windows: venv\\Scripts\\activate\n```\n\n3. Install dependencies:\n```bash\npip install -r requirements.txt\n```\n\n## Usage\n\n1. Start the Streamlit application:\n```bash\nstreamlit run main.py --server.fileWatcherType none\n```\n\n2. Open your browser and navigate to `http://localhost:8501`\n\n3. Upload your SQL file through the web interface\n\n4. Follow the step-by-step process:\n   - Parse SQL code\n   - Process and store chunks\n   - Generate analysis\n   - View and download documentation\n\n## Key Features\n\n### SQL Parsing\n- Intelligent package and procedure detection\n- Accurate dependency tracking\n- Support for complex SQL structures\n\n### Vector Storage\n- Efficient code chunk storage\n- Semantic similarity search\n- Dependency graph construction\n\n### Documentation Generation\n- Comprehensive HTML reports\n- Interactive visualizations\n- Detailed code analysis\n- Dependency diagrams\n\n## Technical Details\n\n### Embedding Model\n- Uses BAAI/bge-small-en-v1.5 for embeddings\n- Supports GPU acceleration when available\n- Efficient batch processing\n\n### Vector Storage\n- ChromaDB for persistent storage\n- Optimized for code similarity search\n- Efficient metadata handling\n\n### LLM Integration\n- Local LLM support via LM Studio\n- Batched processing for large codebases\n- Error handling and retry logic\n\n## Contributing\n\n1. Fork the repository\n2. Create your feature branch (`git checkout -b feature/amazing-feature`)\n3. Commit your changes (`git commit -m 'Add amazing feature'`)\n4. Push to the branch (`git push origin feature/amazing-feature`)\n5. Open a Pull Request\n\n## License\n\nThis project is licensed under the MIT License - see the LICENSE file for details.\n\n## Acknowledgments\n\n- ChromaDB for vector storage\n- Sentence Transformers for embeddings\n- Streamlit for the web interface\n- SQLParse for SQL parsing\n\n## Project Structure\n\n```\nmonolithic/\n├── main.py              # Streamlit application\n├── sqldataeng.py       # SQL parsing engine\n├── vectorstore.py      # Vector storage management\n├── llmprocessor.py     # LLM integration\n├── docgenerator.py     # Documentation generator\n├── requirements.txt    # Project dependencies\n└── README.md          # This file\n```\n\n## Future Enhancements\n\n- Support for additional SQL dialects\n- Enhanced visualization options\n- Code quality metrics\n- Performance optimization suggestions\n- Batch processing for multiple files\n\n## Contact\n\nCreate an issue in the repository for bug reports, feature requests, or general questions.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Framcovasu%2Fmonolithic","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Framcovasu%2Fmonolithic","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Framcovasu%2Fmonolithic/lists"}