Projects in Awesome Lists tagged with python-dataset
A curated list of projects in awesome lists tagged with python-dataset .
https://github.com/anonym0uswork1221/python-code-docstring-scraper
A multi-threaded GitHub scraper to collect Python code with docstrings from public repositories, creating a well-documented dataset for the JaraConverse LLM model.
causal-language-modeling data-scraping dataset dataset-generation dataset-scripts docst docstring-generator github-scraper llm llm-training nlp nlp-machine-learning python-code python-dataset python3 scraper script
Last synced: 11 Jan 2025