Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/guntas-13/cs613-nlp-telugu-team1
Collecting data for Telugu LLM. Group Project in Natural Language Processing Course CS613
https://github.com/guntas-13/cs613-nlp-telugu-team1
beautifulsoup4 corpus crawling deduplication large-language-models llm preprocessing scraping selenium-webdriver telugu-language
Last synced: 2 days ago
JSON representation
Collecting data for Telugu LLM. Group Project in Natural Language Processing Course CS613
- Host: GitHub
- URL: https://github.com/guntas-13/cs613-nlp-telugu-team1
- Owner: guntas-13
- License: apache-2.0
- Created: 2024-09-06T13:27:20.000Z (4 months ago)
- Default Branch: main
- Last Pushed: 2024-12-22T10:40:49.000Z (12 days ago)
- Last Synced: 2024-12-22T11:32:27.657Z (12 days ago)
- Topics: beautifulsoup4, corpus, crawling, deduplication, large-language-models, llm, preprocessing, scraping, selenium-webdriver, telugu-language
- Language: Jupyter Notebook
- Homepage:
- Size: 83.4 MB
- Stars: 2
- Watchers: 1
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE