Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/guntas-13/cs613-nlp-telugu-team1
Collecting data for Telugu LLM. Group Project in Natural Language Processing Course CS613
https://github.com/guntas-13/cs613-nlp-telugu-team1
beautifulsoup4 corpus crawling deduplication large-language-models llm preprocessing scraping selenium-webdriver telugu-language
Last synced: 6 days ago
JSON representation
Collecting data for Telugu LLM. Group Project in Natural Language Processing Course CS613
- Host: GitHub
- URL: https://github.com/guntas-13/cs613-nlp-telugu-team1
- Owner: guntas-13
- License: apache-2.0
- Created: 2024-09-06T13:27:20.000Z (2 months ago)
- Default Branch: main
- Last Pushed: 2024-11-04T22:54:41.000Z (9 days ago)
- Last Synced: 2024-11-04T23:30:28.430Z (9 days ago)
- Topics: beautifulsoup4, corpus, crawling, deduplication, large-language-models, llm, preprocessing, scraping, selenium-webdriver, telugu-language
- Language: Jupyter Notebook
- Homepage:
- Size: 57.4 MB
- Stars: 1
- Watchers: 1
- Forks: 0
- Open Issues: 2