https://github.com/bodrovis-learning/scrapingbee-load-site-data-model-training
How to scrape all text from a website for LLM training
https://github.com/bodrovis-learning/scrapingbee-load-site-data-model-training
python3 scraping web-scraping
Last synced: 10 months ago
JSON representation
How to scrape all text from a website for LLM training
- Host: GitHub
- URL: https://github.com/bodrovis-learning/scrapingbee-load-site-data-model-training
- Owner: bodrovis-learning
- Created: 2024-06-23T14:26:33.000Z (almost 2 years ago)
- Default Branch: master
- Last Pushed: 2024-07-04T12:22:13.000Z (almost 2 years ago)
- Last Synced: 2025-05-15T11:50:40.334Z (about 1 year ago)
- Topics: python3, scraping, web-scraping
- Language: Python
- Homepage: https://www.scrapingbee.com/blog/how-to-scrape-all-text-from-a-website-for-llm-ai-training/
- Size: 23.4 KB
- Stars: 0
- Watchers: 1
- Forks: 4
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# How to scrape all text from a website for LLM training
https://www.scrapingbee.com/blog/how-to-scrape-all-text-from-a-website-for-llm-ai-training/