https://github.com/puureya2/llm-powered-web-scraper
Big Data Web Scraper Framework, Internship Project
https://github.com/puureya2/llm-powered-web-scraper
asyncio crawl4ai crawler crush-cli csv deepseek-r1 gemini-2-5-flash gemini-api gzip json llm pandas pydantic python selenium-webdriver seleniumwire web-scraper
Last synced: about 2 months ago
JSON representation
Big Data Web Scraper Framework, Internship Project
- Host: GitHub
- URL: https://github.com/puureya2/llm-powered-web-scraper
- Owner: puureya2
- Created: 2025-08-15T16:46:21.000Z (about 2 months ago)
- Default Branch: main
- Last Pushed: 2025-08-15T17:27:21.000Z (about 2 months ago)
- Last Synced: 2025-08-15T19:31:18.970Z (about 2 months ago)
- Topics: asyncio, crawl4ai, crawler, crush-cli, csv, deepseek-r1, gemini-2-5-flash, gemini-api, gzip, json, llm, pandas, pydantic, python, selenium-webdriver, seleniumwire, web-scraper
- Language: Python
- Homepage: https://training.gov.au/
- Size: 604 KB
- Stars: 1
- Watchers: 0
- Forks: 0
- Open Issues: 0