Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/spider-rs/web-crawling-guides

How to guides on web-crawling or scraping
https://github.com/spider-rs/web-crawling-guides

agents ai-agents ai-scraping clean-markdown crawler fast-webcrawler html-to-markdown llm-webcrawler scraper web-scraping

Last synced: 1 day ago
JSON representation

How to guides on web-crawling or scraping

Awesome Lists containing this project

README

        




Spider Logo

# Spider Web Crawling and Scraping Guides
This repos contains a collection of guides on how to effectively use the Spider service to crawl or scrape. Contributors are welcome! 😁

## Collection

- [Using the Spider API](spider-api.md)
- [How to Use Proxy Mode](proxy-mode.md)
- [LangChain + Groq + Spider = 🚀 (Integration Guide)](langchain-groq.md)
- [CrewAI Spider Stock Research](crewai-spider-research-agent.md)
- [Extracting Contacts](extracting-contacts.md)
- [Automated Cold Email Outreach Using Spider](auto-email-response-outreach.md)
- [How to Archive Full Website](website-archiving.md)
- Building A Speedy Resilient Web Scraper for RAG AI ([Part 1](building-a-speedy-resilient-web-scraper-for-rag-ai-part1-preparing.md), [Part 2](building-a-speedy-resilient-web-scraper-for-rag-ai-part2-scaling-up.md))
- [Agents from Scrach](ai-agent-from-scratch.md)

## Contribute

We're happy to accept requests in the issue tracker, improvements to the content, and additional guides.