Ecosyste.ms: Awesome

https://github.com/crwlrsoft/crawler

Library for Rapid (Web) Crawler and Scraper Development

crawler crawling hacktoberfest php scraper scraping scraping-websites web-crawler web-crawling web-scraper web-scraping

Last synced: 04 Jul 2024

https://github.com/kunalnagarco/imdb-scraper

🎬 An attempt at the most complete IMDb API

imdb imdb-api imdb-dataset imdb-information imdb-movies imdb-webscrapping scraper scraping-api scraping-websites

Last synced: 03 Jul 2024

https://github.com/kennethreitz/requests-html

Pythonic HTML Parsing for Humans™

html requests scraping scraping-framework scraping-websites

Last synced: 30 Jun 2024

https://github.com/baptisteArno/tinking

🧶 Extract data from any website without code, just clicks.

harvesting puppeteer scraping scraping-websites scrapper scrapping

Last synced: 27 Jun 2024

https://github.com/alash3al/scraply

Scraply a simple dom scraper to fetch information from any html based website

crawler crawling dom golang scraper scrapers scraping-websites scrapy server

Last synced: 21 Jun 2024

https://github.com/pavlovtech/WebReaper

Web scraper, crawler and parser in C#. Designed as simple, declarative and scalable web scraping solution.

crawler datamining parser parsing scraper scraping scraping-api scraping-data scraping-tool scraping-web scraping-websites webcrawler webscraping

Last synced: 15 Jun 2024

https://github.com/KTZgraph/sarenka

OSINT tool - gets data from services like shodan, censys etc. in one app

censys-api cve cve-databases cve-search cwe cwe-search django django-rest-framework docker osint osint-python python3 react react-persist react-redux react-thunk reactjs reconnaissance scraping-websites shodan-api

Last synced: 14 Jun 2024

https://github.com/unixfox/pupflare

A webpage proxy that request through Chromium (puppeteer) - can be used to bypass Cloudflare anti bot / anti ddos on any application (like curl)

anti-bot-page chromium cloudflare cloudflare-bypass cloudflare-scrape docker koa protected-page proxy puppeteer scrape scraping-websites

Last synced: 12 Jun 2024

https://github.com/html2rss/html2rss

📰 Build RSS 2.0 feeds from websites (and JSON APIs) with a few CSS selectors.

atom-feed extract feed feed-configs html html2rss json rss rss-aggregator rss-bridge rss-builder rss-feed rss-feed-scraper rss-generator ruby scrape scraper scraping scraping-websites yahoo-pipes

Last synced: 02 Jun 2024

https://github.com/SilentDemonSD/FZBypassBot

A Elegant Fast Multi Threaded Bypass Bot for Bigger Deeds. Try Now !!

bypass bypasscaptcha bypassing link-bypasser link-shortener scrape scrapers scraping-websites telegram-bot

Last synced: 28 May 2024

https://github.com/driscoll42/ebayMarketAnalyzer

Scrape all eBay sold listings to determine average/median pricing, plot listings over time with trend lines, and extract to excel

ebay python scraping-websites webscraping

Last synced: 26 May 2024

https://github.com/codecadre/imt-school-addresses

pulls addresses from IMT

etl-pipeline public-datasets scraping-websites

Last synced: 25 May 2024

https://github.com/MontFerret/ferret

Declarative web scraping

cdp chrome cli crawler crawling data-mining dsl go golang hacktoberfest library query-language scraper scraping scraping-websites tool

Last synced: 08 May 2024

https://github.com/elixir-crawly/crawly

Crawly, a high-level web crawling & scraping framework for Elixir.

crawler crawling elixir erlang extract-data scraper scraping scraping-websites spider

Last synced: 05 May 2024

https://github.com/slotix/dataflowkit

Extract structured data from web sites. Web sites scraping.

cdp chrome-fetcher crawling extract-data go golang golang-library headless scraper scraping scraping-websites

Last synced: 04 May 2024

https://github.com/DiegoCaraballo/Email-extractor

The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url

email email-extractor email-marketing emails extraction python scraper scrapers scraping scraping-websites scrapper scrapping scrapy scrapy-spider spyder stractor

Last synced: 01 May 2024

https://github.com/brahma-dev/metafetch

NodeJS package that fetches a given URL's title, description, images, links etc.

javascript meta-tags scraper scraping-websites

Last synced: 27 Apr 2024

https://github.com/Anorov/cloudflare-scrape

A Python module to bypass Cloudflare's anti-bot page.

anti-bot-page cloudflare protected-page scrape scraping-websites

Last synced: 26 Apr 2024

https://github.com/its0x08/duckduckgo

A simple DuckDuckGo URL scraper.

ddg duckduckgo python python3 scrape scraper scraping scraping-websites scrapper scrapping search url url-parsing url-scraper urls

Last synced: 19 Apr 2024

https://github.com/Python-World/Python_and_the_Web

Build Bots, Scrape a website or use an API to solve a problem.

api bots fun hacktoberfest python scraping-websites

Last synced: 19 Apr 2024

https://github.com/AmmeySaini/Edu-Mail-Generator

Generate Free Edu Mail(s) within minutes

auto-install-webdriver edu edu-account edu-generator edumail install-webdriver mail python python3 scraping scraping-websites selenium selenium-python student-mail

Last synced: 16 Apr 2024

https://github.com/strugee/fulldom-server

Proxy-like server that will show you the DOM of a page after JS runs

daemon dom hacktoberfest nodejs scraping scraping-websites server

Last synced: 15 Apr 2024

https://github.com/RuthGnz/SpyScrap

CLI and GUI for OSINT. Are you very exhibited on the Internet? Check it! Twitter, Tinder, Facebook, Google, Yandex, BOE. It uses facial recognition to provide more accurate results.

boe cli docker facebook facial-recognition google osint reverse-image-search scraping-websites social-networks socmint tinder twitter