Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with web-scraper
A curated list of projects in awesome lists tagged with web-scraper .
https://github.com/getmaxun/maxun
π₯ Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]
agents api automation browser browser-automation data-extraction no-code no-code-web-scraper playwright robotic-process-automation rpa scraper self-hosted web-agent web-automation web-scraper web-scraping web-scraping-agent webscraping website-to-api
Last synced: 02 Jan 2025
https://github.com/php-curl-class/php-curl-class
PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs
api api-client class client curl framework http http-client http-proxy json php php-curl php-curl-library proxy requests restful web-scraper web-scraping web-service xml
Last synced: 30 Dec 2024
https://github.com/arpit-omprakash/100projectsofcode
A list of practical knowledge-building projects.
c cpp11 csharp java javascript music-player processing programming projects python search-engine web-scraper
Last synced: 03 Jan 2025
https://github.com/arpit-omprakash/100ProjectsOfCode
A list of practical knowledge-building projects.
c cpp11 csharp java javascript music-player processing programming projects python search-engine web-scraper
Last synced: 08 Nov 2024
https://github.com/anaskhan96/soup
Web Scraper in Go, similar to BeautifulSoup
beautifulsoup go golang html-node web-scraper webscraper webscraping
Last synced: 02 Jan 2025
https://github.com/d4vinci/scrapling
Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python
ai ai-scraping automation crawler crawling crawling-python data data-extraction hacktoberfest playwright python python3 scraping selectors stealth web-scraper web-scraping web-scraping-python webscraping xpath
Last synced: 29 Dec 2024
https://github.com/dipu-bd/lightnovel-crawler
Generate and download e-books from online sources.
console-app discord kindle-books lightnovel lightnovel-crawler python telegram termux web-scraper
Last synced: 31 Dec 2024
https://github.com/juancarlospaco/faster-than-requests
Faster requests on Python 3
curl cython download-file faster-than-requests high-performance http-requests ndjson open-data python python-library python-requests python3 requests-toolbelt requests3 scrapy speed urllib urllib3 web-scraper web-scraping
Last synced: 02 Jan 2025
https://github.com/tholian-network/stealth
:rocket: Stealth - Secure, Peer-to-Peer, Private and Automateable Web Browser/Scraper/Proxy
anonymity browser-automation privacy-protection web-browser web-filter web-proxy web-scraper
Last synced: 29 Dec 2024
https://github.com/gosom/google-maps-scraper
scrape data data from Google Maps. Extracts data such as the name, address, phone number, website URL, rating, reviews number, latitude and longitude, reviews,email and more for each place
distributed-scraper distributed-scraping golang google-maps google-maps-scraping web-scraper web-scraping
Last synced: 05 Nov 2024
https://github.com/oshan96/monkey-dl
Bulk download your favourite anime episodes from your favourite anime websites
4anime 9anime anime anime-downloader anime-fans anime-scraper anime-search animepahe animepahe-downloader animeultima ffmpeg hls-downloader monkey-dl web-scraper
Last synced: 03 Jan 2025
https://github.com/Oshan96/monkey-dl
Bulk download your favourite anime episodes from your favourite anime websites
4anime 9anime anime anime-downloader anime-fans anime-scraper anime-search animepahe animepahe-downloader animeultima ffmpeg hls-downloader monkey-dl web-scraper
Last synced: 20 Nov 2024
https://github.com/postmodern/spidr
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
crawler ruby scraper spider spider-links web web-crawler web-scraper web-scraping web-spider
Last synced: 02 Jan 2025
https://github.com/k0rnh0li0/onlyfans-dl
OnlyFans content downloader
media-downloader onlyfans python web-scraper
Last synced: 29 Oct 2024
https://github.com/je-suis-tm/web-scraping
Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, MacroTrends, SHFE and alternative data crawlers on Tomtom, BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist
bloomberg data-scraper data-scraping financial-data financial-times futures futures-historical-data news-scraper news-websites newsletter options-data python-web-scraper reuters scrapper sraping wall-street-journal wallstreetbets web-scraper web-scrapers web-scraping
Last synced: 03 Jan 2025
https://github.com/gildas-lormeau/single-file-cli
CLI tool for saving a faithful copy of a complete web page in a single HTML file (based on SingleFile)
archiving cli crawler deno dockerfile nodejs scraping-websites single-file web-archiving web-crawler web-scraper web-scraping
Last synced: 03 Jan 2025
https://github.com/cassidoo/scrapers
A list of scrapers from around the web.
list scrape-websites scraper web-scraper
Last synced: 10 Dec 2024
https://github.com/oxylabs/quick-start-guide
Python quick start guides to get the most out of Oxylabs' Web Scraper API free trial.
oxylabs scraper scraper-api scraper-python scrapers scraping scraping-websites web-scraper web-scraping
Last synced: 17 Nov 2024
https://github.com/spekulatius/phpscraper
A universal web-util for PHP.
beautifulsoup chromium headless-chrome php php-crawler php-scraper php-spider php-spiders puppeteer pyppeteer scraper scraping scraping-websites scrapy web-scraper web-scraping
Last synced: 03 Jan 2025
https://github.com/spekulatius/PHPScraper
A universal web-util for PHP.
beautifulsoup chromium headless-chrome php php-crawler php-scraper php-spider php-spiders puppeteer pyppeteer scraper scraping scraping-websites scrapy web-scraper web-scraping
Last synced: 25 Oct 2024
https://github.com/AlexMathew/scrapple
A framework for creating semi-automatic web content extractors
beautifulsoup crawler css-selector extractor lxml python scrapers scraping scrapy selector selector-expression tutorial web-scraper web-scraping xpath-expression
Last synced: 31 Oct 2024
https://github.com/austinoboyle/scrape-linkedin-selenium
`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
linkedin python scrape scraper scraping selenium selenium-webdriver web-scraper web-scraping
Last synced: 04 Jan 2025
https://github.com/shaikhsajid1111/social-media-profile-scrapers
Fetch user's data across social media
facebook-scraper instagram-scraper medium-scraper pinterest pinterest-scrapper python quora-scraper reddit-scraper request scrapping-python selenium-python social-media tiktok-scraper twitter-scraper web-scraper web-scraping
Last synced: 04 Jan 2025
https://github.com/jaebradley/basketball_reference_web_scraper
NBA Stats API via Basketball Reference
basketball-reference nba python web-scraper web-scraping
Last synced: 03 Jan 2025
https://github.com/oxylabs/how-to-scrape-google-scholar
A guide for extracting titles, authors, and citations from Google Scholar using Python and Oxylabs SERP Scraper API.
google-scholar google-scholar-scraper google-scholar-scrapper google-search-scraper python python-scraper scraper-api web-scraper web-scraping
Last synced: 30 Dec 2024
https://github.com/crwlrsoft/crawler
Library for Rapid (Web) Crawler and Scraper Development
crawler crawling hacktoberfest php scraper scraping scraping-websites web-crawler web-crawling web-scraper web-scraping
Last synced: 04 Jan 2025
https://github.com/paulpierre/markdown-crawler
A multithreaded πΈοΈ web crawler that recursively crawls a website and creates a π½ markdown file for each page, designed for LLM RAG
html-to-markdown html-to-markdown-converter html2md llm llmops markdown markdown-crawler markdown-parser markdown-scraper md-crawler rag web-scraper
Last synced: 04 Jan 2025
https://github.com/oxylabs/web-unblocker
Free trial Web Unblocker - an AI-powered proxy solution that can bypass even the most sophisticated anti-bot systems.
amazon-captcha bypass bypasscaptcha captcha captcha-breaking captcha-bypass captcha-solving rotate-captcha school-unblocker unblocked-websites unblocker unblocker-website unblocker-websites web-proxy-server web-scraper web-scraping-api web-unblocker webiste-unblocker-github website-unblocker website-unlocker
Last synced: 29 Dec 2024
https://github.com/phantominsights/summarizer
A Reddit bot that summarizes news articles written in Spanish or English. It uses a custom built algorithm to rank words and sentences.
nlp praw python3 reddit-bot spacy web-scraper wordcloud
Last synced: 01 Jan 2025
https://github.com/PhantomInsights/summarizer
A Reddit bot that summarizes news articles written in Spanish or English. It uses a custom built algorithm to rank words and sentences.
nlp praw python3 reddit-bot spacy web-scraper wordcloud
Last synced: 12 Nov 2024
https://github.com/epiqueras/getsy
A simple browser/client-side web scraper.
browser client-side scraper web-scraper
Last synced: 01 Jan 2025
https://github.com/shaikhsajid1111/facebook_page_scraper
Scrapes facebook's pages front end with no limitations & provides a feature to turn data into structured JSON or CSV
csv facebook facebook-apis facebook-page facebook-page-post facebook-page-post-scraper facebook-page-scraper facebook-scraper fb fb-scrapper hacktoberfest open-source python scraper selenium social-media web-scraper web-scraping
Last synced: 03 Jan 2025
https://github.com/senzmaki/senpwai
A desktop app for tracking and batch downloading anime
anime anime-downloader anime-scraper anime-tracker automation batch-downloader beautifulsoup bulk-downloader cross-platform pyqt pyqt6-desktop-application python tracker web-scraper
Last synced: 29 Dec 2024
https://github.com/wikimedia/html-metadata
MetaData html scraper and parser for Node.js (supports Promises and callback style)
javascript metadata-extraction metadata-extractor node-module nodejs web-scraper web-scraping
Last synced: 04 Jan 2025
https://github.com/suntong/cascadia
Go cascadia package command line CSS selector
cascadia command-line command-line-tool css-selector csv-table curl extract html-source html-text tsv web-scraper web-scraping
Last synced: 19 Nov 2024
https://github.com/areed1192/python-sec
A simple python library that allows for easy access of the SEC website so that someone can parse filings, collect data, and query documents.
finance python sec securities-and-exchange-commission web-scraper
Last synced: 01 Jan 2025
https://github.com/fytex/instagram-giveaways-winner
Instagram Bot which when given a post url will spam mentions to increase the chances of winning. Win Instagram Giveaways!
cheating comments giveaways hacktoberfest hacktoberfest2020 instagram instagram-bot instagram-scraper instagram-script macros mentions posts python python3 selenium selenium-python selenium-webdriver spam web-scraper winning
Last synced: 11 Oct 2024
https://github.com/passivebot/facebook-marketplace-scraper
This repository contains a script to scrape Facebook Marketplace data using Playwright, BeautifulSoup and Streamlit.
database facebook facebook-marketing-automation facebook-marketplace playwright playwright-python python sqlite3 web-automation web-scraper web-scraping
Last synced: 19 Nov 2024
https://github.com/khuyentran1401/top-github-scraper
Scape top GitHub repositories and users based on keywords
github github-api python scraping web-scraper web-scraping
Last synced: 19 Dec 2024
https://github.com/scrapehero/yellowpages-scraper
Yellowpages.com Web Scraper written in Python and LXML to extract business details available based on a particular category and location.
business-directory extract html lxml parsing python scraper web-scraper yellow-pages yellow-pages-scraper
Last synced: 04 Nov 2024
https://github.com/oxylabs/playwright-web-scraping
A tutorial for web scraping using Playwright headless browser
playwright web-scraper web-scraping
Last synced: 17 Nov 2024
https://github.com/ankitmathur3193/song-cli
A command line interface for downloading Bollywood and punjabi songs
bollywood download-songs hollywood mp3 mr-jatt music music-download music-download-script python python-script singer song-download song-download-script song-downloader song-pypi songs top-songs tqdm web-scraper youtube
Last synced: 01 Nov 2024
https://github.com/D4Vinci/Scrapling
Lightning-Fast, Adaptive Web Scraping for Python
automation crawler crawling crawling-python css dom-manipulation hacktoberfest lxml playwright python python3 scraping selectors selenium stealth web-scraper web-scraping web-scraping-python webscraping xpath
Last synced: 18 Nov 2024
https://github.com/sanjaysunil/email-scraper
Generate thousands of temporary emails within seconds!
automation email email-generator email-scraper email-scrapping email-service python scrape scraper temp-email temporary web-scraper web-scraping
Last synced: 10 Nov 2024
https://github.com/cobalt-uoft/uoft-scrapers
Public web scraping scripts for the University of Toronto.
open-data toronto uoft web-scraper
Last synced: 03 Nov 2024
https://github.com/nasdin/videorecognition-realtime-autotrainer-alerts
State of the art object detection in real-time using YOLOV3 algorithm. Augmented with a process that allows easy training of the classifier as a plug & play solution . Provides alert if an item in an alert list is detected.
alerts automatic convolutional-neural-networks darknet deep-learning google-image-search image-processing image-recognition machine-learning object-detection real-time tensorflow video-recognition web-scraper webcam webscraping yolo yolo2 yolov2 yolov2-model
Last synced: 07 Nov 2024
https://github.com/serpapi/public-roadmap
Public Roadmap for SerpApi, LLC (https://serpapi.com)
baidu-scraper google-image-scraper google-maps-scraping google-search-scraper scraper scraping serp-api serpapi web-scraper web-scraping webscraping yahoo-scraper
Last synced: 20 Nov 2024
https://github.com/jlospinoso/abrade
A fast Web API scraper written in C++ and built on Boost ASIO
boost-asio boost-beast cpp web-scraper
Last synced: 30 Oct 2024
https://github.com/mawrkus/jason-the-miner
β A versatile Web scraper for Node.js
crawler crawling javascript scraper scraping web-scraper
Last synced: 13 Nov 2024
https://github.com/janchaloupka/web-scraper-nabidek-pronajmu
NΓ‘stroj pro hlΓdΓ‘nΓ novΓ½ch nabΓdek nemovitostΓ na populΓ‘rnΓch realitnΓch serverech. NabΓdky jsou vypisovΓ‘ny do Discord roomky.
apartment-finder discord discord-bot docker python renting web-scraper
Last synced: 06 Nov 2024
https://github.com/milahu/opensubtitles-scraper
scrape subtitles from opensubtitles.org
opensubtitles subtitles web-scraper
Last synced: 13 Dec 2024
https://github.com/jetkai/proxy-scraper
This is an application that scrapes various Proxy API Endpoints, then compiles the proxies into files within the "/proxies/" directory.
exe gradle httpclient jackson-json jar java jdk11 kotlin launch4j proxies proxy proxy-scrape proxy-scraper scraper scraping selenium-java web-scraper web-scraping
Last synced: 30 Dec 2024
https://github.com/phantominsights/reddit-bots
A collection of Reddit bots that I use to enhance the subreddits I manage.
beautifulsoup praw python3 reddit-bot requests rss web-scraper
Last synced: 11 Nov 2024
https://github.com/milahu/aiohttp_chromium
aiohttp-like interface to chromium. based on selenium_driverless to bypass cloudflare
aiohttp asyncio bypass-cloudflare chromium gui-scripting headful-chromium headful-scraper headful-web-scraper headful-webscraper selenium-driverless web-scraper web-scraping
Last synced: 12 Oct 2024
https://github.com/phantominsights/mexican-jobs-2020
Data ETL & Analysis on thousands of job listings from the official Mexican job board (2020 edition).
hacktoberfest job-offers lxml pandas plotly python3 selenium web-scraper
Last synced: 11 Nov 2024
https://github.com/michaeluno/php-simple-web-scraper
A PHP application which runs on Heroku and dumps web site outputs including JavaScript generated contents.
cross-domain cross-domain-request cross-domain-solution cross-origin cross-origin-resource-sharing cross-site cross-site-scripting crowler heroku heroku-application phantomjs php proxy scraper web-scraper
Last synced: 20 Oct 2024
https://github.com/azogue/esiosdata
Web Scraper para datos de demanda, producciΓ³n y coste de la energΓa elΓ©ctrica en EspaΓ±a, y simulador de facturaciΓ³n elΓ©ctrica segΓΊn el PVPC
energy energy-monitor esios python-3 scraper web-scraper
Last synced: 09 Nov 2024
https://github.com/PhantomInsights/tweet-transcriber
A Reddit bot that transcribes tweets from comments and submissions links, mirrors their images and replies back with a formatted Markdown message.
beautifulsoup imgur praw python3 reddit-bot web-scraper
Last synced: 12 Nov 2024
https://github.com/phantominsights/tweet-transcriber
A Reddit bot that transcribes tweets from comments and submissions links, mirrors their images and replies back with a formatted Markdown message.
beautifulsoup imgur praw python3 reddit-bot web-scraper
Last synced: 11 Nov 2024
https://github.com/shobrook/git-pull
Parallelized web scraper for Github
github github-api github-scraper parallel scraper web-scraper
Last synced: 28 Oct 2024
https://github.com/omkarcloud/botasaurus-starter
π OFFICIAL STARTER TEMPLATE FOR BOTASAURUS SCRAPING FRAMEWORK π€
beautifulsoup crawler crawling crawling-framework crawling-python crawling-tool headless node-crawler python-crawler scraper scraping scraping-framework scraping-python scraping-tool selenium web-crawler web-crawling web-scraper web-scraping webscraping
Last synced: 08 Nov 2024
https://github.com/nuzulul/telegram-scraper
A simple Telegram channel scraper
bot html-parser scraper telegram telegram-bot telegram-channel-scraper telegram-channel-to-json telegram-scraper telegram-to-json web-scraper web-scraping
Last synced: 12 Nov 2024
https://github.com/dotnize/moodle-scrape
Easily scrape data from Moodle LMS sites
javascript lms moodle moodle-scrape moodle-scraper nodejs scraper web-scraper webscraper
Last synced: 10 Dec 2024
https://github.com/earowang/rwalkr
R package to provide API to Melbourne pedestrian data
Last synced: 14 Oct 2024
https://github.com/yasharmaster/fk-review-scraper
Python script for scraping product reviews
beautifulsoup python python-script scraping-product-reviews selenium selenium-webdriver web-scraper
Last synced: 18 Nov 2024
https://github.com/omkarcloud/omkar-temp-mail
π OMKAR TEMP MAIL HELPS YOU USE TEMPORARY EMAILS. π€
10minute 10minutemail beautifulsoup crawling disposable-email disposable-email-addresses free-mail mail-api scraper scraping scraping-framework selenium temp-mail tempmail temporary-email web-crawler web-crawling web-scraper web-scraping webscraping
Last synced: 27 Oct 2024
https://github.com/deep5050/abosar
ঠবসর π A collection of short Bengali stories web scraped from various Bengali eMagazines and eNewspapers.
bengali cron-jobs stories web-scraper web-scraping webcrawler
Last synced: 09 Nov 2024
https://github.com/Alnyz/NewApkPure
Search and download applications from apkpure.com
android api api-wrapper apkpure apkpure-scraper downloader python python3 scraping web web-scraper web-scraping wrapper
Last synced: 20 Nov 2024
https://github.com/thewebscraping/tls-requests
TLS Requests is a powerful Python library for secure HTTP requests, offering browser-like TLS fingerprinting, anti-bot page bypass, and high performance.
anti-bot anti-bot-detection anti-bot-page captcha-bypass cf-clearance cloudflare cloudflare-bypass cloudflare-scraper cloudflare-turnstile python-tls-client scraper tls tls-client tls-fingerprinting tls-requests tls-web-scraping web-crawling web-scraper web-scraping web-scraping-api
Last synced: 31 Dec 2024
https://github.com/faheel/youtube-scraper-api
A web API that scrapes a YouTube video's data and returns it as JSON
api json json-api python python3 scraper web-scraper youtube youtube-data
Last synced: 12 Oct 2024
https://github.com/metalwarrior665/actor-rust-scraper
Experimental scraper in Rust suited for running locally or on the Apify platform. Inspired by Apify SDK.
Last synced: 30 Dec 2024
https://github.com/knlnks/uber_eats_scraper
An Uber Eats scraper written in python.
python restaurant selenium uber-eats uber-eats-scraper web-scraper
Last synced: 12 Nov 2024
https://github.com/metalwarrior665/actor-article-extractor-smart
Combines Apify's crawling system and article parsing with unfluff library.
actor apify article-extractor scraper web-scraper
Last synced: 30 Dec 2024
https://github.com/j4asper/dmr.py
Pull data from the danish vehicle registry with dmr.py
denmark dmr motorregister nummerplade python python-library python3 web-scraper
Last synced: 25 Nov 2024
https://github.com/sgtfloyd/mtg-db
Ruby gem containing structured data for all Magic: The Gathering cards
card-database magic-the-gathering mtg ruby-gem web-scraper
Last synced: 22 Dec 2024
https://github.com/oxylabs/golang-web-scraper
A tutorial for building a web scraper in Golang
go golang url-scraper web-scraper web-scraping
Last synced: 17 Nov 2024
https://github.com/leviobrabo/g1-news-scraping
Web scraping do site G1 News
g1 g1-news globo globo-news news scraper scraping scraping-websites web-scraper web-scraping
Last synced: 20 Nov 2024
https://github.com/jlumbroso/princeton-scraper-seas-faculty
This is a web scraper that produces publicly accessible, static JSON feeds directly and automatically from the public SEAS directory website.
directory faculty princeton princeton-university web-scraper
Last synced: 02 Dec 2024
https://github.com/developerjosh/nekonode-site
Watch high-quality, ad-free anime streaming on NekoNode β your ultimate anime destination!
anime anime-api anime-scraper anime-streaming api nextjs streaming-video web-scraper website
Last synced: 16 Nov 2024
https://github.com/vmussa/sdbase
Scripts BΓ‘sicos para Sociologia Digital
computational-social-science computational-sociology digital-humanities digital-sociology qualitative-analysis qualitative-data-analysis qualitative-research web-scraper
Last synced: 13 Oct 2024
https://github.com/deadsec-security/easy-scraper
Create easy workflows for web scraping using the web and drag and drop features. Making scraping easy and fast!
docker easy-to-use selfhostable selfhosted web-scraper web-scraping web-scraping-software web-scrapper-python
Last synced: 22 Oct 2024
https://github.com/palahsu/ScraperImages
Simple Web Images Scraper From Websites!
google-photos-download google-scraper google-scraping image-scraper imagescraper imagescraping scraper scraping scraping-python web-scraper webscraper webscraper-website
Last synced: 20 Nov 2024
https://github.com/anlisha-maharjan/laravel-web-scraping
Web Scraping With PHP. A Laravel REST API to fetch content of any website.
laravel8 php php-crawler spatie-crawler web-scraper
Last synced: 11 Oct 2024
https://github.com/amey-thakur/covid19-web-scraper
Scraping and Visualising India's Real-Time COVID-19 Data From The MOHFW Dataset.
amey ameythakur bar-plots beautifulsoup beautifulsoup4 bs4 covid-19 covid19-data covid19-graph covid19-live-tracker covid19india donut-chart prettytable project python python3 web-scraper web-scraping
Last synced: 09 Nov 2024
https://github.com/palahsu/scraperimages
Simple Web Images Scraper From Websites!
google-photos-download google-scraper google-scraping image-scraper imagescraper imagescraping scraper scraping scraping-python web-scraper webscraper webscraper-website
Last synced: 10 Nov 2024
https://github.com/korniichuk/rightcall
Right call center quality assurance monitoring written in Python
aws aws-comprehend callcenter comprehend contactcenter monitoring python requestium requests selenium web-scraper web-scraping
Last synced: 16 Nov 2024
https://github.com/rija/ghost-ssg
A Docker-based pipeline to publish the content of a local Ghost 4 server as static pages.
bash blogging cli docker docker-compose ghost ghost-cms gitlab integration jamstack nginx nodejs publishing scraping-tool self-hosted static-site-generator web-scraper wget workflow
Last synced: 27 Oct 2024
https://github.com/omkarcloud/web-scraping-template
π THIS WEB SCRAPING TEMPLATE PROVIDES YOU WITH A GREAT STARTING POINT WHEN CREATING WEB SCRAPING BOTS. π€
beautifulsoup crawler crawling crawling-framework crawling-python crawling-tool headless node-crawler python-crawler scraper scraping scraping-framework scraping-python scraping-tool selenium web-crawler web-crawling web-scraper web-scraping webscraping
Last synced: 02 Jan 2025
https://github.com/pps-22-scooby/pps-22-scooby
Scala application that allows web crawling and web scraping of web pages given as input with the use of special rules passed to it through the use of a DSL.
crawler crawlers internal-dsl scala scraper scrapers web web-crawler web-crawling web-scraper web-scrapers
Last synced: 14 Oct 2024
https://github.com/jlumbroso/princeton-scraper-cos-people
This is a web scraper that produces publicly accessible, static JSON feeds directly and automatically from the public COS directory website.
directory faculty princeton princeton-cs princeton-university web-scraper
Last synced: 02 Dec 2024
https://github.com/breadrock1/socialnetworkscraper
Web scraping is simply the process of using a social media web scraper to gather data automatically. It saves users time, effort and sometimes money since itβs an automatic process performed by bots. You could take the time to search the web for all mentions of a certain word or find all prices for a certain product, but that would take a lot of time.
facebook facebook-scraping flake8 mailru osint osint-python python python3 scraper scraping site-scraper social-network social-network-analysis twitter vk-api vkontakte web-scraper web-scraping
Last synced: 11 Nov 2024
https://github.com/omkarcloud/selenium-2captcha-recaptcha-solver-demo
π FINAL CODE FOR TUTORIAL ON HOW TO SOLVE CAPTCHA IN SELENIUM USING 2CAPTCHA π€
2captcha captcha captcha-break captcha-breaker captcha-breaking captcha-bypass captcha-generator captcha-image captcha-library captcha-solver captcha-solving crack-captcha scraping scraping-framework selenium web-crawler web-crawling web-scraper web-scraping webscraping
Last synced: 02 Jan 2025
https://github.com/mkearney/r-bloggers
[Tweet bot] R script tweeting new links to R-bloggers posts
r r-bloggers r-rtweet tweetbot tweets twitter web-scraper
Last synced: 15 Nov 2024
https://github.com/ganevdev/actor-webdesignernews-scraper
Scraper for www.webdesignernews.com, using Apify.
actor apify scrap scraper scraping web-scraper
Last synced: 27 Oct 2024
https://github.com/beautifulmoon211/onthemarket-scraping
Web scraping tool used to extract real estate information from OnTheMarket.com, a leading property portal in the United Kingdom.
cheerio data-extraction onthemarket onthemarket-scraper real-estate requests typescript web-scraper
Last synced: 14 Nov 2024
https://github.com/madhurimarawat/web-scrapper-functions
Streamlit-based Python web scraper for text, images, and PDFs. User-friendly interface for quick data extraction from websites. Simplify your web scraping tasks effortlessly.
automation beautifulsoup complete-pdf-text-data complete-text-downloader image-downloader-python pdf-data-extraction pdf-downloader python requests streamlit-deployment streamlit-webapp text-data-website text-file-rendering user-input-link web-scraper web-scraping web-scraping-automated web-scraping-functions zip-file-download zip-file-rendering
Last synced: 14 Nov 2024
https://github.com/adzo261/codechef-activity-notifier
A python script to notify recent submission of a desired user on Codechef
codechef codechef-crawler competitive-programming competitive-programming-contests python3 web-scraper web-scraping windows-notifications
Last synced: 30 Nov 2024
https://github.com/aaryanrr/DownDetector-CLI
CLI Client for DownDetector.com
cli downdetector python3 web-scraper web-scraping
Last synced: 06 Nov 2024
https://github.com/dori-dev/quotes-crawler
Quotes crawler using scrapy and python.
crawler crawling python scraping-python scraping-websites scrapy scrapy-crawler scrapy-spider web-scraper
Last synced: 09 Nov 2024
https://github.com/sahilbansal17/moodletracker
This is a simple script which will check whether there are any updates on a registered moodle course and print them in the terminal.
beautiful-soup python script web-scraper
Last synced: 12 Nov 2024