Projects in Awesome Lists tagged with web-scraper

https://github.com/getmaxun/maxun

🔥 Open-source no-code web data extraction platform. Turn websites to APIs and spreadsheets with no-code robots in minutes! [In Beta]

agents api automation browser browser-automation data-extraction no-code no-code-web-scraper playwright robotic-process-automation rpa scraper self-hosted web-agent web-automation web-scraper web-scraping web-scraping-agent webscraping website-to-api

Last synced: 02 Jan 2025

https://github.com/php-curl-class/php-curl-class

PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs

api api-client class client curl framework http http-client http-proxy json php php-curl php-curl-library proxy requests restful web-scraper web-scraping web-service xml

Last synced: 30 Dec 2024

https://github.com/arpit-omprakash/100projectsofcode

A list of practical knowledge-building projects.

c cpp11 csharp java javascript music-player processing programming projects python search-engine web-scraper

Last synced: 03 Jan 2025

https://github.com/arpit-omprakash/100ProjectsOfCode

A list of practical knowledge-building projects.

c cpp11 csharp java javascript music-player processing programming projects python search-engine web-scraper

Last synced: 08 Nov 2024

https://github.com/anaskhan96/soup

Web Scraper in Go, similar to BeautifulSoup

beautifulsoup go golang html-node web-scraper webscraper webscraping

Last synced: 02 Jan 2025

https://github.com/d4vinci/scrapling

Undetectable, Lightning-Fast, and Adaptive Web Scraping for Python

ai ai-scraping automation crawler crawling crawling-python data data-extraction hacktoberfest playwright python python3 scraping selectors stealth web-scraper web-scraping web-scraping-python webscraping xpath

Last synced: 29 Dec 2024

https://github.com/dipu-bd/lightnovel-crawler

Generate and download e-books from online sources.

console-app discord kindle-books lightnovel lightnovel-crawler python telegram termux web-scraper

Last synced: 31 Dec 2024

https://github.com/juancarlospaco/faster-than-requests

Faster requests on Python 3

curl cython download-file faster-than-requests high-performance http-requests ndjson open-data python python-library python-requests python3 requests-toolbelt requests3 scrapy speed urllib urllib3 web-scraper web-scraping

Last synced: 02 Jan 2025

https://github.com/tholian-network/stealth

:rocket: Stealth - Secure, Peer-to-Peer, Private and Automateable Web Browser/Scraper/Proxy

anonymity browser-automation privacy-protection web-browser web-filter web-proxy web-scraper

Last synced: 29 Dec 2024

https://github.com/gosom/google-maps-scraper

scrape data data from Google Maps. Extracts data such as the name, address, phone number, website URL, rating, reviews number, latitude and longitude, reviews,email and more for each place

distributed-scraper distributed-scraping golang google-maps google-maps-scraping web-scraper web-scraping

Last synced: 05 Nov 2024

https://github.com/oshan96/monkey-dl

Bulk download your favourite anime episodes from your favourite anime websites

4anime 9anime anime anime-downloader anime-fans anime-scraper anime-search animepahe animepahe-downloader animeultima ffmpeg hls-downloader monkey-dl web-scraper

Last synced: 03 Jan 2025

https://github.com/Oshan96/monkey-dl

Bulk download your favourite anime episodes from your favourite anime websites

4anime 9anime anime anime-downloader anime-fans anime-scraper anime-search animepahe animepahe-downloader animeultima ffmpeg hls-downloader monkey-dl web-scraper

Last synced: 20 Nov 2024

https://github.com/postmodern/spidr

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

crawler ruby scraper spider spider-links web web-crawler web-scraper web-scraping web-spider

Last synced: 02 Jan 2025

https://github.com/k0rnh0li0/onlyfans-dl

OnlyFans content downloader

media-downloader onlyfans python web-scraper

Last synced: 29 Oct 2024

https://github.com/je-suis-tm/web-scraping

Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, MacroTrends, SHFE and alternative data crawlers on Tomtom, BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist

bloomberg data-scraper data-scraping financial-data financial-times futures futures-historical-data news-scraper news-websites newsletter options-data python-web-scraper reuters scrapper sraping wall-street-journal wallstreetbets web-scraper web-scrapers web-scraping

Last synced: 03 Jan 2025

https://github.com/gildas-lormeau/single-file-cli

CLI tool for saving a faithful copy of a complete web page in a single HTML file (based on SingleFile)

archiving cli crawler deno dockerfile nodejs scraping-websites single-file web-archiving web-crawler web-scraper web-scraping

Last synced: 03 Jan 2025

https://github.com/cassidoo/scrapers

A list of scrapers from around the web.

list scrape-websites scraper web-scraper

Last synced: 10 Dec 2024

https://github.com/oxylabs/quick-start-guide

Python quick start guides to get the most out of Oxylabs' Web Scraper API free trial.

oxylabs scraper scraper-api scraper-python scrapers scraping scraping-websites web-scraper web-scraping

Last synced: 17 Nov 2024

https://github.com/spekulatius/phpscraper

A universal web-util for PHP.

beautifulsoup chromium headless-chrome php php-crawler php-scraper php-spider php-spiders puppeteer pyppeteer scraper scraping scraping-websites scrapy web-scraper web-scraping

Last synced: 03 Jan 2025

https://github.com/spekulatius/PHPScraper

A universal web-util for PHP.

beautifulsoup chromium headless-chrome php php-crawler php-scraper php-spider php-spiders puppeteer pyppeteer scraper scraping scraping-websites scrapy web-scraper web-scraping

Last synced: 25 Oct 2024

https://github.com/AlexMathew/scrapple

A framework for creating semi-automatic web content extractors

beautifulsoup crawler css-selector extractor lxml python scrapers scraping scrapy selector selector-expression tutorial web-scraper web-scraping xpath-expression

Last synced: 31 Oct 2024

https://github.com/austinoboyle/scrape-linkedin-selenium

`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.

linkedin python scrape scraper scraping selenium selenium-webdriver web-scraper web-scraping

Last synced: 04 Jan 2025

https://github.com/shaikhsajid1111/social-media-profile-scrapers

Fetch user's data across social media

facebook-scraper instagram-scraper medium-scraper pinterest pinterest-scrapper python quora-scraper reddit-scraper request scrapping-python selenium-python social-media tiktok-scraper twitter-scraper web-scraper web-scraping

Last synced: 04 Jan 2025

https://github.com/jaebradley/basketball_reference_web_scraper

NBA Stats API via Basketball Reference

basketball-reference nba python web-scraper web-scraping

Last synced: 03 Jan 2025

https://github.com/oxylabs/how-to-scrape-google-scholar

A guide for extracting titles, authors, and citations from Google Scholar using Python and Oxylabs SERP Scraper API.

google-scholar google-scholar-scraper google-scholar-scrapper google-search-scraper python python-scraper scraper-api web-scraper web-scraping

Last synced: 30 Dec 2024

https://github.com/crwlrsoft/crawler

Library for Rapid (Web) Crawler and Scraper Development

crawler crawling hacktoberfest php scraper scraping scraping-websites web-crawler web-crawling web-scraper web-scraping

Last synced: 04 Jan 2025

https://github.com/paulpierre/markdown-crawler

A multithreaded 🕸️ web crawler that recursively crawls a website and creates a 🔽 markdown file for each page, designed for LLM RAG

html-to-markdown html-to-markdown-converter html2md llm llmops markdown markdown-crawler markdown-parser markdown-scraper md-crawler rag web-scraper

Last synced: 04 Jan 2025

https://github.com/oxylabs/web-unblocker

Free trial Web Unblocker - an AI-powered proxy solution that can bypass even the most sophisticated anti-bot systems.

amazon-captcha bypass bypasscaptcha captcha captcha-breaking captcha-bypass captcha-solving rotate-captcha school-unblocker unblocked-websites unblocker unblocker-website unblocker-websites web-proxy-server web-scraper web-scraping-api web-unblocker webiste-unblocker-github website-unblocker website-unlocker

Last synced: 29 Dec 2024

https://github.com/phantominsights/summarizer

A Reddit bot that summarizes news articles written in Spanish or English. It uses a custom built algorithm to rank words and sentences.

nlp praw python3 reddit-bot spacy web-scraper wordcloud

Last synced: 01 Jan 2025

https://github.com/PhantomInsights/summarizer

A Reddit bot that summarizes news articles written in Spanish or English. It uses a custom built algorithm to rank words and sentences.

nlp praw python3 reddit-bot spacy web-scraper wordcloud

Last synced: 12 Nov 2024

https://github.com/epiqueras/getsy

A simple browser/client-side web scraper.

browser client-side scraper web-scraper

Last synced: 01 Jan 2025

https://github.com/shaikhsajid1111/facebook_page_scraper

Scrapes facebook's pages front end with no limitations & provides a feature to turn data into structured JSON or CSV

csv facebook facebook-apis facebook-page facebook-page-post facebook-page-post-scraper facebook-page-scraper facebook-scraper fb fb-scrapper hacktoberfest open-source python scraper selenium social-media web-scraper web-scraping

Last synced: 03 Jan 2025

https://github.com/senzmaki/senpwai

A desktop app for tracking and batch downloading anime

anime anime-downloader anime-scraper anime-tracker automation batch-downloader beautifulsoup bulk-downloader cross-platform pyqt pyqt6-desktop-application python tracker web-scraper

Last synced: 29 Dec 2024

https://github.com/wikimedia/html-metadata

MetaData html scraper and parser for Node.js (supports Promises and callback style)

javascript metadata-extraction metadata-extractor node-module nodejs web-scraper web-scraping

Last synced: 04 Jan 2025

https://github.com/suntong/cascadia

Go cascadia package command line CSS selector

cascadia command-line command-line-tool css-selector csv-table curl extract html-source html-text tsv web-scraper web-scraping

Last synced: 19 Nov 2024

https://github.com/areed1192/python-sec

A simple python library that allows for easy access of the SEC website so that someone can parse filings, collect data, and query documents.

finance python sec securities-and-exchange-commission web-scraper

Last synced: 01 Jan 2025

https://github.com/fytex/instagram-giveaways-winner

Instagram Bot which when given a post url will spam mentions to increase the chances of winning. Win Instagram Giveaways!

cheating comments giveaways hacktoberfest hacktoberfest2020 instagram instagram-bot instagram-scraper instagram-script macros mentions posts python python3 selenium selenium-python selenium-webdriver spam web-scraper winning

Last synced: 11 Oct 2024

https://github.com/passivebot/facebook-marketplace-scraper

This repository contains a script to scrape Facebook Marketplace data using Playwright, BeautifulSoup and Streamlit.

database facebook facebook-marketing-automation facebook-marketplace playwright playwright-python python sqlite3 web-automation web-scraper web-scraping

Last synced: 19 Nov 2024

https://github.com/khuyentran1401/top-github-scraper

Scape top GitHub repositories and users based on keywords

github github-api python scraping web-scraper web-scraping

Last synced: 19 Dec 2024

https://github.com/scrapehero/yellowpages-scraper

Yellowpages.com Web Scraper written in Python and LXML to extract business details available based on a particular category and location.

business-directory extract html lxml parsing python scraper web-scraper yellow-pages yellow-pages-scraper

Last synced: 04 Nov 2024

https://github.com/oxylabs/playwright-web-scraping

A tutorial for web scraping using Playwright headless browser

playwright web-scraper web-scraping

Last synced: 17 Nov 2024

https://github.com/ankitmathur3193/song-cli

A command line interface for downloading Bollywood and punjabi songs

bollywood download-songs hollywood mp3 mr-jatt music music-download music-download-script python python-script singer song-download song-download-script song-downloader song-pypi songs top-songs tqdm web-scraper youtube

Last synced: 01 Nov 2024

https://github.com/D4Vinci/Scrapling

Lightning-Fast, Adaptive Web Scraping for Python

automation crawler crawling crawling-python css dom-manipulation hacktoberfest lxml playwright python python3 scraping selectors selenium stealth web-scraper web-scraping web-scraping-python webscraping xpath

Last synced: 18 Nov 2024

https://github.com/sanjaysunil/email-scraper

Generate thousands of temporary emails within seconds!

automation email email-generator email-scraper email-scrapping email-service python scrape scraper temp-email temporary web-scraper web-scraping

Last synced: 10 Nov 2024

https://github.com/cobalt-uoft/uoft-scrapers

Public web scraping scripts for the University of Toronto.

open-data toronto uoft web-scraper

Last synced: 03 Nov 2024

https://github.com/nasdin/videorecognition-realtime-autotrainer-alerts

State of the art object detection in real-time using YOLOV3 algorithm. Augmented with a process that allows easy training of the classifier as a plug & play solution . Provides alert if an item in an alert list is detected.

alerts automatic convolutional-neural-networks darknet deep-learning google-image-search image-processing image-recognition machine-learning object-detection real-time tensorflow video-recognition web-scraper webcam webscraping yolo yolo2 yolov2 yolov2-model

Last synced: 07 Nov 2024

https://github.com/serpapi/public-roadmap

Public Roadmap for SerpApi, LLC (https://serpapi.com)

baidu-scraper google-image-scraper google-maps-scraping google-search-scraper scraper scraping serp-api serpapi web-scraper web-scraping webscraping yahoo-scraper

Last synced: 20 Nov 2024

https://github.com/jlospinoso/abrade

A fast Web API scraper written in C++ and built on Boost ASIO

boost-asio boost-beast cpp web-scraper

Last synced: 30 Oct 2024

https://github.com/mawrkus/jason-the-miner

⛏ A versatile Web scraper for Node.js

crawler crawling javascript scraper scraping web-scraper

Last synced: 13 Nov 2024

https://github.com/janchaloupka/web-scraper-nabidek-pronajmu

Nástroj pro hlídání nových nabídek nemovitostí na populárních realitních serverech. Nabídky jsou vypisovány do Discord roomky.

apartment-finder discord discord-bot docker python renting web-scraper

Last synced: 06 Nov 2024

https://github.com/milahu/opensubtitles-scraper

scrape subtitles from opensubtitles.org

opensubtitles subtitles web-scraper

Last synced: 13 Dec 2024

https://github.com/jetkai/proxy-scraper

This is an application that scrapes various Proxy API Endpoints, then compiles the proxies into files within the "/proxies/" directory.

exe gradle httpclient jackson-json jar java jdk11 kotlin launch4j proxies proxy proxy-scrape proxy-scraper scraper scraping selenium-java web-scraper web-scraping

Last synced: 30 Dec 2024

https://github.com/phantominsights/reddit-bots

A collection of Reddit bots that I use to enhance the subreddits I manage.

beautifulsoup praw python3 reddit-bot requests rss web-scraper

Last synced: 11 Nov 2024

https://github.com/milahu/aiohttp_chromium

aiohttp-like interface to chromium. based on selenium_driverless to bypass cloudflare

aiohttp asyncio bypass-cloudflare chromium gui-scripting headful-chromium headful-scraper headful-web-scraper headful-webscraper selenium-driverless web-scraper web-scraping

Last synced: 12 Oct 2024

https://github.com/phantominsights/mexican-jobs-2020

Data ETL & Analysis on thousands of job listings from the official Mexican job board (2020 edition).

hacktoberfest job-offers lxml pandas plotly python3 selenium web-scraper

Last synced: 11 Nov 2024

https://github.com/michaeluno/php-simple-web-scraper

A PHP application which runs on Heroku and dumps web site outputs including JavaScript generated contents.

cross-domain cross-domain-request cross-domain-solution cross-origin cross-origin-resource-sharing cross-site cross-site-scripting crowler heroku heroku-application phantomjs php proxy scraper web-scraper

Last synced: 20 Oct 2024

https://github.com/azogue/esiosdata

Web Scraper para datos de demanda, producción y coste de la energía eléctrica en España, y simulador de facturación eléctrica según el PVPC

energy energy-monitor esios python-3 scraper web-scraper

Last synced: 09 Nov 2024

https://github.com/PhantomInsights/tweet-transcriber

A Reddit bot that transcribes tweets from comments and submissions links, mirrors their images and replies back with a formatted Markdown message.

beautifulsoup imgur praw python3 reddit-bot web-scraper

Last synced: 12 Nov 2024

https://github.com/phantominsights/tweet-transcriber

A Reddit bot that transcribes tweets from comments and submissions links, mirrors their images and replies back with a formatted Markdown message.

beautifulsoup imgur praw python3 reddit-bot web-scraper

Last synced: 11 Nov 2024

https://github.com/shobrook/git-pull

Parallelized web scraper for Github

github github-api github-scraper parallel scraper web-scraper

Last synced: 28 Oct 2024

https://github.com/omkarcloud/botasaurus-starter

🚀 OFFICIAL STARTER TEMPLATE FOR BOTASAURUS SCRAPING FRAMEWORK 🤖

beautifulsoup crawler crawling crawling-framework crawling-python crawling-tool headless node-crawler python-crawler scraper scraping scraping-framework scraping-python scraping-tool selenium web-crawler web-crawling web-scraper web-scraping webscraping

Last synced: 08 Nov 2024

https://github.com/nuzulul/telegram-scraper

A simple Telegram channel scraper

bot html-parser scraper telegram telegram-bot telegram-channel-scraper telegram-channel-to-json telegram-scraper telegram-to-json web-scraper web-scraping

Last synced: 12 Nov 2024

https://github.com/dotnize/moodle-scrape

Easily scrape data from Moodle LMS sites

javascript lms moodle moodle-scrape moodle-scraper nodejs scraper web-scraper webscraper

Last synced: 10 Dec 2024

https://github.com/earowang/rwalkr

R package to provide API to Melbourne pedestrian data

r web-scraper

Last synced: 14 Oct 2024

https://github.com/yasharmaster/fk-review-scraper

Python script for scraping product reviews

beautifulsoup python python-script scraping-product-reviews selenium selenium-webdriver web-scraper

Last synced: 18 Nov 2024

https://github.com/omkarcloud/omkar-temp-mail

🚀 OMKAR TEMP MAIL HELPS YOU USE TEMPORARY EMAILS. 🤖

10minute 10minutemail beautifulsoup crawling disposable-email disposable-email-addresses free-mail mail-api scraper scraping scraping-framework selenium temp-mail tempmail temporary-email web-crawler web-crawling web-scraper web-scraping webscraping

Last synced: 27 Oct 2024

https://github.com/deep5050/abosar

অবসর 📚 A collection of short Bengali stories web scraped from various Bengali eMagazines and eNewspapers.

bengali cron-jobs stories web-scraper web-scraping webcrawler

Last synced: 09 Nov 2024

https://github.com/Alnyz/NewApkPure

Search and download applications from apkpure.com

android api api-wrapper apkpure apkpure-scraper downloader python python3 scraping web web-scraper web-scraping wrapper

Last synced: 20 Nov 2024

https://github.com/thewebscraping/tls-requests

TLS Requests is a powerful Python library for secure HTTP requests, offering browser-like TLS fingerprinting, anti-bot page bypass, and high performance.

anti-bot anti-bot-detection anti-bot-page captcha-bypass cf-clearance cloudflare cloudflare-bypass cloudflare-scraper cloudflare-turnstile python-tls-client scraper tls tls-client tls-fingerprinting tls-requests tls-web-scraping web-crawling web-scraper web-scraping web-scraping-api

Last synced: 31 Dec 2024

https://github.com/faheel/youtube-scraper-api

A web API that scrapes a YouTube video's data and returns it as JSON

api json json-api python python3 scraper web-scraper youtube youtube-data

Last synced: 12 Oct 2024

https://github.com/metalwarrior665/actor-rust-scraper

Experimental scraper in Rust suited for running locally or on the Apify platform. Inspired by Apify SDK.

apify rust web-scraper

Last synced: 30 Dec 2024

https://github.com/knlnks/uber_eats_scraper

An Uber Eats scraper written in python.

python restaurant selenium uber-eats uber-eats-scraper web-scraper

Last synced: 12 Nov 2024

https://github.com/metalwarrior665/actor-article-extractor-smart

Combines Apify's crawling system and article parsing with unfluff library.

actor apify article-extractor scraper web-scraper

Last synced: 30 Dec 2024

https://github.com/j4asper/dmr.py

Pull data from the danish vehicle registry with dmr.py

denmark dmr motorregister nummerplade python python-library python3 web-scraper

Last synced: 25 Nov 2024

https://github.com/sgtfloyd/mtg-db

Ruby gem containing structured data for all Magic: The Gathering cards

card-database magic-the-gathering mtg ruby-gem web-scraper

Last synced: 22 Dec 2024

https://github.com/oxylabs/golang-web-scraper

A tutorial for building a web scraper in Golang

go golang url-scraper web-scraper web-scraping

Last synced: 17 Nov 2024

https://github.com/leviobrabo/g1-news-scraping

Web scraping do site G1 News

g1 g1-news globo globo-news news scraper scraping scraping-websites web-scraper web-scraping

Last synced: 20 Nov 2024

https://github.com/jlumbroso/princeton-scraper-seas-faculty

This is a web scraper that produces publicly accessible, static JSON feeds directly and automatically from the public SEAS directory website.

directory faculty princeton princeton-university web-scraper

Last synced: 02 Dec 2024

https://github.com/developerjosh/nekonode-site

Watch high-quality, ad-free anime streaming on NekoNode – your ultimate anime destination!

anime anime-api anime-scraper anime-streaming api nextjs streaming-video web-scraper website

Last synced: 16 Nov 2024

https://github.com/vmussa/sdbase

Scripts Básicos para Sociologia Digital

computational-social-science computational-sociology digital-humanities digital-sociology qualitative-analysis qualitative-data-analysis qualitative-research web-scraper

Last synced: 13 Oct 2024

https://github.com/deadsec-security/easy-scraper

Create easy workflows for web scraping using the web and drag and drop features. Making scraping easy and fast!

docker easy-to-use selfhostable selfhosted web-scraper web-scraping web-scraping-software web-scrapper-python

Last synced: 22 Oct 2024

https://github.com/palahsu/ScraperImages

Simple Web Images Scraper From Websites!

google-photos-download google-scraper google-scraping image-scraper imagescraper imagescraping scraper scraping scraping-python web-scraper webscraper webscraper-website

Last synced: 20 Nov 2024

https://github.com/anlisha-maharjan/laravel-web-scraping

Web Scraping With PHP. A Laravel REST API to fetch content of any website.

laravel8 php php-crawler spatie-crawler web-scraper

Last synced: 11 Oct 2024

https://github.com/amey-thakur/covid19-web-scraper

Scraping and Visualising India's Real-Time COVID-19 Data From The MOHFW Dataset.

amey ameythakur bar-plots beautifulsoup beautifulsoup4 bs4 covid-19 covid19-data covid19-graph covid19-live-tracker covid19india donut-chart prettytable project python python3 web-scraper web-scraping

Last synced: 09 Nov 2024

https://github.com/palahsu/scraperimages

Simple Web Images Scraper From Websites!

google-photos-download google-scraper google-scraping image-scraper imagescraper imagescraping scraper scraping scraping-python web-scraper webscraper webscraper-website

Last synced: 10 Nov 2024

https://github.com/korniichuk/rightcall

Right call center quality assurance monitoring written in Python

aws aws-comprehend callcenter comprehend contactcenter monitoring python requestium requests selenium web-scraper web-scraping

Last synced: 16 Nov 2024

https://github.com/rija/ghost-ssg

A Docker-based pipeline to publish the content of a local Ghost 4 server as static pages.

bash blogging cli docker docker-compose ghost ghost-cms gitlab integration jamstack nginx nodejs publishing scraping-tool self-hosted static-site-generator web-scraper wget workflow

Last synced: 27 Oct 2024

https://github.com/omkarcloud/web-scraping-template

🚀 THIS WEB SCRAPING TEMPLATE PROVIDES YOU WITH A GREAT STARTING POINT WHEN CREATING WEB SCRAPING BOTS. 🤖

beautifulsoup crawler crawling crawling-framework crawling-python crawling-tool headless node-crawler python-crawler scraper scraping scraping-framework scraping-python scraping-tool selenium web-crawler web-crawling web-scraper web-scraping webscraping

Last synced: 02 Jan 2025

https://github.com/pps-22-scooby/pps-22-scooby

Scala application that allows web crawling and web scraping of web pages given as input with the use of special rules passed to it through the use of a DSL.

crawler crawlers internal-dsl scala scraper scrapers web web-crawler web-crawling web-scraper web-scrapers

Last synced: 14 Oct 2024

https://github.com/jlumbroso/princeton-scraper-cos-people

This is a web scraper that produces publicly accessible, static JSON feeds directly and automatically from the public COS directory website.

directory faculty princeton princeton-cs princeton-university web-scraper

Last synced: 02 Dec 2024

https://github.com/breadrock1/socialnetworkscraper

Web scraping is simply the process of using a social media web scraper to gather data automatically. It saves users time, effort and sometimes money since it’s an automatic process performed by bots. You could take the time to search the web for all mentions of a certain word or find all prices for a certain product, but that would take a lot of time.

facebook facebook-scraping flake8 mailru osint osint-python python python3 scraper scraping site-scraper social-network social-network-analysis twitter vk-api vkontakte web-scraper web-scraping

Last synced: 11 Nov 2024

https://github.com/omkarcloud/selenium-2captcha-recaptcha-solver-demo

🚀 FINAL CODE FOR TUTORIAL ON HOW TO SOLVE CAPTCHA IN SELENIUM USING 2CAPTCHA 🤖

2captcha captcha captcha-break captcha-breaker captcha-breaking captcha-bypass captcha-generator captcha-image captcha-library captcha-solver captcha-solving crack-captcha scraping scraping-framework selenium web-crawler web-crawling web-scraper web-scraping webscraping

Last synced: 02 Jan 2025

https://github.com/mkearney/r-bloggers

[Tweet bot] R script tweeting new links to R-bloggers posts

r r-bloggers r-rtweet tweetbot tweets twitter web-scraper

Last synced: 15 Nov 2024

https://github.com/ganevdev/actor-webdesignernews-scraper

Scraper for www.webdesignernews.com, using Apify.

actor apify scrap scraper scraping web-scraper

Last synced: 27 Oct 2024

https://github.com/beautifulmoon211/onthemarket-scraping

Web scraping tool used to extract real estate information from OnTheMarket.com, a leading property portal in the United Kingdom.

cheerio data-extraction onthemarket onthemarket-scraper real-estate requests typescript web-scraper

Last synced: 14 Nov 2024

https://github.com/madhurimarawat/web-scrapper-functions

Streamlit-based Python web scraper for text, images, and PDFs. User-friendly interface for quick data extraction from websites. Simplify your web scraping tasks effortlessly.

automation beautifulsoup complete-pdf-text-data complete-text-downloader image-downloader-python pdf-data-extraction pdf-downloader python requests streamlit-deployment streamlit-webapp text-data-website text-file-rendering user-input-link web-scraper web-scraping web-scraping-automated web-scraping-functions zip-file-download zip-file-rendering