Projects in Awesome Lists tagged with scrapping-python
A curated list of projects in awesome lists tagged with scrapping-python .
https://github.com/shaikhsajid1111/social-media-profile-scrapers
Fetch user's data across social media
facebook-scraper instagram-scraper medium-scraper pinterest pinterest-scrapper python quora-scraper reddit-scraper request scrapping-python selenium-python social-media tiktok-scraper twitter-scraper web-scraper web-scraping
Last synced: 05 Apr 2025
https://github.com/lkuffo/web-scraping
Más de 50 ejemplos de web scraping utilizando: Requests | Scrapy | Selenium | LXML | BeautifulSoup
beautifulsoup beautifulsoup4 lxml-etree scraping scraping-python scraping-websites scrapping-python scrapy scrapy-crawler scrapy-spider selenium selenium-python selenium-webdriver web-scraping webscraping
Last synced: 07 Apr 2025
https://github.com/luismr/the-pudim-hunter
The Pudim Hunter 🍮 is a Proof of Concept (PoC) tool to scrape job listings from SimplyHired, analyze them against your resume, and assign a relevance score. Get insights into how well each job matches your skills. Automate your job search smarter! 🚀
openai openai-api playwright playwright-python scrapper scrapping scrapping-python
Last synced: 15 Apr 2025
https://github.com/suman-kr/facebook-automation
:gem: Facebook login Automation using Selenium webdriver
automation facebook python scrapping scrapping-python selenium
Last synced: 17 Dec 2024
https://github.com/abougouffa/arabic-fonts-scraper
A simple script to download all Arabic fonts from the arfonts.net website
arabic arabic-font arabic-language fonts scrapper scrapping-python
Last synced: 14 Apr 2025
https://github.com/adityajn105/cricket_data_extracter
A set of python scripts to extract cricket data from https://cricbuzz.com for analytics purpose.
cricket cricket-data cricket-data-extracter cricket-stats python python-scripts scrapping scrapping-python
Last synced: 16 Jan 2025
https://github.com/rexshijaku/facebookpageaboutscrapper
Scrappes About section of any Facebook Page
facebook facebook-page facebook-scraper facebook-scraping scraper scrapping scrapping-python
Last synced: 15 Apr 2025
https://github.com/volkansah/the_extractor.py
The Extractor is a Python script that extracts Google dorks from the official Google Hacking Database (GHDB) XML file and saves them in a CSV file. The script only extracts dorks that contain the "inurl:" operator because they are more specific and useful for targeted web scanning.
attack black-hat-python cybersecurity example-codes-python ghdb google-hacking-database hacking hacking-exploits hacking-tool hacking-tools information-gathering infosec payload penetration-testing pentest pentesting python redteam-tools scrapper-script scrapping-python
Last synced: 12 Apr 2025
https://github.com/shishirshekhar/github-web-scrapping
This app scrap the GitHub and show summary table of all the trading repositories.
beatifulsoup4 beautifulsoup github python python3 scraping scrapping-python streamlit
Last synced: 20 Feb 2025
https://github.com/zadigo/zineb
An advanced web scrapping framework for Python
python python3 scrapping scrapping-python web-scraping
Last synced: 07 May 2025
https://github.com/guanana/vanir
Django Crypto Manager easily expandable with custom functionalities
crypto crypto-bot crypto-portfolio crypto-portfolio-management cryptocurrency django portfolio portfolio-website python scrapping-python tracker tracking
Last synced: 04 Apr 2025
https://github.com/berru-g/price-comparator
Scrapping e-commerce
aliexpress amazon ebay price-comparison-calculator price-comparison-website price-comparisons scrapping-python work-in-progress
Last synced: 22 Mar 2025
https://github.com/naveenaidu/ieee-webscraper
A cli Scrapping tool to gather all the IEEE Journals
beautifulsoup ieee scrapping-python webscraper webscraper-website
Last synced: 11 Mar 2025
https://github.com/ali7haider/classification_of_documents_using_graph-based-features_and_knn_gt
Classification of Documents Using Graph-Based Features and KNN This project offers hands-on experience with graph theory and machine learning, fostering skills in data representation, algorithm implementation, and analytical thinking in the context of document classification.
document-classification graph-construction graph-theory knn-classification machine-learning scrapping-python
Last synced: 23 Feb 2025
https://github.com/charleslf2/pixel_collect
simple google image scrapper for your computer vision tasks
computer dataset dataset-generation images scrapping scrapping-python
Last synced: 01 Mar 2025
https://github.com/osiris-unnefer/backup_scrapper
Backup downloader / scrapper
backup french pentest pentesting python scrapper scrapping scrapping-python web website
Last synced: 29 Mar 2025
https://github.com/whoisjayd/imdb-scrapper
IMDB Movie Scrapping
dataset imdb python scrape scrapping scrapping-python scrappy selenium selenium-python tmdb
Last synced: 06 Apr 2025
https://github.com/nevilparmar11/webscrapping
Web scrapping tool built in selenium and BeautifulSoup.
beautifulsoup4 python3 scrapping-python selenium
Last synced: 28 Feb 2025
https://github.com/jisha-tr/web-scrapping-imdb
Web Scrapping Imdb Top 250 using BeautifulSoup
data-science hacktoberfest python scrapping scrapping-python web
Last synced: 04 Mar 2025
https://github.com/loglux-lab/usedcarsni
Used Cars NI catalogue scrapping tool
Last synced: 25 Feb 2025
https://github.com/wildonion/checrapper
a dead simple scrapper bot to check that if there is an appointment is available in site or not, alert user if there is a new one
Last synced: 11 Apr 2025
https://github.com/wahidpanda/scrappy-ai
Studio Dev Hub-Technical Test for Machine Learning Engineer
amazonscraping llm machine-learning scrapper-bot scrapper-script scrapping-python streamlit webapp webscrapping
Last synced: 15 Mar 2025
https://github.com/zadigo/zacoby
A web driver for Python with a focus for data mining and web scrapping
data-mining data-science scrapping scrapping-python webdriver
Last synced: 07 May 2025
https://github.com/zadigo/wta_scrapper
A scrapper that retrieves matches from the a player's WTA page
beautifulsoup4 data-mining python scraper scrapping-python sports tennis
Last synced: 07 May 2025
https://github.com/hmshb/scraping-agent-ai
AI-powered web scraping agent built with LangGraph, LangSmith, Firecrawl, and Anthropic AI. Automates intelligent crawling, structured data extraction, and LLM-powered content formatting. Efficiently handles anti-bot mechanisms, error recovery, and batch processing. 🚀
agentic-ai ai ai-agent ai-agents anthropic-claude bots firecrawl generative-ai langchain langgraph llms nlp scraper scrapping-php scrapping-python web web-scraper web-scraping workflow
Last synced: 27 Mar 2025
https://github.com/sonwaneshivani/webscrapper
Flask Application
css flask html mongodb python scrapping-data scrapping-python website
Last synced: 04 Mar 2025
https://github.com/loglux/blackbincollection
The script takes the next Black Bin collection date information from the Belfast City Hall Website and then publishes the event in your Outlook Calendar.
automation belfast bin-collection-app docker outlook-calendar scrapping-python selenium selenium-grid
Last synced: 25 Feb 2025
https://github.com/muhfalihr/pygdtelebot
PyGDTelebot is a versatile Telegram bot meticulously crafted in Python programming language, designed to streamline the process of sharing media content from Instagram to Telegram. This bot offers users a unique experience by simplifying the interaction between Instagram and Telegram.
instagram-api python3 scraping scrapping-python telegram-bot telegram-bot-api
Last synced: 06 Apr 2025
https://github.com/felipemuller20/candidates-data-search
Projeto desenvolvido em Python que busca informações de candidatos aprovados em uma universidade e salva os dados num banco de dados MySQL.
mysql-database python3 scrapping-python
Last synced: 14 May 2025
https://github.com/goran-milenkovic/scrape-american-university-sport-staff
Command line tool for scraping an different American University Staff Directory pages and printing out the staff contact information for the specified sport
beautifulsoup beautifulsoup4 python3 requests scrapper scrapping-python
Last synced: 26 Feb 2025
https://github.com/nedu-m/pywebscan
Python web scrapper - returns searches from Google with some keyed-in paramenters.
python scrapper-script scrapping-python
Last synced: 06 Mar 2025
https://github.com/joannescode/soup-fakepyjobs
beautifulsoup4 python requests scrapping-python webscraping
Last synced: 26 Feb 2025
https://github.com/codeterrayt/code-geass_traveloai
Code-Geass_TraveloAI is a personalized travel recommendation system based on the destination, budget, special occasions, number of people, trip type, food preferences, etc. It utilizes Laravel, FastAPI, LangChain and MySQL for the backend and React for the frontend.
ai-project ai-web-app ai-website codegeass fastapi-project hackathon-project langchain-python laravel-project opensource-projects reactjs-project recommendation-system scrapping-python travel-app travel-recommender travel-website traveloai
Last synced: 23 Mar 2025
https://github.com/kaustubhpatil02/urls_extract
extractor nlp-machine-learning python scrapping scrapping-python
Last synced: 15 Mar 2025
https://github.com/eticialima/image-url-import
Data import from json file and create instance for products image
django python scrapping-python script-python
Last synced: 24 Mar 2025
https://github.com/codeasarjun/web-scraping
This repo contains working example for web scraping
beutifulsoup data-mining data-mining-python python scrapper scrapper-bot scrapper-script scrappers scrapping scrapping-python scripts web-mining web-scapping xpath
Last synced: 16 Mar 2025
https://github.com/ssbaraar/caninecare-ai-vet-for-dog-breeds
This repository explores innovative ways to improve dog care using advanced technologies like computer vision and machine learning. It focuses on three main areas: dog breed classification, disease prediction, and breed recommendation.
classification flask-application jupyter-notebook prediction python3 recommendation-engine scrapping-python
Last synced: 11 Mar 2025
https://github.com/gill-singh-a/github-analytics-tool
A Program made in Python, that uses requests module to fetches and analysis publically available information of Github account
beautifulsoup beautifulsoup4 git github html-parser python requests scrapping scrapping-python
Last synced: 23 Feb 2025
https://github.com/srnascimento40/python-automation-rest-api
Just some exercises for praticing Python Automation, Scrapping and rest API.
automation python rest-api scrapping scrapping-python
Last synced: 26 Mar 2025
https://github.com/dms-codes/scrape_directory_itb
ITB Directory Scraper This Python script scrapes information from the Institut Teknologi Bandung (ITB) directory and saves it to a CSV file. It uses the BeautifulSoup library to parse the HTML content of the directory pages.
beautifulsoup4 csv python requests scrapping-python webscrapping
Last synced: 12 Mar 2025
https://github.com/robertpaulp/expenseadvisor
HackITall 2023- Hackathon
chatgpt-api data-analysis data-processing python scrapping-python
Last synced: 21 Feb 2025
https://github.com/mohnish88/web-scrapping
In this project, I used web scraping tools to extract data from daraz.pk, a popular e-commerce platform. Utilizing the BeautifulSoup and Selenium libraries in Python, I was able to efficiently navigate the website, extract valuable information on product listings, prices, and reviews, and store the data for further analysis.
beautifulsoup beautifulsoup4 html-parsing python-web-scraping scrapping-python selenium selenium-python selenium-webdriver web-scraping
Last synced: 23 Feb 2025
https://github.com/mohamedbsh/an-humble-suggestion-of-a-bot-from-the-mawaqit-app-prayers-times
An humble suggestion of a Telegram&WhatsApp Bot from the mawaqit's app prayers times. Feel free to contribute !
airflow mawaqit postgresql salat-timings scrapping-python telegram-api telegram-bot whatsapp-api whatsapp-bot
Last synced: 13 Mar 2025
https://github.com/tknishh/x-username-scraper
scraping usernames from message box of X handles.
scrapping-python twitter-api x-api
Last synced: 18 Mar 2025
https://github.com/javiercorado/river-compass
Selenium based amazon price scrapper
python scrapping-python selenium
Last synced: 03 Mar 2025
https://github.com/toofancodes/scrapperathleticscontacts
StaffScrapper is a smart and flexible web scraper built for one job: collecting staff contact details from athletics department websites. Whether you're pulling emails, job titles, or phone numbers — even from JavaScript-heavy or obfuscated sites — this tool handles it with ease. Designed for marketing teams, outreach coordinators, and data analys
beautifulsoup beautifulsoup4 python requests scrapping-python selenium selenium-webdriver webdriver-manager
Last synced: 11 Apr 2025