An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with scrapping-python

A curated list of projects in awesome lists tagged with scrapping-python .

https://github.com/luismr/the-pudim-hunter

The Pudim Hunter 🍮 is a Proof of Concept (PoC) tool to scrape job listings from SimplyHired, analyze them against your resume, and assign a relevance score. Get insights into how well each job matches your skills. Automate your job search smarter! 🚀

openai openai-api playwright playwright-python scrapper scrapping scrapping-python

Last synced: 15 Apr 2025

https://github.com/suman-kr/facebook-automation

:gem: Facebook login Automation using Selenium webdriver

automation facebook python scrapping scrapping-python selenium

Last synced: 17 Dec 2024

https://github.com/abougouffa/arabic-fonts-scraper

A simple script to download all Arabic fonts from the arfonts.net website

arabic arabic-font arabic-language fonts scrapper scrapping-python

Last synced: 14 Apr 2025

https://github.com/adityajn105/cricket_data_extracter

A set of python scripts to extract cricket data from https://cricbuzz.com for analytics purpose.

cricket cricket-data cricket-data-extracter cricket-stats python python-scripts scrapping scrapping-python

Last synced: 16 Jan 2025

https://github.com/volkansah/the_extractor.py

The Extractor is a Python script that extracts Google dorks from the official Google Hacking Database (GHDB) XML file and saves them in a CSV file. The script only extracts dorks that contain the "inurl:" operator because they are more specific and useful for targeted web scanning.

attack black-hat-python cybersecurity example-codes-python ghdb google-hacking-database hacking hacking-exploits hacking-tool hacking-tools information-gathering infosec payload penetration-testing pentest pentesting python redteam-tools scrapper-script scrapping-python

Last synced: 12 Apr 2025

https://github.com/shishirshekhar/github-web-scrapping

This app scrap the GitHub and show summary table of all the trading repositories.

beatifulsoup4 beautifulsoup github python python3 scraping scrapping-python streamlit

Last synced: 20 Feb 2025

https://github.com/zadigo/zineb

An advanced web scrapping framework for Python

python python3 scrapping scrapping-python web-scraping

Last synced: 07 May 2025

https://github.com/naveenaidu/ieee-webscraper

A cli Scrapping tool to gather all the IEEE Journals

beautifulsoup ieee scrapping-python webscraper webscraper-website

Last synced: 11 Mar 2025

https://github.com/ali7haider/classification_of_documents_using_graph-based-features_and_knn_gt

Classification of Documents Using Graph-Based Features and KNN This project offers hands-on experience with graph theory and machine learning, fostering skills in data representation, algorithm implementation, and analytical thinking in the context of document classification.

document-classification graph-construction graph-theory knn-classification machine-learning scrapping-python

Last synced: 23 Feb 2025

https://github.com/charleslf2/pixel_collect

simple google image scrapper for your computer vision tasks

computer dataset dataset-generation images scrapping scrapping-python

Last synced: 01 Mar 2025

https://github.com/nevilparmar11/webscrapping

Web scrapping tool built in selenium and BeautifulSoup.

beautifulsoup4 python3 scrapping-python selenium

Last synced: 28 Feb 2025

https://github.com/jisha-tr/web-scrapping-imdb

Web Scrapping Imdb Top 250 using BeautifulSoup

data-science hacktoberfest python scrapping scrapping-python web

Last synced: 04 Mar 2025

https://github.com/loglux-lab/usedcarsni

Used Cars NI catalogue scrapping tool

scrapping-python used-cars

Last synced: 25 Feb 2025

https://github.com/wildonion/checrapper

a dead simple scrapper bot to check that if there is an appointment is available in site or not, alert user if there is a new one

scrapping-python

Last synced: 11 Apr 2025

https://github.com/zadigo/zacoby

A web driver for Python with a focus for data mining and web scrapping

data-mining data-science scrapping scrapping-python webdriver

Last synced: 07 May 2025

https://github.com/zadigo/wta_scrapper

A scrapper that retrieves matches from the a player's WTA page

beautifulsoup4 data-mining python scraper scrapping-python sports tennis

Last synced: 07 May 2025

https://github.com/hmshb/scraping-agent-ai

AI-powered web scraping agent built with LangGraph, LangSmith, Firecrawl, and Anthropic AI. Automates intelligent crawling, structured data extraction, and LLM-powered content formatting. Efficiently handles anti-bot mechanisms, error recovery, and batch processing. 🚀

agentic-ai ai ai-agent ai-agents anthropic-claude bots firecrawl generative-ai langchain langgraph llms nlp scraper scrapping-php scrapping-python web web-scraper web-scraping workflow

Last synced: 27 Mar 2025

https://github.com/loglux/blackbincollection

The script takes the next Black Bin collection date information from the Belfast City Hall Website and then publishes the event in your Outlook Calendar.

automation belfast bin-collection-app docker outlook-calendar scrapping-python selenium selenium-grid

Last synced: 25 Feb 2025

https://github.com/muhfalihr/pygdtelebot

PyGDTelebot is a versatile Telegram bot meticulously crafted in Python programming language, designed to streamline the process of sharing media content from Instagram to Telegram. This bot offers users a unique experience by simplifying the interaction between Instagram and Telegram.

instagram-api python3 scraping scrapping-python telegram-bot telegram-bot-api

Last synced: 06 Apr 2025

https://github.com/felipemuller20/candidates-data-search

Projeto desenvolvido em Python que busca informações de candidatos aprovados em uma universidade e salva os dados num banco de dados MySQL.

mysql-database python3 scrapping-python

Last synced: 14 May 2025

https://github.com/goran-milenkovic/scrape-american-university-sport-staff

Command line tool for scraping an different American University Staff Directory pages and printing out the staff contact information for the specified sport

beautifulsoup beautifulsoup4 python3 requests scrapper scrapping-python

Last synced: 26 Feb 2025

https://github.com/nedu-m/pywebscan

Python web scrapper - returns searches from Google with some keyed-in paramenters.

python scrapper-script scrapping-python

Last synced: 06 Mar 2025

https://github.com/codeterrayt/code-geass_traveloai

Code-Geass_TraveloAI is a personalized travel recommendation system based on the destination, budget, special occasions, number of people, trip type, food preferences, etc. It utilizes Laravel, FastAPI, LangChain and MySQL for the backend and React for the frontend.

ai-project ai-web-app ai-website codegeass fastapi-project hackathon-project langchain-python laravel-project opensource-projects reactjs-project recommendation-system scrapping-python travel-app travel-recommender travel-website traveloai

Last synced: 23 Mar 2025

https://github.com/eticialima/image-url-import

Data import from json file and create instance for products image

django python scrapping-python script-python

Last synced: 24 Mar 2025

https://github.com/ssbaraar/caninecare-ai-vet-for-dog-breeds

This repository explores innovative ways to improve dog care using advanced technologies like computer vision and machine learning. It focuses on three main areas: dog breed classification, disease prediction, and breed recommendation.

classification flask-application jupyter-notebook prediction python3 recommendation-engine scrapping-python

Last synced: 11 Mar 2025

https://github.com/gill-singh-a/github-analytics-tool

A Program made in Python, that uses requests module to fetches and analysis publically available information of Github account

beautifulsoup beautifulsoup4 git github html-parser python requests scrapping scrapping-python

Last synced: 23 Feb 2025

https://github.com/srnascimento40/python-automation-rest-api

Just some exercises for praticing Python Automation, Scrapping and rest API.

automation python rest-api scrapping scrapping-python

Last synced: 26 Mar 2025

https://github.com/dms-codes/scrape_directory_itb

ITB Directory Scraper This Python script scrapes information from the Institut Teknologi Bandung (ITB) directory and saves it to a CSV file. It uses the BeautifulSoup library to parse the HTML content of the directory pages.

beautifulsoup4 csv python requests scrapping-python webscrapping

Last synced: 12 Mar 2025

https://github.com/mohnish88/web-scrapping

In this project, I used web scraping tools to extract data from daraz.pk, a popular e-commerce platform. Utilizing the BeautifulSoup and Selenium libraries in Python, I was able to efficiently navigate the website, extract valuable information on product listings, prices, and reviews, and store the data for further analysis.

beautifulsoup beautifulsoup4 html-parsing python-web-scraping scrapping-python selenium selenium-python selenium-webdriver web-scraping

Last synced: 23 Feb 2025

https://github.com/mohamedbsh/an-humble-suggestion-of-a-bot-from-the-mawaqit-app-prayers-times

An humble suggestion of a Telegram&WhatsApp Bot from the mawaqit's app prayers times. Feel free to contribute !

airflow mawaqit postgresql salat-timings scrapping-python telegram-api telegram-bot whatsapp-api whatsapp-bot

Last synced: 13 Mar 2025

https://github.com/tknishh/x-username-scraper

scraping usernames from message box of X handles.

scrapping-python twitter-api x-api

Last synced: 18 Mar 2025

https://github.com/javiercorado/river-compass

Selenium based amazon price scrapper

python scrapping-python selenium

Last synced: 03 Mar 2025

https://github.com/toofancodes/scrapperathleticscontacts

StaffScrapper is a smart and flexible web scraper built for one job: collecting staff contact details from athletics department websites. Whether you're pulling emails, job titles, or phone numbers — even from JavaScript-heavy or obfuscated sites — this tool handles it with ease. Designed for marketing teams, outreach coordinators, and data analys

beautifulsoup beautifulsoup4 python requests scrapping-python selenium selenium-webdriver webdriver-manager

Last synced: 11 Apr 2025