Projects in Awesome Lists tagged with webscraping-data
A curated list of projects in awesome lists tagged with webscraping-data .
https://github.com/seb943/scrapeop
A python package for scraping oddsportal.com
analytics atp atp-matches baseball betting betting-odds champions-league datasets esports nba nfl odds-portal oddsportal rugby sports sports-data tennis tennis-dataset webscraping-data
Last synced: 09 Apr 2025
https://github.com/thewebscrapingclub/thescrapingclubfree
The Web Scraping Club Free Repository
webscraping webscraping-beautifulsoup webscraping-data
Last synced: 19 Feb 2025
https://github.com/boringppl/linkedin-profiles-scraping
Automatically scrape the web data of people profiles on Linkedin based on a specific search query
beautifulsoup beautifulsoup4 python python3 selenium selenium-webdriver webscraper webscraping webscraping-data webscrapper webscrapping
Last synced: 03 Dec 2024
https://github.com/seb943/scrapevin
A python package for scraping vinted - all foreign versions aswell!
kleiderkreisel python r reselling selenium selenium-python vinted webscraping webscraping-data webscraping-search
Last synced: 10 Apr 2025
https://github.com/antonio-nicolau/chaleno
A Dart package to web scraping data from websites easily and faster using less code lines.
dart flutter-webscrap webscraping webscraping-data
Last synced: 13 Apr 2025
https://github.com/ijayabby/web-scraper-ruby-capstone-project
Web scraping, web harvesting, or web data extraction is data scraping used for extracting data from websites.
nokogiri-gem rubyonrails watir-webdriver webscraping-data
Last synced: 24 Jan 2025
https://github.com/prithivsakthiur/save-web-as-zip
Save any web url as zip ( image + assets + html + css + js )
beatifulsoup beautifulsoup4 huggingface spaces web webscraping webscraping-data website zip
Last synced: 06 May 2025
https://github.com/dimitryzub/webscraping-py
Web Scraping scripts for all Google, other search engines, and other websites (currently outdated, something may not be working).
api bs4 data google-maps-api googleapi googlescraping googlesearchapi lxml parsel playwright python requests scraper scraping scrapy selenium webscraper webscraping webscraping-data webscraping-search
Last synced: 02 Jan 2025
https://github.com/sakan811/find-the-best-place-to-stay-with-price-per-review
Finding the Best Place to Stay with Price per Review from Booking.com
booking css data data-science django docker flask hotel hotel-booking hotels html javascript react webapp webscraping webscraping-data webscrapping webscrapping-python website
Last synced: 11 Apr 2025
https://github.com/fahimfba/web-scraper
Extract data from websites using the web-scrapper. Made with nodejs, ExpressJS, axios & cheerio.
axios cheerio cheeriojs javascript js npm npm-package webscrape webscraping webscraping-data webscraping-search webscrapper
Last synced: 14 Apr 2025
https://github.com/sakan811/sakuyado
Discover the ideal accommodation with a Review/Price analyzer.
booking css data data-science django docker flask hotel hotel-booking hotels html javascript react webapp webscraping webscraping-data webscrapping webscrapping-python website
Last synced: 21 Jun 2025
https://github.com/r3dhulk/web-scrapper-in-perl
Web Scrapper In Perl
blackhat blackhathacking ethical-hacking ethical-hacking-tools hacking hacking-tool perl perl-for-ethical-hacker perl-for-ethical-hackers perl-for-ethical-hacking perl-script perl-scripts perl5 perlforethicalhackers perlforethicalhacking webscraper webscraping webscraping-data webscrapper webscrapping
Last synced: 13 May 2025
https://github.com/zakircodearchitect/web-scrapping
This project automates the web scraping of news articles from the Islamabad High Court website using Selenium. It continuously fetches dynamic content, categorizes it, and saves the data in a CSV file for easy access and analysis.
python selenium selenium-python selenium-webdriver webscraping-data
Last synced: 06 Apr 2025
https://github.com/sakan811/find-osaka-average-hotel-price
Showcase visualizations about Osaka Average Hotel Price. The data was collected from Booking.com
booking bookings data-science data-visualization database grahpql hotel hotel-booking hotels japan japanese sql sqlite tourism tourists webscraper webscraping webscraping-data
Last synced: 23 Feb 2025
https://github.com/riccardorevalor/ainvest
*DEV* AInvest is a Python tool that empowers NLP, LLMs and Gen-AI to create personalized report about the stock the user wants to analyze. Data used to evaluate each stock are scraped from various high-quality sources. Disclaimer: This software is provided for educational purposes only. The author is not responsible for any misuse of this software
genai genai-chatbot stock-market stock-price-prediction webscraping-data
Last synced: 02 Apr 2025
https://github.com/ng10op/tradesphere
TradeSphere is a web-based application designed for stock analysis, utilizing web scraping to collect, analyze, and visualize stock market data.
chromedriver express javascript jwt mongodb nodejs react selenium selenium-webdriver stock stock-analysis stocks tailwindcss webscrape webscraping webscraping-data yahoo-finance
Last synced: 06 Apr 2025
https://github.com/rajputrockstar/sarkarisphere-
This is for helping candidate applying for sarkari exam
automation automation-selenium govt-jobs job-portal job-portal-website python python-3 python-script selenium selenium-python selenium-webdriver streamlit streamlit-webapp webscraping webscraping-data
Last synced: 09 Apr 2025
https://github.com/mdalamin5/machine-learning-2.0
Machine-Learning-2.0: A comprehensive repository documenting my journey to master ML from scratch. It includes core algorithms, advanced techniques, data preprocessing, feature engineering, and real-world projects. Follow my structured approach, inspired by "100 Days of ML," featuring Python implementations, tools, and insightful resources.
data-fetching-from-api datapreprocessing end-to-end-project feature-engineering gradient-descent-optimizers machine-learning-algorithms scikit-learn webscraping-data
Last synced: 25 Feb 2025
https://github.com/andreuvv/myl_scraper
tor.myl.cl web scraper for TCG Mitos y Leyendas (MyL)
leyendas mitos mitosyleyendas myl python selenium tcg tor webscraper webscraping webscraping-data webscrapping-python y
Last synced: 11 Mar 2025
https://github.com/hackfutsec/zonespy
ZoneSpy is a Python script for scraping Zone-H archives, checking notifiers and associated URLs, supporting bulk operations and session management.
bulk-data cyber-threat-intelligence cybersecurity defacement ethical-hacking hacking python python-webscraping security web webscraping webscraping-data webscrapping websecurity zone-h
Last synced: 02 Mar 2025
https://github.com/latip176/latipnime
Source Code web Streaming Anime - data from Otakudesu.
anime stream-anime webscraping-data website
Last synced: 27 Mar 2025
https://github.com/sakan811/find-osaka-and-japan-average-hotel-price
Showcase visualizations about Osaka and Japan Average Hotel Price. The data was collected from Booking.com
booking bookings data-science data-visualization database grahpql hotel hotel-booking hotels japan japanese sql sqlite tourism tourists webscraper webscraping webscraping-data
Last synced: 08 Apr 2025
https://github.com/datacollectionspecialist/web-scraping-tool
Top 5 web scraping tools:#1.scrapeless. #2.Content Grabber.#3.Diffbot.
scrapingtool webscraping webscraping-data webscrapingtool
Last synced: 11 Mar 2025
https://github.com/sakan811/common-japanese-morphemes-in-yoasobi-lyrics
Showcase visualizations about Common Japanese Morphemes in YOASOBI Lyrics
data-visualization music powerbi powerbi-visuals song webscraping webscraping-data yoasobi
Last synced: 23 Feb 2025
https://github.com/anandanraju/web_scrapping_github_top_repositories
Browse through different sites and pick on to scrape. Check the "Project Ideas" section for inspiration. Identify the information you'd like to scrape from the site. Decide the format of the output CSV file. Summarize your project idea and outline your strategy in a Jupyter Notebook.
csv-export github project repository topics webscraping-data website
Last synced: 24 Jun 2025
https://github.com/tinaland101/nasa-web-scraping-project
The project revolves around Mars exploration, specifically focusing on Mars news articles and Mars weather data. The goal is to extract relevant data from NASA webpage and provide data analytics based on Mars data average temperature, pressure and years
beautifulsoup4 html webscraping-data
Last synced: 03 Mar 2025
https://github.com/mahikshith/e-commerce-review-chatbot-end_to_end-aws-deployment
AWS deployed LLM powered chatbot to converse with product reviews freely
astradb aws-ec2 flask-api gemini-pro-api langchain postman python streamlit webscraping-data
Last synced: 17 Dec 2024
https://github.com/quantumudit/test-store-data-analysis
This repository showcases a web scraper with a pipeline structure for efficient data extraction and transformation from websites. The tool can be tailored to leverage its capabilities for insightful data analysis, providing valuable insights and informed decision-making.
data data-visualization dataanalytics python python-webscraping webscraper webscraping-data
Last synced: 17 Feb 2025
https://github.com/abhishekraina7/web-crawler
A JavaScript application that generates an "internal links" report for any website on the internet by crawling each page of the site.
http javascipt node-js webscraping-data
Last synced: 29 Mar 2025
https://github.com/ababaee1/simplescrapertool
Python tool for extracting contents from websites using Selenium
python selenium-webdriver webscraping webscraping-data
Last synced: 06 Apr 2025
https://github.com/asishgouda/web_scrapping_project_amazon
Amazon Web Scrapping Project
Last synced: 21 Jan 2025
https://github.com/thepravin/amazon-web-scripting
amazon jupyter-notebook python web webscraper webscraping webscraping-data
Last synced: 26 Feb 2025
https://github.com/sinha532/review-sentiment-analysis
performing sentiment analysis on reviews of Apple mobile collecting info from website using the BERT model.
bert-fine-tuning transformers webscraping-beautifulsoup webscraping-data
Last synced: 14 Mar 2025
https://github.com/asishkumar-gouda/web_scrapping_project_amazon
Amazon Web Scrapping Project
Last synced: 14 Mar 2025
https://github.com/zefrenchwan/monocle
Tool to list what a website is about
information-gathering nlp osint python3 spacy-nlp webscraping-data webscrapping
Last synced: 13 Mar 2025
https://github.com/drkbluescience/ibm-datascience-spacex
In this project, we predict whether the Falcon 9 first stage will land successfully by following the data science methodology.
data-visualization data-wrangling machine-learning-algorithms sql-query sqlite webscraping-data
Last synced: 04 Mar 2025
https://github.com/revmax-creator/web-scrapper
A powerful and flexible Python-based web scraper designed to extract data from websites efficiently. This repository is ideal for developers, data analysts, and enthusiasts who need a robust solution for web scraping tasks, ranging from basic static pages to complex, JavaScript-rendered content.
webscrape webscraper webscraping webscraping-beautifulsoup webscraping-data webscrapper webscrapping webscrapping-python
Last synced: 30 Mar 2025
https://github.com/srking501/uk-groceries-images
Repository Containing UK Groceries Images
data groceries grocery images links playwright playwright-python webscraping-data webscrapper
Last synced: 23 Mar 2025
https://github.com/zaibten/zaibten-scrapper-backend-server
ZAIBTEN is a cutting-edge customer reviews analysis software powered by web scraping, Natural Language Processing (NLP), and machine learning. It is designed to scrape, analyze, and visualize customer feedback from major e-commerce and review platforms such as Amazon, eBay, and Yelp. With ZAIBTEN, you can automatically extract real-time products.
app beautifulsoup4 expressjs flask jupyter-notebook machine-learning mongodb mongoose nlp-machine-learning nodejs python smtp smtp-server webapplication webscapping webscraping-data
Last synced: 08 Apr 2025
https://github.com/sakan811/honkai-star-rail-a-few-fun-insights-with-data-analysis
The project gives insights that delve into the Honkai Star Rail's character's stats of all available characters as of the given date.
data data-analysis data-science data-visualization game honkai honkai-star-rail honkai-starrail webscraping webscraping-data webscraping-selenium
Last synced: 23 Feb 2025
https://github.com/muzamilofficial/zaibten-scrapper-backend-server
ZAIBTEN is a cutting-edge customer reviews analysis software powered by web scraping, Natural Language Processing (NLP), and machine learning. It is designed to scrape, analyze, and visualize customer feedback from major e-commerce and review platforms such as Amazon, eBay, and Yelp. With ZAIBTEN, you can automatically extract real-time products.
app beautifulsoup4 expressjs flask jupyter-notebook machine-learning mongodb mongoose nlp-machine-learning nodejs python smtp smtp-server webapplication webscapping webscraping-data
Last synced: 14 Feb 2025
https://github.com/dishaaggarwal31/scraping-github-top-repositories-topicwise
This project aims to scrape and store details of the top repositories from featured topics on GitHub. Using Python, Requests, BeautifulSoup, OS, and Pandas, we extract key repository and user details. The collected data is then structured and saved into CSV files for easy access and further analysis
beautifulsoup4 html-css jupyter-notebook os-python pandas-library python3 requests-library-python webscraping-data
Last synced: 02 Apr 2025
https://github.com/vishal815/flight_details_scraper_and_visualization
This project focuses on scraping flight details from Google Flights, processing the data, and performing cleaning and visualization for future use in analytics or predictive modeling.
beautifulsoup end-to-end-project flight-data flight-data-analysis google-flights google-flights-scraper pandas pyhon scraping-project scraping-python selenium vishal-lazrus vishallazrus visualization webscraping webscraping-data
Last synced: 25 Mar 2025