An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with webscraping-data

A curated list of projects in awesome lists tagged with webscraping-data .

https://github.com/boringppl/linkedin-profiles-scraping

Automatically scrape the web data of people profiles on Linkedin based on a specific search query

beautifulsoup beautifulsoup4 python python3 selenium selenium-webdriver webscraper webscraping webscraping-data webscrapper webscrapping

Last synced: 03 Dec 2024

https://github.com/seb943/scrapevin

A python package for scraping vinted - all foreign versions aswell!

kleiderkreisel python r reselling selenium selenium-python vinted webscraping webscraping-data webscraping-search

Last synced: 10 Apr 2025

https://github.com/antonio-nicolau/chaleno

A Dart package to web scraping data from websites easily and faster using less code lines.

dart flutter-webscrap webscraping webscraping-data

Last synced: 13 Apr 2025

https://github.com/ijayabby/web-scraper-ruby-capstone-project

Web scraping, web harvesting, or web data extraction is data scraping used for extracting data from websites.

nokogiri-gem rubyonrails watir-webdriver webscraping-data

Last synced: 24 Jan 2025

https://github.com/prithivsakthiur/save-web-as-zip

Save any web url as zip ( image + assets + html + css + js )

beatifulsoup beautifulsoup4 huggingface spaces web webscraping webscraping-data website zip

Last synced: 06 May 2025

https://github.com/dimitryzub/webscraping-py

Web Scraping scripts for all Google, other search engines, and other websites (currently outdated, something may not be working).

api bs4 data google-maps-api googleapi googlescraping googlesearchapi lxml parsel playwright python requests scraper scraping scrapy selenium webscraper webscraping webscraping-data webscraping-search

Last synced: 02 Jan 2025

https://github.com/fahimfba/web-scraper

Extract data from websites using the web-scrapper. Made with nodejs, ExpressJS, axios & cheerio.

axios cheerio cheeriojs javascript js npm npm-package webscrape webscraping webscraping-data webscraping-search webscrapper

Last synced: 14 Apr 2025

https://github.com/zakircodearchitect/web-scrapping

This project automates the web scraping of news articles from the Islamabad High Court website using Selenium. It continuously fetches dynamic content, categorizes it, and saves the data in a CSV file for easy access and analysis.

python selenium selenium-python selenium-webdriver webscraping-data

Last synced: 06 Apr 2025

https://github.com/riccardorevalor/ainvest

*DEV* AInvest is a Python tool that empowers NLP, LLMs and Gen-AI to create personalized report about the stock the user wants to analyze. Data used to evaluate each stock are scraped from various high-quality sources. Disclaimer: This software is provided for educational purposes only. The author is not responsible for any misuse of this software

genai genai-chatbot stock-market stock-price-prediction webscraping-data

Last synced: 02 Apr 2025

https://github.com/ng10op/tradesphere

TradeSphere is a web-based application designed for stock analysis, utilizing web scraping to collect, analyze, and visualize stock market data.

chromedriver express javascript jwt mongodb nodejs react selenium selenium-webdriver stock stock-analysis stocks tailwindcss webscrape webscraping webscraping-data yahoo-finance

Last synced: 06 Apr 2025

https://github.com/mdalamin5/machine-learning-2.0

Machine-Learning-2.0: A comprehensive repository documenting my journey to master ML from scratch. It includes core algorithms, advanced techniques, data preprocessing, feature engineering, and real-world projects. Follow my structured approach, inspired by "100 Days of ML," featuring Python implementations, tools, and insightful resources.

data-fetching-from-api datapreprocessing end-to-end-project feature-engineering gradient-descent-optimizers machine-learning-algorithms scikit-learn webscraping-data

Last synced: 25 Feb 2025

https://github.com/hackfutsec/zonespy

ZoneSpy is a Python script for scraping Zone-H archives, checking notifiers and associated URLs, supporting bulk operations and session management.

bulk-data cyber-threat-intelligence cybersecurity defacement ethical-hacking hacking python python-webscraping security web webscraping webscraping-data webscrapping websecurity zone-h

Last synced: 02 Mar 2025

https://github.com/latip176/latipnime

Source Code web Streaming Anime - data from Otakudesu.

anime stream-anime webscraping-data website

Last synced: 27 Mar 2025

https://github.com/datacollectionspecialist/web-scraping-tool

Top 5 web scraping tools:#1.scrapeless. #2.Content Grabber.#3.Diffbot.

scrapingtool webscraping webscraping-data webscrapingtool

Last synced: 11 Mar 2025

https://github.com/sakan811/common-japanese-morphemes-in-yoasobi-lyrics

Showcase visualizations about Common Japanese Morphemes in YOASOBI Lyrics

data-visualization music powerbi powerbi-visuals song webscraping webscraping-data yoasobi

Last synced: 23 Feb 2025

https://github.com/anandanraju/web_scrapping_github_top_repositories

Browse through different sites and pick on to scrape. Check the "Project Ideas" section for inspiration. Identify the information you'd like to scrape from the site. Decide the format of the output CSV file. Summarize your project idea and outline your strategy in a Jupyter Notebook.

csv-export github project repository topics webscraping-data website

Last synced: 24 Jun 2025

https://github.com/tinaland101/nasa-web-scraping-project

The project revolves around Mars exploration, specifically focusing on Mars news articles and Mars weather data. The goal is to extract relevant data from NASA webpage and provide data analytics based on Mars data average temperature, pressure and years

beautifulsoup4 html webscraping-data

Last synced: 03 Mar 2025

https://github.com/quantumudit/test-store-data-analysis

This repository showcases a web scraper with a pipeline structure for efficient data extraction and transformation from websites. The tool can be tailored to leverage its capabilities for insightful data analysis, providing valuable insights and informed decision-making.

data data-visualization dataanalytics python python-webscraping webscraper webscraping-data

Last synced: 17 Feb 2025

https://github.com/abhishekraina7/web-crawler

A JavaScript application that generates an "internal links" report for any website on the internet by crawling each page of the site.

http javascipt node-js webscraping-data

Last synced: 29 Mar 2025

https://github.com/ababaee1/simplescrapertool

Python tool for extracting contents from websites using Selenium

python selenium-webdriver webscraping webscraping-data

Last synced: 06 Apr 2025

https://github.com/asishgouda/web_scrapping_project_amazon

Amazon Web Scrapping Project

python webscraping-data

Last synced: 21 Jan 2025

https://github.com/sinha532/review-sentiment-analysis

performing sentiment analysis on reviews of Apple mobile collecting info from website using the BERT model.

bert-fine-tuning transformers webscraping-beautifulsoup webscraping-data

Last synced: 14 Mar 2025

https://github.com/drkbluescience/ibm-datascience-spacex

In this project, we predict whether the Falcon 9 first stage will land successfully by following the data science methodology.

data-visualization data-wrangling machine-learning-algorithms sql-query sqlite webscraping-data

Last synced: 04 Mar 2025

https://github.com/revmax-creator/web-scrapper

A powerful and flexible Python-based web scraper designed to extract data from websites efficiently. This repository is ideal for developers, data analysts, and enthusiasts who need a robust solution for web scraping tasks, ranging from basic static pages to complex, JavaScript-rendered content.

webscrape webscraper webscraping webscraping-beautifulsoup webscraping-data webscrapper webscrapping webscrapping-python

Last synced: 30 Mar 2025

https://github.com/zaibten/zaibten-scrapper-backend-server

ZAIBTEN is a cutting-edge customer reviews analysis software powered by web scraping, Natural Language Processing (NLP), and machine learning. It is designed to scrape, analyze, and visualize customer feedback from major e-commerce and review platforms such as Amazon, eBay, and Yelp. With ZAIBTEN, you can automatically extract real-time products.

app beautifulsoup4 expressjs flask jupyter-notebook machine-learning mongodb mongoose nlp-machine-learning nodejs python smtp smtp-server webapplication webscapping webscraping-data

Last synced: 08 Apr 2025

https://github.com/sakan811/honkai-star-rail-a-few-fun-insights-with-data-analysis

The project gives insights that delve into the Honkai Star Rail's character's stats of all available characters as of the given date.

data data-analysis data-science data-visualization game honkai honkai-star-rail honkai-starrail webscraping webscraping-data webscraping-selenium

Last synced: 23 Feb 2025

https://github.com/muzamilofficial/zaibten-scrapper-backend-server

ZAIBTEN is a cutting-edge customer reviews analysis software powered by web scraping, Natural Language Processing (NLP), and machine learning. It is designed to scrape, analyze, and visualize customer feedback from major e-commerce and review platforms such as Amazon, eBay, and Yelp. With ZAIBTEN, you can automatically extract real-time products.

app beautifulsoup4 expressjs flask jupyter-notebook machine-learning mongodb mongoose nlp-machine-learning nodejs python smtp smtp-server webapplication webscapping webscraping-data

Last synced: 14 Feb 2025

https://github.com/dishaaggarwal31/scraping-github-top-repositories-topicwise

This project aims to scrape and store details of the top repositories from featured topics on GitHub. Using Python, Requests, BeautifulSoup, OS, and Pandas, we extract key repository and user details. The collected data is then structured and saved into CSV files for easy access and further analysis

beautifulsoup4 html-css jupyter-notebook os-python pandas-library python3 requests-library-python webscraping-data

Last synced: 02 Apr 2025

https://github.com/vishal815/flight_details_scraper_and_visualization

This project focuses on scraping flight details from Google Flights, processing the data, and performing cleaning and visualization for future use in analytics or predictive modeling.

beautifulsoup end-to-end-project flight-data flight-data-analysis google-flights google-flights-scraper pandas pyhon scraping-project scraping-python selenium vishal-lazrus vishallazrus visualization webscraping webscraping-data

Last synced: 25 Mar 2025