Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with beautifulsoup
A curated list of projects in awesome lists tagged with beautifulsoup .
https://github.com/aka-sh11/live-weather-update
Get Live Weather Updates of your desired City.
beautifulsoup pillow tkinter-gui weather-app
Last synced: 08 Dec 2024
https://github.com/memirdogan/web-scraping-with-beautiful-soup-and-selenium
This repository offers a guide to web scraping with Beautiful Soup and Selenium. It covers data extraction, HTML basics, XPath, and CSS selectors. Includes a project for scraping Trendyol. Ideal for beginners wanting to learn web scraping.
beautifulsoup beautifulsoup4 html python selenium selenium-webdriver webscraping
Last synced: 11 Oct 2024
https://github.com/lucereal/autoresearcher
AI Agent to gather information on specific topics
ai-agents beautifulsoup google-search-api openai playwright python web-scraping
Last synced: 08 Dec 2024
https://github.com/situmorang-com/linkedin-bs4-beautiful-soup
Scraping LinkedIn using beautiful soup (BS4)
Last synced: 26 Nov 2024
https://github.com/sazuna/zscore_scraper
Scrap a web page's main content based on the z-score (HTML Tags' paths with abnormally long texts will be caught by this method)
beautifulsoup scraper selenium-webdriver webscraping zscore
Last synced: 05 Jan 2025
https://github.com/sakan811/find-common-japanese-character-from-news
Showcase visualizations about common Japanese characters that appear in the news
beautifulsoup beautifulsoup4 data-analysis dataanalysis japanese japanese-language language news powerbi requests sqlite sqlite3 visualization webscraper webscraping
Last synced: 05 Jan 2025
https://github.com/euphrasiologist/web_mining
Examples of methods to mine websites (among others) for their underlying databases.
beautifulsoup extraction plant-cvalue-database plants selenium
Last synced: 09 Jan 2025
https://github.com/yisaer/shuoj_crawler
上海大学OJ的代码爬虫
beautifulsoup python requests spider
Last synced: 02 Dec 2024
https://github.com/ellache/nba-tipoff-scraper
Comparison of predictive ability of elo, glicko2 and trueskill algorithms in forecasting winners of NBA tipoffs. Python/Beautiful Soup/XGBoost. *This was an exploratory project; made publicly available to discuss/display; not easily suited to reuse.
beautifulsoup elo glicko2 python trueskill xgboost
Last synced: 04 Jan 2025
https://github.com/siddharth-nandagopal/web-scraper
a simple web scraper in Python
asdf beautifulsoup bs4 poetry python ray requests webscraper
Last synced: 13 Jan 2025
https://github.com/antodata/homosexualityhistorycurrentstatus
In this project we use folium, pandas and webscraping using requests and beautifulSoup to create a map that contains information about the legal status of homosexuality around the world. We created an API with a function called buildMap to create the map with the colors combining number ranges and static string values to color that map. Also we use request and beautifulSoup to get and parse all the wikipedia pages that contain information about lgbt rights in each country for the html we display when we open a marker (every country has a marker placed in its capital to display with information).
activism beautifulsoup beautifulsoup4 data-science decriminalization-homosexuality folium gay homosexuality html human-rights interactive-maps lgtb pandas python python3 requests web-scraper web-scraping webscraper webscraping
Last synced: 08 Dec 2024
https://github.com/marcusjhang/proceedings
beautifulsoup pandas pdfreader scraping-python
Last synced: 14 Dec 2024
https://github.com/filiprokita/prettyhtml
PrettyHTML is a Python program that prettifies HTML code using BeautifulSoup. With a GUI built using Tkinter, users can easily select an HTML file and apply the prettification process to it. PrettyHTML makes it quick and simple to clean up and organize HTML code.
beautifier beautifulsoup gui html html-beautifier html-css-javascript python python3 tkinter
Last synced: 02 Dec 2024
https://github.com/benitomartin/scraping-to-sql
Open Source Contribution to Justicio Project
beautifulsoup fitz mysql pymupdf python requests
Last synced: 31 Dec 2024
https://github.com/igorbrizack/crawler-web
Aplicação de coleta de dados Web com ReactJS e Python - API Rest
beautifulsoup crawler docker fastapi mongodb nodejs python3 react scraper
Last synced: 28 Nov 2024
https://github.com/code-jl/nfl-point-kicker-data-scraper
A Python-based web scraping toolkit that extracts and processes NFL kicking statistics from Pro-Football-Reference. This project automates the collection of comprehensive game data, with a particular focus on field goal attempts and environmental conditions.
automation beautifulsoup csv data-analysis data-collection field-goals football-statistics kicking-stats nfl python selenium sports-analysis statistics weather-data web-scraping
Last synced: 18 Jan 2025
https://github.com/marcusjhang/scraper
Google Sheets -> Python Scraper -> Postgres DB -> React Frontend
beautifulsoup googlesheetsapi nodejs postgresql reactjs
Last synced: 14 Dec 2024
https://github.com/randika00/ism-web-automation-y23cp-web
Web scraping refers to the extraction of data from a website. Be it a spreadsheet or an API.
2captcha-api beautifulsoup regex scrapy selenium spacy webdriver
Last synced: 08 Dec 2024
https://github.com/gunh0/gamemeca-crawler
🎮 Game rankings data from a website and saves it as a CSV file using BeautifulSoup and requests
beautifulsoup parser parsing python-requests
Last synced: 03 Dec 2024
https://github.com/anjasfedo/web-scrap-beautiful-soup
Web Scraping Automation with Beautiful Soup.
automation beautifulsoup web-scraping
Last synced: 17 Jan 2025
https://github.com/gunh0/powerlink_ranking
Naver PowerLink Ranking Search Program made with PyQt
beautifulsoup pyqt python3 tkinter
Last synced: 03 Dec 2024
https://github.com/pb319/scrap_with_selenium
Let's dive deeper into the domain of web scraping using Selenium.
beautifulsoup pandas pandas-dataframe python python-script selenium
Last synced: 11 Oct 2024
https://github.com/kelcheone/chromestore
chrome Webstore scraper ( stars, users, emails, ...)
beautifulsoup chrome-extension chrome-web-store google-sheets playwright playwright-python web-scraping
Last synced: 08 Dec 2024
https://github.com/marlon-couto/web-utils
A set of helpful utilities for web development
Last synced: 22 Dec 2024
https://github.com/abhishek-k-git/web_scrapping_python
Get notification email when price of the product drops(amazon product) throw web scrapping in python.
beautifulsoup python3 requests
Last synced: 28 Nov 2024
https://github.com/mostafahima/price-tracker-bot
A price tracking tool that scrapes product prices using BeautifulSoup and sends email alerts when prices drop below a target.
Last synced: 28 Dec 2024
https://github.com/harmanveer-2546/implementing-web-scraping-in-python-with-beautifulsoup
Just implementing basics functions of Beautiful soup
beautifulsoup extracting-data extraction-information extraction-methods handling-malformed-markup inspecting parsing-html-with-beautiful-soup python-package scraping-websites web-scraping
Last synced: 11 Jan 2025
https://github.com/mostafahima/billboard-hot-100-to-spotify-playlist
This Python script allows you to travel back in time and create a Spotify playlist of the Billboard Hot 100 songs for any specified date.
api api-client beautifulsoup billboards-hot-100 python-dotenv requests spotipy-library
Last synced: 28 Dec 2024
https://github.com/joe-mccarthy/harlow-bindicator
Sends notification on binday and what bin needs putting out.
beautifulsoup essex github-actions harlow ntfy ntfysh python selenium-webdriver
Last synced: 10 Oct 2024
https://github.com/jhleeeme/bugschartscraper
벅스뮤직 데일리 차트 스크래핑 & 간단히 살펴보기
beautifulsoup pandas python3 requests scraper
Last synced: 17 Jan 2025
https://github.com/kaistullich/x-rates-currency
Requests currency from X-Rates to use for currency conversion
beautifulsoup python-3-5 requests sqlite
Last synced: 03 Dec 2024
https://github.com/bkataru/kayak-scrawler
selenium and bs4 crawler + scraper to obtain flight listing info from kayak.com
beautifulsoup kayak python scraping scrawler selenium
Last synced: 22 Dec 2024
https://github.com/inightjar/flibcartscraping
FlibCart scraping some products with pyhon beautiful soup library
beautifulsoup python webscraping
Last synced: 12 Jan 2025
https://github.com/harmanveer-2546/scraping-covid-19-statistics-using-beautifulsoup
Coronavirus, one of the biggest pandemic has brought all the world to Danger. Along with this, it is one of the trending News, everyone has this day. In this repo, I will be scraping data and printing Covid-19 statistics in human-readable form.
beautifulsoup covid-19 html insights python web-scrapping
Last synced: 11 Jan 2025
https://github.com/meherdeep/parallel-scraping
Scraping data from amazon and flipkart, a comparative analysis.
beautifulsoup beautifulsoup4 parallel-computing python3 webscraping
Last synced: 16 Dec 2024
https://github.com/harmanveer-2546/amazon-web-scraping
In this repo, I will be scraping product information from Amazon website. Accordingly, I will take considering a “Playstation 5” as the target product.
amazon-product-data amazon-product-scraper beautifulsoup html playstation5 python scraping web
Last synced: 11 Jan 2025
https://github.com/fskydev/list-youtube
Automatically create and post YouTube playlists and videos
automation beautifulsoup pandas python python3 scraper selenium tkinter youtube
Last synced: 13 Dec 2024
https://github.com/fdauti/stock_performance
Analyzing Stock Performance with yfinance, bs4, pandas and plotly
beautifulsoup bs4 pandas plotly yfinance
Last synced: 13 Jan 2025
https://github.com/abdelhamid2c/web-scraping
Explore a powerful web scraping technique in this repository.From Python scripts using BeautifulSoup,find everything you need to extract data from websites effortlessly
Last synced: 11 Jan 2025
https://github.com/brianj-4/vlr-scraper
Python vlr.gg scraper
beautifulsoup esports fastapi python vlrgg webscraper webscraping
Last synced: 06 Jan 2025
https://github.com/jdm79/basic-bs4
A quick intro to web scraping with Python and BeautifulSoup
beautifulsoup python webscraping
Last synced: 11 Jan 2025
https://github.com/nfo94/wswpbook
Code drafts for Web Scraping With Python book.
Last synced: 09 Dec 2024
https://github.com/atkaridarshan04/python-libraries
Python and its different Libraries
beautifulsoup jupyterlab matplotlib numpy pandas python seaborn
Last synced: 03 Dec 2024
https://github.com/germanpaul12/automating-hacker-news-and-weather-mails
Project for my Raspberry Pi to send me mails when it rains and to inform with hot tech news
beautifulsoup beautifulsoup4 data hacker-news openweather-api raspberry-pi requests
Last synced: 01 Jan 2025
https://github.com/germanpaul12/petrol-price-scraper-heidelberg-beautifulsoup-requests
Automatically scrape petrol prices for Heidelberg area and push them to Github
automation beautifulsoup heidelberg petrol requests scraper scraping
Last synced: 01 Jan 2025
https://github.com/timothyjan/housing-data-automation
Create a Google spreadsheet with housing prices/addresses/links using Google forms and webscraping on Zillow.
beautifulsoup python selenium-webdriver
Last synced: 04 Dec 2024
https://github.com/youngsong99/city-finder-web-scraping
CityFinder Project
beautifulsoup python web-scraping
Last synced: 09 Dec 2024
https://github.com/thisis-developer/news-scraping-using-beautyfulsoup-selenium-with-django
NewsScraper-Django: A powerful news scraping solution using BeautifulSoup and Selenium, seamlessly integrated with Django. Effortlessly extract articles, handle JavaScript-rendered content, and present results through a user-friendly web interface
beautifulsoup bs4 chromewebdriver django django-framework newspaper python scraping selenium selenium-webdriver
Last synced: 04 Dec 2024
https://github.com/jaiswalchitransh/historical-stock-and-revenue-data-analysis
This project focuses on extracting and visualizing stock data using Python libraries such as yfinance for historical stock prices and web scraping techniques to gather company revenue data. It provides a comprehensive analysis by plotting both stock prices and revenues over time for companies like Tesla and GameStop.
artificial-intelligence beautifulsoup bs4 data-science gamestop jupyter jupyter-notebook machine-learning matplotlib pandas plotly project python python-3 python3 requests stock-market tesla webscraping yfinance
Last synced: 12 Jan 2025
https://github.com/volkansah/python-xpath-tutorial
XPath is a query language used for selecting nodes in an XML or HTML document. Python supports XPath queries through various libraries such as BeautifulSoup, lxml, and more. In this tutorial, we will use BeautifulSoup to demonstrate how XPath works with Python.
beautifulsoup html python python-3 python-script python-tutorial python-tutorials python3 scraping tutorial tutorials xml xpath
Last synced: 09 Dec 2024
https://github.com/trafitto/careerowpy
A simple web scraper to retrieve some useful information regarding Overwatch's competitive matches
beautifulsoup career overwatch python python3 sqlite3 webscraping
Last synced: 19 Jan 2025
https://github.com/daggerpov/chamber-scraper
This is a simple GUI that will scrape https://www.uschamber.com/co/chambers to retrieve info from the chambers in an entered U.S. state. Then, a user can navigate through the chambers of that state by pressing on the next/previous buttons in the GUI. Once a state has been entered, its info will be put into its corresponding .csv file. The "chambers" folder is what would be zipped for the spreadsheet data.
Last synced: 04 Dec 2024
https://github.com/fazal-coder/imdb_scrape
This python script will scrape data of top 1000 movies from IMDb then save in CSV format
beautifulsoup pandas python requests
Last synced: 09 Dec 2024
https://github.com/luminati-io/python-scraping-libraries
The top Python web scraping libraries, comparing their features, categories, and use cases to find the best fit for your data extraction needs.
beautifulsoup curl playwright python python-requests requests scrapy selenium seleniumbase web-scraping
Last synced: 20 Jan 2025
https://github.com/diegoperea20/web_scraping
Web scraping practice using beautiful soup and selenium libraries
beautifulsoup python selenium web-scraping
Last synced: 13 Jan 2025
https://github.com/meatyite/python-steimatzky
A Python library used to Scrape data from steimatzky.co.il
beautifulsoup beautifulsoup4 pypi python python3 scraping web-scraping webscraping
Last synced: 23 Dec 2024
https://github.com/dahsie/sentiment_analysis
J'ai choisi de réaliser ce projet afin de me familiariser avec le transfert learning des modèles NLP pré-entraîné. J'ai utilisé le modèle BERT développé par google afin de réaliser l'analyse de sentiments sur un jeu de données disponible sur Kaggle. J'ai aussi réalisé le web scraping afin de récolter des commentaires sur des produits sur internet
beautifulsoup bert-models deep-learning matplotlib pandas python pytorch scraping web
Last synced: 23 Dec 2024
https://github.com/amber-abuah/amazon-product-scraper
Python script that scrapes data from Amazon product pages and saves into a CSV.
beautifulsoup beautifulsoup4 csv csv-export python scraper scraping
Last synced: 10 Dec 2024
https://github.com/manasjadhav0086/data-scraping-using-beautiful-soup
This project demonstrates how to scrape movie data from IMDb using Python. The notebook contains code to extract specific information about movies, such as their title, genre, release year, and more, for data analysis or visualization.
Last synced: 20 Dec 2024
https://github.com/pushpakrai/web-crawler-
A scalable Python web crawler for efficient, polite extraction, analysis, and ranking of web content, ideal for SEO, data analysis, and development.
beautifulsoup pandas python3 seo web-crawler
Last synced: 20 Jan 2025
https://github.com/emirhansilsupur/web-scraper-news
Web Scraper for News Related to the Israel-Palestine Conflict
beautifulsoup python selenium-webdriver webscraping
Last synced: 10 Jan 2025
https://github.com/jaiswalchitransh/analyzing-historical-stock-data
This project focuses on extracting and visualizing stock data using Python libraries such as yfinance for historical stock prices and web scraping techniques to gather company revenue data. It provides a comprehensive analysis by plotting both stock prices and revenues over time for companies like Tesla and GameStop.
artificial-intelligence beautifulsoup bs4 data-science gamestop jupyter jupyter-notebook machine-learning matplotlib pandas plotly project python python-3 python3 requests stock-market tesla webscraping yfinance
Last synced: 01 Oct 2024
https://github.com/mevlutayilmaz/web-scraping
web scraping with python
beautifulsoup json python requests web-scraping
Last synced: 11 Jan 2025
https://github.com/gtkacz/tjsp-webscraper
beautifulsoup beautifulsoup4 selenium tribunal webscraper webscraping
Last synced: 11 Dec 2024
https://github.com/camara94/web-scraping-with-requests-beautifulsoup-and-selenium
Dans ce tutoriel, nous allons découvrir les techniques de web-scraping en request, beautiful-soup et sélénium
beautifulsoup data-analysis-python requests requests-library-python selenium web-scraping
Last synced: 23 Dec 2024
https://github.com/crispengari/web-scrapping-python
💎 Web Scrapping in python using `requests` and `bs4`.
beatifulsoup4 beautifulsoup bs4 datafetching datagather datascience jupyter-notebook pandas python requests webscapping
Last synced: 16 Dec 2024
https://github.com/kenanbek/tutorial-python-crawler
Crawling website data using Python with requests and Beautiful Soup libraries
beautifulsoup crawler crawling miner parser python python-requests requests
Last synced: 11 Dec 2024
https://github.com/mohd-tahzeeb-khan/reverse_dcf-calculator-and-vistualization
This Dashboard is Developed by me. I had Applied for Internship in which i had to develop this Dashboard to showcase my skill and talent.
beautifulsoup bs4 dash dashboard dataframes flask numpy pandas plotly plotly-dash plotly-express python requests shares stock-market
Last synced: 11 Dec 2024
https://github.com/arya-io/flipkart-data-scraping
A data scraping project to extract product information such as names, prices, descriptions, and ratings from Flipkart using Selenium and BeautifulSoup.
beautifulsoup data-science data-scraping flipkart pandas python selenium web-scraping
Last synced: 20 Dec 2024
https://github.com/victoralessander/hashidentifier
A simple program that try discover a hash of type MD5
Last synced: 11 Dec 2024
https://github.com/kaustubh187/news-aggregator
A news aggregator web application built on Django. Web scraping is used to collect news headlines from different sources.
beautifulsoup django python webscraping
Last synced: 20 Jan 2025
https://github.com/xloupx/app_futebol
App_Futebol faz a raspagem das noticias do site relacionada ao time Real Madrid
beautifulsoup beautifulsoup4 futebol interface python python3 qt5 realmadrid resources webscraping
Last synced: 17 Jan 2025
https://github.com/rupav/scraping-from-imdb
Using python, easy scraping from IMDB website
beautifulsoup beginner beginner-friendly imdb imdb-webscrapping
Last synced: 17 Dec 2024
https://github.com/abgeo/fine-bot
Scrapper bot for videos.police.ge
beautifulsoup bot python scrapper
Last synced: 23 Dec 2024
https://github.com/jasocami/health-insurance-spider
Scrap health insurances with BeautifulSoup and Scrapy, playground
beautifulsoup django redis scrapy
Last synced: 14 Dec 2024
https://github.com/stephancill/github-trending-api
Trending page on GitHub served as REST API
beautifulsoup github github-trending sanic scraping
Last synced: 30 Dec 2024
https://github.com/hubtou/libgh
GitHub scraping tool and library
beautifulsoup github json library python scraper scraping scraping-python scraping-websites servier web-scraper web-scraping web-scraping-python webscraper webscraping xml
Last synced: 12 Dec 2024
https://github.com/phanirithvij/stackwordcloud
Generate a word cloud from a user's stackoverflow profile
beautifulsoup python3 stackoverflow wordcloud wordcloud-generator
Last synced: 13 Dec 2024
https://github.com/sarrabenyahia/tuto-webscraping
webscraping course tutorial
api beautifulsoup headers javascript proxy requests scrapy selenium user-agent webscraping
Last synced: 05 Jan 2025
https://github.com/lfmramos/tool-stock-visualization
This Python script creates a combined plot of stock price and revenue data for a given stock.
beautifulsoup pandas plotly python yfinance-api
Last synced: 19 Jan 2025
https://github.com/germanpaul12/scrape-stoic-quotes-with-requests-and-beautifulsoup
Scraping Stoic Quotes with BeautifulSoup and Requests
beautifulsoup beautifulsoup4 requests scraper scraping stoicism
Last synced: 01 Jan 2025
https://github.com/layerex/books-dl
Консольная утилита для загрузки книжек с одного замечательного сайта.
beautifulsoup beautifulsoup4 books cli dl downloader
Last synced: 13 Dec 2024
https://github.com/navneetguptacse/image-scraper.img
Scrapify - Unleash the Power of Image Scraping with Flask and Machine Learning.
beautifulsoup development flask miniconda notebook python web-scraping
Last synced: 12 Jan 2025
https://github.com/austinjhunt/ibew-data-scraper
Automation of merged data collection and cleaning from ibew.org and unionfacts.com for an Upwork job.
beautifulsoup multithreading python scraper union upwork web
Last synced: 18 Oct 2024
https://github.com/mangesh-balkawade/advariskasm
This Repository Contain The Python Code Which Used To Scrap WebSite Using Beautiful Soup and Other Libraries
beautifulsoup python webscrapping
Last synced: 01 Jan 2025
https://github.com/mangesh-balkawade/advariskassignment
This Repository Contain The Python Code For Web Scrapping Using Beautiful Soup
beautifulsoup pandas python3 webscraping
Last synced: 01 Jan 2025
https://github.com/bilalhassankhan007/scraping-movies-data-from-imdb-using-python
Created a python function to automate scraping of top movies from IMDB for any given genre using BeautifulSoup
beautifulsoup functions imdb imdb-webscrapping pandas python3
Last synced: 17 Dec 2024
https://github.com/devlocalhost/weathergr
weathergr - a wrapper for meteo.gr
beautifulsoup beautifulsoup4 python python-3 python3 weather weather-data weather-forecast weather-station web-scraping-python webscraping
Last synced: 16 Jan 2025
https://github.com/parth-paradkar/how_long
Python script to find the total duration of a YouTube playlist and other related information from its URL
beautifulsoup python web-scraping youtube-playlist
Last synced: 10 Jan 2025
https://github.com/jmoussa/movie-scraper
Scrape theater, movie and time from IMDB using python BeautifulSoup4 library
beautifulsoup beautifulsoup4 endpoint imdb json movie python rest-api scraper
Last synced: 20 Jan 2025
https://github.com/xkomil/filmweb-webscrapper
You can collect ratings of chosen movie from website in return value of json
Last synced: 24 Dec 2024
https://github.com/rampa2510/beautifulsoup4
All the data scraping programs i created using the beautifulsoup4 python module
beautifulsoup beautifulsoup4 python python-3 python-library python3
Last synced: 20 Jan 2025
https://github.com/ireneflorez/vocab-web-scraper
Web scraping tool to extract data (word list) from websites. Data is downloaded into an indexed sqlite3 list using frequency count and length.
argparse beautifulsoup pycharm pytests python
Last synced: 11 Jan 2025
https://github.com/mohnish88/web-scrapping
In this project, I used web scraping tools to extract data from daraz.pk, a popular e-commerce platform. Utilizing the BeautifulSoup and Selenium libraries in Python, I was able to efficiently navigate the website, extract valuable information on product listings, prices, and reviews, and store the data for further analysis.
beautifulsoup beautifulsoup4 html-parsing python-web-scraping scrapping-python selenium selenium-python selenium-webdriver web-scraping
Last synced: 05 Jan 2025
https://github.com/michaelkinfu/hknews-headline-analysis
The Hongkong News headline analysis project was conducted by the Chinese University of Hong Kong Library.
beautifulsoup deep-learning digital-scholarship folium historical-newspapers machine-learning spacy yolov5
Last synced: 18 Dec 2024
https://github.com/jai0212/course-selector-uoft
A program that suggests university courses based on your program and provides a custom GUI to customize your course plan. Created to help UofT students
beautifulsoup csv graphs graphs-algorithms networkx-graph numpy os parser pickle pillow plotly python tkinter-gui tkmacosx
Last synced: 19 Jan 2025
https://github.com/adagio/ivoox_categories
Extract categories and subcategories from iVoox, using concurrent processes
async beautifulsoup beautifulsoup4 concurrency concurrent futures
Last synced: 27 Dec 2024