Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with beautifulsoup
A curated list of projects in awesome lists tagged with beautifulsoup .
https://github.com/judahpaul16/social-media-analytics
A webscraper that grabs statistics from various social media sites based on user input and organizes the data into a CSV file.
beautifulsoup python requests selenium social-media tkinter webscraping
Last synced: 11 Jan 2025
https://github.com/005-bot/monitor
Сервис выполняет периодическое сканирование страницы с актуальными отключениями, выявляет изменения и отправляет их в Redis PubSub.
beautifulsoup httpx monitoring mvp pipenv pubsub python redis web-scraping
Last synced: 20 Dec 2024
https://github.com/amber-abuah/amazon-rating-predictor
MultinomialNB classifier for predicting Amazon review ratings.
beautifulsoup gradio imblearn machine-learning ml naive-bayes-classifier nlp nltk pandas scikit-learn sentiment-analysis sentiment-classification tf-idf
Last synced: 21 Dec 2024
https://github.com/miranizam/online-library
The script that help you to parse books and information about them posted on the site tululu.org. You can also deploy your own site with books and make it in offline format.
argparse beautifulsoup css-selectors github-pages parser python3 website
Last synced: 20 Nov 2024
https://github.com/sadafasad/banks-mc-etl-pipeline
Banks' market capital ETL data pipeline
apache-airflow beautifulsoup docker numpy pandas python requests sqlite
Last synced: 10 Jan 2025
https://github.com/ser-arthur/rent-finder-bot
webscraper that collects rental listings from Zillow and submits them to a Google Form using BeautifulSoup and Selenium for automation.
beautifulsoup google-sheets-api python rentals webautomation webscraping
Last synced: 31 Dec 2024
https://github.com/bryanwilhite/guido-van-r-snake
my self-educational coils of Python (pronounced 🧶 “throat-wobbler mangrove” 🐍)
anaconda audio-analysis audio-processing beautifulsoup data-science data-visualization image-processing jupyter-notebook python python-3 visual-studio-code
Last synced: 31 Dec 2024
https://github.com/busraozdemir0/python_mini_projects
beautifulsoup json-api python webscraping
Last synced: 07 Dec 2024
https://github.com/thmslmr/shangai-ranking-scraper
:mortar_board: Work with Shangai Ranking - Python
beautifulsoup python python-scraper scraper shangai-ranking
Last synced: 29 Dec 2024
https://github.com/aixerum/yelp-scraping
Web scraping tool used to record business addresses, phone numbers, website, supported area and other relevant information of companies from Yelp.com
beautifulsoup python scraper selenium yelp
Last synced: 20 Dec 2024
https://github.com/monarch1108/web_scraping
This project involves extracting textual data from 15 Wikipedia pages focused on various philosophies and philosophers, utilizing Beautiful Soup and Scrapy for efficient web scraping.
beautifulsoup python requests scraping-websites scrapy
Last synced: 22 Dec 2024
https://github.com/alich03/linkedin-scrapping-using-selenium
Scrapping Linkedin data using BeautifulSoup and Selenium
beautifulsoup bs4 googlemaps-api linkedin-scraper scrapping selenium webscrapping
Last synced: 20 Dec 2024
https://github.com/datvodinh/japan-stock-data-crawling
Japan data
beautifulsoup crawling requests selenium-python stock-market
Last synced: 17 Nov 2024
https://github.com/hayatiyrtgl/poems_scraper
beautifulsoup beautifulsoup4 poetry python scraper
Last synced: 22 Dec 2024
https://github.com/gogoasa-c/stayspotter-scraper
Web scraper for finding suitable stays on Booking.com, built with Python 3.9, Flask, and BeautifulSoup.
Last synced: 22 Dec 2024
https://github.com/lexhouk/goit-pyweb-hw-09
Модуль 9: Web-scraping
beautiful-soup beautifulsoup beautifulsoup4 goit goit-pyweb-hw-09 lxml lxml-html parser parsing python-beautiful-soup python-beautifulsoup python-parser python-scraper python-scraping scraper scraping scrapy studying xpath xpath-queries
Last synced: 17 Nov 2024
https://github.com/soham2008xyz/putlocker-api
[DEPRECATED] Back-end parser to find & parse stream URLs from online video putlockers. JSON API exposed for consumption.
beautifulsoup flask json python
Last synced: 06 Dec 2024
https://github.com/pb319/scrapify
The repository contains some beginner-friendly resources to help you start web-scraping using Beautiful Soup.
beautifulsoup python webscraping
Last synced: 20 Nov 2024
https://github.com/thijswillemmoens/scraping_nba_stats
Trying to do some data scraping from the Basketball Reference website.
beautifulsoup beautifulsoup4 nba nba-stats pandas pandas-python python3 scraping scraping-websites sports-analytics
Last synced: 01 Jan 2025
https://github.com/dantesc03/web-scraping-trabajo-2
This project is a Python-based web scraper and data analyzer that extracts quotes, authors, and associated tags from the website Quotes to Scrape. It processes the data to create structured CSV files and includes functionalities for filtering and analyzing quotes by tags.
beautifulsoup jupyter-notebook learn nltk nltk-python nltk-tokenizer python web-scraping
Last synced: 06 Dec 2024
https://github.com/alvarovelascoprieto/scraperegela
Este script gestiona las peticiones HTTP (cookies, tokens, redirecciones...) necesarias para iniciar sesión en egela (moodle) y descarga todo los PDFs de la asignatura. Además, introduce todas las tareas disponibles junto con su fecha de entrega en un documeto csv.
beautifulsoup http requests scraping
Last synced: 22 Dec 2024
https://github.com/goyalyatin/web-crawler
Web Crawler, fetching data from wikipedia and putting to mongoDB
beautifulsoup mongodb python3 webcrawler
Last synced: 03 Jan 2025
https://github.com/sadmanca/imdb-scraper
Scrapes IMDb's movie database and outputs the data to CSV files.
beautifulsoup data-scraping imdb numpy pandas python requests
Last synced: 06 Dec 2024
https://github.com/greed2411/soa
Stackoverflow answers , this python script gets the question or error from the user and displays the top 15 results from stackoverflow and according to user's choice displays it in a browser.
beautifulsoup bs4 python-script requests scraping-websites stackoverflow stackoverflow-answer user-choice webbrowser
Last synced: 18 Jan 2025
https://github.com/thekartikeyamishra/book-scraper
A Python-based project that scrapes book details from the website Books to Scrape using Beautiful Soup and pandas. This project extracts book titles, prices, and availability statuses from the website and exports the data to a CSV file for further analysis or usage.
beautifulsoup beautifulsoup4 pandas python requests webscraper
Last synced: 03 Jan 2025
https://github.com/ojengwa/gfe
Web scrapy with Flask, BS4, RQ on Heroku
beautifulsoup flask heroku python rq scheduled-tasks scraper scraping
Last synced: 24 Nov 2024
https://github.com/antoncoding/html_content_crawler
A smart html crawler to automatically detect main content of a HTML page.
beautifulsoup content crawlers html nlp-parsing parser
Last synced: 15 Dec 2024
https://github.com/i-am-nolan25/python-for-data-science-ai-development
Jupyter Notebooks from Python for Data Science, AI & Development Course
api beautifulsoup data-science data-visualization datacollection datastructures matplotlib numpy pandas python rest-api webscraping
Last synced: 16 Jan 2025
https://github.com/sarthakjshetty/red
Developing a database of species threats and stresses from the IUCN Red List. Published in Conservation Letters 2021.
beautifulsoup bots iucn-red-list python3 scrapper selenium
Last synced: 06 Jan 2025
https://github.com/thekamik/web-scraping-lotto-data
Scraping lotto data using selenium
beautifulsoup beautifulsoup4 python python3 selenium selenium-python selenium-webdriver selenium-webdriver-python web-scraping
Last synced: 23 Dec 2024
https://github.com/semoal/pythoncrawler
Python crawler with XMLRPC & BeautifulSoap
beautifulsoup crawler python wordpress xmlrpc
Last synced: 15 Dec 2024
https://github.com/malek-logh/realestatemappertool
This project involves scraping real estate data from https://www.mubawab.tn, cleaning and inserting it into a PostgreSQL database, and performing clustering analysis with visualization on Tunisia map
beautifulsoup docker folium postgres python selenium
Last synced: 06 Dec 2024
https://github.com/jacknormand/tfrrs-topqualifer
A data collection program for college track
beautifulsoup postgresql python webscraping
Last synced: 09 Jan 2025
https://github.com/jimmymugendi/british-airways-virtual-internship
This repo focuse on websrcappig data from British Airways customer review and analysing the data to unwind new insights.We get to work with the famous BeautifulSoup and requests, as well as pandas for dataframe manipulation and matplotlib for better visualizations of our data,
beautifulsoup matplotlib pandas requests
Last synced: 16 Jan 2025
https://github.com/eliasbnk/scrapethissite-forms
Hockey Teams: Forms, Searching and Pagination - Walkthrough
beautifulsoup python requests webscraping
Last synced: 20 Nov 2024
https://github.com/eliasbnk/scrapethissite-simple
Countries of the World: A Simple Example - Walkthrough
beautifulsoup python requests webscraping
Last synced: 20 Nov 2024
https://github.com/limagabri/nba-player-statistics-scraper
This is a Python script to scrape NBA player statistics data from the official NBA website.
beautifulsoup json pyhton selenium webscraper webscraping
Last synced: 03 Jan 2025
https://github.com/mananag-1784/nse-web-scrapper
Creating a web scrapper powered by Selenium and Beautiful Soup to extract stocks data
beautifulsoup stocks webscraping
Last synced: 03 Jan 2025
https://github.com/keikoro/c3speakers
A program to fetch Twitter handles of CCC speakers for a given year. :neckbeard: :speech_balloon::bird:
beautifulsoup python python3 twitter webscraping
Last synced: 24 Nov 2024
https://github.com/rmncldyo/google-reverse-image-search
A simple python wrapper designed for leveraging Google's search by image capabilities to perform reverse image searches programatically.
beautifulsoup beautifulsoup4 crawler google google-image google-image-crawler google-image-scraper google-image-search google-images google-reverse-image-crawler google-reverse-image-scraper google-reverse-image-search image image-search python python3 requests reverse-image-search scraper search-by-image
Last synced: 04 Jan 2025
https://github.com/winterwind/moviereccomendationsystem
My personal summer project: a program that prompts the user to enter the desired genre(s) and keyword(s) and outputs a list of movies that matches that query; results print in terminal
beautifulsoup beautifulsoup-library beautifulsoup4 cinemagoer filesystem html-parser imdb imdb-dataset imdb-webscrapping imdbpy lxml python python3 regex requests-library terminal-based try-catch txt-files webscraping webscrapping-python
Last synced: 07 Dec 2024
https://github.com/martachesnova/web-scraping-indeed
Web scraped Data-related jobs (including descriptions) from Indeed.com for a group project. I used Python libraries Splinter and BeautifulSoup.
beautifulsoup python splinter web-scraping
Last synced: 06 Jan 2025
https://github.com/leftcoastnerdgirl/webscraping_and_beautifulsoup
This project uses Beautiful Soup to create scrap data from a news website.
beautifulsoup data-visualization jupyter-notebook splinter webscraping
Last synced: 04 Jan 2025
https://github.com/martachesnova/web-scraping-usajobs.gov
Web Scraped job listings (including the job descriptions) USAjobs.gov for a team project. Used Python libraries Splinter and BeautifulSoup.
beautifulsoup python splinter web-scraping
Last synced: 06 Jan 2025
https://github.com/ondrejhruby/datascrapper
A Python script for scraping and extracting data from specific sources, designed to automate data collection and streamline information retrieval.
automation beautifulsoup data-cleaning data-extraction data-processing error-handling python requests scripting web-scraping
Last synced: 21 Nov 2024
https://github.com/pddiii/season-ticket-prices
SeatGeek MLB Home Team Season Ticket Prices
beautifulsoup json mlb pandas python requests-python seatgeek seatgeek-api
Last synced: 04 Jan 2025
https://github.com/qzcool/uscis-case-status-estimation-system-stat-ez
Estimates time of case results arrival, for applicants who are waiting for their USCIS case results with the receipt numbers at hand.
beautifulsoup crawler immigration web
Last synced: 21 Nov 2024
https://github.com/miguel-a-barbosa/talento-tech-intermedio
Talento Tech Análisis y Visualización de Datos nivel Intermedio. 📊
beautifulsoup data-analytics data-visualisation excel flourish google-colab matplotlib numpy pandas powerbi scikit-learn seaborn sqlite
Last synced: 07 Dec 2024
https://github.com/guermoud98/cuisinerecipeanalysis
beautifulsoup jupiter-notebook pandas python requests-library-python
Last synced: 25 Nov 2024
https://github.com/mindful-ai-assistants/sp2024-election-analysis
🗳️ An analysis of voting patterns in São Paulo's 2024 elections, focusing on voter behavior, absenteeism, and geographic trends."
beautifulsoup data-analysis data-science datavisualization geolocation power-bi python
Last synced: 21 Nov 2024
https://github.com/mulaza/anime-movie-list-web-scraper
🍥A python web-scraper that scrapes the imdb 150 Top anime movie list page for movie ranks, titles, runtimes and descriptions and saves them to a .json file.
automation beautifulsoup beautifulsoup4 python web-scraper webscraper webscrapers webscraping webscrapping
Last synced: 25 Nov 2024
https://github.com/ejw-data/web-scraping-builtin-jobs
Simple web scrape using requests and Beautiful Soup to collect the best workplaces from Builtin Chicago.
Last synced: 21 Nov 2024
https://github.com/kvdomingo/autocomeback
Web scraper to pull comeback/debut data from dbkpop and r/kpop
beautifulsoup cloud-functions kpop pubsub python web-scraping
Last synced: 04 Jan 2025
https://github.com/margaretkhendre/mars-articles-vs-data-scraping-challenge
In this repository, titles and preview text are scraped from Mars news articles in the part_1 Jupyter notebook. In the part_2 Jupyter notebook, Mars weather data is analyzed and scraped.
beautifulsoup html splinter webscraping
Last synced: 09 Jan 2025
https://github.com/ashwinsomi/news-analyzer
Analyzes the news on the internet by a said news publisher on its emotion using sentiment analyzer by scraping news from internet.
beautifulsoup python3 vader-sentiment-analysis webscraping
Last synced: 21 Nov 2024
https://github.com/chanmeng666/douban-elite-scraper
A Python scraper that collects and archives elite posts and images from Douban groups, organizing content into Markdown format.
beautifulsoup content-archiving data-collection douban image-downloader markdown python web-scraping
Last synced: 25 Nov 2024
https://github.com/asier-ortiz/recipe-scraper
Web scraper using Beautiful Soup
Last synced: 07 Dec 2024
https://github.com/gill-singh-a/github-analytics-tool
A Program made in Python, that uses requests module to fetches and analysis publically available information of Github account
beautifulsoup beautifulsoup4 git github html-parser python requests scrapping scrapping-python
Last synced: 04 Jan 2025
https://github.com/gauravmcode/medicine-info
A Flutter app to fetch information of medicines
beautifulsoup dart flutter webscraping
Last synced: 21 Nov 2024
https://github.com/ltfschoen/aind-nlp-bookworm
Term 2 Project 3 NLP using IBM Watson Discover and Conversation service and IBM Bluemix platform
beautifulsoup bluemix natural-language-processing pillow sentiment-analysis watson-conversation watson-developer-cloud watson-discovery wordcloud wordcloud-library
Last synced: 04 Jan 2025
https://github.com/thunchanokbow/extract-data-from-website
beautifulsoup python3 selenium selenium-webdriver
Last synced: 09 Jan 2025
https://github.com/jwc20/fcisapi
A scraper API for fetching workplace fatality and catastrophe reports from the United States Department of Labor website.
accidents-analysis beautifulsoup beautifulsoup4 fatality labor python requests scraper scraper-api workplace
Last synced: 25 Nov 2024
https://github.com/aadewunmi/scraping-numbers-from-html-using-beautifulsoup
Scraping Numbers from HTML using BeautifulSoup . This program will use urllib to read the HTML from the data files below, and parse the data, extracting numbers and compute the sum of the numbers in the file.
beautifulsoup mysql python3 web-scraping
Last synced: 25 Nov 2024
https://github.com/nataliabeltranarg/nlp-booking-scraping-sentimentanalysis
Natural Language Processing sentiment analysis of Booking.com. Rental price impact of SONAR festival in Barcelona using difference in difference and OLS.
beautifulsoup data-science natural-language-processing nltk python selenium text-mining webscraping
Last synced: 22 Nov 2024
https://github.com/hygull/pyrandocs
A repository (pyrandocs i.e. Python's v2/v3 random documents/files) containing different Python v2/v3 based resources (files/documents etc.) from random topics. A project to try different Python's features.
base64 beautifulsoup class exception-handling hashlib json jwt-authentication loops matplotlib nltk numpy pandas python2 python27 python3 requests searching time-complexity urllib urllib2
Last synced: 22 Nov 2024
https://github.com/praneethravuri/newton-classroom-automation
A bot that utilizes webscraping techniques to navigate through the Google Meet interface, enter specified classes at the designated times, and exit the classes once the sessions are complete.
beautifulsoup bs4 python selenium webscraping
Last synced: 27 Nov 2024
https://github.com/barkerbg001/thiniversescraper
A Python tool to extract 3D printing models & data from Thiniverse. Customizable, automated, and user-friendly.
beautifulsoup bulkdata python thingiverse webscraping xlsx
Last synced: 22 Nov 2024
https://github.com/joeylemon/reviewtracker
a python web service to extract review details from LendingTree
beautifulsoup flask pytest python venv
Last synced: 22 Nov 2024
https://github.com/anas436/html-web-scraping4-using-beautifulsoup
beautifulsoup jupyter-notebook lxml python3 requests
Last synced: 06 Dec 2024
https://github.com/godsonkoithodathu/musical-time-machine
Developed a Python program that utilizes spotipy to create or add a playlist to your Spotify account based on a selected year. The program leverages requests to retrieve data, BeautifulSoup for web scraping relevant music lists, and os for environment management.
beautifulsoup python3 requests spotipy-library
Last synced: 04 Jan 2025
https://github.com/anas436/html-web-scraping3-using-beautifulsoup
beautifulsoup html jupyter-notebook lxml python3 requests
Last synced: 06 Dec 2024
https://github.com/sourasishbasu/producttracker
Track price drops and Check for product availability
beautifulsoup python script selenium
Last synced: 09 Jan 2025
https://github.com/areal060781/web-scraping
Web scrapping application for weather forecast information and present it in a terminal.
beautifulsoup beautifulsoup4 selenium webdataextraction webharvesting webscraping
Last synced: 19 Jan 2025
https://github.com/anas436/wikipedia-web-scraping1-using-beautifulsoup
beautifulsoup html jupyter-notebook lxml pandas python3 requests
Last synced: 06 Dec 2024
https://github.com/anas436/html-web-scraping1-using-beautifulsoup
beautifulsoup html5 jupyter-notebook lxml python3 requests
Last synced: 06 Dec 2024
https://github.com/moenessgannouni/linkedin-scrapping
A project focused on web scraping to extract useful data such as profile names, job titles and Location from LinkedIn.
beautifulsoup python selenium webdriver webscraping
Last synced: 11 Oct 2024
https://github.com/anas436/ibm-data-analyst-capstone-project-with-python
beautifulsoup html5lib http ibm-cognos-analytics ibm-watson-studio iframe image json jupyter-notebook jupyterlab matplotlib-pyplot openpyxl os pandas python3 requests requests-library-python seaborn sqlite3 wget
Last synced: 06 Dec 2024
https://github.com/yuji1702/valorant-data-scrapper
The Valorant Data Collector is a Python-based tool that scrapes and collects detailed player statistics from VLR.gg. It allows users to search for players, extract their performance data, and export the results into a CSV file. With support for multithreaded scraping, it efficiently gathers data on agents used, key performance metrics, and more.
beautifulsoup concurrent-scraping csv-export data-collection game-analytics machine-learning multithreading player-statistics python requests valorant vlrgg web-scraping
Last synced: 22 Nov 2024
https://github.com/alfastrek/textinsight-pro
Webscraper with Natural Language Processing
beautifulsoup jupyter-notebook nltk numpy pandas python webscraping
Last synced: 22 Nov 2024
https://github.com/comsavvy/scrapping_challenge
Web scraping with beautiful soup @10Academy
beautifulsoup twitter web-scraping
Last synced: 25 Nov 2024
https://github.com/ekocbiyik/python_scripts
scripts for make something easier
beautifulsoup cpu-usage memory-usage python python-server ram-usage scraper scrapy simple-server
Last synced: 25 Nov 2024
https://github.com/aymane-maghouti/jumia-data-pipeline
This project focuses on extracting data from the Jumia website using Beautiful Soup, storing it in an Excel file with Pandas, and then transferring the data to a PostgreSQL database using SQLAlchemy and Pandas.
beautifulsoup data-pipeline etl pandas postgresql python web-scraping
Last synced: 17 Jan 2025
https://github.com/mattmoony/webcrawler.py
A very simple python webcrawler. This is just a fun little side project, which I used to gather some valuable experience with advanced Python- and Web techniques. 🐍
beautifulsoup crawler indexing mongodb multithreading pymongo python spider web webcrawler
Last synced: 19 Jan 2025
https://github.com/jdm79/newspaper-scraper-template
Basic version of my web scraper which scrapes newspaper headlines and sends the results to a PostgreSQL website
beautifulsoup beautifulsoup4 bs4 postgresql python scraping template webscraping
Last synced: 11 Jan 2025
https://github.com/tech-c-p/conversai
ConversAI is an innovative conversational AI framework designed for intelligent text extraction and querying across various document formats and web content, leveraging advanced natural language processing techniques.
beautifulsoup chatbot genai gradio groq langchain large-language-models llama3 mlops nlp ocr pymupdf python
Last synced: 08 Dec 2024
https://github.com/vgvr0/el-corte-ingles-supermarket-scraper
A Python script for web scraping various product categories from an online supermarket (elcorteingles.es) and saving product details into a CSV file
beautifulsoup chromedriver el-corte-ingles el-corte-ingles-scraping requests scraping-python selenium selenium-base selenium-python supermarket-scraping
Last synced: 26 Nov 2024
https://github.com/vgvr0/dia-supermarket-scraper
A Python script for web scraping various product categories from an online supermarket (dia.es) and saving product details into a CSV file
beautifulsoup chromedriver dia-scraper scraper scraping seleniumbase seleniun-python supermarket-scraper supermarket-scraping supermercado-dia-scraper undetected-chromedriver
Last synced: 26 Nov 2024
https://github.com/anas436/html-web-scraping2-using-beautifulsoup
beautifulsoup html jupyter-notebook lxml python3 requests
Last synced: 06 Dec 2024
https://github.com/aka-sh11/live-weather-update
Get Live Weather Updates of your desired City.
beautifulsoup pillow tkinter-gui weather-app
Last synced: 08 Dec 2024
https://github.com/pizzart/jam-analysis
python script to web scrape a ludum dare user's game ratings and show them as a graph
beautifulsoup ludum-dare matplotlib python selenium
Last synced: 20 Dec 2024