Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with beautifulsoup

A curated list of projects in awesome lists tagged with beautifulsoup .

https://github.com/judahpaul16/social-media-analytics

A webscraper that grabs statistics from various social media sites based on user input and organizes the data into a CSV file.

beautifulsoup python requests selenium social-media tkinter webscraping

Last synced: 11 Jan 2025

https://github.com/005-bot/monitor

Сервис выполняет периодическое сканирование страницы с актуальными отключениями, выявляет изменения и отправляет их в Redis PubSub.

beautifulsoup httpx monitoring mvp pipenv pubsub python redis web-scraping

Last synced: 20 Dec 2024

https://github.com/miranizam/online-library

The script that help you to parse books and information about them posted on the site tululu.org. You can also deploy your own site with books and make it in offline format.

argparse beautifulsoup css-selectors github-pages parser python3 website

Last synced: 20 Nov 2024

https://github.com/ser-arthur/rent-finder-bot

webscraper that collects rental listings from Zillow and submits them to a Google Form using BeautifulSoup and Selenium for automation.

beautifulsoup google-sheets-api python rentals webautomation webscraping

Last synced: 31 Dec 2024

https://github.com/thmslmr/shangai-ranking-scraper

:mortar_board: Work with Shangai Ranking - Python

beautifulsoup python python-scraper scraper shangai-ranking

Last synced: 29 Dec 2024

https://github.com/aixerum/yelp-scraping

Web scraping tool used to record business addresses, phone numbers, website, supported area and other relevant information of companies from Yelp.com

beautifulsoup python scraper selenium yelp

Last synced: 20 Dec 2024

https://github.com/monarch1108/web_scraping

This project involves extracting textual data from 15 Wikipedia pages focused on various philosophies and philosophers, utilizing Beautiful Soup and Scrapy for efficient web scraping.

beautifulsoup python requests scraping-websites scrapy

Last synced: 22 Dec 2024

https://github.com/gogoasa-c/stayspotter-scraper

Web scraper for finding suitable stays on Booking.com, built with Python 3.9, Flask, and BeautifulSoup.

beautifulsoup flask python

Last synced: 22 Dec 2024

https://github.com/soham2008xyz/putlocker-api

[DEPRECATED] Back-end parser to find & parse stream URLs from online video putlockers. JSON API exposed for consumption.

beautifulsoup flask json python

Last synced: 06 Dec 2024

https://github.com/pb319/scrapify

The repository contains some beginner-friendly resources to help you start web-scraping using Beautiful Soup.

beautifulsoup python webscraping

Last synced: 20 Nov 2024

https://github.com/dantesc03/web-scraping-trabajo-2

This project is a Python-based web scraper and data analyzer that extracts quotes, authors, and associated tags from the website Quotes to Scrape. It processes the data to create structured CSV files and includes functionalities for filtering and analyzing quotes by tags.

beautifulsoup jupyter-notebook learn nltk nltk-python nltk-tokenizer python web-scraping

Last synced: 06 Dec 2024

https://github.com/alvarovelascoprieto/scraperegela

Este script gestiona las peticiones HTTP (cookies, tokens, redirecciones...) necesarias para iniciar sesión en egela (moodle) y descarga todo los PDFs de la asignatura. Además, introduce todas las tareas disponibles junto con su fecha de entrega en un documeto csv.

beautifulsoup http requests scraping

Last synced: 22 Dec 2024

https://github.com/goyalyatin/web-crawler

Web Crawler, fetching data from wikipedia and putting to mongoDB

beautifulsoup mongodb python3 webcrawler

Last synced: 03 Jan 2025

https://github.com/sadmanca/imdb-scraper

Scrapes IMDb's movie database and outputs the data to CSV files.

beautifulsoup data-scraping imdb numpy pandas python requests

Last synced: 06 Dec 2024

https://github.com/greed2411/soa

Stackoverflow answers , this python script gets the question or error from the user and displays the top 15 results from stackoverflow and according to user's choice displays it in a browser.

beautifulsoup bs4 python-script requests scraping-websites stackoverflow stackoverflow-answer user-choice webbrowser

Last synced: 18 Jan 2025

https://github.com/thekartikeyamishra/book-scraper

A Python-based project that scrapes book details from the website Books to Scrape using Beautiful Soup and pandas. This project extracts book titles, prices, and availability statuses from the website and exports the data to a CSV file for further analysis or usage.

beautifulsoup beautifulsoup4 pandas python requests webscraper

Last synced: 03 Jan 2025

https://github.com/ojengwa/gfe

Web scrapy with Flask, BS4, RQ on Heroku

beautifulsoup flask heroku python rq scheduled-tasks scraper scraping

Last synced: 24 Nov 2024

https://github.com/antoncoding/html_content_crawler

A smart html crawler to automatically detect main content of a HTML page.

beautifulsoup content crawlers html nlp-parsing parser

Last synced: 15 Dec 2024

https://github.com/sarthakjshetty/red

Developing a database of species threats and stresses from the IUCN Red List. Published in Conservation Letters 2021.

beautifulsoup bots iucn-red-list python3 scrapper selenium

Last synced: 06 Jan 2025

https://github.com/semoal/pythoncrawler

Python crawler with XMLRPC & BeautifulSoap

beautifulsoup crawler python wordpress xmlrpc

Last synced: 15 Dec 2024

https://github.com/malek-logh/realestatemappertool

This project involves scraping real estate data from https://www.mubawab.tn, cleaning and inserting it into a PostgreSQL database, and performing clustering analysis with visualization on Tunisia map

beautifulsoup docker folium postgres python selenium

Last synced: 06 Dec 2024

https://github.com/jacknormand/tfrrs-topqualifer

A data collection program for college track

beautifulsoup postgresql python webscraping

Last synced: 09 Jan 2025

https://github.com/jimmymugendi/british-airways-virtual-internship

This repo focuse on websrcappig data from British Airways customer review and analysing the data to unwind new insights.We get to work with the famous BeautifulSoup and requests, as well as pandas for dataframe manipulation and matplotlib for better visualizations of our data,

beautifulsoup matplotlib pandas requests

Last synced: 16 Jan 2025

https://github.com/eliasbnk/scrapethissite-forms

Hockey Teams: Forms, Searching and Pagination - Walkthrough

beautifulsoup python requests webscraping

Last synced: 20 Nov 2024

https://github.com/eliasbnk/scrapethissite-simple

Countries of the World: A Simple Example - Walkthrough

beautifulsoup python requests webscraping

Last synced: 20 Nov 2024

https://github.com/limagabri/nba-player-statistics-scraper

This is a Python script to scrape NBA player statistics data from the official NBA website.

beautifulsoup json pyhton selenium webscraper webscraping

Last synced: 03 Jan 2025

https://github.com/mananag-1784/nse-web-scrapper

Creating a web scrapper powered by Selenium and Beautiful Soup to extract stocks data

beautifulsoup stocks webscraping

Last synced: 03 Jan 2025

https://github.com/keikoro/c3speakers

A program to fetch Twitter handles of CCC speakers for a given year. :neckbeard: :speech_balloon::bird:

beautifulsoup python python3 twitter webscraping

Last synced: 24 Nov 2024

https://github.com/winterwind/moviereccomendationsystem

My personal summer project: a program that prompts the user to enter the desired genre(s) and keyword(s) and outputs a list of movies that matches that query; results print in terminal

beautifulsoup beautifulsoup-library beautifulsoup4 cinemagoer filesystem html-parser imdb imdb-dataset imdb-webscrapping imdbpy lxml python python3 regex requests-library terminal-based try-catch txt-files webscraping webscrapping-python

Last synced: 07 Dec 2024

https://github.com/martachesnova/web-scraping-indeed

Web scraped Data-related jobs (including descriptions) from Indeed.com for a group project. I used Python libraries Splinter and BeautifulSoup.

beautifulsoup python splinter web-scraping

Last synced: 06 Jan 2025

https://github.com/leftcoastnerdgirl/webscraping_and_beautifulsoup

This project uses Beautiful Soup to create scrap data from a news website.

beautifulsoup data-visualization jupyter-notebook splinter webscraping

Last synced: 04 Jan 2025

https://github.com/martachesnova/web-scraping-usajobs.gov

Web Scraped job listings (including the job descriptions) USAjobs.gov for a team project. Used Python libraries Splinter and BeautifulSoup.

beautifulsoup python splinter web-scraping

Last synced: 06 Jan 2025

https://github.com/ondrejhruby/datascrapper

A Python script for scraping and extracting data from specific sources, designed to automate data collection and streamline information retrieval.

automation beautifulsoup data-cleaning data-extraction data-processing error-handling python requests scripting web-scraping

Last synced: 21 Nov 2024

https://github.com/pddiii/season-ticket-prices

SeatGeek MLB Home Team Season Ticket Prices

beautifulsoup json mlb pandas python requests-python seatgeek seatgeek-api

Last synced: 04 Jan 2025

https://github.com/qzcool/uscis-case-status-estimation-system-stat-ez

Estimates time of case results arrival, for applicants who are waiting for their USCIS case results with the receipt numbers at hand.

beautifulsoup crawler immigration web

Last synced: 21 Nov 2024

https://github.com/mindful-ai-assistants/sp2024-election-analysis

🗳️ An analysis of voting patterns in São Paulo's 2024 elections, focusing on voter behavior, absenteeism, and geographic trends."

beautifulsoup data-analysis data-science datavisualization geolocation power-bi python

Last synced: 21 Nov 2024

https://github.com/mulaza/anime-movie-list-web-scraper

🍥A python web-scraper that scrapes the imdb 150 Top anime movie list page for movie ranks, titles, runtimes and descriptions and saves them to a .json file.

automation beautifulsoup beautifulsoup4 python web-scraper webscraper webscrapers webscraping webscrapping

Last synced: 25 Nov 2024

https://github.com/ejw-data/web-scraping-builtin-jobs

Simple web scrape using requests and Beautiful Soup to collect the best workplaces from Builtin Chicago.

beautifulsoup python requests

Last synced: 21 Nov 2024

https://github.com/kvdomingo/autocomeback

Web scraper to pull comeback/debut data from dbkpop and r/kpop

beautifulsoup cloud-functions kpop pubsub python web-scraping

Last synced: 04 Jan 2025

https://github.com/margaretkhendre/mars-articles-vs-data-scraping-challenge

In this repository, titles and preview text are scraped from Mars news articles in the part_1 Jupyter notebook. In the part_2 Jupyter notebook, Mars weather data is analyzed and scraped.

beautifulsoup html splinter webscraping

Last synced: 09 Jan 2025

https://github.com/ashwinsomi/news-analyzer

Analyzes the news on the internet by a said news publisher on its emotion using sentiment analyzer by scraping news from internet.

beautifulsoup python3 vader-sentiment-analysis webscraping

Last synced: 21 Nov 2024

https://github.com/chanmeng666/douban-elite-scraper

A Python scraper that collects and archives elite posts and images from Douban groups, organizing content into Markdown format.

beautifulsoup content-archiving data-collection douban image-downloader markdown python web-scraping

Last synced: 25 Nov 2024

https://github.com/asier-ortiz/recipe-scraper

Web scraper using Beautiful Soup

beautifulsoup python

Last synced: 07 Dec 2024

https://github.com/gill-singh-a/github-analytics-tool

A Program made in Python, that uses requests module to fetches and analysis publically available information of Github account

beautifulsoup beautifulsoup4 git github html-parser python requests scrapping scrapping-python

Last synced: 04 Jan 2025

https://github.com/gauravmcode/medicine-info

A Flutter app to fetch information of medicines

beautifulsoup dart flutter webscraping

Last synced: 21 Nov 2024

https://github.com/jwc20/fcisapi

A scraper API for fetching workplace fatality and catastrophe reports from the United States Department of Labor website.

accidents-analysis beautifulsoup beautifulsoup4 fatality labor python requests scraper scraper-api workplace

Last synced: 25 Nov 2024

https://github.com/aadewunmi/scraping-numbers-from-html-using-beautifulsoup

Scraping Numbers from HTML using BeautifulSoup . This program will use urllib to read the HTML from the data files below, and parse the data, extracting numbers and compute the sum of the numbers in the file.

beautifulsoup mysql python3 web-scraping

Last synced: 25 Nov 2024

https://github.com/nataliabeltranarg/nlp-booking-scraping-sentimentanalysis

Natural Language Processing sentiment analysis of Booking.com. Rental price impact of SONAR festival in Barcelona using difference in difference and OLS.

beautifulsoup data-science natural-language-processing nltk python selenium text-mining webscraping

Last synced: 22 Nov 2024

https://github.com/hygull/pyrandocs

A repository (pyrandocs i.e. Python's v2/v3 random documents/files) containing different Python v2/v3 based resources (files/documents etc.) from random topics. A project to try different Python's features.

base64 beautifulsoup class exception-handling hashlib json jwt-authentication loops matplotlib nltk numpy pandas python2 python27 python3 requests searching time-complexity urllib urllib2

Last synced: 22 Nov 2024

https://github.com/praneethravuri/newton-classroom-automation

A bot that utilizes webscraping techniques to navigate through the Google Meet interface, enter specified classes at the designated times, and exit the classes once the sessions are complete.

beautifulsoup bs4 python selenium webscraping

Last synced: 27 Nov 2024

https://github.com/barkerbg001/thiniversescraper

A Python tool to extract 3D printing models & data from Thiniverse. Customizable, automated, and user-friendly.

beautifulsoup bulkdata python thingiverse webscraping xlsx

Last synced: 22 Nov 2024

https://github.com/joeylemon/reviewtracker

a python web service to extract review details from LendingTree

beautifulsoup flask pytest python venv

Last synced: 22 Nov 2024

https://github.com/godsonkoithodathu/musical-time-machine

Developed a Python program that utilizes spotipy to create or add a playlist to your Spotify account based on a selected year. The program leverages requests to retrieve data, BeautifulSoup for web scraping relevant music lists, and os for environment management.

beautifulsoup python3 requests spotipy-library

Last synced: 04 Jan 2025

https://github.com/sourasishbasu/producttracker

Track price drops and Check for product availability

beautifulsoup python script selenium

Last synced: 09 Jan 2025

https://github.com/areal060781/web-scraping

Web scrapping application for weather forecast information and present it in a terminal.

beautifulsoup beautifulsoup4 selenium webdataextraction webharvesting webscraping

Last synced: 19 Jan 2025

https://github.com/moenessgannouni/linkedin-scrapping

A project focused on web scraping to extract useful data such as profile names, job titles and Location from LinkedIn.

beautifulsoup python selenium webdriver webscraping

Last synced: 11 Oct 2024

https://github.com/yuji1702/valorant-data-scrapper

The Valorant Data Collector is a Python-based tool that scrapes and collects detailed player statistics from VLR.gg. It allows users to search for players, extract their performance data, and export the results into a CSV file. With support for multithreaded scraping, it efficiently gathers data on agents used, key performance metrics, and more.

beautifulsoup concurrent-scraping csv-export data-collection game-analytics machine-learning multithreading player-statistics python requests valorant vlrgg web-scraping

Last synced: 22 Nov 2024

https://github.com/alfastrek/textinsight-pro

Webscraper with Natural Language Processing

beautifulsoup jupyter-notebook nltk numpy pandas python webscraping

Last synced: 22 Nov 2024

https://github.com/comsavvy/scrapping_challenge

Web scraping with beautiful soup @10Academy

beautifulsoup twitter web-scraping

Last synced: 25 Nov 2024

https://github.com/aymane-maghouti/jumia-data-pipeline

This project focuses on extracting data from the Jumia website using Beautiful Soup, storing it in an Excel file with Pandas, and then transferring the data to a PostgreSQL database using SQLAlchemy and Pandas.

beautifulsoup data-pipeline etl pandas postgresql python web-scraping

Last synced: 17 Jan 2025

https://github.com/mattmoony/webcrawler.py

A very simple python webcrawler. This is just a fun little side project, which I used to gather some valuable experience with advanced Python- and Web techniques. 🐍

beautifulsoup crawler indexing mongodb multithreading pymongo python spider web webcrawler

Last synced: 19 Jan 2025

https://github.com/jdm79/newspaper-scraper-template

Basic version of my web scraper which scrapes newspaper headlines and sends the results to a PostgreSQL website

beautifulsoup beautifulsoup4 bs4 postgresql python scraping template webscraping

Last synced: 11 Jan 2025

https://github.com/tech-c-p/conversai

ConversAI is an innovative conversational AI framework designed for intelligent text extraction and querying across various document formats and web content, leveraging advanced natural language processing techniques.

beautifulsoup chatbot genai gradio groq langchain large-language-models llama3 mlops nlp ocr pymupdf python

Last synced: 08 Dec 2024

https://github.com/vgvr0/el-corte-ingles-supermarket-scraper

A Python script for web scraping various product categories from an online supermarket (elcorteingles.es) and saving product details into a CSV file

beautifulsoup chromedriver el-corte-ingles el-corte-ingles-scraping requests scraping-python selenium selenium-base selenium-python supermarket-scraping

Last synced: 26 Nov 2024

https://github.com/vgvr0/dia-supermarket-scraper

A Python script for web scraping various product categories from an online supermarket (dia.es) and saving product details into a CSV file

beautifulsoup chromedriver dia-scraper scraper scraping seleniumbase seleniun-python supermarket-scraper supermarket-scraping supermercado-dia-scraper undetected-chromedriver

Last synced: 26 Nov 2024

https://github.com/aka-sh11/live-weather-update

Get Live Weather Updates of your desired City.

beautifulsoup pillow tkinter-gui weather-app

Last synced: 08 Dec 2024

https://github.com/pizzart/jam-analysis

python script to web scrape a ludum dare user's game ratings and show them as a graph

beautifulsoup ludum-dare matplotlib python selenium

Last synced: 20 Dec 2024