Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with beautifulsoup

A curated list of projects in awesome lists tagged with beautifulsoup .

https://github.com/nelcifranmagalhaes/web_crawler

A web crawler for all Naruto characters

anime beautifulsoup characters crawler naruto python

Last synced: 03 Dec 2024

https://github.com/tj2904/pp-api

An API providing sentiment-scored news for PositivePress using the Vader NLP algorithm. A consuming frontend can be found at: https://github.com/tj2904/positive-press

beautifulsoup fastapi feedparser nltk vader-sentiment-analysis

Last synced: 28 Dec 2024

https://github.com/r00tman/gmailnotify

Simple GMail new message notifications through Atom feed

beautifulsoup gmail libnotify linux python requests

Last synced: 09 Dec 2024

https://github.com/nanorobocop/beautifulsoup-for-confluence

Patch for BeautifulSoup to work with Confluence's namespaced tags and attributes

beautifulsoup confluence docker python3 xml

Last synced: 09 Dec 2024

https://github.com/vincent-g-van/timed-web-scraper

Script takes in a url, class type, and class name and uses this information to scrape a list. In time intervals, the script will check to see if the site has updated that particular list.

beautifulsoup timer webscraper

Last synced: 03 Dec 2024

https://github.com/mdimado/medraga

MedRaga is a medical assistance application aimed at providing accurate and personalized medical information to healthcare professionals. It uses RAG technology to retrieve the latest medical research from trusted sources, augment it with patient data, and deliver personalized diagnoses and treatment plans.

arxiv beautifulsoup cohere fastapi firebase firebase-auth gemini-api langchain langchain-python playwright qdrant qdrant-vector-database rag reactjs selenium

Last synced: 22 Dec 2024

https://github.com/danhilse/web-scraper

A versatile Python-based web scraper that extracts content from single URLs or entire sitemaps, organizing data into structured text files. Features include sitemap parsing, content grouping by URL structure, and an easy-to-use command-line interface. Ideal for data extraction, content analysis, and web research tasks.

beautifulsoup cli-tool data-extraction python sitemap-parser web-scraping

Last synced: 10 Dec 2024

https://github.com/snehawk20/elasticsearch-search-engine

A search engine which takes keywords as queries and retrieves a ranked list of results

beautifulsoup elasticsearch flask okapi-bm25

Last synced: 11 Dec 2024

https://github.com/rahulvictor12/the-movie-database-data-scrapper

A Python web scraper that collects movie data from The Movie Database (TMDB). It uses `requests`, `BeautifulSoup`, and `pandas` to extract titles, ratings, genres, and cast details from multiple pages. The data is structured into DataFrames and saved as a CSV, perfect for analysis or integration into projects.

beautifulsoup colab-notebook dataframes numpy pandas python requests testing webscraping

Last synced: 30 Dec 2024

https://github.com/ubeydgur/job-posting-scraper

Scraping job postings from the website with BeautifulSoup.

beautifulsoup excel pandas python requests scraper web-scraping

Last synced: 13 Jan 2025

https://github.com/pustovitdmytro/parimatch

parser for famous bookmaker site

beautifulsoup parimatch parser python3 re scrapper selenium

Last synced: 10 Jan 2025

https://github.com/layerex/bandcamp_list_albums

List albums from bandcamp page of artist or label.

bandcamp beautifulsoup beautifulsoup4 cli

Last synced: 13 Dec 2024

https://github.com/dnyaneshvn/1mg-scrapper

This Scrapper extracts medicine details from the 1mg website using Selenium and BeautifulSoup. It efficiently collects and saves data like names, prices, images, etc. into CSV files.

1mg 1mg-scrapper beautifulsoup onemg selenium-python webscraping

Last synced: 06 Jan 2025

https://github.com/discdiver/new-belgium-ratings

Find the most popular New Belgium beers of all time!

beautifulsoup data-analysis pandas python seaborn webscraping

Last synced: 10 Jan 2025

https://github.com/isatyamks/beautifulsoup-webscraper

my learning journey into web scraping using the BeautifulSoup library

beautifulsoup data-science python3 webscraping

Last synced: 18 Dec 2024

https://github.com/code-alchemist01/intellifist-ai

An artificial intelligence model that randomly generates questions according to class, course and subject for the educational application we will make.

api beautifulsoup fastapi gemini python random-generation scraping-python scraping-websites token tokenization

Last synced: 31 Oct 2024

https://github.com/shyjal/python-issuu-to-jpg

Beautifulsoup scrapper for downloading issuu document to jpg images using embed id or document id

beautifulsoup issuu python python-issuu

Last synced: 25 Dec 2024

https://github.com/hjsblogger/async-io-python

Demonstration of asyncio in Python using a string of different usecases (or test scenarios)

asyncio automation beautifulsoup beautifulsoup4 python-asynchronous python-asyncio requests selenium selenium-python web-scraping webscraping

Last synced: 11 Oct 2024

https://github.com/yusufcinarci/scrape_discord_bot

Bu proje, Python kullanarak oluşturulan bir Discord botunu içerir. Bot, belirli web sitelerinden veri kazıma işlemini otomatize eder ve Discord sunucusunda verilen komutlarla kazılmış olan verileri belirtilen mail adreslerine gönderir.

beautifulsoup discord discord-bot fortytwofficial python scraper webscraping yusufcinarci

Last synced: 26 Dec 2024

https://github.com/eversmile12/bs-data-scraping

⚗️ Testing some web-scraping using BeautifulSoup, Python, and some SQLAlchemy

beautifulsoup scraper webscraping

Last synced: 27 Dec 2024

https://github.com/yjg30737/wiki-offline

convert Wikipedia html into txt which makes it able to read offline

beautifulsoup python python3 python37 python38 urllib web-crawler web-crawling wiki wikipedia

Last synced: 03 Jan 2025

https://github.com/ruanbekker/python-web-scraper

Web Scraper that Scrapes my Blogs and Ingest the Data into Elasticsearch

beautifulsoup elasticsearch python scraper webscraper

Last synced: 07 Jan 2025

https://github.com/tanaybhadula/pricestark

A price comparison website that collects products and their prices from various websites. Utilised BeautifulSoup library to scrape data and Flask to develop the website.

beautifulsoup content-aggregator flask python

Last synced: 10 Jan 2025

https://github.com/drkenreid/deadlinkscraper

A short Python script built in Google colab for iteratively searching subdomains for dead links.

beautifulsoup scraper scraping scraping-websites

Last synced: 31 Dec 2024

https://github.com/preetvadaliya/weather-app

Simple python program to get real time weather data of your location.

beautifulsoup python python3 weather-app webscraping

Last synced: 05 Dec 2024

https://github.com/supercoolx/web-scraping-beautifulsoup-selenium

Web scraping using beautiful-soup module

beautifulsoup python scraping

Last synced: 11 Jan 2025

https://github.com/rikardfahlstrom/python_jumpstart_course

Code from "Python Jumpstart by Building 10 Apps" by Mike Kennedy.

beautifulsoup pycharm python requests

Last synced: 04 Jan 2025

https://github.com/moindalvs/web_scraping_amazon_product_reviews

Web Scraping Amazon Reviews with Multiple Pages loop till the Last Page

amazon beautifulsoup docker html nlp review scrapy splash webscraping

Last synced: 18 Jan 2025

https://github.com/gappeah/nike_web_crawler

This project involves web scraping Nike's product pages to extract product names, prices, and links. The project showcases three different implementations of the web crawler using Selenium and BeautifulSoup. It also includes visualisation of the scraped data using Matplotlib and Seaborn.

beautifulsoup data-analysis data-visualization python selenium web-crawler web-scraper webcrawler webscraper webscraping webscraping-beautifulsoup

Last synced: 10 Nov 2024

https://github.com/kenwuqianghao/internship-opportunity-web-scraper

A web scraper that finds opportunities and internships for highschoolers

beautifulsoup python webscraper

Last synced: 27 Dec 2024

https://github.com/qgelado/qgelado-machine-learning

🤖 Projeto desenvolvido para analisar os preços dos sorvetes em estoque, auxiliando na definição de preços competitivos

api aprendizado-de-maquina beautifulsoup jupyter machine-learning matplotlib pandas requests selenium sklearn

Last synced: 12 Jan 2025

https://github.com/allendema/beauty_reichelt

Scrapes reichelt.de for products. With BeautifulSoup.

beautifulsoup bs4 e-commerce python reichelt requests scraper scraping

Last synced: 12 Jan 2025

https://github.com/navneetguptacse/web-scraping.scr

Python's micro web framework (Flask) - based web scraping tool for `Flipkart's` product data extraction - descriptions, ratings, and reviews using Python and beautiful-soup.

beautifulsoup flask flipkart-scraper-python micro-framework python scrapping website

Last synced: 12 Jan 2025

https://github.com/naveenaidu/google-crawler

Google Crawler - Curates the search results

beautifulsoup crawler scraper

Last synced: 18 Jan 2025

https://github.com/abhimanyu-gaurav/webscraping

WebScraping using Beautiful Soup .

beautifulsoup bs4 html5 python requests

Last synced: 20 Jan 2025

https://github.com/karantyagi/web-crawler

BFS and DFS implementations for a wikipedia crawler

beautifulsoup crawler

Last synced: 12 Jan 2025

https://github.com/kinoute/scraper-allocine

Just playing with BeautifulSoup and Python to scrap some movies on Allocine.fr.

allocine beautifulsoup csv docker movies postgresql python scraping scraping-websites

Last synced: 09 Nov 2024

https://github.com/layer-se7en/web-scraping-sandbox

Python scripts for scraping data from Scrapethissite.com

aiohttp beautifulsoup beautifulsoup4 python sandbox webscraping

Last synced: 09 Jan 2025

https://github.com/harinadh76/finder-deck

Finder Desk is a python based web application where user can find Internships related to specific tech stack and it sends email updates according to previous searches .This project uses scraping tools to extract live data from different websites.

beautifulsoup flask flask-application python python3 smtplib webscraping

Last synced: 19 Jan 2025

https://github.com/saadarazzaq/sentimart

Amazon Reviews Sentiment Analysis Web App 🙂

beautifulsoup nltk pandas python streamlit-webapp

Last synced: 23 Nov 2024

https://github.com/somada141/mangastream-downloader

Simple scripts that download mangastream manga chapters through the URL of the first page.

beautifulsoup compression downloader manga mangastream python requests scraper zip

Last synced: 19 Jan 2025

https://github.com/russbiggs/italydemonyms

D3.js map of Italian comuni and their demonyms 🇮🇹

beautifulsoup d3v3 python

Last synced: 11 Jan 2025

https://github.com/dmickelson/newssentimentanalyzerproject

Analyzes top current headlines from major RSS news feed, Uses DistilBERT model fine-tuned for sentiment analysis. Provides a GradioUI for ease of use.

beautifulsoup gradio huggingface logging pinecone pytest python pytorch rss sentiment-analysis tqdm webscraping yaml

Last synced: 27 Nov 2024

https://github.com/kailuser/funpay-scrapper

This library provides convenient methods for scraping data from Funpay, a popular online marketplace for buying and selling game items

beautifulsoup bs4 funpay pypi pypi-package pypi-source requests scrapping

Last synced: 14 Nov 2024

https://github.com/othreecodes/isastudent

{{name}} Is a Student ?

api beautifulsoup request webscraping

Last synced: 20 Jan 2025

https://github.com/alimghmi/crypto-news-etl

A simple ETL data pipeline using python and sqlite3

beautifulsoup crawling etl-pipeline python scraper sqlite3

Last synced: 11 Jan 2025

https://github.com/pushpakrai/web-crawler-

A scalable Python web crawler for efficient, polite extraction, analysis, and ranking of web content, ideal for SEO, data analysis, and development.

beautifulsoup pandas python3 seo web-crawler

Last synced: 20 Jan 2025

https://github.com/mython-dev/parser-news

This bot can parse the news from the site kun.uz :)

aiogram beautifulsoup parser parsing python telegram-bot

Last synced: 22 Nov 2024

https://github.com/dmitry-zharinov/books-library

Парсинг и создание оффлайн-библиотеки

beautifulsoup django web-scraper

Last synced: 16 Jan 2025

https://github.com/abdelmajidlh/scrap_pubmed_articles_python

Downloading scientific articles using #beautifulsoup in #python (#Pubmed IDs). Private use for educational purposes only - The author has "no responsibility" for illegal use.

beautifulsoup beautifulsoup4 biology pubmed python scraping webscraping

Last synced: 28 Nov 2024

https://github.com/mohnish88/web-scrapping

In this project, I used web scraping tools to extract data from daraz.pk, a popular e-commerce platform. Utilizing the BeautifulSoup and Selenium libraries in Python, I was able to efficiently navigate the website, extract valuable information on product listings, prices, and reviews, and store the data for further analysis.

beautifulsoup beautifulsoup4 html-parsing python-web-scraping scrapping-python selenium selenium-python selenium-webdriver web-scraping

Last synced: 05 Jan 2025

https://github.com/rampa2510/beautifulsoup4

All the data scraping programs i created using the beautifulsoup4 python module

beautifulsoup beautifulsoup4 python python-3 python-library python3

Last synced: 20 Jan 2025

https://github.com/jaiswalchitransh/analyzing-historical-stock-data

This project focuses on extracting and visualizing stock data using Python libraries such as yfinance for historical stock prices and web scraping techniques to gather company revenue data. It provides a comprehensive analysis by plotting both stock prices and revenues over time for companies like Tesla and GameStop.

artificial-intelligence beautifulsoup bs4 data-science gamestop jupyter jupyter-notebook machine-learning matplotlib pandas plotly project python python-3 python3 requests stock-market tesla webscraping yfinance

Last synced: 01 Oct 2024

https://github.com/fskydev/list-youtube

Automatically create and post YouTube playlists and videos

automation beautifulsoup pandas python python3 scraper selenium tkinter youtube

Last synced: 13 Dec 2024

https://github.com/moenessgannouni/linkedin-scrapping

A project focused on web scraping to extract useful data such as profile names, job titles and Location from LinkedIn.

beautifulsoup python selenium webdriver webscraping

Last synced: 11 Oct 2024

https://github.com/anaritagomes/webscraping_beautifulsoup

Web Scraping + creation and connection to sqlite3 db with BeautifulSoup

beautifulsoup sqlite3 webscraping

Last synced: 05 Jan 2025

https://github.com/viiviiiix/scrape-this-site-sandbox

A collection of projects that is used to learn web scraping.

beautifulsoup python scrape-this-site web-scraping

Last synced: 13 Jan 2025

https://github.com/ahlem-phantom/django-travel-agency

Travel Agency website built using Django, designed for browsing and booking of travel packages.

beautifulsoup celery djagno django-rest-framework numpy rabbitmq weasyprint

Last synced: 19 Dec 2024

https://github.com/lazycatcoder/autoriadataparser

Python parser extracting data about cars and sellers from the AutoRia website

autoria autoriaparser autoriatool beautifulsoup carscraper parsing python pythonscraper selenium webscraping

Last synced: 27 Dec 2024

https://github.com/alibarkhordargeek/advanced_py_project

پروژه پایان دوره پایتون پیشرفته

beautifulsoup bs4 machine-learning mariadb ml mysql python regex requests sklearn sql webscraping

Last synced: 23 Oct 2024

https://github.com/adagio/ivoox_categories

Extract categories and subcategories from iVoox, using concurrent processes

async beautifulsoup beautifulsoup4 concurrency concurrent futures

Last synced: 27 Dec 2024

https://github.com/ireneflorez/vocab-web-scraper

Web scraping tool to extract data (word list) from websites. Data is downloaded into an indexed sqlite3 list using frequency count and length.

argparse beautifulsoup pycharm pytests python

Last synced: 11 Jan 2025

https://github.com/jmoussa/movie-scraper

Scrape theater, movie and time from IMDB using python BeautifulSoup4 library

beautifulsoup beautifulsoup4 endpoint imdb json movie python rest-api scraper

Last synced: 20 Jan 2025

https://github.com/austinjhunt/ibew-data-scraper

Automation of merged data collection and cleaning from ibew.org and unionfacts.com for an Upwork job.

beautifulsoup multithreading python scraper union upwork web

Last synced: 18 Oct 2024

https://github.com/jasocami/health-insurance-spider

Scrap health insurances with BeautifulSoup and Scrapy, playground

beautifulsoup django redis scrapy

Last synced: 14 Dec 2024

https://github.com/abgeo/fine-bot

Scrapper bot for videos.police.ge

beautifulsoup bot python scrapper

Last synced: 23 Dec 2024

https://github.com/arya-io/flipkart-data-scraping

A data scraping project to extract product information such as names, prices, descriptions, and ratings from Flipkart using Selenium and BeautifulSoup.

beautifulsoup data-science data-scraping flipkart pandas python selenium web-scraping

Last synced: 20 Dec 2024

https://github.com/emirhansilsupur/web-scraper-news

Web Scraper for News Related to the Israel-Palestine Conflict

beautifulsoup python selenium-webdriver webscraping

Last synced: 10 Jan 2025

https://github.com/manasjadhav0086/data-scraping-using-beautiful-soup

This project demonstrates how to scrape movie data from IMDb using Python. The notebook contains code to extract specific information about movies, such as their title, genre, release year, and more, for data analysis or visualization.

beautifulsoup pandas reques

Last synced: 20 Dec 2024

https://github.com/diegoperea20/web_scraping

Web scraping practice using beautiful soup and selenium libraries

beautifulsoup python selenium web-scraping

Last synced: 13 Jan 2025

https://github.com/jaiswalchitransh/historical-stock-and-revenue-data-analysis

This project focuses on extracting and visualizing stock data using Python libraries such as yfinance for historical stock prices and web scraping techniques to gather company revenue data. It provides a comprehensive analysis by plotting both stock prices and revenues over time for companies like Tesla and GameStop.

artificial-intelligence beautifulsoup bs4 data-science gamestop jupyter jupyter-notebook machine-learning matplotlib pandas plotly project python python-3 python3 requests stock-market tesla webscraping yfinance

Last synced: 12 Jan 2025

https://github.com/germanpaul12/petrol-price-scraper-heidelberg-beautifulsoup-requests

Automatically scrape petrol prices for Heidelberg area and push them to Github

automation beautifulsoup heidelberg petrol requests scraper scraping

Last synced: 01 Jan 2025

https://github.com/germanpaul12/automating-hacker-news-and-weather-mails

Project for my Raspberry Pi to send me mails when it rains and to inform with hot tech news

beautifulsoup beautifulsoup4 data hacker-news openweather-api raspberry-pi requests

Last synced: 01 Jan 2025

https://github.com/jdm79/basic-bs4

A quick intro to web scraping with Python and BeautifulSoup

beautifulsoup python webscraping

Last synced: 11 Jan 2025

https://github.com/fdauti/stock_performance

Analyzing Stock Performance with yfinance, bs4, pandas and plotly

beautifulsoup bs4 pandas plotly yfinance

Last synced: 13 Jan 2025

https://github.com/inightjar/flibcartscraping

FlibCart scraping some products with pyhon beautiful soup library

beautifulsoup python webscraping

Last synced: 12 Jan 2025

https://github.com/joe-mccarthy/harlow-bindicator

Sends notification on binday and what bin needs putting out.

beautifulsoup essex github-actions harlow ntfy ntfysh python selenium-webdriver

Last synced: 10 Oct 2024

https://github.com/mostafahima/billboard-hot-100-to-spotify-playlist

This Python script allows you to travel back in time and create a Spotify playlist of the Billboard Hot 100 songs for any specified date.

api api-client beautifulsoup billboards-hot-100 python-dotenv requests spotipy-library

Last synced: 28 Dec 2024

https://github.com/mostafahima/price-tracker-bot

A price tracking tool that scrapes product prices using BeautifulSoup and sends email alerts when prices drop below a target.

beautifulsoup scraping-python

Last synced: 28 Dec 2024

https://github.com/pb319/scrap_with_selenium

Let's dive deeper into the domain of web scraping using Selenium.

beautifulsoup pandas pandas-dataframe python python-script selenium

Last synced: 11 Oct 2024

https://github.com/marcusjhang/scraper

Google Sheets -> Python Scraper -> Postgres DB -> React Frontend

beautifulsoup googlesheetsapi nodejs postgresql reactjs

Last synced: 14 Dec 2024