Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with beautifulsoup
A curated list of projects in awesome lists tagged with beautifulsoup .
https://github.com/nelcifranmagalhaes/web_crawler
A web crawler for all Naruto characters
anime beautifulsoup characters crawler naruto python
Last synced: 03 Dec 2024
https://github.com/tj2904/pp-api
An API providing sentiment-scored news for PositivePress using the Vader NLP algorithm. A consuming frontend can be found at: https://github.com/tj2904/positive-press
beautifulsoup fastapi feedparser nltk vader-sentiment-analysis
Last synced: 28 Dec 2024
https://github.com/r00tman/gmailnotify
Simple GMail new message notifications through Atom feed
beautifulsoup gmail libnotify linux python requests
Last synced: 09 Dec 2024
https://github.com/nanorobocop/beautifulsoup-for-confluence
Patch for BeautifulSoup to work with Confluence's namespaced tags and attributes
beautifulsoup confluence docker python3 xml
Last synced: 09 Dec 2024
https://github.com/vincent-g-van/timed-web-scraper
Script takes in a url, class type, and class name and uses this information to scrape a list. In time intervals, the script will check to see if the site has updated that particular list.
beautifulsoup timer webscraper
Last synced: 03 Dec 2024
https://github.com/mdimado/medraga
MedRaga is a medical assistance application aimed at providing accurate and personalized medical information to healthcare professionals. It uses RAG technology to retrieve the latest medical research from trusted sources, augment it with patient data, and deliver personalized diagnoses and treatment plans.
arxiv beautifulsoup cohere fastapi firebase firebase-auth gemini-api langchain langchain-python playwright qdrant qdrant-vector-database rag reactjs selenium
Last synced: 22 Dec 2024
https://github.com/danhilse/web-scraper
A versatile Python-based web scraper that extracts content from single URLs or entire sitemaps, organizing data into structured text files. Features include sitemap parsing, content grouping by URL structure, and an easy-to-use command-line interface. Ideal for data extraction, content analysis, and web research tasks.
beautifulsoup cli-tool data-extraction python sitemap-parser web-scraping
Last synced: 10 Dec 2024
https://github.com/snehawk20/elasticsearch-search-engine
A search engine which takes keywords as queries and retrieves a ranked list of results
beautifulsoup elasticsearch flask okapi-bm25
Last synced: 11 Dec 2024
https://github.com/rahulvictor12/the-movie-database-data-scrapper
A Python web scraper that collects movie data from The Movie Database (TMDB). It uses `requests`, `BeautifulSoup`, and `pandas` to extract titles, ratings, genres, and cast details from multiple pages. The data is structured into DataFrames and saved as a CSV, perfect for analysis or integration into projects.
beautifulsoup colab-notebook dataframes numpy pandas python requests testing webscraping
Last synced: 30 Dec 2024
https://github.com/ubeydgur/job-posting-scraper
Scraping job postings from the website with BeautifulSoup.
beautifulsoup excel pandas python requests scraper web-scraping
Last synced: 13 Jan 2025
https://github.com/pustovitdmytro/parimatch
parser for famous bookmaker site
beautifulsoup parimatch parser python3 re scrapper selenium
Last synced: 10 Jan 2025
https://github.com/gajendrasharma-github/web-scraping
Using Selenium and Beautiful Soup
beautifulsoup python scraping-websites selenium
Last synced: 11 Oct 2024
https://github.com/layerex/bandcamp_list_albums
List albums from bandcamp page of artist or label.
bandcamp beautifulsoup beautifulsoup4 cli
Last synced: 13 Dec 2024
https://github.com/dnyaneshvn/1mg-scrapper
This Scrapper extracts medicine details from the 1mg website using Selenium and BeautifulSoup. It efficiently collects and saves data like names, prices, images, etc. into CSV files.
1mg 1mg-scrapper beautifulsoup onemg selenium-python webscraping
Last synced: 06 Jan 2025
https://github.com/dreamjet31/python-web-scrapping
beautifulsoup college python web-scraping
Last synced: 18 Dec 2024
https://github.com/discdiver/new-belgium-ratings
Find the most popular New Belgium beers of all time!
beautifulsoup data-analysis pandas python seaborn webscraping
Last synced: 10 Jan 2025
https://github.com/isatyamks/beautifulsoup-webscraper
my learning journey into web scraping using the BeautifulSoup library
beautifulsoup data-science python3 webscraping
Last synced: 18 Dec 2024
https://github.com/code-alchemist01/intellifist-ai
An artificial intelligence model that randomly generates questions according to class, course and subject for the educational application we will make.
api beautifulsoup fastapi gemini python random-generation scraping-python scraping-websites token tokenization
Last synced: 31 Oct 2024
https://github.com/kartikeya443/jarvis_automation
Voice Controlled Virtual Assistant
ai automation beautifulsoup nlp ocr pytorch selenium tensorflow voice-assistant voice-recognition
Last synced: 25 Dec 2024
https://github.com/shyjal/python-issuu-to-jpg
Beautifulsoup scrapper for downloading issuu document to jpg images using embed id or document id
beautifulsoup issuu python python-issuu
Last synced: 25 Dec 2024
https://github.com/hjsblogger/async-io-python
Demonstration of asyncio in Python using a string of different usecases (or test scenarios)
asyncio automation beautifulsoup beautifulsoup4 python-asynchronous python-asyncio requests selenium selenium-python web-scraping webscraping
Last synced: 11 Oct 2024
https://github.com/yusufcinarci/scrape_discord_bot
Bu proje, Python kullanarak oluşturulan bir Discord botunu içerir. Bot, belirli web sitelerinden veri kazıma işlemini otomatize eder ve Discord sunucusunda verilen komutlarla kazılmış olan verileri belirtilen mail adreslerine gönderir.
beautifulsoup discord discord-bot fortytwofficial python scraper webscraping yusufcinarci
Last synced: 26 Dec 2024
https://github.com/eversmile12/bs-data-scraping
⚗️ Testing some web-scraping using BeautifulSoup, Python, and some SQLAlchemy
beautifulsoup scraper webscraping
Last synced: 27 Dec 2024
https://github.com/seeratawan01/webscraper
Simple Website Scraper using ElectionJS And Python
beautifulsoup electron electron-app eletronjs nodejs python requests scraper tldextract webscraping
Last synced: 02 Jan 2025
https://github.com/yjg30737/wiki-offline
convert Wikipedia html into txt which makes it able to read offline
beautifulsoup python python3 python37 python38 urllib web-crawler web-crawling wiki wikipedia
Last synced: 03 Jan 2025
https://github.com/ruanbekker/python-web-scraper
Web Scraper that Scrapes my Blogs and Ingest the Data into Elasticsearch
beautifulsoup elasticsearch python scraper webscraper
Last synced: 07 Jan 2025
https://github.com/oxylabs/beautiful-soup-parsing-tutorial
Using Python & Beautiful Soup to Parse Data
beautiful-soup-parsing beautifulsoup beautifulsoup-html-parser github-python how-to-use-beautiful-soup-in-python parser python python-beautiful-soup python-web-scraper
Last synced: 17 Jan 2025
https://github.com/tanaybhadula/pricestark
A price comparison website that collects products and their prices from various websites. Utilised BeautifulSoup library to scrape data and Flask to develop the website.
beautifulsoup content-aggregator flask python
Last synced: 10 Jan 2025
https://github.com/drkenreid/deadlinkscraper
A short Python script built in Google colab for iteratively searching subdomains for dead links.
beautifulsoup scraper scraping scraping-websites
Last synced: 31 Dec 2024
https://github.com/preetvadaliya/weather-app
Simple python program to get real time weather data of your location.
beautifulsoup python python3 weather-app webscraping
Last synced: 05 Dec 2024
https://github.com/supercoolx/web-scraping-beautifulsoup-selenium
Web scraping using beautiful-soup module
Last synced: 11 Jan 2025
https://github.com/rikardfahlstrom/python_jumpstart_course
Code from "Python Jumpstart by Building 10 Apps" by Mike Kennedy.
beautifulsoup pycharm python requests
Last synced: 04 Jan 2025
https://github.com/moindalvs/web_scraping_amazon_product_reviews
Web Scraping Amazon Reviews with Multiple Pages loop till the Last Page
amazon beautifulsoup docker html nlp review scrapy splash webscraping
Last synced: 18 Jan 2025
https://github.com/gappeah/nike_web_crawler
This project involves web scraping Nike's product pages to extract product names, prices, and links. The project showcases three different implementations of the web crawler using Selenium and BeautifulSoup. It also includes visualisation of the scraped data using Matplotlib and Seaborn.
beautifulsoup data-analysis data-visualization python selenium web-crawler web-scraper webcrawler webscraper webscraping webscraping-beautifulsoup
Last synced: 10 Nov 2024
https://github.com/srinivasrm/graphics_cards_analysis_and_application
In the current project I have extracted graphics card current prices from an authorizer retailer in India and performed analysis
beautifulsoup data-analysis data-science data-visualization etl graphic-card-price-prediction graphics-card graphics-card-analysis heroku-database machine-learning matplotlib pgsql python regression scikit-learn seaborn sql streamlit webapplication
Last synced: 08 Jan 2025
https://github.com/akshay-rajan/gsmarena-api
Fetch smartphone specifications from GSMArena
beautifulsoup django django-rest-framework gsmarena gsmarena-api python webscraping
Last synced: 06 Jan 2025
https://github.com/kenwuqianghao/internship-opportunity-web-scraper
A web scraper that finds opportunities and internships for highschoolers
beautifulsoup python webscraper
Last synced: 27 Dec 2024
https://github.com/qgelado/qgelado-machine-learning
🤖 Projeto desenvolvido para analisar os preços dos sorvetes em estoque, auxiliando na definição de preços competitivos
api aprendizado-de-maquina beautifulsoup jupyter machine-learning matplotlib pandas requests selenium sklearn
Last synced: 12 Jan 2025
https://github.com/allendema/beauty_reichelt
Scrapes reichelt.de for products. With BeautifulSoup.
beautifulsoup bs4 e-commerce python reichelt requests scraper scraping
Last synced: 12 Jan 2025
https://github.com/navneetguptacse/web-scraping.scr
Python's micro web framework (Flask) - based web scraping tool for `Flipkart's` product data extraction - descriptions, ratings, and reviews using Python and beautiful-soup.
beautifulsoup flask flipkart-scraper-python micro-framework python scrapping website
Last synced: 12 Jan 2025
https://github.com/naveenaidu/google-crawler
Google Crawler - Curates the search results
Last synced: 18 Jan 2025
https://github.com/abhimanyu-gaurav/webscraping
WebScraping using Beautiful Soup .
beautifulsoup bs4 html5 python requests
Last synced: 20 Jan 2025
https://github.com/karantyagi/web-crawler
BFS and DFS implementations for a wikipedia crawler
Last synced: 12 Jan 2025
https://github.com/kinoute/scraper-allocine
Just playing with BeautifulSoup and Python to scrap some movies on Allocine.fr.
allocine beautifulsoup csv docker movies postgresql python scraping scraping-websites
Last synced: 09 Nov 2024
https://github.com/layer-se7en/web-scraping-sandbox
Python scripts for scraping data from Scrapethissite.com
aiohttp beautifulsoup beautifulsoup4 python sandbox webscraping
Last synced: 09 Jan 2025
https://github.com/harinadh76/finder-deck
Finder Desk is a python based web application where user can find Internships related to specific tech stack and it sends email updates according to previous searches .This project uses scraping tools to extract live data from different websites.
beautifulsoup flask flask-application python python3 smtplib webscraping
Last synced: 19 Jan 2025
https://github.com/saadarazzaq/sentimart
Amazon Reviews Sentiment Analysis Web App 🙂
beautifulsoup nltk pandas python streamlit-webapp
Last synced: 23 Nov 2024
https://github.com/somada141/mangastream-downloader
Simple scripts that download mangastream manga chapters through the URL of the first page.
beautifulsoup compression downloader manga mangastream python requests scraper zip
Last synced: 19 Jan 2025
https://github.com/vishalshenoy2002/github-tool
A Github Tool which helps people extract repositories and see issues in github
beauti beautifulsoup beautifulsoup4 bs4 github github-automation github-issues issues python python-3 python3 repository request-module requests web-scraping
Last synced: 13 Jan 2025
https://github.com/russbiggs/italydemonyms
D3.js map of Italian comuni and their demonyms 🇮🇹
Last synced: 11 Jan 2025
https://github.com/dmickelson/newssentimentanalyzerproject
Analyzes top current headlines from major RSS news feed, Uses DistilBERT model fine-tuned for sentiment analysis. Provides a GradioUI for ease of use.
beautifulsoup gradio huggingface logging pinecone pytest python pytorch rss sentiment-analysis tqdm webscraping yaml
Last synced: 27 Nov 2024
https://github.com/kailuser/funpay-scrapper
This library provides convenient methods for scraping data from Funpay, a popular online marketplace for buying and selling game items
beautifulsoup bs4 funpay pypi pypi-package pypi-source requests scrapping
Last synced: 14 Nov 2024
https://github.com/othreecodes/isastudent
{{name}} Is a Student ?
api beautifulsoup request webscraping
Last synced: 20 Jan 2025
https://github.com/alimghmi/crypto-news-etl
A simple ETL data pipeline using python and sqlite3
beautifulsoup crawling etl-pipeline python scraper sqlite3
Last synced: 11 Jan 2025
https://github.com/pushpakrai/web-crawler-
A scalable Python web crawler for efficient, polite extraction, analysis, and ranking of web content, ideal for SEO, data analysis, and development.
beautifulsoup pandas python3 seo web-crawler
Last synced: 20 Jan 2025
https://github.com/mython-dev/parser-news
This bot can parse the news from the site kun.uz :)
aiogram beautifulsoup parser parsing python telegram-bot
Last synced: 22 Nov 2024
https://github.com/dmitry-zharinov/books-library
Парсинг и создание оффлайн-библиотеки
beautifulsoup django web-scraper
Last synced: 16 Jan 2025
https://github.com/abdelmajidlh/web_scraping
amazon-scraper beautifulsoup bs4 requests web-scraping
Last synced: 28 Nov 2024
https://github.com/abdelmajidlh/scrap_pubmed_articles_python
Downloading scientific articles using #beautifulsoup in #python (#Pubmed IDs). Private use for educational purposes only - The author has "no responsibility" for illegal use.
beautifulsoup beautifulsoup4 biology pubmed python scraping webscraping
Last synced: 28 Nov 2024
https://github.com/mohnish88/web-scrapping
In this project, I used web scraping tools to extract data from daraz.pk, a popular e-commerce platform. Utilizing the BeautifulSoup and Selenium libraries in Python, I was able to efficiently navigate the website, extract valuable information on product listings, prices, and reviews, and store the data for further analysis.
beautifulsoup beautifulsoup4 html-parsing python-web-scraping scrapping-python selenium selenium-python selenium-webdriver web-scraping
Last synced: 05 Jan 2025
https://github.com/rampa2510/beautifulsoup4
All the data scraping programs i created using the beautifulsoup4 python module
beautifulsoup beautifulsoup4 python python-3 python-library python3
Last synced: 20 Jan 2025
https://github.com/jaiswalchitransh/analyzing-historical-stock-data
This project focuses on extracting and visualizing stock data using Python libraries such as yfinance for historical stock prices and web scraping techniques to gather company revenue data. It provides a comprehensive analysis by plotting both stock prices and revenues over time for companies like Tesla and GameStop.
artificial-intelligence beautifulsoup bs4 data-science gamestop jupyter jupyter-notebook machine-learning matplotlib pandas plotly project python python-3 python3 requests stock-market tesla webscraping yfinance
Last synced: 01 Oct 2024
https://github.com/fskydev/list-youtube
Automatically create and post YouTube playlists and videos
automation beautifulsoup pandas python python3 scraper selenium tkinter youtube
Last synced: 13 Dec 2024
https://github.com/sakan811/find-common-japanese-character-from-news
Showcase visualizations about common Japanese characters that appear in the news
beautifulsoup beautifulsoup4 data-analysis dataanalysis japanese japanese-language language news powerbi requests sqlite sqlite3 visualization webscraper webscraping
Last synced: 05 Jan 2025
https://github.com/moenessgannouni/linkedin-scrapping
A project focused on web scraping to extract useful data such as profile names, job titles and Location from LinkedIn.
beautifulsoup python selenium webdriver webscraping
Last synced: 11 Oct 2024
https://github.com/thekamik/web-scraping-lotto-data
Scraping lotto data using selenium
beautifulsoup beautifulsoup4 python python3 selenium selenium-python selenium-webdriver selenium-webdriver-python web-scraping
Last synced: 23 Dec 2024
https://github.com/anaritagomes/webscraping_beautifulsoup
Web Scraping + creation and connection to sqlite3 db with BeautifulSoup
beautifulsoup sqlite3 webscraping
Last synced: 05 Jan 2025
https://github.com/viiviiiix/scrape-this-site-sandbox
A collection of projects that is used to learn web scraping.
beautifulsoup python scrape-this-site web-scraping
Last synced: 13 Jan 2025
https://github.com/ahlem-phantom/django-travel-agency
Travel Agency website built using Django, designed for browsing and booking of travel packages.
beautifulsoup celery djagno django-rest-framework numpy rabbitmq weasyprint
Last synced: 19 Dec 2024
https://github.com/vvipjain/hockey-tournament-analysis
Hockey Tournament Analysis
beautifulsoup data data-analysis data-visualization databases pandas pandas-dataframe powerbi python python-library python-script requests-library-python sql sql-server sqlalchemy
Last synced: 19 Dec 2024
https://github.com/lazycatcoder/autoriadataparser
Python parser extracting data about cars and sellers from the AutoRia website
autoria autoriaparser autoriatool beautifulsoup carscraper parsing python pythonscraper selenium webscraping
Last synced: 27 Dec 2024
https://github.com/alibarkhordargeek/advanced_py_project
پروژه پایان دوره پایتون پیشرفته
beautifulsoup bs4 machine-learning mariadb ml mysql python regex requests sklearn sql webscraping
Last synced: 23 Oct 2024
https://github.com/adagio/ivoox_categories
Extract categories and subcategories from iVoox, using concurrent processes
async beautifulsoup beautifulsoup4 concurrency concurrent futures
Last synced: 27 Dec 2024
https://github.com/ireneflorez/vocab-web-scraper
Web scraping tool to extract data (word list) from websites. Data is downloaded into an indexed sqlite3 list using frequency count and length.
argparse beautifulsoup pycharm pytests python
Last synced: 11 Jan 2025
https://github.com/jmoussa/movie-scraper
Scrape theater, movie and time from IMDB using python BeautifulSoup4 library
beautifulsoup beautifulsoup4 endpoint imdb json movie python rest-api scraper
Last synced: 20 Jan 2025
https://github.com/devlocalhost/weathergr
weathergr - a wrapper for meteo.gr
beautifulsoup beautifulsoup4 python python-3 python3 weather weather-data weather-forecast weather-station web-scraping-python webscraping
Last synced: 16 Jan 2025
https://github.com/austinjhunt/ibew-data-scraper
Automation of merged data collection and cleaning from ibew.org and unionfacts.com for an Upwork job.
beautifulsoup multithreading python scraper union upwork web
Last synced: 18 Oct 2024
https://github.com/sarrabenyahia/tuto-webscraping
webscraping course tutorial
api beautifulsoup headers javascript proxy requests scrapy selenium user-agent webscraping
Last synced: 05 Jan 2025
https://github.com/jasocami/health-insurance-spider
Scrap health insurances with BeautifulSoup and Scrapy, playground
beautifulsoup django redis scrapy
Last synced: 14 Dec 2024
https://github.com/abgeo/fine-bot
Scrapper bot for videos.police.ge
beautifulsoup bot python scrapper
Last synced: 23 Dec 2024
https://github.com/arya-io/flipkart-data-scraping
A data scraping project to extract product information such as names, prices, descriptions, and ratings from Flipkart using Selenium and BeautifulSoup.
beautifulsoup data-science data-scraping flipkart pandas python selenium web-scraping
Last synced: 20 Dec 2024
https://github.com/emirhansilsupur/web-scraper-news
Web Scraper for News Related to the Israel-Palestine Conflict
beautifulsoup python selenium-webdriver webscraping
Last synced: 10 Jan 2025
https://github.com/manasjadhav0086/data-scraping-using-beautiful-soup
This project demonstrates how to scrape movie data from IMDb using Python. The notebook contains code to extract specific information about movies, such as their title, genre, release year, and more, for data analysis or visualization.
Last synced: 20 Dec 2024
https://github.com/diegoperea20/web_scraping
Web scraping practice using beautiful soup and selenium libraries
beautifulsoup python selenium web-scraping
Last synced: 13 Jan 2025
https://github.com/jaiswalchitransh/historical-stock-and-revenue-data-analysis
This project focuses on extracting and visualizing stock data using Python libraries such as yfinance for historical stock prices and web scraping techniques to gather company revenue data. It provides a comprehensive analysis by plotting both stock prices and revenues over time for companies like Tesla and GameStop.
artificial-intelligence beautifulsoup bs4 data-science gamestop jupyter jupyter-notebook machine-learning matplotlib pandas plotly project python python-3 python3 requests stock-market tesla webscraping yfinance
Last synced: 12 Jan 2025
https://github.com/germanpaul12/petrol-price-scraper-heidelberg-beautifulsoup-requests
Automatically scrape petrol prices for Heidelberg area and push them to Github
automation beautifulsoup heidelberg petrol requests scraper scraping
Last synced: 01 Jan 2025
https://github.com/germanpaul12/automating-hacker-news-and-weather-mails
Project for my Raspberry Pi to send me mails when it rains and to inform with hot tech news
beautifulsoup beautifulsoup4 data hacker-news openweather-api raspberry-pi requests
Last synced: 01 Jan 2025
https://github.com/jdm79/basic-bs4
A quick intro to web scraping with Python and BeautifulSoup
beautifulsoup python webscraping
Last synced: 11 Jan 2025
https://github.com/brianj-4/vlr-scraper
Python vlr.gg scraper
beautifulsoup esports fastapi python vlrgg webscraper webscraping
Last synced: 06 Jan 2025
https://github.com/fdauti/stock_performance
Analyzing Stock Performance with yfinance, bs4, pandas and plotly
beautifulsoup bs4 pandas plotly yfinance
Last synced: 13 Jan 2025
https://github.com/inightjar/flibcartscraping
FlibCart scraping some products with pyhon beautiful soup library
beautifulsoup python webscraping
Last synced: 12 Jan 2025
https://github.com/joe-mccarthy/harlow-bindicator
Sends notification on binday and what bin needs putting out.
beautifulsoup essex github-actions harlow ntfy ntfysh python selenium-webdriver
Last synced: 10 Oct 2024
https://github.com/mostafahima/billboard-hot-100-to-spotify-playlist
This Python script allows you to travel back in time and create a Spotify playlist of the Billboard Hot 100 songs for any specified date.
api api-client beautifulsoup billboards-hot-100 python-dotenv requests spotipy-library
Last synced: 28 Dec 2024
https://github.com/mostafahima/price-tracker-bot
A price tracking tool that scrapes product prices using BeautifulSoup and sends email alerts when prices drop below a target.
Last synced: 28 Dec 2024
https://github.com/pb319/scrap_with_selenium
Let's dive deeper into the domain of web scraping using Selenium.
beautifulsoup pandas pandas-dataframe python python-script selenium
Last synced: 11 Oct 2024
https://github.com/marcusjhang/scraper
Google Sheets -> Python Scraper -> Postgres DB -> React Frontend
beautifulsoup googlesheetsapi nodejs postgresql reactjs
Last synced: 14 Dec 2024