Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with beautifulsoup

A curated list of projects in awesome lists tagged with beautifulsoup .

https://github.com/gamemann/selenium-and-beautifulsoup-lab

A full lab and guide on how to use Selenium paired with Beautiful Soup to parse and extract data from a website using Python.

beautifulsoup beautifulsoup4 bs4 firefox geckodriver node nodejs python react selenium selenium-python selenium-webdriver webscraper webscraping

Last synced: 10 Oct 2024

https://github.com/ptyadana/web-scraping-and-api-in-python

Web Scraping and API in Python using beautifulsoup, requests, requests-xml, etc for processing multiple APIs and scraping multple sites such as youtube, soundcloud and many more.

365datascience api beautifulsoup exchangeratesapi-io github-api itunes-api jokes-api jupyter-notebook juypter lxml python3 requests requests-html soundcloud steam urllib webscraping youtube

Last synced: 15 Nov 2024

https://github.com/thejoin95/register.it-dynamic-dns-updater

Allows the dynamic updating of all the existing record, of any type, that is managed by register.it

beautifulsoup dynamic-dns dynamic-dns-updater register

Last synced: 06 Nov 2024

https://github.com/circa10a/web-link-tester

Flask App to scrape hrefs from websites and validate links

beautifulsoup docker-image flask flask-application flask-restful python3 uwsgi-nginx

Last synced: 02 Oct 2024

https://github.com/chaitanyarahalkar/financial-info-extractor

Extract financial information in CSV format for companies compliant to the NSE

beautifulsoup csv-parser data-extraction data-scraping financial-data financial-services python selenium

Last synced: 17 Dec 2024

https://github.com/vigzmv/lyricaly

:musical_note: Lyricaly gets Lyrics delivered to your Terminal for any Song. Uses Python beautifulsoup4 to scrap lyrics. pypi: lyricaly

beautifulsoup lyrics musixmatch scrap songs

Last synced: 26 Dec 2024

https://github.com/omkarpathak/codechef-problem-statement-downloader

Script for downloading codechef problem statements

beautifulsoup codechef codechef-crawler python3 urllib

Last synced: 12 Dec 2024

https://github.com/jbris/vue-python-graphql

A simple search tool to retrieve git repo information from GitHub, GitLab, and Bitbucket. Uses aiohttp and Graphene for server-side API searches, and Vue.js for client-side rendering.

aiohttp apollo apollo-client asyncio beautifulsoup bitbucket bulma docker github gitlab graphene graphene-aiohttp graphql gunicorn python python3 vue vue-aiohttp-graphql vuex yapsy

Last synced: 13 Nov 2024

https://github.com/twtrubiks/google-play-store-spider-bs4-excel

Google-Play-Store-spider use Beautiful Soup on Python to EXCEL

beautifulsoup crawler google-play-store pyexcel python sql-database xlsx

Last synced: 16 Nov 2024

https://github.com/pysrc/bs

Use go to parse html, and just like BeautifulSoup

beautifulsoup go html parse

Last synced: 12 Dec 2024

https://github.com/nigeld3v/Tumblr_Image_scrape

Download ALL the images (JPEG/GIF/PNG) from any Tumblr website! This project employs Python3 and BeautifulSoup4 to scrape a Tumblr site (with the url provided by the user) to download, page by page, all the images from the Tumblr site's posts. Ideal for archiving other peoples' Tumblrs <3

archive art beautifulsoup beautifulsoup4 blog blogging comics design fashion gif gifs graphics graphics-library image images scraper tumblr tumblr-image-scrape webcomics website-scraper

Last synced: 04 Nov 2024

https://github.com/antoniojcosta/alibaba-auto-quote-request

A Python project to automatically request quotations and collect data from Alibaba sellers, using Pyautogui to make the contact and bs4 soup with requests library to make a webscrapping.

alibaba beautifulsoup dataclasses excel interface pyautogui pysimplegui python rpa ui webscraping

Last synced: 02 Nov 2024

https://github.com/ahmedbesbes/scraping-tutorial

code for data scraping Youtube tutorial

beautifulsoup python requests scraping

Last synced: 23 Nov 2024

https://github.com/ferhatgec/weathere

Fegeya WeaTHERE, CLI-based, colorized 'today's weather' client.

beautifulsoup bs4 cli client python3 unix weather without-api

Last synced: 08 Nov 2024

https://github.com/jashanj0tsingh/pythontutorials

Scrape Lyrics from genius.com to a text file. Educational purposes only.

beautifulsoup genius-lyrics genius-lyrics-search lyrics-scraping python tutorial webscraping

Last synced: 05 Nov 2024

https://github.com/twtrubiks/google-play-store-spider-selenium

Google-Play-Store-spider use Selenium +Beautiful Soup on Python

beautifulsoup chrome crawler firefox python selenium spider sqlite

Last synced: 16 Nov 2024

https://github.com/jordicorbilla/web-scraping

Web scraping scripts to extract financial data

balance-sheet beautifulsoup extract-financial-data names-scraping

Last synced: 06 Nov 2024

https://github.com/loopglitch26/stock-sentiment-analysis

Sentiment Analysis of Stock News Headlines using Natural Language Processing

beautifulsoup natural-language-processing sentiment-analysis

Last synced: 20 Nov 2024

https://github.com/praneethravuri/amazon-product-information-scraper

This Python web-scraping project retrieves product names, prices, review stars, and review counts for a specific product category.

amazon amazon-product-scraper beautifulsoup category csv executable online price products python ratings requests reviews scraper scrapes selenium webdriver webscraping website

Last synced: 13 Oct 2024

https://github.com/adityamangal1/image-scraping-python

In general, there are multiple ways that you can download images from a web page. There are even multiple Python packages that can help you with this task. They are both good libraries for pulling data out of HTML.

beautifulsoup bs4-modules functions image-scraper pip python3 requests-module webscrapping-python

Last synced: 08 Nov 2024

https://github.com/mrmimic/data-mining

With data, you can build a kingdom.

beautifulsoup data mining scraping

Last synced: 19 Nov 2024

https://github.com/haasr/indeed-jobs-searcher

Simple CLI-based tool for extracting and storing relevant job info. from bulk job searches on indeed.com

beautifulsoup beautifulsoup4 cli command-line command-line-tool indeed indeed-scraping job-search job-searcher python python-3 python-script python3 selenium selenium-python

Last synced: 12 Nov 2024

https://github.com/haoruilee/python_spider

🕷高考成绩爬虫+数据处理

beautifulsoup json python3 requests

Last synced: 06 Dec 2024

https://github.com/pptacher/web_scraper

book an appointment in city hall in website paris.fr to have your passport/id issued. Reservez en quelques minutes votre rendez-vous dans une mairie d'arrondissement a Paris pour déposer votre demande de passeport ou carte d'identité.

beautifulsoup browser-automation http libcurl paris re2 selenium webscraper webscraping

Last synced: 14 Oct 2024

https://github.com/rootkot/invader

Python simple module for data grabbing from websites with JavaScript support

beautifulsoup grabber javascript parsing python2-7 python3 scraper web

Last synced: 17 Nov 2024

https://github.com/inishchith/quora-sentiment-analysis

sentiment analysis on quora answers based on word frequency [Deprecated]

afinn beautifulsoup python quora quora-answers sentiment-analysis

Last synced: 19 Dec 2024

https://github.com/rtlee9/sic-list

List of SIC codes and descriptions from authoritative sources

beautifulsoup industry-classification web-scraping

Last synced: 09 Nov 2024

https://github.com/carlosulisesochoa/bookmarks-to-notion

An automated tool that generates Notion pages from bookmarks or just a list of URLs, using BeautifulSoup for web scraping, OpenAI's GPT model for refining page content, and the Notion API for page creation.

ai beautifulsoup notion notion-api openai openai-api python

Last synced: 08 Nov 2024

https://github.com/bpkaur/word-frequency-in-moby-dick

To find out the most frequent words in the novel Moby Dick using Python.

beautifulsoup data-analysis data-science moby-dick nltk notebook-jupyter python3

Last synced: 11 Dec 2024

https://github.com/thomasborgen/soup2dict

Transforms BeautifulSoup soup to python dict or json

beautifulsoup beautifulsoup4 dict json parser transformer

Last synced: 07 Nov 2024

https://github.com/tbrlpld/logtweet

Command line tool to create a tweet based on a #100DaysOfCode log message.

beautifulsoup mypy pytest python tweet twitter

Last synced: 13 Nov 2024

https://github.com/saptaks/mangadownloader-cli

It is a python script to download all the pages of a particular issue in a manga

beautifulsoup manga pillow python-script

Last synced: 24 Oct 2024

https://github.com/zensimilia/on-route-bot

Telegram bot will warn you about traffic jams on your route by schedule.

aiogram apscheduler beautifulsoup python telegram-bot

Last synced: 09 Nov 2024

https://github.com/hitesh22rana/cybernews

Python module for latest CyberSpace updates.

api-rest beautifulsoup cybernews latest-news news open-source python webscraping

Last synced: 27 Dec 2024

https://github.com/trainingbypackt/data-science-projects-with-python-elearning

A case study approach to successful data science projects using Python pandas and scikit learn

beautifulsoup datascience matplotlib numpy pandas python scipy

Last synced: 14 Nov 2024

https://github.com/rclement/flask-pretty

Flask extension to output prettified HTML pages

beautifulsoup flask flask-extension prettified-html-pages

Last synced: 14 Oct 2024

https://github.com/skullface/sceneland

🎸 automatically aggregates Cleveland-area concerts in one place

beautifulsoup cleveland concerts ohio scraping

Last synced: 17 Dec 2024

https://github.com/kishlayjeet/github-topics-data-scraping

This code is a web scraping script that extracts data from GitHub. It creates a CSV file with the top 100 topics from GitHub and the top 20 repositories for each topic.

beautifulsoup data-scraping featured-repo github pandas pandas-dataframe python python-script requests scraping-websites web-scraping

Last synced: 24 Dec 2024

https://github.com/onlyphantom/pricemate

A simple scraper for departure time and prices from Jakarta to Bandung from Tiket.com

beautifulsou beautifulsoup tiket-kereta-api web-scraper web-scraping

Last synced: 13 Dec 2024

https://github.com/chishui/jssoupselector

This is yet another CSS selector. It's designed to be used by JSSoup, but it can also be used in other projects.

beautifulsoup css html javascript jssoup selector

Last synced: 23 Dec 2024

https://github.com/yuvrajchandra/dsaquestionsscraper

This python project scrapes 500 Must Do Interview Questions for FAANG created by Pepcoding Team.

algorithms beautifulsoup data-structures dsa dsa-questions pepcoding python scraping selenium

Last synced: 14 Nov 2024

https://github.com/orkunaktas/sofascore-webscraping

⚽️I scraped the shot data of the Fenerbahçe - Adana Demirspor match from Sofascore⚽️

beautifulsoup data-analysis football-analytics football-data selenium webscraping

Last synced: 11 Oct 2024

https://github.com/leabs/web-scrape-ui

Python based web scraping in the browser

beautifulsoup beautifulsoup4 python requests webui

Last synced: 30 Nov 2024

https://github.com/aqafridi/data-analytics-specialization

In this Specialization of Data Analytics we will use systematic computational analysis of data or statistics using Python. (rarely Excel or Cognos)

beautifulsoup cognos-dashboard data-science data-visualization excel numpy pandas python sql

Last synced: 14 Nov 2024

https://github.com/sunshineplan/node

HTML parsing library, the alternative to BeautifulSoup in Golang.

beautifulsoup css-selectors generic go golang html-parser xpath xpath-query

Last synced: 06 Nov 2024

https://github.com/srinivasrm/mutual-funds-analysis-and-prediction

In this project I have performed analysis and prediction on 1,3,and 5 year returns on 1064 mutual funds in India. I have scraped data from a website which is the most visited website for mutual fund investments.I have tested regression models linear model,SGD Regressor , Random Forest Regressor,Decision Tree Regressor,Ridge,MLP Regressor and linear model (Lasso).After which I have selected the best perorming model and performed Hyper parameter tuning and then deployed an interactive application which can generate the visualization and send an email with the visualization to the users email address.

beautifulsoup data-analysis data-base data-cleaning data-science deployment etl finanace frontend funds machine-learning mutual mutual-funds pgsql python scikit-learn sql streamlit web webapplication

Last synced: 11 Oct 2024

https://github.com/ismaelfi/scrape-linkedin-posts

Automated LinkedIn Post Scraper using Python, Selenium, and BeautifulSoup. Extract content, reactions, and comments from LinkedIn user posts and save them to a CSV file.

beautifulsoup linkedin linkedin-scraper python3 script selenium selenium-python

Last synced: 29 Nov 2024

https://github.com/desgard/learning-in-shiyanlou.com

Learning Programming in shiyanlou.com.

argparse beautifulsoup python3 shiyanlou

Last synced: 07 Nov 2024

https://github.com/itielshwartz/python-station-backend

A full pipeline for downloading, cleaning and enriching the history of planetpython.org

backend beautifulsoup pipeline praw python python-station

Last synced: 04 Dec 2024

https://github.com/kalebu/worldmeter-coronavirus-scraper

A python program that tracks coronavirus statistics based on the worldometer website

beautifulsoup coronavirus data-extraction data-science python-tanzania tanzania webscraping worldmeter-coronavirus-scraper

Last synced: 14 Dec 2024

https://github.com/thelastgimbus/apis-scraper

Python web scraper for getting Polish political parties support percentage!

beautifulsoup beautifulsoup4 vote voter-engagement voting web-scraping

Last synced: 29 Nov 2024

https://github.com/k9mil/statum

🗺️ statum, a Twitch streamer-related website. Written in Python + Flask, with MongoDB. Current features include Twitch OAuth integration, personalized dashboard, unique streamer insights & much more.

app beautifulsoup bs4 css dashboard database flask html http jinja2 mongodb python python-flask python3 requests scraping streamer twitch web-app website

Last synced: 07 Nov 2024

https://github.com/dchan3/thoughtfulsoup

dchan3's thoughtful extension of BS4

beautifulsoup beautifulsoup4 bs4 extension python web-scraping

Last synced: 11 Nov 2024

https://github.com/hjsblogger/web-scraping-with-python

Demonstration of Web Scraping using Selenium Python (Pytest & Pyunit) and Beautiful Soup

beautiful-soup beautifulsoup beautifulsoup4 lambdatest selenium-python selenium-webdriver web-scraping youtube-scrapping

Last synced: 11 Oct 2024

https://github.com/shgopher/pythonfamily

python 知识体系,包括Python基础,Python AI, Python 爬虫,Python 脚本,Python 数据分析

beautifulsoup biopython biosppy django fastapi flask matplotlib numpy pandas pytest python3 pytorch robotframework scipy scrapy tensorflow

Last synced: 09 Oct 2024

https://github.com/Arsenic-ATG/I-am-Feeling-Lucky

Automation that opens all the top google searches in new tabs of your browser

beautifulsoup beautifulsoup4 google-search python python-script requests requests-module search-keywords webscraping

Last synced: 17 Nov 2024

https://github.com/ym496/storygraph-api

A python package to interact with and scrape the StoryGraph website.

api beautifulsoup scraper selenium storygraph

Last synced: 11 Oct 2024

https://github.com/arsenic-atg/i-am-feeling-lucky

Automation that opens all the top google searches in new tabs of your browser

beautifulsoup beautifulsoup4 google-search python python-script requests requests-module search-keywords webscraping

Last synced: 17 Nov 2024

https://github.com/fredcodee/scrape-google-play-store-reviews-of-any-android-application

Scrape/ exctract Google Play Store Reviews of any Android Application and save to a csv file

beautifulsoup python webscraping

Last synced: 10 Nov 2024

https://github.com/rainydevzz/spacehey-scanner

A webscraping tool for returning Spacehey profiles based on keywords.

automation beautifulsoup python spacehey webscraping

Last synced: 25 Dec 2024

https://github.com/oxylabs/mechanicalsoup-proxy-integration

Python tutorial for integrating Oxylabs' Residential Proxies with MechanicalSoup library

beautifulsoup bs4 github-python mechanicalsoup proxy-list proxy-list-github proxy-rotator proxy-site python requests rotating-proxy

Last synced: 17 Nov 2024

https://github.com/preetvadaliya/google-news

Simple News app built in Python using Beautiful-soup web scraping.

beautifulsoup googlenews python python3 webscraping

Last synced: 05 Dec 2024

https://github.com/leynier/comohay-api

API of ComoHay website implemented with FastAPI and BeautifulSoup

api beautifulsoup clasificados comohay fastapi opensource python scraper

Last synced: 20 Dec 2024

https://github.com/victoralessander/smith

A toolkit to make easy web scraping the world.

beautifulsoup bot extract-information python python3 telegram webcrawler webscraping

Last synced: 11 Dec 2024

https://github.com/farahibrar/programming-in-python

Explore a comprehensive collection of Python programming for diverse data analysis and data science projects. This repository covers data exploration, visualization, statistical analysis, machine learning, NLP, and model deployment. Perfect for enthusiasts looking to delve into practical examples and advanced techniques.

beautifulsoup dataanalysis docker flask folium jupyter-notebook machine-learning matplotlib nltk numpy pandas python pytorch scikit-learn scikitlearn scipy seaborn spacy statsmodels tensorflow

Last synced: 06 Dec 2024

https://github.com/camara94/analyse_semantique_latente

Cet article passe en revue l'analyse sémantique latente (LSA), une théorie de la signification ainsi qu'une méthode pour extraire ce sens de passages de texte, basée sur des statistiques calculs sur un ensemble de documents. LSA comme théorie du sens définit un espace sémantique latent où les documents et les mots individuels sont représentés sous forme de vecteurs. LSA en tant que technique de calcul utilise l'algèbre linéaire pour extraire les dimensions qui représentent cet espace. Cette représentation permet le calcul de la similarité entre les termes et les documents, la catégorisation des termes et documents, et résumé de grandes collections de documents en utilisant procédures automatisées qui imitent la façon dont les humains effectuent des tâches cognitives similaires. Nous présentons quelques détails techniques, divers exemples illustratifs et discutons d'un nombre de candidatures en linguistique, psychologie, sciences cognitives, éducation, sciences de l'information et analyse de données textuelles en général.

artificial-intelligence beautifulsoup lsa machine-learning nltk python requests text-classification text-mining-analysis textsummarization tf-idf

Last synced: 23 Dec 2024

https://github.com/shishirshekhar/github-web-scrapping

This app scrap the GitHub and show summary table of all the trading repositories.

beatifulsoup4 beautifulsoup github python python3 scraping scrapping-python streamlit

Last synced: 30 Dec 2024

https://github.com/quantumudit/analyzing-books

This project focuses on scraping data related to books by their genre from the "Books To Scrape" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

beautifulsoup data-analytics power-bi python webscraping

Last synced: 06 Nov 2024

https://github.com/reekrajroy/mini-projects-using-python

This repo contains a webscrapping project, automated message sending over whatsapp, text to speech generation, featching weather, amongst other projects. SMTP protocol client, beautiful soap, open weather api, pywhatkit amongst others were used. Made completely using python.

beautifulsoup openweatherapi python pywhatkit smtp-client

Last synced: 24 Dec 2024

https://github.com/bijoy-sust/simple-web-scraping-in-python

A list of resources and introductory notebooks for Web Scraping in Python using BeautifulSoup.

beautifulsoup machine-learning notebook python-3 web-scraping

Last synced: 25 Nov 2024

https://github.com/imvickykumar999/dark-web-scraping

https://www.upgrad.com/learn/

beautifulsoup bs4 webscraping

Last synced: 01 Dec 2024

https://github.com/carloocchiena/python_url_crawler

A script that starting from a webpage, iterate thru all its link, appending them in a list. Sort of proxy to get all pages in a website

beautifulsoup crawler python python3

Last synced: 28 Nov 2024

https://github.com/jwc20/waasuapi

A scraper API for fetching available jobs from YCombinator's workatastartup.com

beautifulsoup jobsearch python scraper scraper-api selenium

Last synced: 25 Nov 2024

https://github.com/nemeslaszlo/product_pricetracking

Simple script to check a product price, if the price went down to our border price send a mail about it and you can check it. (Beautiful Soup usage and practise)

beautifulsoup mail product-price product-pricetracking requests smtplib

Last synced: 01 Dec 2024

https://github.com/mousazourob/essayresearcher

A website that streamlines the research process by showing articles with excerpts based on a particular topic and group of keywords

beautifulsoup bootstrap css flask html javascript jquery python web-scraping

Last synced: 22 Dec 2024

https://github.com/easonlai/yahoo_news_searching_and_text_analytics

This is a demo repo to demonstrate how to use Python Requests and Beautiful Soup 4 libraries to search and scrape result data. And save scraping results into Pandas dataframe.

azure azure-cognitive-services azure-text-analytics beautifulsoup beautifulsoup4 microsoft-cognitive-services news-scraper news-scraping newsscraper python python3 requests sentiment-analysis web-scraping webscraping yahoo

Last synced: 10 Nov 2024

https://github.com/somada141/mangatown-downloader

Simple scripts that download mangatown manga chapters through the URL of the first page.

beautifulsoup compression downloader manga mangatown python requests scraper zip

Last synced: 18 Nov 2024

https://github.com/malhotra5/fake-news-filter

An application to detect fake and untrustworthy websites

beautifulsoup fakenewsdetection pyqt4 python3 urllib3

Last synced: 23 Nov 2024