Projects in Awesome Lists tagged with bs4
A curated list of projects in awesome lists tagged with bs4 .
https://github.com/krypton-byte/tiktok-downloader
Tiktok Downloader/Scraper using requests & bs4
asynchronous asyncio beautifulsoup bs4 crawler downloader flask krypton-byte lightweight nowm python python3 requests tiktok watermark web without
Last synced: 06 Apr 2025
https://github.com/lb2281075105/python-wechat-itchat
微信机器人,基于Python itchat接口功能实例展示:01-itchat获取微信好友或者微信群分享文章、02-itchat获取微信公众号文章、03-itchat监听微信公众号发送的文章、04 itchat监听微信群或好友撤回的消息、05 itchat获得微信好友信息以及表图对比、06 python打印出微信被删除好友、07 itchat自动回复好友、08 itchat微信好友个性签名词云图、09 itchat微信好友性别比例、10 微信群或微信好友撤回消息拦截、11 itchat微信群或好友之间转发消息
beautifulsoup4 bs4 echarts itchat jieba matplotlib matplotlib-live numpy os pandas pillow system time uuid wechat
Last synced: 28 Oct 2025
https://github.com/jimywork/djangohunter
Tool designed to help identify incorrectly configured Django applications that are exposing sensitive information.
beautifulsoup bs4 django hacking python python3 shodan tool
Last synced: 08 May 2025
https://github.com/pingcheng/bootstrap4-datetimepicker
datetimepicker for bootstrap 4
Last synced: 05 Oct 2025
https://github.com/berrysauce/ingredients
🥗 Determine the "ingredients" (or technologies) behind a website
bs4 fastapi httpx ingredients python sveltekit tool vercel web website-scanner
Last synced: 25 Oct 2025
https://github.com/tomoncle/python-notes
Python related technologies used in work: crawler, data analysis, timing tasks, RPC, page parsing, decorator, built-in functions, Python objects, multi-threading, multi-process, asynchronous, redis, mongodb, mysql, openstack, etc.
bs4 matplotlib mlab mongodb mysql numpy openstack python27 redis rpc scheduler xpath
Last synced: 09 Apr 2025
https://github.com/tomoncle/Python-notes
Python related technologies used in work: crawler, data analysis, timing tasks, RPC, page parsing, decorator, built-in functions, Python objects, multi-threading, multi-process, asynchronous, redis, mongodb, mysql, openstack, etc.
bs4 matplotlib mlab mongodb mysql numpy openstack python27 redis rpc scheduler xpath
Last synced: 19 Jul 2025
https://github.com/wazzabeee/copy-spotter
Make plagiarism detection easier. This script will find similar sentences between given files and highlight them in a side by side comparison.
beautifulsoup bs4 docx odt pdf plagiarism plagiarism-check plagiarism-checker plagiarism-detection plagiarism-detector python side-by-sidediff similarity similarity-detection similarity-score txt
Last synced: 15 Aug 2025
https://github.com/dhvcc/rss-parser
typed python RSS parsing module built using xmltodict and pydantic
atom atom-feed atom-parser bs4 gplv3 mit-license pydantic python python-3 python3 rss rss-feed-parser rss-feed-scraper rss-parser typed typed-python xml xml-parser
Last synced: 23 Apr 2025
https://github.com/irevenko/info-bot
🤖 A Versatile Telegram Bot
bot bots bs4 cryptocoins cryptocurrencies cryptocurrency news parse pytelegrambotapi python python3 requests scraping telegram telegram-bot telegram-bots time weather weather-api weather-forecast
Last synced: 01 May 2025
https://github.com/thearrival/IsmailScript
Is a tool written by using python programming language. Enable a penetration testers to save a time and performing a full Reconnaissance and Information Gathering on any Web Application Server.
bs4 cybersecurity dns dns-server information-gathering ip ismailscript nmap penetration-testers portscanner python python3 reconnaissance reconnaissance-framework requests serverbanner serverscan socket webapplication whois
Last synced: 01 May 2025
https://github.com/sungyeonu/animal-crossing-scraper
Web scraper for Animal Crossing - New Horizons data using bs4
animal-crossing-new-horizons bs4
Last synced: 17 Jul 2025
https://github.com/amanoteam/duckpy
A simple Python library for searching on DuckDuckGo.
asynchronous asyncio bs4 duckduckgo duckduckgo-search framework httpx module pypi python python3 search
Last synced: 14 May 2025
https://github.com/codhek/pysoc.js
:sunglasses: Simple gsoc data scraper, search for any keyword and instantly get data about the orgs that match your search criteria, use filters to sort and analyse the data, uses fuzzy-searching to improve user-search experience
analytics bs4 firebase fusejs fuzzy-search gsoc-data-scraper reactjs scrapper thread-pool
Last synced: 19 Jun 2025
https://github.com/jatin-8898/complaint-system
Complaint System App using MongoDB, Express and Node ⌛
admin authentication body-parser bootswatch bs4 complaint-management-system complaints db error-handling flash handlebars junior-engineer login mongo mongodb mongoose node nodejs passport routes
Last synced: 29 Oct 2025
https://github.com/jatin-8898/coursera
My Assignment Submission made during the course 📙
applied-data-science-with-python applied-machine-learning big-data bs4 cloud-engineering coursera coursera-machine-learning coursera-specialization data-science-specialization dialogflow front-end-development frontend google-cloud hong-kong-university kotlin neural-network python-crash-course-coursera specialization sql-for-data-science standford
Last synced: 21 Mar 2025
https://github.com/aydinnyunus/scrapin
LinkedIn Scraper
beautifulsoup beautifulsoup4 bs4 bs4-requests linkedin linkedin-profile linkedin-scraper python python-3 python3 scraper scraping scraping-websites scrapper scrapping scrapy scrapy-crawler selenium selenium-python selenium-webdriver
Last synced: 30 Jul 2025
https://github.com/dvamishkilapa/vkarchivedownloader
Утилита для скачивания всех вложений, которые можно получить из архива аккаунта VKontakte
aiohttp archive asyncio beautifulsoup4 browser bs4 cookie json multithreading parse parser processpoolexecutor python vk vkontakte
Last synced: 23 Oct 2025
https://github.com/akimosi/moneycontrolpy
A python API to Money Control forum.
api bs4 datacollection python python3 selenium-python webscraping
Last synced: 14 Dec 2025
https://github.com/gamemann/how-to-use-selenium-and-beautifulsoup
A full lab and how-to guide on how to use Selenium paired with Beautiful Soup to parse and extract data from a website using Python.
beautifulsoup beautifulsoup4 bs4 firefox geckodriver node nodejs python react selenium selenium-python selenium-webdriver webscraper webscraping
Last synced: 25 Sep 2025
https://github.com/dinhanhx/PerceiveYourMeme
A Python library to scrap from https://knowyourmeme.com/ . Written in Python 3.7!
beginner-friendly bs4 dataset hacktoberfest knowyourmeme library meme memes memes-api python-3 python3 scrap urllib3
Last synced: 09 May 2025
https://github.com/mratanusarkar/dataset-indian-companies
Web Scraping "List of companies in India" from AmbitionBox Website using Python and Beautiful Soup
beautiful-soup beautifulsoup beautifulsoup-library beautifulsoup4 bs4 company-dataset csv data dataset indian-companies jupyter jupyter-notebook jupyter-notebooks python scraper scraping scraping-websites scrapper scrapping
Last synced: 22 Sep 2025
https://github.com/gamemann/How-To-Use-Selenium-And-BeautifulSoup
A full lab and how-to guide on how to use Selenium paired with Beautiful Soup to parse and extract data from a website using Python.
beautifulsoup beautifulsoup4 bs4 firefox geckodriver node nodejs python react selenium selenium-python selenium-webdriver webscraper webscraping
Last synced: 24 Oct 2025
https://github.com/mehmetkahya0/web-resource-downloader
This is a Python script that downloads all resources (images, scripts, stylesheets, etc.) from a given website.
algorithms beautifulsoup4 bs4 bs4-requests data-analysis data-science datascience python python3 requests scraper scraping
Last synced: 12 Oct 2025
https://github.com/parvvaresh/web-scraper-projects
This repository contains projects that include web scraping
bs4 python requests webscraping
Last synced: 22 Apr 2025
https://github.com/dinhanhx/perceiveyourmeme
A Python library to scrap from https://knowyourmeme.com/ . Written in Python 3.7!
beginner-friendly bs4 dataset hacktoberfest knowyourmeme library meme memes memes-api python-3 python3 scrap urllib3
Last synced: 12 Apr 2025
https://github.com/mindpatch/startpage-parser
parsing search results from startpage search engine (based on google.com results)
bs4 google-parser googlesearch parsing python startpage
Last synced: 07 Jul 2025
https://github.com/gunh0/whs-utils
🛠️ Essential utilities for conducting classes in the WhiteHat School education program.
bs4 crwaler elk elk-stack kind kubernetes parser python3 siem
Last synced: 29 Aug 2025
https://github.com/gamemann/web-extract-with-chatgpt
A Python project that extracts data from websites with the option to process the data through @openai's ChatGPT API. The results are either printed to stdout or sent via a POST request.
ai api beautifulsoup beautifulsoup4 bs4 chatgpt chatgpt-api discourse extract gpt gpt-3 gpt-4 openai openai-api post python selenium web-extractor web-scrape web-scraper
Last synced: 28 Oct 2025
https://github.com/spidy20/instagram_scrapper
It is a Python GUI in which you need to enter your Instagram ID, it will fetch all the details with DP for that Insta ID.
bs4 fetch-instagram-info insta-scrape instagram instagram-api instagram-api-python instagram-bot instagram-scraper webscraping
Last synced: 12 Apr 2025
https://github.com/santhoshse7en/imdby
imdby is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies.
beautifulsoup4 bs4 characters imdb imdb-api imdb-movie imdb-movie-database imdb-movies imdb-webscrapping movies pandas pypi python scrapers sys time without-api
Last synced: 15 Jun 2025
https://github.com/dimitryzub/webscraping-py
Web Scraping scripts for all Google, other search engines, and other websites (currently outdated, something may not be working).
api bs4 data google-maps-api googleapi googlescraping googlesearchapi lxml parsel playwright python requests scraper scraping scrapy selenium webscraper webscraping webscraping-data webscraping-search
Last synced: 12 Aug 2025
https://github.com/nimiology/persian_subdl
this script can download all Persian subtitles from worldsubtitle.info
Last synced: 22 Jul 2025
https://github.com/oxylabs/aiohttp-proxy-integration
Python tutorial for implementing Residential Proxies with AIOHTTP
aiohttp asyncio beautifulsoup bs4 github-python proxy-generator proxy-list proxy-list-github proxy-rotator proxy-site python python3 requests residential-proxy rotating-proxy scraping webproxy
Last synced: 23 Apr 2025
https://github.com/andreytretiachenko/ozon_parser
Парсер товаров Ozon в excel таблицу
Last synced: 01 Sep 2025
https://github.com/buckthorndev/the-football-data
Home of football data
bs4 football football-api football-data football-data-csv footballdata laliga league-table premier-league seriea soccer-data soccer-league
Last synced: 28 Feb 2025
https://github.com/ferhatgec/weathere
Fegeya WeaTHERE, CLI-based, colorized 'today's weather' client.
beautifulsoup bs4 cli client python3 unix weather without-api
Last synced: 14 Apr 2025
https://github.com/querateam/dataanalysis_bootcamp_crawler
Web scraper implementations for a variety of websites.
beautifulsoup beautifulsoup4 bootcamp bs4 data-analysis python quera scrapy selenium webcrawling webscraping
Last synced: 08 Jul 2025
https://github.com/pydevcasts/resumerover
This project is a tool for extracting emails from websites. Users can input a list of links, and the tool will gather emails from those pages. It also allows users to send resumes to the extracted emails
asyncio beautifulsoup bs4 cache email-sender email-template job-scheduler jobsearch oop python request resume-template selenium threading unittest
Last synced: 29 Apr 2025
https://github.com/amey-thakur/covid19-web-scraper
Scraping and Visualising India's Real-Time COVID-19 Data From The MOHFW Dataset.
amey ameythakur bar-plots beautifulsoup beautifulsoup4 bs4 covid-19 covid19-data covid19-graph covid19-live-tracker covid19india donut-chart prettytable project python python3 web-scraper web-scraping
Last synced: 17 Jul 2025
https://github.com/mldictionary/mldictionary
MLDictionary is word's dictionary for several language. Available in pypi
beautifulsoup4 bs4 dictionary-learning pypi-package pypi-source python webscraping
Last synced: 19 Oct 2025
https://github.com/0memo07/web-crawler
Web Crawler with Python
beautifulsoup4 bs4 crawler crawlers crawling crawling-python web-crawler web-crawler-python web-crawling webcrawler
Last synced: 24 Apr 2025
https://github.com/prashantpandey9/covid19-india-tracker
This app is developed to aware people about the global pandemic CORONA-VIRUS(COVID-19) cases all around the world.22-april-2020
bs4 coronavirus coronavirus-info coronavirus-real-time coronavirus-tracking django hacktoberfest2021 india opensource python requests-module tracker
Last synced: 11 Apr 2025
https://github.com/innovatorved/webscraping-extract-sanfoundry-mcq
Web Scrapping Project extract MCQ of any topic from sanfoundry.com in PDF form
beautifulsoup4 bs4 extract-sanfoundry-mcq python sanfoundry sanfoundry-mcq-saver webscraping
Last synced: 18 Mar 2025
https://github.com/lostdir/askthesite
A web application that lets you chat with any website's content! Enter a URL and engage in a conversation by asking questions to receive insightful answers powered by a language model (LLM).
ai bs4 chatbot flask llm python rag streamlit webscraping
Last synced: 05 Apr 2025
https://github.com/javinator9889/pygle
A tool for searching the entire web with the Google technology
api beautifulsoup4 browser bs4 dictionary extreme fast google google-api pip powerful python-3 python3 requests scraping-websites search search-engine search-google urllib web
Last synced: 17 Jul 2025
https://github.com/aybruhm/animesukurepa-backend
An application that scrapes anime based on the user's search query. Built with Python and Django Rest Framework.
aggregator bs4 django gogoanime gogoanimeio python3 scraper webscraping
Last synced: 06 Sep 2025
https://github.com/othreecodes/poorly-drawn-api
Unofficial API for poorlydrawnlines.com
api backend background-tasks bs4 celery poorlydrawnlines web-crawler
Last synced: 15 May 2025
https://github.com/ddayguerrero/spimi-indexer
Boolean retrieval search engine with SPIMI indexing and BM25 ranking
bm25 bs4 inverted-index okapi python3 reuters-corpus search spimi
Last synced: 16 Mar 2025
https://github.com/radkick/vs-bootstrap-snippets
Visual Studio - Bootstrap V3 and V4 Snippets
bootstrap bootstrap-snippets bootstrap4-snippets bs4 html-snippets twitter-bootstrap visual-studio-extension
Last synced: 29 Jul 2025
https://github.com/adityamangal1/scrap-it-python
Project
bs4 codeforcause pandas project python3
Last synced: 21 Feb 2025
https://github.com/adityamangal1/python-scraping
Python Scraping
beautifulsoup bs4 csv-parser files modules os pandas requests termcolor try-except with-open
Last synced: 21 Feb 2025
https://github.com/akshatnitd/imdb_portal
A utility tool to check out movies information and ratings!
bs4 cli imdb imdb-rating movie-information python terminal
Last synced: 26 Mar 2025
https://github.com/iamfoysal/one-hundred-python
One hundred Python scripts with source code
automatically-screen-sapture bs4 bypass-recaptcha-v2 django google-recaptcha pandas python selenium selenium-webdriver spotify tkinter
Last synced: 29 Apr 2025
https://github.com/santhoshse7en/utuby
simple script for downloading Youtube comments without using the Youtube API
ajax bs4 css-selector lexicon lxml scraper sentimental-analysis textblob textblob-sentiment-analysis vandersentiment web without-api youtube youtube-api youtube-channel youtube-comments youtube-comments-downloader youtube-downloader youtube-scraper
Last synced: 28 Oct 2025
https://github.com/parthjohri/dtu-seatalertbot
Welcome to the DTU Seat Bot! Stay updated with the latest seat availability information for various courses at Delhi Technological University. Get real-time information on available seats for different subject codes and their respective groups. Whether you're a student planning your course selection.
aws bs4 delhi-technological-university mongodb python3 selenium
Last synced: 15 Apr 2025
https://github.com/jwest951227/socialscraper
scraper that scrapes all the social accounts using DDG/Google search engine
aiohttp asyncio bs4 concurrent instaloader pandas selenium threading zenrows
Last synced: 26 Feb 2025
https://github.com/shilongdai/apartment_scraper
Python webscraper for apartments.com
bs4 csv json python selenium webscraping
Last synced: 13 Oct 2025
https://github.com/oxylabs/mechanicalsoup-proxy-integration
Python tutorial for integrating Oxylabs' Residential Proxies with MechanicalSoup library
beautifulsoup bs4 github-python mechanicalsoup proxy-list proxy-list-github proxy-rotator proxy-site python requests rotating-proxy
Last synced: 14 May 2025
https://github.com/tristan296/best-price-detector
A web scraping application that retrieves product information from numerous websites without the hardcoding.
Last synced: 12 Apr 2025
https://github.com/allendema/archeddit
Save posts from Reddit locally. Forever.
archive archiver beautifulsoup4 bs4 downloader json python reddit requests subreddits teddit
Last synced: 08 Oct 2025
https://github.com/justkappaman/habr-parser
Parse info about the top articles on Habr.com for a given period of time
beautifulsoup4 bs4 cli command-line grabber habr habrahabr parser python python3 scraper terminal
Last synced: 31 Oct 2025
https://github.com/devlocalhost/ampy
ampy - an apkmirror scraper in python
apk apk-download apk-downloader apkmirror apks beautifulsoup beautifulsoup4 bs4 cloudscraper python-3 python3 scraper scraping scraping-websites
Last synced: 14 Apr 2025
https://github.com/rahulmoundekar/webscraping-in-python
webscraping in python
beautifulsoup4 bs4 html5lib python-3 requests-module webscraper-website
Last synced: 08 Nov 2025
https://github.com/jwest951227/socialScraper
scraper that scrapes all the social accounts using DDG/Google search engine
aiohttp asyncio bs4 concurrent instaloader pandas selenium threading zenrows
Last synced: 21 Sep 2025
https://github.com/osintt/xvideos.py
powerful Python library for xvideos.com
bs4 porn python scraper xvideos xvideos-api xvideos-downloader xvideos-lib
Last synced: 29 Apr 2025
https://github.com/hanan-nawaz/daraz-price-comparator
Daraz Price Comparator 🌐🛍️ Compare prices effortlessly! 🔄 Find deals in 🇵🇰 Pakistan, 🇧🇩 Bangladesh, 🇳🇵 Nepal, and 🇱🇰 Sri Lanka. Python, Requests, BeautifulSoup, and Airflow power smart shopping. Save time, shop savvy! 🛒💡
beautifulsoup bs4 darazpk python requests-python telemart webscraping
Last synced: 21 Mar 2025
https://github.com/ehsanshahbazii/castbox_downloader
دانلود فایل صوتی و کاور اپیزود ها در اپلیکیشن کست باکس (بدون نیاز به ای پی آی)
bs4 castbox downloader playwright python python3 requests
Last synced: 12 Apr 2025
https://github.com/dchan3/thoughtfulsoup
dchan3's thoughtful extension of BS4
beautifulsoup beautifulsoup4 bs4 extension python web-scraping
Last synced: 21 Nov 2025
https://github.com/kayx23/indeed-scraper
Scrape job posts off Indeed Canada (ca.indeed.com)
bs4 scrapy selenium webscraping
Last synced: 27 Mar 2025
https://github.com/prithivsakthiur/wiki-scrap
Wikipedia Scrapping
bs4 content gradio gradio-interface image requests scrapper scrapping sdk-python text wikipedia
Last synced: 09 Sep 2025
https://github.com/friskes/loadnsi
Utility for downloading and updating NSI fixtures
aiofiles asyncio bs4 cli code-generator database django-models fixtures gzip httpx json nsi postgres python rich rosminzdrav sql tenacity
Last synced: 08 May 2025
https://github.com/koushikphy/tele-price-tracker
A Telegram bot that keeps track of prices for your products.
asyncio bs4 flipkart online-shop pytelegrambotapi python python-telegram python-telegram-bot python3 telegram telegram-bot webscraper webscraping
Last synced: 03 Mar 2025
https://github.com/santhoshse7en/alcoholics-anonymous
Research Project to analyse the knowledge about Alcoholics Anonymous in public
aa-meetings alcoholics alcoholics-anonymous anonymous bs4 crawler data-extraction-and-pre-processing google-search-using-python news-crawler newspaper3k python the-hindu web-scraping without-api
Last synced: 03 Mar 2025
https://github.com/jatin-8898/mizuxe
A beautiful template theme named Mizuxe made using Bootstrap 4 beta :blue_book:
bootstrap bs4 collapse-button mizuxe navigation node-module theme
Last synced: 25 Aug 2025
https://github.com/benjetson/dining-alexa
Find out what's on the menu at Clemson's dining halls.
alexa aws-lambda beautifulsoup4 bs4 clemson cuhackit python python-3
Last synced: 30 Aug 2025
https://github.com/imvickykumar999/dark-web-scraping
https://www.upgrad.com/learn/
Last synced: 01 Mar 2025
https://github.com/justin-pyne/dota-liquipedia-web-scraper
Scraping information off Liquipedia from DOTA leagues with BeautifulSoup/Pandas for statistical analysis/EDA.
bs4 csv data-analysis pandas python scraper
Last synced: 13 Jul 2025
https://github.com/bieanimaton/github-scraping
This is project of a Github Scraping made with Python.
beautifulsoup4 bs4 github python python27 python3 requests scraping-websites
Last synced: 25 Dec 2025
https://github.com/hansputera/nekopoi-scrape
Scraping nekopoi menggunakan bahasa Python
beautifulsoup4 bs4 python3 scraping
Last synced: 08 Oct 2025
https://github.com/deeeelin/ig_message_analyzer
IG Message Analyzer , analyzes IG inbox messages , evaluates relationship : )
analytics analyze analyzer analyzers beutifulsoup bs4 entertainment html5 instagram object-oriented parse parser side-project tabulate
Last synced: 24 Jul 2025
https://github.com/frolov-andrey2405/multisite-parser-bot-for-telegram
This repository contains a Telegram bot that is able to parse multiple websites for information and send it to the user through private messages. The websites currently supported are www.vfxmed.com and https://blendermarket.com/.
aiomysql asyncio bs4 httpx json-api lxml mysql-database python3 re requests
Last synced: 22 Sep 2025
https://github.com/tanzeelgcuf/idealo-product-scraping-
This code is develop to scrape data from idealo.de website and compare the top products with good prices.
bs4 flask-application python scraping-python
Last synced: 11 Jun 2025
https://github.com/architrixs/getfunds
ISIN code to Price
boursorama bs4 financial-data isin requests scraper webscraping
Last synced: 24 Feb 2025
https://github.com/kenvilar/data-analysis-using-python
Transforming a description of a location from an analyzed CSV file data using Pandas with Python 3
bs4 data-analysis jupyter pandas python python3 requests xlrd
Last synced: 04 Oct 2025
https://github.com/mistersoandso/minecraft-packet-protocol-database
Parsed Protocol Numbers from https://wiki.vg/Protocol_version_numbers
bs4 json minecraft packet requests webscraping
Last synced: 28 Mar 2025
https://github.com/vishalshenoy2002/github-tool
A Github Tool which helps people extract repositories and see issues in github
beauti beautifulsoup beautifulsoup4 bs4 github github-automation github-issues issues python python-3 python3 repository request-module requests web-scraping
Last synced: 03 Mar 2025
https://github.com/renanstn/safety-vulnerabilities-detailed-info
Script utilizado para extrair detalhes de uma vulnerabilidade de uma página do pyup.io.
beautifulsoup beautifulsoup4 bs4 python
Last synced: 23 Mar 2025
https://github.com/rezamobaraki/goodreads-peewee-python
simple project on command-line | goodreads.com
beautifulsoup4 bs4 crawling orm peewee python requests scraping
Last synced: 04 Nov 2025