An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with bs4

A curated list of projects in awesome lists tagged with bs4 .

https://github.com/lb2281075105/python-wechat-itchat

微信机器人,基于Python itchat接口功能实例展示:01-itchat获取微信好友或者微信群分享文章、02-itchat获取微信公众号文章、03-itchat监听微信公众号发送的文章、04 itchat监听微信群或好友撤回的消息、05 itchat获得微信好友信息以及表图对比、06 python打印出微信被删除好友、07 itchat自动回复好友、08 itchat微信好友个性签名词云图、09 itchat微信好友性别比例、10 微信群或微信好友撤回消息拦截、11 itchat微信群或好友之间转发消息

beautifulsoup4 bs4 echarts itchat jieba matplotlib matplotlib-live numpy os pandas pillow system time uuid wechat

Last synced: 28 Oct 2025

https://github.com/jimywork/djangohunter

Tool designed to help identify incorrectly configured Django applications that are exposing sensitive information.

beautifulsoup bs4 django hacking python python3 shodan tool

Last synced: 08 May 2025

https://github.com/pingcheng/bootstrap4-datetimepicker

datetimepicker for bootstrap 4

bootstrap4 bs4 datetimepicker

Last synced: 05 Oct 2025

https://github.com/berrysauce/ingredients

🥗 Determine the "ingredients" (or technologies) behind a website

bs4 fastapi httpx ingredients python sveltekit tool vercel web website-scanner

Last synced: 25 Oct 2025

https://github.com/tomoncle/python-notes

Python related technologies used in work: crawler, data analysis, timing tasks, RPC, page parsing, decorator, built-in functions, Python objects, multi-threading, multi-process, asynchronous, redis, mongodb, mysql, openstack, etc.

bs4 matplotlib mlab mongodb mysql numpy openstack python27 redis rpc scheduler xpath

Last synced: 09 Apr 2025

https://github.com/tomoncle/Python-notes

Python related technologies used in work: crawler, data analysis, timing tasks, RPC, page parsing, decorator, built-in functions, Python objects, multi-threading, multi-process, asynchronous, redis, mongodb, mysql, openstack, etc.

bs4 matplotlib mlab mongodb mysql numpy openstack python27 redis rpc scheduler xpath

Last synced: 19 Jul 2025

https://github.com/wazzabeee/copy-spotter

Make plagiarism detection easier. This script will find similar sentences between given files and highlight them in a side by side comparison.

beautifulsoup bs4 docx odt pdf plagiarism plagiarism-check plagiarism-checker plagiarism-detection plagiarism-detector python side-by-sidediff similarity similarity-detection similarity-score txt

Last synced: 15 Aug 2025

https://github.com/rg089/newsemble

API for fetching data from news websites.

api bs4 flask heroku mongodb news newsapi newsemble python scraper webscraping

Last synced: 27 Dec 2025

https://github.com/thearrival/IsmailScript

Is a tool written by using python programming language. Enable a penetration testers to save a time and performing a full Reconnaissance and Information Gathering on any Web Application Server.

bs4 cybersecurity dns dns-server information-gathering ip ismailscript nmap penetration-testers portscanner python python3 reconnaissance reconnaissance-framework requests serverbanner serverscan socket webapplication whois

Last synced: 01 May 2025

https://github.com/sungyeonu/animal-crossing-scraper

Web scraper for Animal Crossing - New Horizons data using bs4

animal-crossing-new-horizons bs4

Last synced: 17 Jul 2025

https://github.com/amanoteam/duckpy

A simple Python library for searching on DuckDuckGo.

asynchronous asyncio bs4 duckduckgo duckduckgo-search framework httpx module pypi python python3 search

Last synced: 14 May 2025

https://github.com/codhek/pysoc.js

:sunglasses: Simple gsoc data scraper, search for any keyword and instantly get data about the orgs that match your search criteria, use filters to sort and analyse the data, uses fuzzy-searching to improve user-search experience

analytics bs4 firebase fusejs fuzzy-search gsoc-data-scraper reactjs scrapper thread-pool

Last synced: 19 Jun 2025

https://github.com/kadnan/scrapegen

A simple python tool that generates a requests/bs4 based web scraper

beautiful bs4 python requests scraper

Last synced: 15 May 2025

https://github.com/dvamishkilapa/vkarchivedownloader

Утилита для скачивания всех вложений, которые можно получить из архива аккаунта VKontakte

aiohttp archive asyncio beautifulsoup4 browser bs4 cookie json multithreading parse parser processpoolexecutor python vk vkontakte

Last synced: 23 Oct 2025

https://github.com/akimosi/moneycontrolpy

A python API to Money Control forum.

api bs4 datacollection python python3 selenium-python webscraping

Last synced: 14 Dec 2025

https://github.com/gamemann/how-to-use-selenium-and-beautifulsoup

A full lab and how-to guide on how to use Selenium paired with Beautiful Soup to parse and extract data from a website using Python.

beautifulsoup beautifulsoup4 bs4 firefox geckodriver node nodejs python react selenium selenium-python selenium-webdriver webscraper webscraping

Last synced: 25 Sep 2025

https://github.com/dinhanhx/PerceiveYourMeme

A Python library to scrap from https://knowyourmeme.com/ . Written in Python 3.7!

beginner-friendly bs4 dataset hacktoberfest knowyourmeme library meme memes memes-api python-3 python3 scrap urllib3

Last synced: 09 May 2025

https://github.com/gamemann/How-To-Use-Selenium-And-BeautifulSoup

A full lab and how-to guide on how to use Selenium paired with Beautiful Soup to parse and extract data from a website using Python.

beautifulsoup beautifulsoup4 bs4 firefox geckodriver node nodejs python react selenium selenium-python selenium-webdriver webscraper webscraping

Last synced: 24 Oct 2025

https://github.com/mehmetkahya0/web-resource-downloader

This is a Python script that downloads all resources (images, scripts, stylesheets, etc.) from a given website.

algorithms beautifulsoup4 bs4 bs4-requests data-analysis data-science datascience python python3 requests scraper scraping

Last synced: 12 Oct 2025

https://github.com/parvvaresh/web-scraper-projects

This repository contains projects that include web scraping

bs4 python requests webscraping

Last synced: 22 Apr 2025

https://github.com/dinhanhx/perceiveyourmeme

A Python library to scrap from https://knowyourmeme.com/ . Written in Python 3.7!

beginner-friendly bs4 dataset hacktoberfest knowyourmeme library meme memes memes-api python-3 python3 scrap urllib3

Last synced: 12 Apr 2025

https://github.com/mindpatch/startpage-parser

parsing search results from startpage search engine (based on google.com results)

bs4 google-parser googlesearch parsing python startpage

Last synced: 07 Jul 2025

https://github.com/gunh0/whs-utils

🛠️ Essential utilities for conducting classes in the WhiteHat School education program.

bs4 crwaler elk elk-stack kind kubernetes parser python3 siem

Last synced: 29 Aug 2025

https://github.com/gamemann/web-extract-with-chatgpt

A Python project that extracts data from websites with the option to process the data through @openai's ChatGPT API. The results are either printed to stdout or sent via a POST request.

ai api beautifulsoup beautifulsoup4 bs4 chatgpt chatgpt-api discourse extract gpt gpt-3 gpt-4 openai openai-api post python selenium web-extractor web-scrape web-scraper

Last synced: 28 Oct 2025

https://github.com/spidy20/instagram_scrapper

It is a Python GUI in which you need to enter your Instagram ID, it will fetch all the details with DP for that Insta ID.

bs4 fetch-instagram-info insta-scrape instagram instagram-api instagram-api-python instagram-bot instagram-scraper webscraping

Last synced: 12 Apr 2025

https://github.com/santhoshse7en/imdby

imdby is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies.

beautifulsoup4 bs4 characters imdb imdb-api imdb-movie imdb-movie-database imdb-movies imdb-webscrapping movies pandas pypi python scrapers sys time without-api

Last synced: 15 Jun 2025

https://github.com/dimitryzub/webscraping-py

Web Scraping scripts for all Google, other search engines, and other websites (currently outdated, something may not be working).

api bs4 data google-maps-api googleapi googlescraping googlesearchapi lxml parsel playwright python requests scraper scraping scrapy selenium webscraper webscraping webscraping-data webscraping-search

Last synced: 12 Aug 2025

https://github.com/nimiology/persian_subdl

this script can download all Persian subtitles from worldsubtitle.info

bs4 python requests subtitle

Last synced: 22 Jul 2025

https://github.com/andreytretiachenko/ozon_parser

Парсер товаров Ozon в excel таблицу

bs4 ozon-marketplace selenium

Last synced: 01 Sep 2025

https://github.com/ferhatgec/weathere

Fegeya WeaTHERE, CLI-based, colorized 'today's weather' client.

beautifulsoup bs4 cli client python3 unix weather without-api

Last synced: 14 Apr 2025

https://github.com/pydevcasts/resumerover

This project is a tool for extracting emails from websites. Users can input a list of links, and the tool will gather emails from those pages. It also allows users to send resumes to the extracted emails

asyncio beautifulsoup bs4 cache email-sender email-template job-scheduler jobsearch oop python request resume-template selenium threading unittest

Last synced: 29 Apr 2025

https://github.com/mldictionary/mldictionary

MLDictionary is word's dictionary for several language. Available in pypi

beautifulsoup4 bs4 dictionary-learning pypi-package pypi-source python webscraping

Last synced: 19 Oct 2025

https://github.com/81vm3/yinghua_spider

樱花动漫爬虫

bs4 python python3 qt qt5 spider video

Last synced: 29 Oct 2025

https://github.com/prashantpandey9/covid19-india-tracker

This app is developed to aware people about the global pandemic CORONA-VIRUS(COVID-19) cases all around the world.22-april-2020

bs4 coronavirus coronavirus-info coronavirus-real-time coronavirus-tracking django hacktoberfest2021 india opensource python requests-module tracker

Last synced: 11 Apr 2025

https://github.com/innovatorved/webscraping-extract-sanfoundry-mcq

Web Scrapping Project extract MCQ of any topic from sanfoundry.com in PDF form

beautifulsoup4 bs4 extract-sanfoundry-mcq python sanfoundry sanfoundry-mcq-saver webscraping

Last synced: 18 Mar 2025

https://github.com/lostdir/askthesite

A web application that lets you chat with any website's content! Enter a URL and engage in a conversation by asking questions to receive insightful answers powered by a language model (LLM).

ai bs4 chatbot flask llm python rag streamlit webscraping

Last synced: 05 Apr 2025

https://github.com/wasi0013/gqtr

fetches quotes from https://goodreads.com/ according to tags & post random quotes on twitter

bs4 goodreads lxml python requests tweepy twitter

Last synced: 19 Jul 2025

https://github.com/aybruhm/animesukurepa-backend

An application that scrapes anime based on the user's search query. Built with Python and Django Rest Framework.

aggregator bs4 django gogoanime gogoanimeio python3 scraper webscraping

Last synced: 06 Sep 2025

https://github.com/oleksandryanchar/vacancies-parser-bot

Aiogram-based Telegram bot for scraping websites for junior/trainee Python/backend developer vacancies

aiogram backend bs4 celery junior python requests scraper telegram trainee vacancies

Last synced: 11 Sep 2025

https://github.com/ddayguerrero/spimi-indexer

Boolean retrieval search engine with SPIMI indexing and BM25 ranking

bm25 bs4 inverted-index okapi python3 reuters-corpus search spimi

Last synced: 16 Mar 2025

https://github.com/akshatnitd/imdb_portal

A utility tool to check out movies information and ratings!

bs4 cli imdb imdb-rating movie-information python terminal

Last synced: 26 Mar 2025

https://github.com/parthjohri/dtu-seatalertbot

Welcome to the DTU Seat Bot! Stay updated with the latest seat availability information for various courses at Delhi Technological University. Get real-time information on available seats for different subject codes and their respective groups. Whether you're a student planning your course selection.

aws bs4 delhi-technological-university mongodb python3 selenium

Last synced: 15 Apr 2025

https://github.com/jwest951227/socialscraper

scraper that scrapes all the social accounts using DDG/Google search engine

aiohttp asyncio bs4 concurrent instaloader pandas selenium threading zenrows

Last synced: 26 Feb 2025

https://github.com/shilongdai/apartment_scraper

Python webscraper for apartments.com

bs4 csv json python selenium webscraping

Last synced: 13 Oct 2025

https://github.com/oxylabs/mechanicalsoup-proxy-integration

Python tutorial for integrating Oxylabs' Residential Proxies with MechanicalSoup library

beautifulsoup bs4 github-python mechanicalsoup proxy-list proxy-list-github proxy-rotator proxy-site python requests rotating-proxy

Last synced: 14 May 2025

https://github.com/tristan296/best-price-detector

A web scraping application that retrieves product information from numerous websites without the hardcoding.

bs4 python

Last synced: 12 Apr 2025

https://github.com/justkappaman/habr-parser

Parse info about the top articles on Habr.com for a given period of time

beautifulsoup4 bs4 cli command-line grabber habr habrahabr parser python python3 scraper terminal

Last synced: 31 Oct 2025

https://github.com/jwest951227/socialScraper

scraper that scrapes all the social accounts using DDG/Google search engine

aiohttp asyncio bs4 concurrent instaloader pandas selenium threading zenrows

Last synced: 21 Sep 2025

https://github.com/osintt/xvideos.py

powerful Python library for xvideos.com

bs4 porn python scraper xvideos xvideos-api xvideos-downloader xvideos-lib

Last synced: 29 Apr 2025

https://github.com/hanan-nawaz/daraz-price-comparator

Daraz Price Comparator 🌐🛍️ Compare prices effortlessly! 🔄 Find deals in 🇵🇰 Pakistan, 🇧🇩 Bangladesh, 🇳🇵 Nepal, and 🇱🇰 Sri Lanka. Python, Requests, BeautifulSoup, and Airflow power smart shopping. Save time, shop savvy! 🛒💡

beautifulsoup bs4 darazpk python requests-python telemart webscraping

Last synced: 21 Mar 2025

https://github.com/ehsanshahbazii/castbox_downloader

دانلود فایل صوتی و کاور اپیزود ها در اپلیکیشن کست باکس (بدون نیاز به ای پی آی)

bs4 castbox downloader playwright python python3 requests

Last synced: 12 Apr 2025

https://github.com/dchan3/thoughtfulsoup

dchan3's thoughtful extension of BS4

beautifulsoup beautifulsoup4 bs4 extension python web-scraping

Last synced: 21 Nov 2025

https://github.com/kayx23/indeed-scraper

Scrape job posts off Indeed Canada (ca.indeed.com)

bs4 scrapy selenium webscraping

Last synced: 27 Mar 2025

https://github.com/jatin-8898/mizuxe

A beautiful template theme named Mizuxe made using Bootstrap 4 beta :blue_book:

bootstrap bs4 collapse-button mizuxe navigation node-module theme

Last synced: 25 Aug 2025

https://github.com/benjetson/dining-alexa

Find out what's on the menu at Clemson's dining halls.

alexa aws-lambda beautifulsoup4 bs4 clemson cuhackit python python-3

Last synced: 30 Aug 2025

https://github.com/markolofsen/unrealon-parser-amazon

Enterprise-grade Amazon scraper with AI-powered automation, anti-detection, and real-time monitoring capabilities.

amazon bs4 parser parsing proxy scraper

Last synced: 13 Sep 2025

https://github.com/imvickykumar999/dark-web-scraping

https://www.upgrad.com/learn/

beautifulsoup bs4 webscraping

Last synced: 01 Mar 2025

https://github.com/justin-pyne/dota-liquipedia-web-scraper

Scraping information off Liquipedia from DOTA leagues with BeautifulSoup/Pandas for statistical analysis/EDA.

bs4 csv data-analysis pandas python scraper

Last synced: 13 Jul 2025

https://github.com/bieanimaton/github-scraping

This is project of a Github Scraping made with Python.

beautifulsoup4 bs4 github python python27 python3 requests scraping-websites

Last synced: 25 Dec 2025

https://github.com/hansputera/nekopoi-scrape

Scraping nekopoi menggunakan bahasa Python

beautifulsoup4 bs4 python3 scraping

Last synced: 08 Oct 2025

https://github.com/deeeelin/ig_message_analyzer

IG Message Analyzer , analyzes IG inbox messages , evaluates relationship : )

analytics analyze analyzer analyzers beutifulsoup bs4 entertainment html5 instagram object-oriented parse parser side-project tabulate

Last synced: 24 Jul 2025

https://github.com/frolov-andrey2405/multisite-parser-bot-for-telegram

This repository contains a Telegram bot that is able to parse multiple websites for information and send it to the user through private messages. The websites currently supported are www.vfxmed.com and https://blendermarket.com/.

aiomysql asyncio bs4 httpx json-api lxml mysql-database python3 re requests

Last synced: 22 Sep 2025

https://github.com/tanzeelgcuf/idealo-product-scraping-

This code is develop to scrape data from idealo.de website and compare the top products with good prices.

bs4 flask-application python scraping-python

Last synced: 11 Jun 2025

https://github.com/blackhatinside/btech_pydosc

Python Reconnaisance using Google Dorks and Web Scraping

bs4 dorking dorks hacking osint python regex scraping security

Last synced: 06 Jul 2025

https://github.com/rdil/crawley

My attempt at a web crawler.

bs4 crawler python python3 web

Last synced: 11 Jun 2025

https://github.com/kenvilar/data-analysis-using-python

Transforming a description of a location from an analyzed CSV file data using Pandas with Python 3

bs4 data-analysis jupyter pandas python python3 requests xlrd

Last synced: 04 Oct 2025

https://github.com/mistersoandso/minecraft-packet-protocol-database

Parsed Protocol Numbers from https://wiki.vg/Protocol_version_numbers

bs4 json minecraft packet requests webscraping

Last synced: 28 Mar 2025

https://github.com/karthikmprakash/google-airports-wifi-scraper

Scrape for WiFi APs and passwords in airports across the world.

airports bs4 chrome python regex scraper selenium web wifi

Last synced: 26 Feb 2025

https://github.com/renanstn/safety-vulnerabilities-detailed-info

Script utilizado para extrair detalhes de uma vulnerabilidade de uma página do pyup.io.

beautifulsoup beautifulsoup4 bs4 python

Last synced: 23 Mar 2025

https://github.com/rezamobaraki/goodreads-peewee-python

simple project on command-line | goodreads.com

beautifulsoup4 bs4 crawling orm peewee python requests scraping

Last synced: 04 Nov 2025

https://github.com/naveennk045/ai-webscraper

AI-powered web scraper designed to extract, process, and analyze data from websites efficiently.

bs4 groq-api llm python3 selinium

Last synced: 15 Jul 2025