Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/chiraag-kakar/PyAutomation

Simple and Useful Automation Tools built with the help of modules available with Python published at PyPI.

beautifulsoup4 pdf2text pypi-packages python-automation python3 regex regexp-search requests truth-table-generator ttg worldometer-scraping worldometers

Last synced: 04 Jul 2024

https://github.com/ruichongliu/Crawler_pubg.op.gg

This is a web crawler for pubg.op.gg, written by Ruichong Liu. 绝地求生游戏数据抓取

beautifulsoup4 crawler pubg python3 scrape selenium

Last synced: 02 Jul 2024

https://github.com/lb2281075105/Python-Spider

豆瓣电影top250、斗鱼爬取json数据以及爬取美女图片、淘宝、有缘、CrawlSpider爬取红娘网相亲人的部分基本信息以及红娘网分布式爬取和存储redis、爬虫小demo、Selenium、爬取多点、django开发接口、爬取有缘网信息、模拟知乎登录、模拟github登录、模拟图虫网登录、爬取多点商城整站数据、爬取微信公众号历史文章、爬取微信群或者微信好友分享的文章、itchat监听指定微信公众号分享的文章

beautifulsoup4 crawlspider django itchat mongodb mysql pymysql python redis scrapy selenium spider weichat xpath

Last synced: 30 Jun 2024

https://github.com/roniemartinez/dude

dude uncomplicated data extraction: A simple framework for writing web scrapers using Python decorators

async beautifulsoup4 crawler css framework lxml parsel playwright python scraper scraping selenium sync web-scraping webscraping xpath

Last synced: 27 Jun 2024

https://github.com/mohdsaqibhbi/easy_images

Download hundreds of images from Google. Do image post processing later.

beautifulsoup4 image-duplicate-detection opencv python python-magic requests selenium tabulate tqdm

Last synced: 27 Jun 2024

https://github.com/BufferingIO/film2subtitle

A REST API for film2subtitle.com website written in Python and FastAPI ⚡

beautifulsoup4 fastapi film2subtitle python rest-api web-scraping

Last synced: 27 Jun 2024

https://github.com/monkeyapple/TOC-Generator-for-Bear-Note

Generating TOC(Table-of-contents) for Bear Notes users.

bear-notes beautifulsoup4 markdown pyqt5 python3 toc-generator

Last synced: 27 Jun 2024

https://github.com/pcomputo/Whole-Foods-Delivery-Slot

Automated script for Whole Foods and Amazon Fresh delivery slot

amazon automation beautifulsoup4 cart chrome firefox-script python selenium-python webdriver wholefoods

Last synced: 24 Jun 2024

https://github.com/nigeld3v/Tumblr_Image_scrape

Download ALL the images (JPEG/GIF/PNG) from any Tumblr website! This project employs Python3 and BeautifulSoup4 to scrape a Tumblr site (with the url provided by the user) to download, page by page, all the images from the Tumblr site's posts. Ideal for archiving other peoples' Tumblrs <3

archive art beautifulsoup beautifulsoup4 blog blogging comics design fashion gif gifs graphics graphics-library image images scraper tumblr tumblr-image-scrape webcomics website-scraper

Last synced: 17 Jun 2024

https://github.com/PaulMcInnis/JobFunnel

Scrape job websites into a single spreadsheet with no duplicates.

automated beautifulsoup beautifulsoup4 csv glassdoor indeed international job jobs monster python scraper search tfidf waterloo yaml

Last synced: 08 Jun 2024

https://github.com/bdibon/books-to-scrape

A python script that leverages asyncio to achieve a blazingly fast webscraping (with beautifulsoup).

asyncio beautifulsoup4 python python3

Last synced: 07 Jun 2024

https://github.com/shashanoid/Gen2Kindle

Search, download, convert and send files directly to your kindle from Libgen in one place.

beautifulsoup4 books genlibrus kindle kindlegen libgen python

Last synced: 26 May 2024

https://github.com/Jimut123/jimutmap

API to get enormous amount of high resolution satellite images from satellites.pro quickly through multi-threading! create map your own map dataset. Bringing data to Humans.

api areal-image beautifulsoup4 dataset deep-learning-dataset enormous fake-header geo high image images jimutmap ml multithreading resolution satellite satellite-data scrape scraping segmentation-mask

Last synced: 21 May 2024

https://github.com/bomquote/transistor

Transistor, a Python web scraping framework for intelligent use cases.

beautifulsoup4 framework headless-browsers lxml mechanicalsoup python-3 requests scraping

Last synced: 19 May 2024

https://github.com/sushil79g/Facebook-message-to-CSV-exporter

This script helps in producing the chat dataset(in csv format) of your facebook chat

beautifulsoup4 chat chatbot csv facebook message scraper

Last synced: 16 May 2024

https://github.com/Joel-hanson/Iceberg-locations

Current Antarctic large iceberg positions derived from ASCAT and OSCAT-2

beautifulsoup4 climate-change git-scraping iceberg python scraping

Last synced: 08 May 2024

https://github.com/paulosabayomi/web-links-analysis-reports

The repository containing the log files of the links analysis reports of web pages, basically broken links report

beautifulsoup4 broken-link-checker broken-link-finder broken-links python requests-module seo

Last synced: 04 May 2024

https://github.com/codingforentrepreneurs/Web-Scraping

Learn how to leverage Python's amazing tools to scrape data from other websites. The end goal of this course is to scrape blogs to analyze trending keywords and phrases. We'll be using Python 3.6, Requests, BeautifulSoup, Asyncio, Pandas, Numpy, and more!

aysncio beautifulsoup beautifulsoup4 joincfe numpy pandas python python-requests python3 requests scraper sraping tutorial web-scraping

Last synced: 13 Apr 2024

https://github.com/cwjokaka/ok_ip_proxy_pool

🍿爬虫代理IP池(proxy pool) python🍟一个还ok的IP代理池

aiohttp async beautifulsoup4 crawler flask http ip pool proxy proxypool py python python3 spider sqlite

Last synced: 01 Apr 2024

https://github.com/Trinkle23897/learn2018-autodown

清华大学新版网络学堂课程自动下载脚本 / A python script to clone all files from learn.tsinghua.edu.cn

beautifulsoup4 python3 requests tqdm

Last synced: 19 Mar 2024

https://github.com/Igglybuff/mreg

Generates a "Match releases" expression for your autodl-irssi filter based on dvdsreleasedates.com's "Most Requested DVD Release Dates" section.

autodl-irssi bash beautifulsoup4 flask python3 requests

Last synced: 17 Mar 2024

https://github.com/shihabshahrier/advising_PlusPlus_bracU

Web Scraping Advising Info (BRACU)

beautifulsoup4 django-framework python3

Last synced: 16 Mar 2024