Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/kyng-cytro/Screen-Time-Bot

Telegram bot to get Movies and TV-Shows updates

bot mongodb python3 telegram telegram-bot web-scraper web-scraping

Last synced: 27 Jun 2024

https://github.com/tholian-network/stealth

:rocket: Stealth - Secure, Peer-to-Peer, Private and Automateable Web Browser/Scraper/Proxy

anonymity browser-automation privacy-protection web-browser web-filter web-proxy web-scraper

Last synced: 09 Jun 2024

https://github.com/gildas-lormeau/single-file-cli

CLI tool for saving a faithful copy of a complete web page in a single HTML file

cli deno nodejs single-file web-archiving web-scraper web-scraping

Last synced: 01 Jun 2024

https://github.com/k0rnh0li0/onlyfans-dl

OnlyFans content downloader

media-downloader onlyfans python web-scraper

Last synced: 31 May 2024

https://github.com/anthonygharvey/developer_news_cli_app

A Ruby CLI app that scrapes article information from FreeCodeCamp, HackerNoon, Codeburst and A List Apart.

cli-app nokogiri ruby ruby-gem web-scraper

Last synced: 28 May 2024

https://github.com/cassidoo/scrapers

A list of scrapers from around the web.

list scrape-websites scraper web-scraper

Last synced: 28 May 2024

https://github.com/khuyentran1401/top-github-scraper

Scape top GitHub repositories and users based on keywords

github github-api python scraping web-scraper web-scraping

Last synced: 27 May 2024

https://github.com/aaryanrr/DownDetector-CLI

CLI Client for DownDetector.com

cli downdetector python3 web-scraper web-scraping

Last synced: 26 May 2024

https://github.com/PhantomInsights/tweet-transcriber

A Reddit bot that transcribes tweets from comments and submissions links, mirrors their images and replies back with a formatted Markdown message.

beautifulsoup imgur praw python3 reddit-bot web-scraper

Last synced: 21 May 2024

https://github.com/PhantomInsights/summarizer

A Reddit bot that summarizes news articles written in Spanish or English. It uses a custom built algorithm to rank words and sentences.

nlp praw python3 reddit-bot spacy web-scraper wordcloud

Last synced: 21 May 2024

https://github.com/anaskhan96/soup

Web Scraper in Go, similar to BeautifulSoup

beautifulsoup go golang html-node web-scraper webscraper webscraping

Last synced: 11 May 2024

https://github.com/postmodern/spidr

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

crawler ruby scraper spider spider-links web web-crawler web-scraper web-scraping web-spider

Last synced: 01 May 2024

https://github.com/boring-dragon/scraper

scraper for avas.mv

python scraper web-scraper

Last synced: 29 Apr 2024

https://github.com/austinoboyle/scrape-linkedin-selenium

`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.

linkedin python scrape scraper scraping selenium selenium-webdriver web-scraper web-scraping

Last synced: 26 Apr 2024

https://github.com/BruceDone/awesome-crawler

A collection of awesome web crawler,spider in different languages

awesome crawler node-crawler scraper spider web-crawler web-scraper

Last synced: 19 Apr 2024

https://github.com/paulpierre/markdown-crawler

A multithreaded πŸ•ΈοΈ web crawler that recursively crawls a website and creates a πŸ”½ markdown file for each page, designed for LLM RAG

html-to-markdown html-to-markdown-converter html2md llm llmops markdown markdown-crawler markdown-parser markdown-scraper md-crawler rag web-scraper

Last synced: 11 Apr 2024

https://github.com/scrapehero/yellowpages-scraper

Yellowpages.com Web Scraper written in Python and LXML to extract business details available based on a particular category and location.

business-directory extract html lxml parsing python scraper web-scraper yellow-pages yellow-pages-scraper

Last synced: 08 Apr 2024

https://github.com/epiqueras/getsy

A simple browser/client-side web scraper.

browser client-side scraper web-scraper

Last synced: 25 Mar 2024

https://github.com/gosom/google-maps-scraper

scrape data data from Google Maps. Extracts data such as the name, address, phone number, website URL, rating, reviews number, latitude and longitude, reviews,email and more for each place

distributed-scraper distributed-scraping golang google-maps google-maps-scraping web-scraper web-scraping

Last synced: 17 Mar 2024

https://github.com/cobalt-uoft/uoft-scrapers

Public web scraping scripts for the University of Toronto.

open-data toronto uoft web-scraper

Last synced: 16 Mar 2024