Projects in Awesome Lists tagged with cheerio
A curated list of projects in awesome lists tagged with cheerio .
https://github.com/cheeriojs/cheerio
The fast, flexible, and elegant library for parsing and manipulating HTML and XML.
cheerio dom hacktoberfest html htmlparser htmlparser2 jquery parser scraper selector
Last synced: 12 Dec 2025
https://github.com/bda-research/node-crawler
Web Crawler/Spider for NodeJS + server-side jQuery ;-)
cheerio crawler extract-data javascript jquery nodejs spider
Last synced: 13 May 2025
https://github.com/ozanmakes/wring
Extract content from webpages using CSS Selectors, XPath, and JS expressions
Last synced: 02 Mar 2025
https://github.com/xiyuan-fengyu/ppspider
web spider built by puppeteer, support task-queue and task-scheduling by decorators,support nedb / mongodb, support data visualization; 基于puppeteer的web爬虫框架,提供灵活的任务队列管理调度方案,提供便捷的数据保存方案(nedb/mongodb),提供数据可视化和用户交互的实现方案
angular cheerio crawler headless mongodb nedb node node-spider nodejs nodejs-spider proxy puppeteer spider task-queue task-scheduling typescript
Last synced: 05 Apr 2025
https://github.com/chenelias/cambridge-dictionary-api
API for Cambridge Dictionary, written in Node.js.
api cambridge-dictionary cambridge-dictionary-api cheerio express free-dictionary-api nodejs
Last synced: 24 Apr 2025
https://github.com/LuXDAmore/nuxt-prune-html
🔌⚡ Nuxt module to prune html before sending it to the browser (it removes elements matching CSS selector(s)), useful for boosting performance showing a different HTML for bots/audits by removing all the scripts with dynamic rendering
audit bot cheerio dynamic-rendering html lighthouse measure modules nuxt nuxt-module nuxtjs optimization optimization-algorithms optimize pagespeed-insights performance prune pruning vuejs web-vitals
Last synced: 30 Mar 2025
https://github.com/luxdamore/nuxt-prune-html
🔌⚡ Nuxt module to prune html before sending it to the browser (it removes elements matching CSS selector(s)), useful for boosting performance showing a different HTML for bots/audits by removing all the scripts with dynamic rendering
audit bot cheerio dynamic-rendering html lighthouse measure modules nuxt nuxt-module nuxtjs optimization optimization-algorithms optimize pagespeed-insights performance prune pruning vuejs web-vitals
Last synced: 07 May 2025
https://github.com/kiesun/vue-studymaps
使用 Vue.js 开发的聚合应用。通过爬虫抓取平时浏览的网站,省去逐个点开网页的时间。
cheerio express superagent vue vue-router vuex
Last synced: 25 Jun 2025
https://github.com/tazeg/sample-web-scraping-with-electron
Sample project for web scraping with Electron
cheerio electron javascript scraping scraping-websites
Last synced: 14 Jun 2025
https://github.com/huangxizhou/dytt-reptitle
🐜 Dytt crawler
cheerio mongodb node-reptitle nodejs npm
Last synced: 21 Mar 2025
https://github.com/transitive-bullshit/scrape-github-trending
Tutorial for web scraping / crawling with Node.js.
cheerio crawling scraping tutorial
Last synced: 30 Apr 2025
https://github.com/lifailon/torapi
Unofficial API (backend) and RSS for RuTracker, Kinozal, RuTor and NoNameClub for receiving torrent files and detailed information about distribution.
api api-server axios cheerio express expressjs github-actions javascript js nodejs postman postman-test rest-api rss rutorrent rutracker swagger torrent tracker web
Last synced: 27 Apr 2025
https://github.com/bupt-hjm/buptclass
A nodejs-spider that gets the infomation of empty classrooms in BUPT
cheerio node-tesseract superagent tesseract-ocr
Last synced: 14 Oct 2025
https://github.com/sid24rane/Personal-Chef
An Self learning AI Chatbot who doesnt let you waste food by recommending awesome Recipies
android api api-ai app chatbot cheerio glide material-design nodejs recipe recipe-details
Last synced: 01 May 2025
https://github.com/asimpson/nodejs-web-scraper-cookbook
📝 Resources for web scraping with node.js
cheerio nodejs scrapers web-scrapers
Last synced: 23 Mar 2025
https://github.com/viperadnan-git/pasting
Publishing tool made in nodejs using deta.
cheerio deta deta-base markedjs nodejs pastebin pastebin-service publishing-service telegraph
Last synced: 19 Jul 2025
https://github.com/apify/super-scraper
Generic REST API for scraping websites. Drop-in replacement for ScrapingBee, ScrapingAnt, and ScraperAPI services. And it is open-source!
api apify cheerio javascript nodejs playwright scraping typescript web-scraping
Last synced: 03 Nov 2025
https://github.com/robert-z/simple-pokemon-json-api
🐸 A simple Pokémon API used in APIs introduction lessons at Skylab Coders Academy.
cheerio es6 express javascript
Last synced: 22 Jun 2025
https://github.com/hansputera/otakudesu-scrape
A module that retrieves data from otakudesu.vip
anime-scraper cheerio javascript otakudesu typescript
Last synced: 02 Sep 2025
https://github.com/capturr/scraper
All In One API to easily scrape data from any website, without worrying about captchas and bot detection mecanisms.
captcha cheerio crawler crawling data declarative extract growth-hacking html javascript json jsonld nodejs recaptcha scraper scraping spider typescript web web-scraping
Last synced: 02 Aug 2025
https://github.com/cheeriojs/cheerio-select
CSS selector engine supporting jQuery selectors, based on css-select
cheerio jquery jquery-positional-selectors
Last synced: 04 Apr 2025
https://github.com/popcorn-official/pop-api-scraper
The base modules for the popcorn-api scraper
cheerio http popcorn popcorn-api popcorn-time
Last synced: 08 Aug 2025
https://github.com/wahengchang/node-dcard-scraper
it is an example of implementing cheerio scraper of extracting images in dcard
cheerio crawler dcard example javascript nodejs npm scraper tutorial
Last synced: 11 Apr 2025
https://github.com/yashkathe/download-comicbooks-api
Unoffical api to get comic books from various publishers. Published on NPM !
axios cheerio comic-books comic-downloader comics comics-scraper dccomics marvel marvel-comics nodejs npm web-scrapping
Last synced: 14 Oct 2025
https://github.com/jonschlinkert/html-toc
Generate a HTML table of contents using cheerio.
cheerio html table-of-contents toc
Last synced: 23 Apr 2025
https://github.com/yashkathe/f1-api-json
F1-API is a TypeScript-based web scraping API designed to extract information about Formula 1 races, drivers, cars, standings, and race schedules. This powerful web scraper automates the process of gathering data and aggregating it into a structured format for easy analysis and consumption.
axios cheerio f1 f1-api formula1 formula1-analysis formula1-api nodejs npm typescript web-scraping
Last synced: 09 Apr 2025
https://github.com/emoji-gen/browser-extension
:earth_asia: Ultimate Browser Extension for Emoji Generator
cheerio chrome emoji firefox typescript webextensions
Last synced: 24 Oct 2025
https://github.com/risyasin/arachnod
High performance crawler for Nodejs
cheerio crawler javascript nodejs redis scraper spider
Last synced: 05 Apr 2025
https://github.com/ekamid/cricbuzz-live
Unofficial API for data fetching from Cricbuzz.com
cheerio cricbuzz cricket cricket-data cricket-score scrapper
Last synced: 26 Oct 2025
https://github.com/hoangsonww/ai-gov-content-curator
💡An end-to-end solution for aggregating, summarizing, and displaying news articles using an AI-powered backend, an automated CRON crawler, and a responsive Next.js frontend. It integrates technologies like Express.js, MongoDB, Puppeteer, and GenAI/LLMs to deliver up-to-date, curated content to government staff and other users.
artificial-intelligence axios cheerio crawler cron cronjob docker express expressjs google-generative-ai mongodb mongoose nextjs nodejs puppeteer react shadcn-ui tailwindcss typescript vercel
Last synced: 09 Apr 2025
https://github.com/mohammad-mghn/dev-tab
WEB TAB makes it easy for you to stay up-to-date with the latest developer news, tools, jobs and events.
axios cheerio fs nextjs nextjs-13 react tailwindcss typescript web-scraping
Last synced: 10 Oct 2025
https://github.com/gal-dahan/crawling-jobs
Scans jobs in Israel By scraping public websites using a Node.js, every six hours.
cheerio crawling-jobs israel nodejs
Last synced: 16 Apr 2025
https://github.com/4awpawz/trio
Fast, simple yet powerful JavaScript-driven static site generation.
blog-engine categories category cheerio css html javascript markdown sass static-site-generator static-website-generator tag tags
Last synced: 15 May 2025
https://github.com/whjin/node-tbuys
用户购物电商网站:Node.js+Express+MongoDB+Mongoose
bootstrap5 cheerio express mongodb mongoose nodejs
Last synced: 11 Apr 2025
https://github.com/diosamuel/zippydamn
Unofficial Zippyshare CLI tools (download and search)
anime cheerio cli downloader hacktoberfest nodejs scraper zippyshare
Last synced: 15 Apr 2025
https://github.com/mortezasabihi/mrtehran.com-scraper
mrtehran.com node js scrapper with api
cheerio cheerio-js cheeriojs nodejs-scraping scraper scraper-api
Last synced: 21 Jul 2025
https://github.com/xusd320/niddle
A super fast nodejs addon for html parsing and manipulation written in rust.
cheerio htmlparser jquery napi node-addon rust
Last synced: 24 Dec 2025
https://github.com/qiufeihong2018/navigation-server
:chocolate_bar: A website collection and navigation platform backend project
cheerio commitizen db eslint express express-session istanbul mocha mongoose navigation passport-local-mongoose should supertest winston
Last synced: 10 Jul 2025
https://github.com/mrelvin/github-profile-chart
👨💻 Visualize your github profile.
cheerio github-api-v4 graphql koa2 vue2
Last synced: 26 Feb 2025
https://github.com/viclafouch/fetch-crawler
📌 A Node.JS Web crawler using the API Fetch to scrap static websites
cheerio crawler crawling-sites fetch-api nodejs promises scrapping
Last synced: 01 Aug 2025
https://github.com/mitica/ascrape-js
Extracts article content from a web page.
Last synced: 05 Aug 2025
https://github.com/suspiciouslookingowl/scrape-yt
Simple lib to scrape information from youtube such as search results, video information, related videos, playlist information and up next video
Last synced: 02 Oct 2025
https://github.com/dsc8x/node-scraper
Scraping websites made easy! A minimalistic yet powerful tool for collecting data from websites.
axios cheerio javascript node scraper scraping website-scraper
Last synced: 01 Apr 2025
https://github.com/lukel97/room-booking-tcd
A web app for booking group study rooms within TCD
bootstrap cheerio graphql javascript nokogiri react ruby
Last synced: 06 May 2025
https://github.com/osintt/xvideos.js
🔞 xvideos.com wrapper based on typescript
cheerio porn puppeteer scraper scraping spider typescript xvideos xvideos-api
Last synced: 18 Aug 2025
https://github.com/saiyamdubey/erp_automation
Here I am using different diffrent unique library to handle the Links and files and to Download that staff smoothlly ...🤑
automation backed cheerio downloder es6-javascript express-js javascript nodejs puppeteer server
Last synced: 15 Jul 2025
https://github.com/capturr/jsonld-extract
A damn simple tool to extract json-ld metadata from webpage using jquery like api (jQuery, Cheerio, CashDom ...).
cashdom cheerio crawler crawling data extract extractor javascript jquery json jsonld metadata nodejs parser scraper scraping spider typescript
Last synced: 24 Mar 2025
https://github.com/mariazevedo88/airbnb-scraper
Web scraper/crawler of Airbnb Brazilians page
airbnb cheerio cheerio-js cheerio-node node node-js nodejs puppeteer scraper scraperjs scrapers scraping scraping-websites scrapy-crawler yarn
Last synced: 19 Apr 2025
https://github.com/beautifulmoon211/onthemarket-scraping
Web scraping tool used to extract real estate information from OnTheMarket.com, a leading property portal in the United Kingdom.
cheerio data-extraction onthemarket onthemarket-scraper real-estate requests typescript web-scraper
Last synced: 13 Jun 2025
https://github.com/cornayy/dofus-scraper
An open-source Dofus encyclopedia scraper.
cheerio dofus dofus-scraper encyclopedia node nodejs scraping ts typescript
Last synced: 03 Oct 2025
https://github.com/doyourbest96/ai-powered-profile-scraper-node
Next.js project that serves as a user scraping tool. It utilizes Puppeteer, a Node.js library for automating web browsers, to scrape user data from various sources.
cheerio mongo-db nextjs playwright puppeteer tailwindcss
Last synced: 10 Sep 2025
https://github.com/davekeehl/f1-insight
A visual analytics web application to gather insight into the current Formula 1 season.
analytics cheerio ergast-api f1 insight nextjs nivo results tailwind typescript
Last synced: 19 Mar 2025
https://github.com/ariear/otakudesu-api
UnOfficial Otakudesu API 👀
anime cheerio honojs otakudesu-api scraping
Last synced: 19 Jul 2025
https://github.com/rafaelpermec/live-broker-api
Um estudo sobre raspagem de dados em back-end, simulando uma corretora que realiza ações de compra e venda de ativos e fluxo de caixa de clientes em tempo real.
authentication authorization backend-api cheerio data-science express helmet jwt-authentication mysql nodejs typescript web-scraping
Last synced: 19 Apr 2025
https://github.com/mariazevedo88/imdb-scraper
IMDB webscraper with Request-Promise, Cheerio and Nightmare.js
cheerio cheerio-js cheeriojs nightmarejs node node-js nodejs request request-promise scraper scrapers scraping-websites scrapy-crawler
Last synced: 19 Apr 2025
https://github.com/jonschlinkert/gulp-html-toc
Gulp plugin for html-toc, for generating a HTML table of contents.
cheerio gulp gulpplugin html html-toc table-of-contents toc
Last synced: 12 May 2025
https://github.com/paulo9mv/presidente-discord-bot
Bot do Discord que envia citações aleatórias dos presidentes do Brasil. :speech_balloon:
cheerio discord discord-bot discordjs hacktoberfest javascript js json nodejs request
Last synced: 07 Apr 2025
https://github.com/michaelworm/grunt-inky
A grunt plugin for ZURB Inky. https://github.com/zurb/inky
cheerio grunt-plugins inky zurb
Last synced: 13 Apr 2025
https://github.com/breakdance/breakdance-cli
CLI for breakdance, the HTML to markdown converter for node.js.
breakdance cheerio cli command-line convert html markdown snapdragon
Last synced: 24 Jul 2025
https://github.com/leftmove/paul-graham
RSS feed for Paul Graham's essays.
cheerio paul-graham rss typescript
Last synced: 29 Jun 2025
https://github.com/joseluria/linkedin-scrapping
Un simple web scraper que busca trabajos de React por medio de LinkedIn.
cheerio eslint github-actions nodejs prettier ts-standard typescript zod
Last synced: 11 Apr 2025
https://github.com/jbris/react-typescript-graphql
A simple search tool to retrieve git repo information from GitHub, GitLab, and Bitbucket. Uses TypeScript and GraphQL for server-side API searches, and React.js for client-side rendering.
apollo apollo-server bitbucket cheerio cheeriojs docker docker-compose docker-image github gitlab graphql inversify inversifyjs makefile node nodejs react typegraphql typescript
Last synced: 04 May 2025
https://github.com/mariazevedo88/reddit-scraper
Web scraper/crawler of Reddit page
cheerio cheerio-js cheeriojs mongodb mongodb-atlas mongoose reddit reddit-bot request request-promise scraper scraperjs scrapers scraping-websites scrapy-crawler
Last synced: 19 Apr 2025
https://github.com/mglagola/chartlandia
React Native & React Native Web App that pulls crypto currency ticker info from coinmarketcap.com
cheerio coinmarketcap cryptocurrency express nextjs nodejs react react-native react-native-web redux
Last synced: 12 May 2025
https://github.com/mirsahib/dpdc-bill-manager
:zap: An app to manage multiple electricity bill for Dhaka Power Distribution Company :zap:
cheerio dpdc expressjs nodejs reactjs scraper utility-manager
Last synced: 12 Apr 2025
https://github.com/somritdasgupta/aide
aiDe is a browser extension that combines a sideKick bar and web UI for local LLMs provided by Ollama library, with online web search capabilities.
cheerio llm next-js ollama vector-database
Last synced: 24 Apr 2025
https://github.com/bes-js/burc-yorum
Türkçe Günlük/Haftalık/Aylık Burç Yorumları/Burç Özelliklerini Gösteren NPM Paketi.
axios burc burc-yorum burclar burcyorum cheerio commonjs esmodule npm-package zodiac zodiac-sign
Last synced: 23 Mar 2025
https://github.com/skorotkiewicz/fakeid-api
FakeID scraping API
cheerio fakeid fakenamegenerator nodejs restful-api scraper
Last synced: 25 Oct 2025
https://github.com/dreamjet31/linkedinscraping-using-zenrow
Scraping linkedin company profile using puppeteer and zenrow
axios cheerio googlesheetapi javascript linkedin linkedin-scraper parallel puppeteer zenrows
Last synced: 18 Aug 2025
https://github.com/fahimfba/web-scraper
Extract data from websites using the web-scrapper. Made with nodejs, ExpressJS, axios & cheerio.
axios cheerio cheeriojs javascript js npm npm-package webscrape webscraping webscraping-data webscraping-search webscrapper
Last synced: 14 Apr 2025
https://github.com/yatharth1706/imdb_scraper
Web Scraper to extract meta-data of movies 🎬 from Official IMDb WebSite
cheerio javascript nodejs request
Last synced: 01 Sep 2025
https://github.com/fahimfba/simple-web-scrapper
Extract data from websites using the web-scrapper. Made with nodejs, ExpressJS, axios & cheerio.
axios cheerio cheeriojs javascript js npm npm-package webscrape webscraping webscraping-data webscraping-search webscrapper
Last synced: 10 Oct 2025
https://github.com/ariear/tikchan
the minimal tik-tok downloader
cheerio tiktok tiktok-api tiktok-downloader web-scraping
Last synced: 07 May 2025
https://github.com/gkhan205/link-previewer
Web Scrapping app with NodeJS and ReactJS which generates Previews of the Link user input like in Social Media Apps
axios cheerio full-stack-web-development fullstack-development link-preview nodejs reactjs web-scrapping
Last synced: 30 Apr 2025
https://github.com/varunon9/github-scraper
A nodejs script (using cheerio module) to extract github users information and save to json file.
cheerio github-scraping nodejs-scraping web-scraping
Last synced: 11 Apr 2025
https://github.com/juliandavidmr/htmltabletolatex
Converting html tables to latex tables
cheerio generator-latex latex tables typescript
Last synced: 19 Oct 2025
https://github.com/alenvelocity/mywaifulist-scraper
Unofficial MyWaifuList Scraper
anime cheerio mywaifulist scraper waifu
Last synced: 19 Jul 2025
https://github.com/pinkpixel-dev/web-scout-mcp
A powerful MCP server extension providing web search and content extraction capabilities. Integrates DuckDuckGo search functionality and URL content extraction into your MCP environment, enabling AI assistants to search the web and extract webpage content programmatically.
ai-assistant ai-tools cheerio content-extraction crawler duckduckgo duckduckgo-search google-search mcp mcp-server web-content web-crawler web-scraper web-scraping web-search web-search-agent
Last synced: 23 Jun 2025
https://github.com/guilhermebkel/ah-negao-discord-bot
:robot: A WIP discord bot that retrieves posts from "Ah Negão! - https://www.ahnegao.com.br" to a discord channel.
ahnegao bot cheerio discord discordjs webscraping
Last synced: 23 Sep 2025
https://github.com/dikaardnt/scrape-primbon
Scrape From primbon.com
axios cheerio primbon scrape-primbon
Last synced: 22 Jul 2025
https://github.com/nomansiddiqui0000/rozee.pk-jobs-scrapper
This scraper, built in Node.js using Puppeteer and Cheerio, is designed to extract job listings from the Rozee.pk website. It can scrape multiple pages and gather detailed information, including job titles, company names, skills, and more. The output is saved in structured CSV files, with sample datasets for cities like Lahore, Karachi, etc.
automation cheerio javascript jobs jobs-scraping jobs-search jobscraper nodejs puppetter scra scraped-data scraper-api scraping scraping-websites scrapy-crawler
Last synced: 27 Oct 2025
https://github.com/deveshsangwan/cricketscoreapi
Welcome to the Cricket Score API! This project is designed to provide real-time cricket scores using TypeScript and npm. It uses technologies like Cheerio for web scraping, Prisma for accessing MongoDB, Express-jwt for authentication, and Chai and Mocha for testing.
api cheerio cricket cricket-api cricket-data cricket-score cricket-stats docker express jwt-authentication livescore mocha-chai mongodb nodejs prisma realtime-data rest-api sports-data typescript webscraping
Last synced: 08 Oct 2025
https://github.com/felipetodev/apartment-scraper
home seeker for homeless devs 🏚️
cheerio cloudflare-workers hono nodejs scraper vitest
Last synced: 02 Aug 2025
https://github.com/konjoinfinity/songscraper
A song chart scraper for converting ultimate guitar tabs and charts into google docs 🎶🎵🎸🎹📄
chart cheerio googledocs googledrive googledriveapi music pupeteer scraper
Last synced: 02 May 2025
https://github.com/moos/quget
Get web snippets from the command-line.
cheerio cli css-selector jquery snippet web-snippets
Last synced: 12 Apr 2025
https://github.com/imgss/greendot
node小项目,抓取github的小绿点,用chart.js生成图表,显示在浏览器上
Last synced: 26 Feb 2025
https://github.com/ignema/movnots
Enjoy Motivational Quotes From The Comfort of Your Homescreen
axios cheerio cronjob express nodejs notification-api socket-io
Last synced: 16 Sep 2025
https://github.com/sirwanafifi/tesco-clubcards-of-the-day
Tesco ClubCards of the Day is a script developed using Bun to fetch the latest prices of products available with Tesco ClubCards. The script saves the data in a daily JSON file, making it easy to track price changes and deals over time.
bun cheerio clubcard scraping supermarket tesco
Last synced: 31 Mar 2025
https://github.com/cyan33/front-end-data-visualization
:eyes: Web Data Visualization for the graduate thesis in 2017.
cheerio data-visualization echarts expressjs frontend javascript
Last synced: 06 Nov 2025
https://github.com/sarthak-0-sach/amazon_webscraper_application
A Next.js and Bright Data-powered e-commerce product scraping site. Get notified on price drops and stock status. Automate with cron jobs.
bright-data cheerio headless-ui mongodb nextjs nodemailer responsive tailwind-css web-scraper
Last synced: 11 Apr 2025