An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with headless-browser

A curated list of projects in awesome lists tagged with headless-browser .

https://github.com/ariya/phantomjs

Scriptable Headless Browser

automation headless headless-browser phantomjs

Last synced: 22 Oct 2025

https://github.com/archivebox/archivebox

πŸ—ƒ Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

archivebox backups bookmark-archiver browser-bookmarks chromium digipres firefox headless-browser internet-archiving pinboard pocket python rss self-hosted singlefile warc wayback-machine web-archiving wget youtube-dl

Last synced: 09 Sep 2025

https://github.com/pirate/ArchiveBox

πŸ—ƒ Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

archivebox backups bookmark-archiver browser-bookmarks chromium digipres firefox headless-browser internet-archiving pinboard pocket python rss self-hosted singlefile warc wayback-machine web-archiving wget youtube-dl

Last synced: 01 Apr 2025

https://github.com/ArchiveBox/ArchiveBox

πŸ—ƒ Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...

archivebox backups bookmark-archiver browser-bookmarks chromium digipres firefox headless-browser internet-archiving pinboard pocket python rss self-hosted singlefile warc wayback-machine web-archiving wget youtube-dl

Last synced: 13 Mar 2025

https://github.com/vgalin/html2image

A package acting as a wrapper around the headless mode of existing web browsers to generate images from URLs and from HTML+CSS strings or files.

chrome chromium chromium-browser css headless-browser html html2image python python3

Last synced: 28 Mar 2025

https://github.com/watzon/marionette

Selenium alternative for Crystal. Browser manipulation without the Java overhead.

api browser crystal crystal-lang devtools firefox headless headless-browser marionette puppeteer selenium selenium-webdriver shards testing webdriver

Last synced: 26 Aug 2025

https://github.com/scrapfly/python-scrapfly

Scrapfly Python SDK for headless browsers and proxy rotation

crawler headless-browser python scraper scraping scraping-api sdk web-scraper web-scraping

Last synced: 14 Apr 2025

https://github.com/TarekJor/bookmark-archiver

πŸ—„ Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...

archive backup bookmarks browser chromium firefox google-chrome headless-browser headless-chrome html-export pinboard pocket preservation python rss safari web-archive web-archiving web-browser wget

Last synced: 27 Mar 2025

https://github.com/tarekjor/bookmark-archiver

πŸ—„ Save an archived copy of websites from Pocket/Pinboard/Bookmarks/RSS. Outputs HTML, PDFs, and more...

archive backup bookmarks browser chromium firefox google-chrome headless-browser headless-chrome html-export pinboard pocket preservation python rss safari web-archive web-archiving web-browser wget

Last synced: 05 Oct 2025

https://github.com/hi-tech-ai/scraping-headless_mode

Simple web scraping from public website using headless mode.

headless headless-browser python selenium selenium-webdriver web-scraping

Last synced: 28 Apr 2025

https://github.com/alexmarqs/alm-invoices-cli

🧾 My personal CLI app to manage my invoices via Web Scraping.

automation chalk cli executable headless-browser inquirerjs meow nodejs pkg puppeteer typescript webscraping

Last synced: 15 Mar 2025

https://github.com/ciffelia/fast-speed-test

Unofficial CLI client for Fast.com Internet Speed Test

chromium firefox headless-browser nodejs playwright webkit

Last synced: 31 Dec 2025

https://github.com/oxylabs/puppeteer-tutorial

Use this tutorial and learn how to perform web scraping using a headless browser.

headless-browser puppeteer web-scraping

Last synced: 22 Jun 2025

https://github.com/avindra/casd-json-schema

Schema discovery tool for the CA technologies (now Broadcom) ServiceDesk / ServiceCatalog web services

broadcom ca-technologies data-engineering doselect headless-browser javascript json parsing scraper service-management servicedesk

Last synced: 05 Dec 2025

https://github.com/sofiane-abou-abderrahim/javascript-introduction-to-testing-synchronous-code

In this little JavaScript demo, I used the 3 main core types of testing: Unit Test, Integration Test and End-to-End Test (or User Interface Test). I tested my application with relatively simple synchronous code.

assertion-library end-to-end-test headless-browser intergration-test javascript jest nodejs test-runner unit-test webpack

Last synced: 30 Dec 2025

https://github.com/karlicoss/bt-wifi-reconnect

Make BT Wifi great again

automation headless-browser

Last synced: 07 Apr 2025

https://github.com/kihdev/playwright-stealth-4j

Playwright-Stealth for JVM – A Kotlin-based library to enhance Playwright's stealth capabilities for Java, Kotlin, and Groovy.

headless-browser java jvm kotlin playwright stealth

Last synced: 26 Mar 2025

https://github.com/pinkpixel-dev/prysm

Prysm is a blazing-smart Puppeteer-based web scraper that doesn't just extract - it understands structure. Capable of scraping virtually any website with intelligent content detection and 14 specialized scroll strategies that adapt to different page layouts, Prysm excels at extracting content that other scrapers miss.

api cloudflare-bypass content-extraction data-extraction headless-browser headless-browsers javascript nodejs pagination puppeteer web-automation web-scraper web-scraping

Last synced: 15 Oct 2025

https://github.com/codeterrayt/dare2024.com-solver

Dare2024.com Solver is a Python automation script for seamlessly solving Dare2024.com quizzes. Impress your friends with correct answers effortlessly. Compatible with all dare2024.com versions and future updates.

automation-script automation-scripts dare2024 headless-browser headless-browsers opensource python python3 quiz-solver selenium selenium-python selenium-webdriver web-automation web-automation-with-selenium web-scraping web-scraping-project web-scraping-python web-scraping-software webdriver-manager

Last synced: 23 Mar 2025

https://github.com/zmwangx/docker-selenium-python

Python 3, selenium, Chromium/chromedriver or Firefox/geckodriver.

docker dockerfile headless-browser python selenium

Last synced: 22 Mar 2025

https://github.com/bhattjayd/passbreachfinder

A Python script that checks whether a password has been compromised using the Have I Been Pwned service. The script automates the process of querying the website and retrieving the results for the given password, leveraging Selenium and a headless Firefox browser. It’s a simple tool for testing password security and checking for data breaches.

automation command-line-tool cybersecurity data-breach data-breach-checker geckodriver haveibeenpwned headless-browser password-leak password-security python security-tool selenium web-scraping web-scraping-python

Last synced: 07 Apr 2025

https://github.com/tamdilip/localstorage-grabber

A minimal web app to grab localStorage of a web page with headless browser using puppeteer on server side.

headless headless-browser headless-chrome javascript localstorage nodejs puppeteer rest-api

Last synced: 27 Oct 2025

https://github.com/matteuzzz/correos-cl-postal-code-scraper

Python-based scraper that automates the postal code lookup on the official Correos de Chile website. It simulates the public form with autocomplete validation and returns clean JSON responses. Fully API-ready for integration with Django or Flask backends.

api-ready chile codigo-postal correos-de-chile form-autocomplete headless-browser json-output playwright postal-code python web-scraping

Last synced: 15 Jun 2025

https://github.com/luminati-io/manage-failed-python-requests

Handle failed HTTP requests in Python using retry strategies with HTTPAdapter, Tenacity, and custom logic to improve web scraping reliability.

headless-browser http python requests scraping-browser status-codes tenacity web-scraping web-unblocker

Last synced: 28 Oct 2025

https://github.com/chandankhamitkar/billbot

This is a BillBot πŸ€– which acts a simplistic invoice generator, just by taking input from user and converting into a Invoice and sending invoice image to user. CURRENT STAGE: Enhancing πŸš€

bot chromium-browser express gemini-ai headless-browser nextjs postgresql prisma puppeteer redis server telegram-bot-api typescript webhook

Last synced: 02 Mar 2025

https://github.com/llllllllllooedf/web-image-uploader-bot

<h1 align="center">Web Image Uploader Bot</h1>## Project Overview:This project automates real image uploads to websites using stealth browser automation. Unlike typical bot traffic that simulates page views, this system performs actual image uploads via form submission, mimicking real human behavior. Built for scalability, it’s ideal for boosti

automation bot-network botnetworks distributed-automation fingerprint-spoofing headless-browser human-like-interaction image-uploader proxy-rotation puppeteer selenium stealth-browsing web-automation web-image

Last synced: 16 Jun 2025

https://github.com/valpere/datascrapexter

Universal web scraper built with Go featuring advanced anti-detection, ethical compliance, and configuration-driven operation for any website.

anti-detection captcha-solver chromedp cobra-cli colly configuration-driven data-extraction data-scraping ethical-scraping go golang goquery headless-browser high-performance legal-compliance proxy-rotation viper web-crawling web-scraping

Last synced: 11 Jul 2025

https://github.com/jcloh98/rental-property-finder

A web scraper that helps users find rental properties by automatically gathering and organizing listings from various websites to discover available homes and apartments.

data headless-browser node scraper scraping web

Last synced: 23 Feb 2025

https://github.com/johnnylearnscs/nekofetch

NekoFetch – A smart, cat-powered CLI scraper for downloading videos, files, and subtitles with optional headless browser support.

automation cli ffmpeg file-downloader headless-browser media-scraper nekofetch playwright python scraper subtitles terminal-app video-downloader

Last synced: 20 Jul 2025

https://github.com/ali-jaan-butt/job-posting-scraping

Job posting data scraped from Indeed.com. This data is used in django web for testing purpose.

headless-browser job job-posting job-postings jobs python scraping scraping-websites selenium selenium-python selenium-webdriver

Last synced: 23 Jul 2025

https://github.com/sultannaufal/puppeteer-mcp-server

Self-hosted Puppeteer MCP server with remote SSE access, API key authentication, and Docker deployment. Complete tool suite for browser automation via Model Context Protocol.

api authentication browser-automation docker headless-browser mcp model-context-protocol nodejs puppeteer remote-access server-sent-events sse typescript web-scraping

Last synced: 30 Dec 2025

https://github.com/thesethrose/fetch-browser

A powerful headless browser MCP server that enables AI agents to fetch web content and perform Google searches without requiring any API keys.

automation headless-browser mcp mcp-server

Last synced: 15 May 2025

https://github.com/macrat/ayd-web-scenario-scheme

A headless browser controller for Ayd status monitoring tool.

alerting ayd headless-browser monitoring

Last synced: 15 Dec 2025

https://github.com/gideon-k-addai/x-dm-followers

Python script that sends a DM to all the users that follow your X (formerly Twitter) account. With headless browser option and detailed debugging logs.

direct-message dm dm-tool headless headless-browser messager messaging selenium twitter twitter-dm twitter-dm-tool twitter-dmer twitter-dmer-tool x

Last synced: 23 Apr 2025

https://github.com/luminati-io/scraping-browser

Scraping Browser is an automated headless browser for effortless web scraping with Puppeteer, Selenium, and Playwright.

captcha-solving headless-browser headless-browsers javascript nodejs playwright proxy-server puppeteer python scraping-browser selenium web-scraping

Last synced: 02 Apr 2025

https://github.com/heyhaiden/mcp-ag-grid

Headless AG Grid server for advanced data visualization, manipulation, and export, seamlessly integrated with Claude Desktop.

ag-grid claude-desktop data-grid data-visualization headless-browser mcp open-source puppeteer

Last synced: 07 Sep 2025

https://github.com/inqwise/inqwise-mcp-site2ts

Convert existing websites into TypeScript Next.js apps via an MCP server (Rust) with a Node/Playwright helper. ARM-first, Tailwind-first, sandboxed under `.site2ts/`

arm64 headless-browser mcp nextjs playwright rust tailwindcss typescript web-crawler

Last synced: 07 Oct 2025

https://github.com/instill-network/serp

Google SERP via Playwright β€” minimal scraper and benchmarking CLI. CLI & Docker; headless/headful (VNC); focuses on organic results.

benchmarking cli docker google-search headless-browser nodejs playwright proxies residential-proxies serp typescript vnc web-scraping

Last synced: 30 Dec 2025

https://github.com/just-rich/x-dm-followers

Python script that sends a DM to all the users that follow the account. With headless browser option and detailed debugging logs.

direct-message direct-messaging dm headless headless-browser messager messaging python selenium selenium-webdriver twitter twitter-dm twitter-dm-tool twitter-dmer twitter-dmer-tool x x-dm x-dm-tool x-dmer x-dmer-tool

Last synced: 10 Oct 2025