Projects in Awesome Lists tagged with extractor
A curated list of projects in awesome lists tagged with extractor .
https://github.com/peazip/PeaZip
Free Zip / Unzip software and Rar file extractor. Cross-platform file and archive manager. Features volume spanning, compression, authenticated encryption. Supports 7Z, 7-Zip sfx, ACE, ARJ, Brotli, BZ2, CAB, CHM, CPIO, DEB, GZ, ISO, JAR, LHA/LZH, NSIS, OOo, PAQ/LPAQ, PEA, QUAD, RAR, RPM, split, TAR, Z, ZIP, ZIPX, Zstandard.
archiver brotli compression encryption extractor paq peazip rar zip zstandard
Last synced: 30 Mar 2025
https://github.com/peazip/peazip
Free Zip / Unzip software and Rar file extractor. Cross-platform file and archive manager. Features volume spanning, compression, authenticated encryption. Supports 7Z, 7-Zip sfx, ACE, ARJ, Brotli, BZ2, CAB, CHM, CPIO, DEB, GZ, ISO, JAR, LHA/LZH, NSIS, OOo, PAQ/LPAQ, PEA, QUAD, RAR, RPM, split, TAR, Z, ZIP, ZIPX, Zstandard.
archiver brotli compression encryption extractor paq peazip rar zip zstandard
Last synced: 06 Apr 2026
https://github.com/Bioruebe/UniExtract2
Universal Extractor 2 is a tool to extract files from any type of archive or installer.
decompression extractor file-analysis unpacker
Last synced: 28 Mar 2025
https://github.com/bioruebe/uniextract2
Universal Extractor 2 is a tool to extract files from any type of archive or installer.
decompression extractor file-analysis unpacker
Last synced: 30 Jan 2026
https://github.com/fhamborg/news-please
news-please - an integrated web crawler and information extractor for news that just works
cc-news ccnews commoncrawl crawler data-gathering elasticsearch extract-articles extract-information extractor json news news-archive news-articles news-crawler news-extractor news-scraper news-websites nlp python roberta
Last synced: 13 May 2025
https://github.com/teamnewpipe/newpipeextractor
NewPipe's core library for extracting data from streaming sites
bandcamp crawler extractor mediaccc newpipe peertube scraper soundcloud youtube
Last synced: 05 Feb 2026
https://github.com/TeamNewPipe/NewPipeExtractor
NewPipe's core library for extracting data from streaming sites
bandcamp crawler extractor mediaccc newpipe peertube scraper soundcloud youtube
Last synced: 19 Jul 2025
https://github.com/zelon88/hrconvert2
A self-hosted, drag-and-drop & nosql file conversion server & share tool that supports 445 file formats in 13 languages.
archiver conversion converter document-conversion extractor file-converter file-sharing format image multilingual ocr ocr-recognition pdf-converter php server virustotal
Last synced: 15 May 2026
https://github.com/zelon88/HRConvert2
A self-hosted, drag-and-drop & nosql file conversion server & share tool that supports 445 file formats in 13 languages.
archiver conversion converter document-conversion extractor file-converter file-sharing format image multilingual ocr ocr-recognition pdf-converter php server virustotal
Last synced: 28 Mar 2025
https://github.com/tatuylonen/wiktextract
Wiktionary dump file parser and multilingual data extractor
dictionary extractor lua multilingual parser scribunto templates wikitext wiktionary wiktionary-parser
Last synced: 14 May 2025
https://github.com/stangirard/seo-audits-toolkit
SEO & Security Audit for Websites. Lighthouse & Security Headers crawler, Sitemap/Keywords/Images Extractor, Summarizer, etc ...
analysis audits crawler dashboard extractor headers internal-links lighthouse link-extractor python securityheader seo seo-tools serp summarizer
Last synced: 04 Apr 2025
https://github.com/StanGirard/seo-audits-toolkit
SEO & Security Audit for Websites. Lighthouse & Security Headers crawler, Sitemap/Keywords/Images Extractor, Summarizer, etc ...
analysis audits crawler dashboard extractor headers internal-links lighthouse link-extractor python securityheader seo seo-tools serp summarizer
Last synced: 26 Mar 2025
https://github.com/AlexMathew/scrapple
A framework for creating semi-automatic web content extractors
beautifulsoup crawler css-selector extractor lxml python scrapers scraping scrapy selector selector-expression tutorial web-scraper web-scraping xpath-expression
Last synced: 29 Mar 2025
https://github.com/alexeichhorn/youtubekit
YouTube video and audio extractor for iOS, watchOS, visionOS, tvOS and macOS
audio downloader extractor ios native swift video youtube youtube-dl yt-dlp
Last synced: 01 Feb 2026
https://github.com/adoconnection/sevenzipextractor
C# wrapper for 7z.dll
7zip dotnet dotnet-core extractor
Last synced: 14 May 2025
https://github.com/alexeichhorn/YouTubeKit
YouTube video and audio extractor for iOS, watchOS, visionOS, tvOS and macOS
audio downloader extractor native swift video youtube youtube-dl
Last synced: 18 Jul 2025
https://github.com/opensemanticsearch/open-semantic-etl
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
annotation documents elasticsearch enrichment etl extract extract-information extract-text extractor ingest ingestion-pipeline ingests-documents named-entity-recognition nlp ocr pdf python rdf solr solr-dataimporter
Last synced: 06 Apr 2025
https://github.com/zhupingqi/RuiJi.Net
crawler framework, distributed crawler extractor
crawler extractor headless-chrome netcore owin scraper scrapy
Last synced: 04 May 2025
https://github.com/dwisiswant0/galer
A fast tool to fetch URLs from HTML attributes by crawl-in.
crawler devtool extractor galer go golang spider url-extractor url-parser waybackurls
Last synced: 12 Apr 2025
https://github.com/lipoja/URLExtract
URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.
extract extractor hacktoberfest urls
Last synced: 22 Mar 2025
https://github.com/lifenjoiner/ISx
ISx is an InstallShield installer extractor
extractor installshield unpacker windows
Last synced: 17 Apr 2026
https://github.com/microsoft/recursiveextractor
RecursiveExtractor is a .NET Standard 2.0 archive extraction Library, and Command Line Tool which can process 7zip, ar, bzip2, deb, gzip, iso, rar, tar, vhd, vhdx, vmdk, wim, xzip, and zip archives and any nested combination of the supported formats.
archive disc-image extractor nuget recursion
Last synced: 15 May 2025
https://github.com/Bioruebe/godotdec
An unpacker for Godot Engine package files (.pck)
extractor godot-engine unpacker
Last synced: 12 Mar 2026
https://github.com/bioruebe/godotdec
An unpacker for Godot Engine package files (.pck)
extractor godot-engine unpacker
Last synced: 11 May 2025
https://github.com/UE-Explorer/UE-Explorer
UnrealScript decompiler and explorer tool for Unreal Engine packages.
app decompiler exporter extractor help-wanted ue-explorer unreal unreal-engine unrealscript upk
Last synced: 11 Mar 2025
https://github.com/hxz393/brutalityextractor
适用于高性能系统的多进程解压缩软件(A multiprocess decompression software for high-performance system)
brute-force brute-force-attack brute-force-decompression brute-force-techniques computational-efficiency data-processing decompression efficient-compression-tool extractor high-performance high-speed-decompression optimization parallel-computing parallel-decompression parallel-optimization parallel-processing performance-enhancement performance-optimization performance-testing scalable
Last synced: 20 Aug 2025
https://github.com/gilbsgilbs/babel-plugin-i18next-extract
Babel plugin that statically extracts i18next and react-i18next translation keys.
babel-plugin extractor i18n i18next internationalization react react-i18next static-analysis
Last synced: 27 Jan 2026
https://github.com/gengteng/axum-valid
axum-valid is a library that provides data validation extractors for the Axum web framework. It integrates validator, garde and validify, three popular validation crates in the Rust ecosystem, to offer convenient validation and data handling extractors for Axum applications.
axum extractor garde validator validify
Last synced: 16 Dec 2025
https://github.com/morungos/node-word-extractor
Read data from a Word document using node.js
Last synced: 20 Jun 2025
https://github.com/lifenjoiner/isx
ISx is an InstallShield installer extractor
extractor installshield unpacker windows
Last synced: 21 Aug 2025
https://github.com/mefistotelis/pylabview
Python reader of LabVIEW RSRC files (VI, CTL, LLB). File format description on the Wiki.
extractor fileformat labview python3 reverse-engineering
Last synced: 08 Jun 2026
https://github.com/schollz/ingredients
Extract recipe ingredients from any recipe website on the internet.
cooking extractor ingredients recipes tagging
Last synced: 16 Apr 2026
https://github.com/astahmer/box-extractor
Static code extraction. Zero-runtime CSS-in-TS `<Box />` -> became a part of Panda CSS
ast box extractor sprinkles static typescript vanilla-extract vite
Last synced: 07 May 2025
https://github.com/cdimascio/essence
Automatically extract the main text content (and more) from an HTML document
extractor hacktoberfest html-extractor scraper web-content-extractor webpage-extractor website-extractor
Last synced: 14 Apr 2025
https://github.com/antonyhaman/youtube-jextractor
Android based library that allows you to download or play audio and video from Youtube, in other words - youtube-dl for android
android android-library audio extract-video extractor java muxed-streams stream streams videodetails youtube youtube-dl youtube-dl-android youtube-downloader youtube-extractor youtube-video youtube-videos youtubeextractor
Last synced: 28 Oct 2025
https://github.com/theLSA/burp-sensitive-param-extractor
burpsuite extension for check and extract sensitive request parameter
burp-plugin burpsuite checker extractor parameters sensitive
Last synced: 09 Aug 2025
https://github.com/extremecoders-re/nuitka-extractor
Tool to extract nuitka compiled executables
extractor nuitka python unpacker
Last synced: 17 Jan 2026
https://github.com/xypwn/filediver
Extract Helldivers 2's 3D models, audio, video, textures and more.
extractor game-files go golang helldivers-2 helldivers2 stingray stingray-engine
Last synced: 21 Apr 2026
https://github.com/lukasgeiter/gettext-extractor
A flexible and powerful Gettext message extractor with support for JavaScript, TypeScript, JSX and HTML.
extractor gettext i18n l10n po-files translation typescript
Last synced: 12 Apr 2025
https://github.com/ropensci-archive/monkeylearn
:no_entry: ARCHIVED :no_entry: Accesses the Monkeylearn API for Text Classifiers and Extractors
classifier extractor monkeylearn nlp nlp-machine-learning peer-reviewed r r-package rstats
Last synced: 14 Mar 2025
https://github.com/theLSA/burp-info-extractor
burpsuite extension for extract information from data
burp-plugin burpsuite extractor information
Last synced: 11 Jul 2025
https://github.com/maxton/GameArchives
A C# library for reading several video game archive formats, and a sample file explorer.
extractor guitar-hero iso pfs ps4 rock-band rock-band-vr stfs wii xbox
Last synced: 20 Feb 2026
https://github.com/WirelessAlien/ZipXtract
It can extract zip, tar, bz2, gz, 7z, xz, jar and z etc (encrypted .zip & .7z supported)
7z 7zip android apache-commons-compress bzip2 extractor jar kotlin tar unzipper xz z zip zip-extraction zip4j zipextractor
Last synced: 21 Apr 2025
https://github.com/little-data/gametoolkit
搜集游戏解包工具|Collection of game unpacking tools
3ds collections extractor game psp tools unpacking
Last synced: 09 Mar 2026
https://github.com/wuqiang1985/i18n-helper
国际化自动包裹 & 提取 & 翻译词条工具
extractor i18n translation wrapper
Last synced: 21 Feb 2026
https://github.com/OpenByteDev/SourceScraper
Simple library which helps you to retrieve the source of various video streaming sites.
extractor nodejs npm-package scraper scrapers scraping scraping-tool source-extraction
Last synced: 22 Mar 2025
https://github.com/openbytedev/sourcescraper
Simple library which helps you to retrieve the source of various video streaming sites.
extractor nodejs npm-package scraper scrapers scraping scraping-tool source-extraction
Last synced: 03 Oct 2025
https://github.com/zenghongtu/general-news-extractor-js
🤔一个新闻网页正文通用抽取器,包括标题、作者和日期。
Last synced: 10 Jul 2025
https://github.com/j0k3r/f43.me
A more readable & cleaner feed
extractor feed hacktoberfest php rss symfony symfony-application
Last synced: 01 Sep 2025
https://github.com/fab2s/yaetl
Yet Another ETL in PHP
etl extract-transform-load extractor flow joiner loader php php-etl qualifier transformer workflow
Last synced: 02 Aug 2025
https://github.com/akosbalasko/onyx-boox-annotation-highlight-extractor
This tool extracts the highlights and the annotations from OnyxBoox Reading Notes txt files, and converts them to linked zettelkasten literature and permanent notes
boox extractor obsidian-md onyx onyx-boox second-brain zettelkasten
Last synced: 25 Jun 2025
https://github.com/lguilhermee/EAC-Extractor-Utility
This tool Decrypt and Extract the files from the EAC
decrypt eac easy-anti-cheat extractor
Last synced: 12 Apr 2025
https://github.com/lucasayres/url-feature-extractor
Extracting features from URLs to build a data set for machine learning. The purpose is to find a machine learning model to predict phishing URLs, which are targeted to the Brazilian population.
benign blacklist dataset extractor host lexical machine-learning phishing phishtank python safebrowsing wot
Last synced: 16 May 2025
https://github.com/augustoproiete/exceldna-unpack
Command-line utility to extract the contents of Excel-DNA add-ins packed with ExcelDnaPack
decompile excel-dna exceldna extractor hacktoberfest packing resource unpacking
Last synced: 05 Apr 2025
https://github.com/inaridiy/webforai
The best HTML to Markdown library, A esm-native & Useful Utilities with simple, lightweight and epic quality.
article-extractor extractor html-to-markdown html2markdown html2md html2text readability scraping text-mining
Last synced: 06 Apr 2025
https://github.com/osh/gr-eventstream
gr-eventstream is a set of GNU Radio blocks for creating precisely timed events and either inserting them into, or extracting them from normal data-streams precisely. It allows for the definition of high speed time-synchronous c++ burst event handlers, as well as bridging to standard GNU Radio Async PDU messages with precise timing easily.
burst c-plus-plus event-handling extract-data extractor gnu-radio injection message-passing python radio signal-processing signaling-pathways synchronization synchronization-service synchronous timing-simulator
Last synced: 12 Apr 2025
https://github.com/n0samu/director-files-extract
This Python script extracts Macromedia / Adobe Director movies and casts from Windows and Mac executables.
extractor macromedia macromedia-director
Last synced: 11 Jul 2025
https://github.com/jingshing/ai-image-tag-extractor
A tool to help you get image info.
ai ai-image extractor imageai novelai python stable-diffusion tag tags tkinter tkinter-gui
Last synced: 21 Aug 2025
https://github.com/lifegpc/msg-tool
Tools for export and import scripts
bgi circus ethornell extract-text extractor galgame kirikiri
Last synced: 09 May 2026
https://github.com/JingShing/AI-image-tag-extractor
A tool to help you get image info.
ai ai-image extractor imageai novelai python stable-diffusion tag tags tkinter tkinter-gui
Last synced: 05 Apr 2025
https://github.com/bioruebe/cicdec
An unpacker for Clickteam Install Creator installers
Last synced: 10 Aug 2025
https://github.com/smartive/proc-that
proc(ess)-that - easy extendable ETL tool for Node.js. Written in TypeScript.
etl extractor loader node observables rxjs transformer typescript
Last synced: 06 Apr 2025
https://github.com/aboutcode-org/extractcode
A mostly universal file extraction library and CLI tool to extract almost any archive in a reasonably safe way on Linux, macOS and Windows.
7zip archive bzip2 cab cpio decompression extract extractor gzip iso9660 libarchive lzma tar xz zip zstd
Last synced: 27 Oct 2025
https://github.com/proxzima/darkspider
Anatomy and Visualization of the Network structure of the Dark web using multi-threaded crawler
collaborate crawler dark-web extractor github github-pages hacktoberfest networkx onion osint python scraper tor
Last synced: 14 Mar 2026
https://github.com/mermade/openapi-extract
Extract single paths/operations from OpenAPI definitions
extract extractor metadata microservice microservices openapi openapi3 swagger
Last synced: 18 Jul 2025
https://github.com/sec-ant/hard-sub-extractor
hardcoded subtitles extraction tool 硬字幕提取工具(非作者,备份用)
extractor harcoded hardsub ocr subtitles videosubfinder
Last synced: 11 Oct 2025
https://github.com/knownsec/minitools-bin_extractor
A simple script for quickly mining sensitive information in binary files.
Last synced: 11 Mar 2026
https://github.com/knownsec/Minitools-bin_extractor
A simple script for quickly mining sensitive information in binary files.
Last synced: 11 Jul 2025
https://github.com/gabrielelanzafamee/viral-clips-extractor
Extract the most viral clips from a random youtube video.
automation chatgpt clips extractor face-detection moviepy opencv python speech-to-text text-to-text
Last synced: 18 Jun 2025
https://github.com/forlos/akaibu
Visual novel archive extractor and resource converter
converter extractor gui reverse-engineering rust visual-novel visual-novels
Last synced: 26 Apr 2025
https://github.com/akascape/sprite-extractor-on-the-go
This is a GUI (Python based) program that can extract sprite sheet (which are in a bundle form) to separate images
extractor extractors gui gui-application plist plist-extract plist-files sprite sprite-animation sprite-editor sprite-extract sprite-extractor sprite-extractor-otg sprite-extractor-program sprite-plist-extractor spritesheet spritesheet-cutter spritesheet-generator spritesheet-packer spritesheet-unpacker
Last synced: 12 Apr 2025
https://github.com/dfaust/mkv-bash-tools
A collection of BASH scripts for processing mkv files
command-line-app dts extractor matroska mkv shell-script subtitles tools transcoder
Last synced: 08 Apr 2025
https://github.com/Mermade/openapi-extract
Extract single paths/operations from OpenAPI definitions
extract extractor metadata microservice microservices openapi openapi3 swagger
Last synced: 03 Apr 2025
https://github.com/p0dalirius/hashes-harvester
Automatically extracts NT and LM hashes from Windows memory dumps based on volatility.
extractor forensics hive lm nt volatility windows
Last synced: 03 Sep 2025
https://github.com/jooapa/Angry_Birds_Decompilation
decomp instructions/script for all 6 Angry Birds Games'
angry-bird-game angry-birds angrybirds decompilation decompiler decrypt decryption decryptor extractor fusion hack ka3d klinoff lua lua-script pig python python3 reverse-engineering
Last synced: 11 Mar 2026
https://github.com/thoughtworksinc/extractor.scala
Make PartialFunction and extractors composable
extractor extractor-objects partial-functions scala
Last synced: 30 Apr 2025
https://github.com/killo445/extract-xiso-gui
GUI version of extract-xiso
extract extract-xiso extractor gui windows xbox xiso
Last synced: 28 Apr 2025
https://github.com/jooapa/angry_birds_decompilation
decomp instructions/script for all 6 Angry Birds Games'
angry-bird-game angry-birds angrybirds decompilation decompiler decrypt decryption decryptor extractor fusion hack ka3d klinoff lua lua-script pig python python3 reverse-engineering
Last synced: 07 Jul 2025
https://github.com/simplrjs/ts-extractor
TypeScript AST extractor to useful JSON structure.
abstract-syntax-tree extractor json-structure typescript typescript-ast-extractor
Last synced: 14 Apr 2025
https://github.com/devmehq/extract-text
node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!
extract-text extractor ocr pdf tessaract tesseract-ocr
Last synced: 17 Jan 2026
https://github.com/woojubb/html-article-extractor
A web page content extractor
article-extracting article-extractor crawler crawling extraction extractor
Last synced: 24 Dec 2025
https://github.com/ravbug/mcsoundsextractor
Extracts the audio files out of Minecraft Java Edition. Written in C++
audio boost cpp cross-platform extractor minecraft native wxwidgets
Last synced: 06 May 2025
https://github.com/helias/wowhead-php-extractors
Compare your database with WoWHead and find missing data
azerothcore extractor parser trinitycore world-of-warcraft wow wowhead
Last synced: 06 May 2025