An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with extractor

A curated list of projects in awesome lists tagged with extractor .

https://github.com/peazip/PeaZip

Free Zip / Unzip software and Rar file extractor. Cross-platform file and archive manager. Features volume spanning, compression, authenticated encryption. Supports 7Z, 7-Zip sfx, ACE, ARJ, Brotli, BZ2, CAB, CHM, CPIO, DEB, GZ, ISO, JAR, LHA/LZH, NSIS, OOo, PAQ/LPAQ, PEA, QUAD, RAR, RPM, split, TAR, Z, ZIP, ZIPX, Zstandard.

archiver brotli compression encryption extractor paq peazip rar zip zstandard

Last synced: 30 Mar 2025

https://github.com/peazip/peazip

Free Zip / Unzip software and Rar file extractor. Cross-platform file and archive manager. Features volume spanning, compression, authenticated encryption. Supports 7Z, 7-Zip sfx, ACE, ARJ, Brotli, BZ2, CAB, CHM, CPIO, DEB, GZ, ISO, JAR, LHA/LZH, NSIS, OOo, PAQ/LPAQ, PEA, QUAD, RAR, RPM, split, TAR, Z, ZIP, ZIPX, Zstandard.

archiver brotli compression encryption extractor paq peazip rar zip zstandard

Last synced: 06 Apr 2026

https://github.com/Bioruebe/UniExtract2

Universal Extractor 2 is a tool to extract files from any type of archive or installer.

decompression extractor file-analysis unpacker

Last synced: 28 Mar 2025

https://github.com/bioruebe/uniextract2

Universal Extractor 2 is a tool to extract files from any type of archive or installer.

decompression extractor file-analysis unpacker

Last synced: 30 Jan 2026

https://github.com/teamnewpipe/newpipeextractor

NewPipe's core library for extracting data from streaming sites

bandcamp crawler extractor mediaccc newpipe peertube scraper soundcloud youtube

Last synced: 05 Feb 2026

https://github.com/TeamNewPipe/NewPipeExtractor

NewPipe's core library for extracting data from streaming sites

bandcamp crawler extractor mediaccc newpipe peertube scraper soundcloud youtube

Last synced: 19 Jul 2025

https://github.com/zelon88/hrconvert2

A self-hosted, drag-and-drop & nosql file conversion server & share tool that supports 445 file formats in 13 languages.

archiver conversion converter document-conversion extractor file-converter file-sharing format image multilingual ocr ocr-recognition pdf-converter php server virustotal

Last synced: 15 May 2026

https://github.com/zelon88/HRConvert2

A self-hosted, drag-and-drop & nosql file conversion server & share tool that supports 445 file formats in 13 languages.

archiver conversion converter document-conversion extractor file-converter file-sharing format image multilingual ocr ocr-recognition pdf-converter php server virustotal

Last synced: 28 Mar 2025

https://github.com/tatuylonen/wiktextract

Wiktionary dump file parser and multilingual data extractor

dictionary extractor lua multilingual parser scribunto templates wikitext wiktionary wiktionary-parser

Last synced: 14 May 2025

https://github.com/stangirard/seo-audits-toolkit

SEO & Security Audit for Websites. Lighthouse & Security Headers crawler, Sitemap/Keywords/Images Extractor, Summarizer, etc ...

analysis audits crawler dashboard extractor headers internal-links lighthouse link-extractor python securityheader seo seo-tools serp summarizer

Last synced: 04 Apr 2025

https://github.com/StanGirard/seo-audits-toolkit

SEO & Security Audit for Websites. Lighthouse & Security Headers crawler, Sitemap/Keywords/Images Extractor, Summarizer, etc ...

analysis audits crawler dashboard extractor headers internal-links lighthouse link-extractor python securityheader seo seo-tools serp summarizer

Last synced: 26 Mar 2025

https://github.com/pcjbird/assetsextractor

『Assets提取工具』是一款OSX平台上用于将Assets.car或xxx.app中打包的png图片、pdf等资源重新提取出来的开发者工具。Assets.car常见于iOS/Mac/Unity等开发中的资源打包。

app assets car extractor ios mac osx resource unity

Last synced: 05 Apr 2025

https://github.com/alexcmgit/kanade

Android app to extract apks from installed apps.

android apk app extractor flutter kanade

Last synced: 12 Apr 2025

https://github.com/mikemeliz/torcrawl.py

Crawl and extract (regular or onion) webpages through TOR network

crawler extractor onion osint python tor

Last synced: 14 Jan 2026

https://github.com/tobyxdd/android-ota-payload-extractor

A fast & natively cross-platform Android OTA payload extractor written in Go

android extractor magisk ota payload root

Last synced: 07 Apr 2025

https://github.com/alexeichhorn/youtubekit

YouTube video and audio extractor for iOS, watchOS, visionOS, tvOS and macOS

audio downloader extractor ios native swift video youtube youtube-dl yt-dlp

Last synced: 01 Feb 2026

https://github.com/MikeMeliz/TorCrawl.py

Crawl and extract (regular or onion) webpages through TOR network

crawler extractor onion osint python tor

Last synced: 07 Apr 2025

https://github.com/alexeichhorn/YouTubeKit

YouTube video and audio extractor for iOS, watchOS, visionOS, tvOS and macOS

audio downloader extractor native swift video youtube youtube-dl

Last synced: 18 Jul 2025

https://github.com/opensemanticsearch/open-semantic-etl

Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database

annotation documents elasticsearch enrichment etl extract extract-information extract-text extractor ingest ingestion-pipeline ingests-documents named-entity-recognition nlp ocr pdf python rdf solr solr-dataimporter

Last synced: 06 Apr 2025

https://github.com/zhupingqi/RuiJi.Net

crawler framework, distributed crawler extractor

crawler extractor headless-chrome netcore owin scraper scrapy

Last synced: 04 May 2025

https://github.com/dwisiswant0/galer

A fast tool to fetch URLs from HTML attributes by crawl-in.

crawler devtool extractor galer go golang spider url-extractor url-parser waybackurls

Last synced: 12 Apr 2025

https://github.com/lipoja/URLExtract

URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.

extract extractor hacktoberfest urls

Last synced: 22 Mar 2025

https://github.com/lifenjoiner/ISx

ISx is an InstallShield installer extractor

extractor installshield unpacker windows

Last synced: 17 Apr 2026

https://github.com/microsoft/recursiveextractor

RecursiveExtractor is a .NET Standard 2.0 archive extraction Library, and Command Line Tool which can process 7zip, ar, bzip2, deb, gzip, iso, rar, tar, vhd, vhdx, vmdk, wim, xzip, and zip archives and any nested combination of the supported formats.

archive disc-image extractor nuget recursion

Last synced: 15 May 2025

https://github.com/Bioruebe/godotdec

An unpacker for Godot Engine package files (.pck)

extractor godot-engine unpacker

Last synced: 12 Mar 2026

https://github.com/bioruebe/godotdec

An unpacker for Godot Engine package files (.pck)

extractor godot-engine unpacker

Last synced: 11 May 2025

https://github.com/UE-Explorer/UE-Explorer

UnrealScript decompiler and explorer tool for Unreal Engine packages.

app decompiler exporter extractor help-wanted ue-explorer unreal unreal-engine unrealscript upk

Last synced: 11 Mar 2025

https://github.com/crazy-max/undock

Extract contents of a container image in a local folder

cli container docker extractor oci

Last synced: 03 Apr 2025

https://github.com/gilbsgilbs/babel-plugin-i18next-extract

Babel plugin that statically extracts i18next and react-i18next translation keys.

babel-plugin extractor i18n i18next internationalization react react-i18next static-analysis

Last synced: 27 Jan 2026

https://github.com/vgmoose/OpenBackupExtractor

A free program for extracting data (like voicemails) from iPhone and iPad backups.

backup extractor ios iphone voicemail

Last synced: 04 May 2025

https://github.com/gengteng/axum-valid

axum-valid is a library that provides data validation extractors for the Axum web framework. It integrates validator, garde and validify, three popular validation crates in the Rust ecosystem, to offer convenient validation and data handling extractors for Axum applications.

axum extractor garde validator validify

Last synced: 16 Dec 2025

https://github.com/vgmoose/openbackupextractor

A free program for extracting data (like voicemails) from iPhone and iPad backups.

backup extractor ios iphone voicemail

Last synced: 15 Apr 2025

https://github.com/morungos/node-word-extractor

Read data from a Word document using node.js

extractor msword

Last synced: 20 Jun 2025

https://github.com/lifenjoiner/isx

ISx is an InstallShield installer extractor

extractor installshield unpacker windows

Last synced: 21 Aug 2025

https://github.com/mefistotelis/pylabview

Python reader of LabVIEW RSRC files (VI, CTL, LLB). File format description on the Wiki.

extractor fileformat labview python3 reverse-engineering

Last synced: 08 Jun 2026

https://github.com/schollz/ingredients

Extract recipe ingredients from any recipe website on the internet.

cooking extractor ingredients recipes tagging

Last synced: 16 Apr 2026

https://github.com/astahmer/box-extractor

Static code extraction. Zero-runtime CSS-in-TS `<Box />` -> became a part of Panda CSS

ast box extractor sprinkles static typescript vanilla-extract vite

Last synced: 07 May 2025

https://github.com/bpolaszek/bentools-etl

PHP ETL (Extract / Transform / Load) library with SOLID principles + almost no dependency.

callable etl export extract extractor import input invoke load loader loop output pattern php transform transformer

Last synced: 06 Apr 2025

https://github.com/cdimascio/essence

Automatically extract the main text content (and more) from an HTML document

extractor hacktoberfest html-extractor scraper web-content-extractor webpage-extractor website-extractor

Last synced: 14 Apr 2025

https://github.com/antonyhaman/youtube-jextractor

Android based library that allows you to download or play audio and video from Youtube, in other words - youtube-dl for android

android android-library audio extract-video extractor java muxed-streams stream streams videodetails youtube youtube-dl youtube-dl-android youtube-downloader youtube-extractor youtube-video youtube-videos youtubeextractor

Last synced: 28 Oct 2025

https://github.com/theLSA/burp-sensitive-param-extractor

burpsuite extension for check and extract sensitive request parameter

burp-plugin burpsuite checker extractor parameters sensitive

Last synced: 09 Aug 2025

https://github.com/extremecoders-re/nuitka-extractor

Tool to extract nuitka compiled executables

extractor nuitka python unpacker

Last synced: 17 Jan 2026

https://github.com/xypwn/filediver

Extract Helldivers 2's 3D models, audio, video, textures and more.

extractor game-files go golang helldivers-2 helldivers2 stingray stingray-engine

Last synced: 21 Apr 2026

https://github.com/lukasgeiter/gettext-extractor

A flexible and powerful Gettext message extractor with support for JavaScript, TypeScript, JSX and HTML.

extractor gettext i18n l10n po-files translation typescript

Last synced: 12 Apr 2025

https://github.com/ropensci-archive/monkeylearn

:no_entry: ARCHIVED :no_entry: Accesses the Monkeylearn API for Text Classifiers and Extractors

classifier extractor monkeylearn nlp nlp-machine-learning peer-reviewed r r-package rstats

Last synced: 14 Mar 2025

https://github.com/theLSA/burp-info-extractor

burpsuite extension for extract information from data

burp-plugin burpsuite extractor information

Last synced: 11 Jul 2025

https://github.com/vegidio/umd

A desktop app to easily download media files hosted on popular websites.

bunkr coomer cyberdrop downloader erome extractor fapello go imaglr jpgfish kemono reddit redgifs saint scraper simpcity

Last synced: 25 Apr 2026

https://github.com/maxton/GameArchives

A C# library for reading several video game archive formats, and a sample file explorer.

extractor guitar-hero iso pfs ps4 rock-band rock-band-vr stfs wii xbox

Last synced: 20 Feb 2026

https://github.com/WirelessAlien/ZipXtract

It can extract zip, tar, bz2, gz, 7z, xz, jar and z etc (encrypted .zip & .7z supported)

7z 7zip android apache-commons-compress bzip2 extractor jar kotlin tar unzipper xz z zip zip-extraction zip4j zipextractor

Last synced: 21 Apr 2025

https://github.com/little-data/gametoolkit

搜集游戏解包工具|Collection of game unpacking tools

3ds collections extractor game psp tools unpacking

Last synced: 09 Mar 2026

https://github.com/wuqiang1985/i18n-helper

国际化自动包裹 & 提取 & 翻译词条工具

extractor i18n translation wrapper

Last synced: 21 Feb 2026

https://github.com/OpenByteDev/SourceScraper

Simple library which helps you to retrieve the source of various video streaming sites.

extractor nodejs npm-package scraper scrapers scraping scraping-tool source-extraction

Last synced: 22 Mar 2025

https://github.com/openbytedev/sourcescraper

Simple library which helps you to retrieve the source of various video streaming sites.

extractor nodejs npm-package scraper scrapers scraping scraping-tool source-extraction

Last synced: 03 Oct 2025

https://github.com/zenghongtu/general-news-extractor-js

🤔一个新闻网页正文通用抽取器,包括标题、作者和日期。

extractor news readability

Last synced: 10 Jul 2025

https://github.com/j0k3r/f43.me

A more readable & cleaner feed

extractor feed hacktoberfest php rss symfony symfony-application

Last synced: 01 Sep 2025

https://github.com/akosbalasko/onyx-boox-annotation-highlight-extractor

This tool extracts the highlights and the annotations from OnyxBoox Reading Notes txt files, and converts them to linked zettelkasten literature and permanent notes

boox extractor obsidian-md onyx onyx-boox second-brain zettelkasten

Last synced: 25 Jun 2025

https://github.com/lguilhermee/EAC-Extractor-Utility

This tool Decrypt and Extract the files from the EAC

decrypt eac easy-anti-cheat extractor

Last synced: 12 Apr 2025

https://github.com/lucasayres/url-feature-extractor

Extracting features from URLs to build a data set for machine learning. The purpose is to find a machine learning model to predict phishing URLs, which are targeted to the Brazilian population.

benign blacklist dataset extractor host lexical machine-learning phishing phishtank python safebrowsing wot

Last synced: 16 May 2025

https://github.com/augustoproiete/exceldna-unpack

Command-line utility to extract the contents of Excel-DNA add-ins packed with ExcelDnaPack

decompile excel-dna exceldna extractor hacktoberfest packing resource unpacking

Last synced: 05 Apr 2025

https://github.com/inaridiy/webforai

The best HTML to Markdown library, A esm-native & Useful Utilities with simple, lightweight and epic quality.

article-extractor extractor html-to-markdown html2markdown html2md html2text readability scraping text-mining

Last synced: 06 Apr 2025

https://github.com/osh/gr-eventstream

gr-eventstream is a set of GNU Radio blocks for creating precisely timed events and either inserting them into, or extracting them from normal data-streams precisely. It allows for the definition of high speed time-synchronous c++ burst event handlers, as well as bridging to standard GNU Radio Async PDU messages with precise timing easily.

burst c-plus-plus event-handling extract-data extractor gnu-radio injection message-passing python radio signal-processing signaling-pathways synchronization synchronization-service synchronous timing-simulator

Last synced: 12 Apr 2025

https://github.com/n0samu/director-files-extract

This Python script extracts Macromedia / Adobe Director movies and casts from Windows and Mac executables.

extractor macromedia macromedia-director

Last synced: 11 Jul 2025

https://github.com/lifegpc/msg-tool

Tools for export and import scripts

bgi circus ethornell extract-text extractor galgame kirikiri

Last synced: 09 May 2026

https://github.com/bioruebe/cicdec

An unpacker for Clickteam Install Creator installers

extractor unpacker

Last synced: 10 Aug 2025

https://github.com/smartive/proc-that

proc(ess)-that - easy extendable ETL tool for Node.js. Written in TypeScript.

etl extractor loader node observables rxjs transformer typescript

Last synced: 06 Apr 2025

https://github.com/tatokis/qresextract

Qt binary resource (qres) extractor

extract extractor qres qt5 resource

Last synced: 14 Apr 2025

https://github.com/brosck/apihookingdetector

「⚙️」Detect which native Windows API's (NtAPI) are being hooked

api detect detector extract extractor hooking native native-api nt ntapi windows

Last synced: 06 May 2025

https://github.com/velocityzen/meta-extractor

Super simple and fast html page meta data extractor with low memory footprint

atom extractor feed html meta metadata nodejs opengraph parser rss

Last synced: 22 Mar 2025

https://github.com/aboutcode-org/extractcode

A mostly universal file extraction library and CLI tool to extract almost any archive in a reasonably safe way on Linux, macOS and Windows.

7zip archive bzip2 cab cpio decompression extract extractor gzip iso9660 libarchive lzma tar xz zip zstd

Last synced: 27 Oct 2025

https://github.com/proxzima/darkspider

Anatomy and Visualization of the Network structure of the Dark web using multi-threaded crawler

collaborate crawler dark-web extractor github github-pages hacktoberfest networkx onion osint python scraper tor

Last synced: 14 Mar 2026

https://github.com/mermade/openapi-extract

Extract single paths/operations from OpenAPI definitions

extract extractor metadata microservice microservices openapi openapi3 swagger

Last synced: 18 Jul 2025

https://github.com/sec-ant/hard-sub-extractor

hardcoded subtitles extraction tool 硬字幕提取工具(非作者,备份用)

extractor harcoded hardsub ocr subtitles videosubfinder

Last synced: 11 Oct 2025

https://github.com/knownsec/minitools-bin_extractor

A simple script for quickly mining sensitive information in binary files.

bin extractor python

Last synced: 11 Mar 2026

https://github.com/knownsec/Minitools-bin_extractor

A simple script for quickly mining sensitive information in binary files.

bin extractor python

Last synced: 11 Jul 2025

https://github.com/forlos/akaibu

Visual novel archive extractor and resource converter

converter extractor gui reverse-engineering rust visual-novel visual-novels

Last synced: 26 Apr 2025

https://github.com/dfaust/mkv-bash-tools

A collection of BASH scripts for processing mkv files

command-line-app dts extractor matroska mkv shell-script subtitles tools transcoder

Last synced: 08 Apr 2025

https://github.com/tonylianlong/wechatstickerextractor

Mac微信表情包提取工具,提取动态的表情包成为gif图片~

dynamic extractor gif sticker wechat

Last synced: 15 Sep 2025

https://github.com/Mermade/openapi-extract

Extract single paths/operations from OpenAPI definitions

extract extractor metadata microservice microservices openapi openapi3 swagger

Last synced: 03 Apr 2025

https://github.com/p0dalirius/hashes-harvester

Automatically extracts NT and LM hashes from Windows memory dumps based on volatility.

extractor forensics hive lm nt volatility windows

Last synced: 03 Sep 2025

https://github.com/thoughtworksinc/extractor.scala

Make PartialFunction and extractors composable

extractor extractor-objects partial-functions scala

Last synced: 30 Apr 2025

https://github.com/killo445/extract-xiso-gui

GUI version of extract-xiso

extract extract-xiso extractor gui windows xbox xiso

Last synced: 28 Apr 2025

https://github.com/simplrjs/ts-extractor

TypeScript AST extractor to useful JSON structure.

abstract-syntax-tree extractor json-structure typescript typescript-ast-extractor

Last synced: 14 Apr 2025

https://github.com/devmehq/extract-text

node.js module for extracting text from html, pdf, doc, docx, xls, xlsx, csv, pptx, png, jpg, gif, rtf and more!

extract-text extractor ocr pdf tessaract tesseract-ocr

Last synced: 17 Jan 2026

https://github.com/ravbug/mcsoundsextractor

Extracts the audio files out of Minecraft Java Edition. Written in C++

audio boost cpp cross-platform extractor minecraft native wxwidgets

Last synced: 06 May 2025

https://github.com/helias/wowhead-php-extractors

Compare your database with WoWHead and find missing data

azerothcore extractor parser trinitycore world-of-warcraft wow wowhead

Last synced: 06 May 2025