Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with metadata-extraction

A curated list of projects in awesome lists tagged with metadata-extraction .

https://github.com/tern-tools/tern

Tern is a software composition analysis tool and Python library that generates a Software Bill of Materials for container images and Dockerfiles. The SBOM that Tern generates will give you a layer-by-layer view of what's inside your container in a variety of formats including human-readable, JSON, HTML, SPDX and more.

compliance containers dependencies docker metadata-extraction open-source oss-compliance python risk-management sbom software-composition-analysis spdx supply-chain-security tool

Last synced: 27 Dec 2024

https://github.com/deepjyoti30/ytmdl-web-v2

Web version of ytmdl. Allows downloading songs with metadata embedded from various sources like itunes, gaana, LastFM etc.

audio audio-extraction download free-music free-music-download freemusic freesoftware high-quality-music itunes metadata metadata-extraction music-download no-ads online-music songs spotify webapp youtube ytmdl

Last synced: 23 Dec 2024

https://github.com/m8sec/pymeta

Utility to download and extract document metadata from an organization. This technique can be used to identify: domains, usernames, software/version numbers and naming conventions.

extract-metadata information-disclosure metadata metadata-extraction pentest pentest-tool python3

Last synced: 27 Dec 2024

https://github.com/aydinnyunus/exiflooter

ExifLooter finds geolocation on all image urls and directories also integrates with OpenStreetMap

bug-bounty bugbounty cyber-security exif exif-metadata exiftool golang hack hacking image metadata metadata-extraction osint redteam security

Last synced: 24 Dec 2024

https://github.com/aydinnyunus/exifLooter

ExifLooter finds geolocation on all image urls and directories also integrates with OpenStreetMap

bug-bounty bugbounty cyber-security exif exif-metadata exiftool golang hack hacking image metadata metadata-extraction osint redteam security

Last synced: 05 Nov 2024

https://github.com/adultmm/AdultMediaManager

Adult Media Manager is the ultimate media manager for your adult movies and videos. Organize your content for Kodi, Plex, and other media centers.

adult-contents media-manager metadata-extraction

Last synced: 03 Nov 2024

https://github.com/wikimedia/html-metadata

MetaData html scraper and parser for Node.js (supports Promises and callback style)

javascript metadata-extraction metadata-extractor node-module nodejs web-scraper web-scraping

Last synced: 28 Dec 2024

https://github.com/betahuhn/metadata-scraper

🏷️ A JavaScript library for scraping/parsing metadata from a web page.

html-scraper javascript-library meta-tags metadata metadata-extraction metatags open-graph page parser typescript

Last synced: 29 Dec 2024

https://github.com/BetaHuhn/metadata-scraper

🏷️ A JavaScript library for scraping/parsing metadata from a web page.

html-scraper javascript-library meta-tags metadata metadata-extraction metatags open-graph page parser typescript

Last synced: 30 Oct 2024

https://github.com/outerport/paramit

Parameterize Python scripts/notebooks all from the command line and run on cloud GPUs

ai aiops config experiment-tracking machine-learning metadata metadata-extraction metadata-tracking ml mlops python pytorch

Last synced: 24 Dec 2024

https://github.com/vvzen/parse-exr-header

Pure Python (no additional dependencies) helper module to read metadata from the header of OpenEXR files.

metadata-extraction openexr python python27 python3 vfx

Last synced: 22 Dec 2024

https://github.com/d3x-at/sd-parsers

A Python library to read metadata from images created by Stable Diffusion.

automatic1111 comfyui fooocus invokeai metadata-extraction novelai parse parser stable-diffusion stable-diffusion-library

Last synced: 26 Dec 2024

https://github.com/KGCP/MEL-TNNT

Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)

metadata-extraction named-entity-recognition natural-language-processing nlp nlp-ner pipeline

Last synced: 17 Nov 2024

https://github.com/soypat/exif

Dead simple exchangeable image file format tools for Go optimized for large image files using lazy loading.

exif exiftool go golang hyperspectral image jpeg large-files metadata metadata-extraction tiff

Last synced: 28 Oct 2024

https://github.com/hfiguiere/exempi

Exempi: XMP SDK (freedesktop mirror)

metadata metadata-extraction xmp

Last synced: 28 Oct 2024

https://github.com/corentinb/youtube-ma

💾 YouTube video metadata archiver written in Golang

archiver downloader go golang metadata metadata-extraction scraper youtube

Last synced: 17 Nov 2024

https://github.com/CorentinB/YouTube-MA

💾 YouTube video metadata archiver written in Golang

archiver downloader go golang metadata metadata-extraction scraper youtube

Last synced: 16 Nov 2024

https://github.com/multimc/meta

Repository for Minecraft package metadata update scripts

liteloader lwjgl metadata metadata-extraction minecraft minecraft-forge multimc package-json quiltmc

Last synced: 07 Nov 2024

https://github.com/lttkgp/metadata-extractor

Extract song metadata from YouTube links with Spotify API

hacktoberfest metadata-extraction spotify-api youtube

Last synced: 11 Oct 2024

https://github.com/liyasthomas/metadata

🌌 File metadata viewer - Build with Electron

electron electron-app liyasthomas metadata metadata-extraction

Last synced: 02 Dec 2024

https://github.com/darenasc/aeda

Build a data catalog by running a single line of code

data-catalog data-exploration database eda metadata metadata-extraction

Last synced: 27 Oct 2024

https://github.com/mostlygeek/go-exiftool

go bindings for exiftool

exiftool golang media metadata-extraction

Last synced: 01 Nov 2024

https://github.com/gomoob/php-metadata-extractor

PHP wrapper to easily call the Java metadata-extrator library.

gomoob metadata-extraction metadata-extractor php

Last synced: 06 Nov 2024

https://github.com/Visio-projects/Visio-Shape-Extract

:arrow_heading_down: This VSTO Add-In extracts the shape attributes' text from a Microsoft Visio file. It is written as VSTO in C#/VB.NET and VBA.

addin entity-attribute extract metadata-extraction ribbon shape vba visio visio-vsto-addin visual-studio-2015 visual-studio-2017 visual-studio-community visualbasic vsto

Last synced: 13 Nov 2024

https://github.com/oduwsdl/aiu

A library for interacting with web archive collections at Archive-It, Trove, Pandora, and more.

archiveit metadata metadata-extraction webarchives

Last synced: 08 Nov 2024

https://github.com/a-gubskiy/x.web.metaextractor

Library which allow extract meta information from page

dncuug extract-meta-information metadata metadata-extraction net-core open-graph web

Last synced: 08 Nov 2024

https://github.com/truethari/infomedia

Python application that can be used to retrieve media file information such as duration, frame rate, bit rate, etc..

audio-processing ffmpeg ffprobe metadata-extraction music video video-processing

Last synced: 10 Nov 2024

https://github.com/eellak/gsoc2018-gg-extraction

NER & Metadata Extraction of the Greek Government Gazette

machine-learning metadata-extraction ner-annotation nlp python

Last synced: 08 Nov 2024

https://github.com/transitive-bullshit/p-exiftool

Wrapper around exiftool for reading metadata from many different file types.

exiftool metadata metadata-extraction

Last synced: 24 Oct 2024

https://github.com/jpastorm/gotube

A youtube library for retrieving metadata, obtaining direct links to video-only/audio-only/mixed versions of videos and download videos.

download go golang library metadata-extraction url video youtube youtube-downloader

Last synced: 28 Dec 2024

https://github.com/dubyadude/metamaster

View metadata of files using Windows Context Menu.

camera-raw dotnet exif icc iptc jfif metadata metadata-extraction metadata-extractor png webp windows xmpp

Last synced: 10 Oct 2024

https://github.com/audiomuze/tagminder

Import, maintain and export tag metadata to/from audio files and a dynamically created SQLite table. Automates incremental tag cleanup, enrichment and standardisation for your digital audio library at scale using pre-scripted SQL queries, achieving quality and consistency throughout your music collection in a manner not possible with a tagger.

audio-metadata data-enrichment data-wrangling flac metadata-editing metadata-extraction music-library music-metadata music-tagging musicbrainz rym-capitalisation sqlite3

Last synced: 22 Nov 2024

https://github.com/mesmacosta/hive-custom-hook

Example on how to implement a hive hook

hadoop hive hive-hook java metadata-extraction

Last synced: 11 Nov 2024

https://github.com/sdsc-ordes/gimie

Extract linked metadata from repositories

cli fair-data git library linked-open-data metadata-extraction scientific-software

Last synced: 07 Nov 2024

https://github.com/bagaturchess/visualbytecode

GUI which visualizes the metadata information encoded into the Java bytecode for a given set of Java classes. Especially designed to search for Java Annotations, for example in your Spring Boot project.

annotation-processing annotation-processor annotation-tool annotations bytecode java javaee javase metadata-extraction spring spring-boot

Last synced: 22 Nov 2024

https://github.com/leroyanders/acrticle-scrapper

This Python-based repository hosts a sophisticated service designed for scraping web articles and converting them into Markdown format. The core functionality of this service includes extracting the main content of articles, such as headlines, key paragraphs, and associated images, and then seamlessly transforming this content into well-structured…

article-parser content-creation-tools content-extraction data-archiving html-to-markdown-converter image-downloading markdown-conversion metadata-extraction python web-scraping

Last synced: 27 Oct 2024

https://github.com/mariocesar/htmlmetadata

Extract metadata from html pages using Open Graph metadata, HTML metadata, and a series of fallbacks

metadata-extraction metatags python-module python3 schema-org

Last synced: 15 Oct 2024

https://github.com/hfiguiere/exempi-rs

Rust bindings for Exempi

metadata metadata-extraction rust rust-crate xmp

Last synced: 12 Oct 2024

https://github.com/delucis/whorl

✏️❔ Find out who the author(s) is/are from an input URL

author-identification metadata-extraction scraper

Last synced: 20 Dec 2024

https://github.com/qwazr/extractor

A WEB API for text and meta-data extraction

extract-data extractor metadata-extraction parse parser-auto-detection

Last synced: 04 Nov 2024

https://github.com/jadedctrl/songattr

Fill Haiku's Audio attributes with ffmpeg. (🪞)

extended-attributes haiku metadata-extraction music-library music-metadata xattr xattrs

Last synced: 19 Nov 2024

https://github.com/lucksi/mycroft

Mycroft is a Offline File Analyzer and Metadata Scraper

file-analyzer hash hashing metadata metadata-extraction python zip-extraction zip-files

Last synced: 10 Nov 2024

https://github.com/cybercentrecanada/assemblyline-service-characterize

Assemblyline 4 metadata extraction and entropy calculation plugin

assemblyline entropy lnk malware-analysis metadata-extraction shortcuts

Last synced: 11 Nov 2024

https://github.com/jeankassio/sioner-metadata-extractor

Sioner Metadata Extractor uses Chromedriver to extract metadata from websites with javascript using Symfony/Panther.

extractor extractors metadata metadata-extraction metadata-extractor panther panther-web scraper scraping scraping-data scraping-web scraping-websites synfony

Last synced: 17 Nov 2024

https://github.com/justingosses/nasa-repos-visual-explorer

Exploration of NASA code across multiple orgs on github.com using the awesome-list-visual-explorer-template

github-api metadata metadata-extraction nasa nasa-code software-portal visual-explorer

Last synced: 17 Nov 2024

https://github.com/kareem-emad/youtube_metadata_scraper

An expansion over the Youtube-8m Dataset to get more data about the videos such likes/views and channel info through scrapping youtube

dataset-generation metadata-extraction python scraping scraping-tool tensorflow2 youtube youtube8m

Last synced: 18 Nov 2024

https://github.com/tiegertropfen119-0001/youtube-video-downloader--yourtube

🎥📥🎵 YourTube is the ultimate video downloader, supporting MP3 and MP4 downloads from YouTube playlists. With download support for resolutions from 144p to 4k, YourTube also saves video titles and IDs, metadata, and thumbnails. Plus, the geo-bypass feature ensures you can download videos from any location. Download YourTube and start building you

4k-video-downloader download-videos downloader easy-to-use fast fast-download-speed geobypass gui metadata-extraction mp3-converter mp4-downloader playlist playlist-downloader thumbnail youtube youtube-api youtube-dl youtube-downloader

Last synced: 14 Nov 2024

https://github.com/ellman12/datetakenextractor

Small, fast, simple library for reading and writing Date Taken metadata for photos and videos.

csharp date-taken datetaken datetakenextractor datetime datetime-library dotnet metadata metadata-extraction metadata-extractor nuget

Last synced: 16 Nov 2024

https://github.com/jmousqueton/metalookup

Extract metadata from various file formats including PDFs, images (PNG, JPEG, TIFF, BMP, GIF), and Office documents (DOCX, XLSX, PPTX).

image metadata metadata-extraction office365 osint osint-python osint-tool pdf python video

Last synced: 10 Nov 2024

https://github.com/leocode/image-service

Containerized media processor which allows to manipulate images/videos or extract metadata

docker-image image-processing metadata-extraction video-processing

Last synced: 22 Dec 2024

https://github.com/maastrichtu-ids/sparql-profiler

✨ A package to profile SPARQL endpoints to extract the nodes and relations represented in the knowledge graph

metadata-extraction sparql sparql-endpoints sparql-query

Last synced: 21 Dec 2024

https://github.com/gyk4j/wreck

Java Swing GUI tool to correct Windows NTFS file system timestamp attributes from embedded metadata or naming convention

creation-time file-time metadata metadata-extraction metadata-management

Last synced: 02 Dec 2024

https://github.com/gyk4j/wreck-net

A C#.NET WinForms GUI and CLI console tool to correct Windows NTFS file system timestamp attributes from embedded metadata or naming convention

7-zip creation-time exif exif-metadata exiftool file-time last-modified mediainfo metadata metadata-extraction metadata-management

Last synced: 02 Dec 2024

https://github.com/sammarks/cloudformation-audio-metadata

Generates metadata (currently just duration) information for an audio file uploaded to a S3 bucket.

audio aws aws-lambda cloudformation cloudformation-template metadata-extraction sam serverless serverless-application-model

Last synced: 16 Dec 2024

https://github.com/sandrain/hpssix

A metadata indexing framework for large-scale storage systems, such as HPSS archival storage system.

hpss indexing metadata-extraction metadata-management

Last synced: 11 Nov 2024

https://github.com/taha-parsayan/xml-metadata-extractor

Extracting metadata related to MRI and PET images from xml files

brain metadata metadata-extraction mri pet xml xmlfile

Last synced: 02 Dec 2024

https://github.com/shintales/zid3

Simple ID3v2 metadata reader

id3 id3-reader id3v2 metadata metadata-extraction mp3-tags zig

Last synced: 24 Oct 2024

https://github.com/metalcorebear/arxiv-parser

A set of scripts for parsing scientific articles from arXiv.

arxiv-api metadata-extraction scientific-papers scientific-publications scientific-research

Last synced: 26 Dec 2024

https://github.com/sdsc-ordes/gimie-api

A containerized service providing a REST API for the gimie package.

metadata-extraction rdf rest-api service

Last synced: 26 Dec 2024

https://github.com/mohammadreza-mohammadi94/metadata_extractor

Easily grab and check out metadata from your images, audio files, and documents with our user-friendly app. Get detailed insights and keep your data super organized!

functional-programming metadata metadata-extraction modularization python sqllite3 streamlit

Last synced: 26 Dec 2024