Projects in Awesome Lists tagged with file-processing
A curated list of projects in awesome lists tagged with file-processing .
https://github.com/mayankpratap/samchika
A fast and light-weight multithreaded file processing library for Java.
concurrency file-processing java kotlin multithreading open-source parallel-processing performance scala
Last synced: 26 Jun 2025
https://github.com/transloadit/terraform-provider-transloadit
Terraform integration for Transloadit
api file-processing image-processing image-recognition media-processing-api terraform terraform-provider video-encoding
Last synced: 07 May 2025
https://github.com/znicholls/netcdf-scm
Simple wrappers for processing netcdf files for use in simple climate models
climate climate-analysis climate-models climate-science file-processing netcdf netcdf4
Last synced: 06 Jan 2026
https://github.com/ilyashusterman/doc-to-readable
Universal document-to-markdown and section splitter for HTML, URLs, and PDFs.
docs document-conversion documents file-processing html javascript json markdown nodejs npm rag splitter
Last synced: 28 Jan 2026
https://github.com/vcfvct/fixed-width-ts-decorator
Fixed width file handler parser with TypeScript Decorator
decorator file-processing fixed-width javascript reflection typescript
Last synced: 13 Apr 2025
https://github.com/mhmelshaaer/file-structures-organization-and-processing
Implementing basic file manipulations concepts and algorithms in c++
c-plus-plus file-organization file-processing hashing indexed-search
Last synced: 05 May 2025
https://github.com/shramkoweb/bookbot
A Python-based text analyzer that counts words and character frequencies in any .txt file, providing a detailed, sorted report. Perfect for quick text insights and learning text processing basics!
automation beginner-friendly character-frequency data-analysis file-processing open-source python text-analysis text-parser text-processing word-count
Last synced: 02 Feb 2026
https://github.com/sthagen/puhdistusalue
Puhdistusalue (Finnish for clean area here meaning purge range) - Purge monotonically named files in folders keeping range endpoints
compression developer-tools file-processing
Last synced: 19 Aug 2025
https://github.com/kaliv0/pyrio
Functional-style Streams API
file-processing fluent-api functional-programming python-functional streams-api
Last synced: 10 Apr 2025
https://github.com/fairfield-programming/libiii
🎆 An embeddable library for the Interpolated Image Interchange format.
c clang embedded file file-processing image image-processing images library standard standards
Last synced: 22 Aug 2025
https://github.com/jabedude/acct
Rust crate for processing acct(5) files
acct accton crates file-processing log logging parsing rust
Last synced: 14 May 2025
https://github.com/abitofhelp/adaptive_pipeline
Adaptive Rust pipeline for high-throughput file processing—dynamic chunking, parallelism, AES/ChaCha encryption, backpressure, and Prometheus/tracing.
adaptive-concurrency backpressure chunking concurrency data-pipeline encryption file-processing metrics observability opentelemetry parallelism prometheus rust stream-processing tracing
Last synced: 17 May 2026
https://github.com/comsavvy/perimeter
File processing
bash-script distance-calculation file-processing perimeter python
Last synced: 18 Apr 2026
https://github.com/dylan-stewart/capstone
Cloud-Based Analytics Application: CSV Check ~ Data Visualization for Inexperienced Users
automation cloud css csv data-visualization exploratory-data-analysis file-processing flask gcp gcs html javascript machine-learning python visualization web-app
Last synced: 09 Apr 2026
https://github.com/maxinexiong/geocoding-web-service
This repository houses a geocoding web application built in Python with Flask that transforms address data within a file into precise latitude and longitude coordinates. Upon uploading a file, users can preview the output table on the website, download the converted file, and visualise the exact location of each address on a map.
css3 file-download file-processing file-upload flask flask-application flask-webapp folium folium-maps geocoder geocoding html5 pandas pandas-dataframe python web-application
Last synced: 29 Apr 2026
https://github.com/mrzslr/simple-file-processing-pipeline
A distributed system for processing image files using microservices and message queues using RabbitMQ.
docker docker-compose file-processing message-queue microservice python python3 rabbitmq
Last synced: 15 Apr 2026
https://github.com/hitesh22rana/sourcecollector
A simple tool to consolidate multiple files into a single .txt file. Perfect for feeding your files to AI tools without any fuss.
ai-tools data-preparation file-processing text-processing utility
Last synced: 04 Nov 2025
https://github.com/dawidrylko/mergerocket
A CLI tool for recursively merging text file contents into a single output file, featuring customizable parameters and formatting optimized for LLMs.
concatenation file-processing merge text-merge
Last synced: 22 Feb 2026
https://github.com/ablomer/mediaconvert
⚙️ A modern web-based media converter that processes files entirely in your browser. Convert videos, images, and audio files between formats using FFmpeg and ImageMagick WebAssembly.
audio-converter browser-based client-side drag-and-drop dropzone ffmpeg file-converter file-processing firebase image-converter imagemagick mantine-ui media-converter react typescript video-converter vite web-application webassembly
Last synced: 07 Apr 2026
https://github.com/zejiran/transactions-email-processor
📬 Process a file containing debit and credit transactions on an account and generate a summary email with relevant information
email file-processing golang transactions
Last synced: 15 Mar 2025
https://github.com/abitofhelp/optimized_adaptive_pipeline_rs
Adaptive Rust pipeline for high-throughput file processing—dynamic chunking, parallelism, AES/ChaCha encryption, backpressure, and Prometheus/tracing.
adaptive-concurrency backpressure chunking concurrency data-pipeline encryption file-processing metrics observability opentelemetry parallelism prometheus rust stream-processing tracing
Last synced: 05 Oct 2025
https://github.com/barannmeisterr/zoomdurationanalyzer
Java program that reads, processes, and displays attendance records in a Zoom meeting report provided as a txt file.
duplicate-detection file-processing hashing hashtable java linear-probing node oop string-manipulation txt-files
Last synced: 13 May 2026
https://github.com/diluv/hilo
Processing and file management service for Diluv.
file-management file-processing
Last synced: 16 Jun 2025
https://github.com/victornpb/file-open-resume
A substitute for open() that lets you resume from where you left off. Very useful for consuming large files, or running a ETL script.
etl file-processing file-reader python python3 script
Last synced: 03 Apr 2025
https://github.com/montasim/terser-minify-tool
This script processes files within a project directory by minifying JavaScript files and copying other file types to a specified output directory. It leverages the Terser library for minification and tracks the size reduction achieved, reporting in kilobytes.
build-tool compression file-processing minification resource-optimization terser web-optimization
Last synced: 04 Apr 2025
https://github.com/zachacious/presto
CLI tool to run AI on files and directories of files
ai anthropic automation cli code-generation code-transformation developer-tools documentation file-processing golang llm openai productivity refactoring terminal
Last synced: 16 Apr 2026
https://github.com/oagoulart/luawalk
A file format reading and writing tool in Lua
file-format file-parser file-processing filesystem
Last synced: 21 Jun 2025
https://github.com/romelium/dircat
DirCat is a high-performance C++ utility that acts like the Unix cat command, but for entire directories. It efficiently concatenates and displays file contents, supporting multi-threading, recursion, and filtering. Note: This project heavily utilized AI tools during its development.
ai-tool ai-tools cmake console cpp cpp20 cross-platform dictionary directory-traversal file-processing filesystem multithreading utility
Last synced: 22 Mar 2025
https://github.com/krumyakimov/integration-file-processor-async
Asynchronous JSON file processor using public APIs and scheduled task execution
api-integration async asyncio automation data-processing file-processing integration json public-api python scheduler webservice
Last synced: 16 Aug 2025
https://github.com/artemzarubin/hamming-code
Implementation of Hamming code for error detection and correction. This Python application encodes/decodes text files using the 'ASCII -> binary -> Hamming code' scheme, allowing variable information block size (m). Developed for Work 1 in "Program and Data Security".
binary-data data-security encoder-decoder error-control-coding error-correction error-detection file-processing hamming-code lab-assignment python python3 university-project
Last synced: 15 Apr 2025
https://github.com/satheesh-meadi/medical-text-analysis-system
🚀 Medical Text Analysis System– An AI-powered web app that summarizes medical text, translates into 15+ languages, extracts entities (NER), and performs sentiment analysis. Built with Streamlit, PubMedBERT, and Google API to streamline healthcare data analysis.
chatbot file-processing llms machine-learning multilingual named-entity-recognition nlp ollama python streamlit
Last synced: 10 Apr 2026
https://github.com/yuis-ice/videos-to-tomontage-thumbnails
Generate thumbnail montages from video files to quickly identify and browse your video collection
automation cli command-line-tool content-management education ffmpeg file-processing gaming media-processing mkv montage mp4 nodejs streaming surveillance thumbnail-generator typescript video-analysis video-processing video-thumbnails
Last synced: 09 Apr 2026
https://github.com/xza85hrf/excel-comparison-app
Excel Comparison Application is a Python-based tool that compares two Excel files and generates a new Excel file with the differences. It's primarily designed to help in database updating by identifying new clients. The app also has a graphical user interface for easier use and logs operations for potential troubleshooting.
case-sensitive-comparison data-analysis data-difference database-comparison database-updates excel-comparison file-merging file-processing gui-application new-client-detection python
Last synced: 25 Mar 2025
https://github.com/valdotle/mangopeeler
CLI tool to detect and remove images inserted by aggregators and duplicates from your locally stored manga.
Last synced: 18 Mar 2025
https://github.com/jet-logic/alterx
A powerful file processing toolkit for batch transformations of HTML, JSON, TOML, XML, and YAML files
command-line-tool file-processing html-parser json-parsing json-serialization python3 toml-parsing xml-parsing yaml-processor
Last synced: 20 Jan 2026
https://github.com/barannmeisterr/exceldataanalyzeravltree
Student Data Query is a Java project designed to manage and query student data using an AVL tree data structure.
apache-poi avl-tree balanced-search-trees data-structures excel file-processing java node searchquery strings xlsx
Last synced: 29 Apr 2026