Projects in Awesome Lists tagged with file-processing
A curated list of projects in awesome lists tagged with file-processing .
https://github.com/mayankpratap/samchika
A fast and light-weight multithreaded file processing library for Java.
concurrency file-processing java kotlin multithreading open-source parallel-processing performance scala
Last synced: 26 Jun 2025
https://github.com/transloadit/terraform-provider-transloadit
Terraform integration for Transloadit
api file-processing image-processing image-recognition media-processing-api terraform terraform-provider video-encoding
Last synced: 07 May 2025
https://github.com/znicholls/netcdf-scm
Simple wrappers for processing netcdf files for use in simple climate models
climate climate-analysis climate-models climate-science file-processing netcdf netcdf4
Last synced: 25 Mar 2025
https://github.com/yanivhaliwa/chatwithgpt
ai code-generation file-processing gpt gpt-4 gpt-4o linux machine-learning nlp ocr openai python shell-scripting
Last synced: 27 Jul 2025
https://github.com/mhmelshaaer/file-structures-organization-and-processing
Implementing basic file manipulations concepts and algorithms in c++
c-plus-plus file-organization file-processing hashing indexed-search
Last synced: 05 May 2025
https://github.com/vcfvct/fixed-width-ts-decorator
Fixed width file handler parser with TypeScript Decorator
decorator file-processing fixed-width javascript reflection typescript
Last synced: 13 Apr 2025
https://github.com/kaliv0/pyrio
Functional-style Streams API
file-processing fluent-api functional-programming python-functional streams-api
Last synced: 10 Apr 2025
https://github.com/fairfield-programming/libiii
🎆 An embeddable library for the Interpolated Image Interchange format.
c clang embedded file file-processing image image-processing images library standard standards
Last synced: 22 Aug 2025
https://github.com/jabedude/acct
Rust crate for processing acct(5) files
acct accton crates file-processing log logging parsing rust
Last synced: 14 May 2025
https://github.com/sthagen/puhdistusalue
Puhdistusalue (Finnish for clean area here meaning purge range) - Purge monotonically named files in folders keeping range endpoints
compression developer-tools file-processing
Last synced: 19 Aug 2025
https://github.com/zejiran/transactions-email-processor
📬 Process a file containing debit and credit transactions on an account and generate a summary email with relevant information
email file-processing golang transactions
Last synced: 15 Mar 2025
https://github.com/shramkoweb/bookbot
A Python-based text analyzer that counts words and character frequencies in any .txt file, providing a detailed, sorted report. Perfect for quick text insights and learning text processing basics!
automation beginner-friendly character-frequency data-analysis file-processing open-source python text-analysis text-parser text-processing word-count
Last synced: 28 Mar 2025
https://github.com/ablomer/mediaconvert
⚙️ A modern web-based media converter that processes files entirely in your browser. Convert videos, images, and audio files between formats using FFmpeg and ImageMagick WebAssembly.
audio-converter browser-based client-side drag-and-drop dropzone ffmpeg file-converter file-processing firebase image-converter imagemagick mantine-ui media-converter react typescript video-converter vite web-application webassembly
Last synced: 29 Mar 2025
https://github.com/comsavvy/perimeter
File processing
bash-script distance-calculation file-processing perimeter python
Last synced: 30 Oct 2025
https://github.com/hitesh22rana/sourcecollector
A simple tool to consolidate multiple files into a single .txt file. Perfect for feeding your files to AI tools without any fuss.
ai-tools data-preparation file-processing text-processing utility
Last synced: 04 Nov 2025
https://github.com/dawidrylko/mergerocket
A CLI tool for recursively merging text file contents into a single output file, featuring customizable parameters and formatting optimized for LLMs.
concatenation file-processing merge text-merge
Last synced: 06 Jul 2025
https://github.com/dylan-stewart/capstone
Cloud-Based Analytics Application: CSV Check ~ Data Visualization for Inexperienced Users
automation cloud css csv data-visualization exploratory-data-analysis file-processing flask gcp gcs html javascript machine-learning python visualization web-app
Last synced: 14 Jun 2025
https://github.com/abitofhelp/optimized_adaptive_pipeline_rs
Adaptive Rust pipeline for high-throughput file processing—dynamic chunking, parallelism, AES/ChaCha encryption, backpressure, and Prometheus/tracing.
adaptive-concurrency backpressure chunking concurrency data-pipeline encryption file-processing metrics observability opentelemetry parallelism prometheus rust stream-processing tracing
Last synced: 05 Oct 2025
https://github.com/abitofhelp/adaptive_pipeline
Adaptive Rust pipeline for high-throughput file processing—dynamic chunking, parallelism, AES/ChaCha encryption, backpressure, and Prometheus/tracing.
adaptive-concurrency backpressure chunking concurrency data-pipeline encryption file-processing metrics observability opentelemetry parallelism prometheus rust stream-processing tracing
Last synced: 10 Oct 2025
https://github.com/maxinexiong/geocoding-web-service
This repository houses a geocoding web application built in Python with Flask that transforms address data within a file into precise latitude and longitude coordinates. Upon uploading a file, users can preview the output table on the website, download the converted file, and visualise the exact location of each address on a map.
css3 file-download file-processing file-upload flask flask-application flask-webapp folium folium-maps geocoder geocoding html5 pandas pandas-dataframe python web-application
Last synced: 08 Apr 2025
https://github.com/jet-logic/alterx
A powerful file processing toolkit for batch transformations of HTML, JSON, TOML, XML, and YAML files
command-line-tool file-processing html-parser json-parsing json-serialization python3 toml-parsing xml-parsing yaml-processor
Last synced: 11 Oct 2025
https://github.com/germabyte/chatlog-cleaner
chatlog-cleaner.py is a user-friendly program designed to streamline the process of cleaning and organizing markdown files generated from ChatGPT conversations. If you frequently work with markdown files containing dialogue, this tool helps by removing user inputs and retaining only ChatGPT's or Assistant's responses.
automation chatgpt cleaning data-cleaning file-processing gui markdown python text-processing tkinter utility
Last synced: 15 Jun 2025
https://github.com/vpakarinen/image-video-converter
automating image and video conversion.
automation command-line ffmpeg file-processing filesystem-monitoring image-processing open-source productivity python utilities utility video-processing watchdog
Last synced: 22 Feb 2025
https://github.com/diluv/hilo
Processing and file management service for Diluv.
file-management file-processing
Last synced: 16 Jun 2025
https://github.com/victornpb/file-open-resume
A substitute for open() that lets you resume from where you left off. Very useful for consuming large files, or running a ETL script.
etl file-processing file-reader python python3 script
Last synced: 03 Apr 2025
https://github.com/montasim/terser-minify-tool
This script processes files within a project directory by minifying JavaScript files and copying other file types to a specified output directory. It leverages the Terser library for minification and tracks the size reduction achieved, reporting in kilobytes.
build-tool compression file-processing minification resource-optimization terser web-optimization
Last synced: 04 Apr 2025
https://github.com/zachacious/presto
CLI tool to run AI on files and directories of files
ai anthropic automation cli code-generation code-transformation developer-tools documentation file-processing golang llm openai productivity refactoring terminal
Last synced: 19 Jun 2025
https://github.com/oagoulart/luawalk
A file format reading and writing tool in Lua
file-format file-parser file-processing filesystem
Last synced: 21 Jun 2025
https://github.com/romelium/dircat
DirCat is a high-performance C++ utility that acts like the Unix cat command, but for entire directories. It efficiently concatenates and displays file contents, supporting multi-threading, recursion, and filtering. Note: This project heavily utilized AI tools during its development.
ai-tool ai-tools cmake console cpp cpp20 cross-platform dictionary directory-traversal file-processing filesystem multithreading utility
Last synced: 22 Mar 2025
https://github.com/krumyakimov/integration-file-processor-async
Asynchronous JSON file processor using public APIs and scheduled task execution
api-integration async asyncio automation data-processing file-processing integration json public-api python scheduler webservice
Last synced: 16 Aug 2025
https://github.com/mrzslr/simple-file-processing-pipeline
A distributed system for processing image files using microservices and message queues using RabbitMQ.
docker docker-compose file-processing message-queue microservice python python3 rabbitmq
Last synced: 29 Jun 2025
https://github.com/artemzarubin/hamming-code
Implementation of Hamming code for error detection and correction. This Python application encodes/decodes text files using the 'ASCII -> binary -> Hamming code' scheme, allowing variable information block size (m). Developed for Work 1 in "Program and Data Security".
binary-data data-security encoder-decoder error-control-coding error-correction error-detection file-processing hamming-code lab-assignment python python3 university-project
Last synced: 15 Apr 2025
https://github.com/satheesh-meadi/medical-text-analysis-system
🚀 Medical Text Analysis System– An AI-powered web app that summarizes medical text, translates into 15+ languages, extracts entities (NER), and performs sentiment analysis. Built with Streamlit, PubMedBERT, and Google API to streamline healthcare data analysis.
chatbot file-processing llms machine-learning multilingual named-entity-recognition nlp ollama python streamlit
Last synced: 08 Apr 2025
https://github.com/yuis-ice/videos-to-tomontage-thumbnails
Generate thumbnail montages from video files to quickly identify and browse your video collection
automation cli command-line-tool content-management education ffmpeg file-processing gaming media-processing mkv montage mp4 nodejs streaming surveillance thumbnail-generator typescript video-analysis video-processing video-thumbnails
Last synced: 04 Sep 2025
https://github.com/barannmeisterr/exceldataanalyzeravltree
Student Data Query is a Java project designed to manage and query student data using an AVL tree data structure.
apache-poi avl-tree balanced-search-trees data-structures excel file-processing java node searchquery strings xlsx
Last synced: 12 Jun 2025
https://github.com/xza85hrf/excel-comparison-app
Excel Comparison Application is a Python-based tool that compares two Excel files and generates a new Excel file with the differences. It's primarily designed to help in database updating by identifying new clients. The app also has a graphical user interface for easier use and logs operations for potential troubleshooting.
case-sensitive-comparison data-analysis data-difference database-comparison database-updates excel-comparison file-merging file-processing gui-application new-client-detection python
Last synced: 25 Mar 2025
https://github.com/valdotle/mangopeeler
CLI tool to detect and remove images inserted by aggregators and duplicates from your locally stored manga.
Last synced: 18 Mar 2025