An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with file-processing

A curated list of projects in awesome lists tagged with file-processing .

https://github.com/mayankpratap/samchika

A fast and light-weight multithreaded file processing library for Java.

concurrency file-processing java kotlin multithreading open-source parallel-processing performance scala

Last synced: 26 Jun 2025

https://github.com/znicholls/netcdf-scm

Simple wrappers for processing netcdf files for use in simple climate models

climate climate-analysis climate-models climate-science file-processing netcdf netcdf4

Last synced: 06 Jan 2026

https://github.com/ilyashusterman/doc-to-readable

Universal document-to-markdown and section splitter for HTML, URLs, and PDFs.

docs document-conversion documents file-processing html javascript json markdown nodejs npm rag splitter

Last synced: 28 Jan 2026

https://github.com/vcfvct/fixed-width-ts-decorator

Fixed width file handler parser with TypeScript Decorator

decorator file-processing fixed-width javascript reflection typescript

Last synced: 13 Apr 2025

https://github.com/mhmelshaaer/file-structures-organization-and-processing

Implementing basic file manipulations concepts and algorithms in c++

c-plus-plus file-organization file-processing hashing indexed-search

Last synced: 05 May 2025

https://github.com/shramkoweb/bookbot

A Python-based text analyzer that counts words and character frequencies in any .txt file, providing a detailed, sorted report. Perfect for quick text insights and learning text processing basics!

automation beginner-friendly character-frequency data-analysis file-processing open-source python text-analysis text-parser text-processing word-count

Last synced: 02 Feb 2026

https://github.com/sthagen/puhdistusalue

Puhdistusalue (Finnish for clean area here meaning purge range) - Purge monotonically named files in folders keeping range endpoints

compression developer-tools file-processing

Last synced: 19 Aug 2025

https://github.com/fairfield-programming/libiii

🎆 An embeddable library for the Interpolated Image Interchange format.

c clang embedded file file-processing image image-processing images library standard standards

Last synced: 22 Aug 2025

https://github.com/jabedude/acct

Rust crate for processing acct(5) files

acct accton crates file-processing log logging parsing rust

Last synced: 14 May 2025

https://github.com/abitofhelp/adaptive_pipeline

Adaptive Rust pipeline for high-throughput file processing—dynamic chunking, parallelism, AES/ChaCha encryption, backpressure, and Prometheus/tracing.

adaptive-concurrency backpressure chunking concurrency data-pipeline encryption file-processing metrics observability opentelemetry parallelism prometheus rust stream-processing tracing

Last synced: 17 May 2026

https://github.com/dylan-stewart/capstone

Cloud-Based Analytics Application: CSV Check ~ Data Visualization for Inexperienced Users

automation cloud css csv data-visualization exploratory-data-analysis file-processing flask gcp gcs html javascript machine-learning python visualization web-app

Last synced: 09 Apr 2026

https://github.com/maxinexiong/geocoding-web-service

This repository houses a geocoding web application built in Python with Flask that transforms address data within a file into precise latitude and longitude coordinates. Upon uploading a file, users can preview the output table on the website, download the converted file, and visualise the exact location of each address on a map.

css3 file-download file-processing file-upload flask flask-application flask-webapp folium folium-maps geocoder geocoding html5 pandas pandas-dataframe python web-application

Last synced: 29 Apr 2026

https://github.com/mrzslr/simple-file-processing-pipeline

A distributed system for processing image files using microservices and message queues using RabbitMQ.

docker docker-compose file-processing message-queue microservice python python3 rabbitmq

Last synced: 15 Apr 2026

https://github.com/hitesh22rana/sourcecollector

A simple tool to consolidate multiple files into a single .txt file. Perfect for feeding your files to AI tools without any fuss.

ai-tools data-preparation file-processing text-processing utility

Last synced: 04 Nov 2025

https://github.com/dawidrylko/mergerocket

A CLI tool for recursively merging text file contents into a single output file, featuring customizable parameters and formatting optimized for LLMs.

concatenation file-processing merge text-merge

Last synced: 22 Feb 2026

https://github.com/ablomer/mediaconvert

⚙️ A modern web-based media converter that processes files entirely in your browser. Convert videos, images, and audio files between formats using FFmpeg and ImageMagick WebAssembly.

audio-converter browser-based client-side drag-and-drop dropzone ffmpeg file-converter file-processing firebase image-converter imagemagick mantine-ui media-converter react typescript video-converter vite web-application webassembly

Last synced: 07 Apr 2026

https://github.com/zejiran/transactions-email-processor

📬 Process a file containing debit and credit transactions on an account and generate a summary email with relevant information

email file-processing golang transactions

Last synced: 15 Mar 2025

https://github.com/abitofhelp/optimized_adaptive_pipeline_rs

Adaptive Rust pipeline for high-throughput file processing—dynamic chunking, parallelism, AES/ChaCha encryption, backpressure, and Prometheus/tracing.

adaptive-concurrency backpressure chunking concurrency data-pipeline encryption file-processing metrics observability opentelemetry parallelism prometheus rust stream-processing tracing

Last synced: 05 Oct 2025

https://github.com/barannmeisterr/zoomdurationanalyzer

Java program that reads, processes, and displays attendance records in a Zoom meeting report provided as a txt file.

duplicate-detection file-processing hashing hashtable java linear-probing node oop string-manipulation txt-files

Last synced: 13 May 2026

https://github.com/diluv/hilo

Processing and file management service for Diluv.

file-management file-processing

Last synced: 16 Jun 2025

https://github.com/victornpb/file-open-resume

A substitute for open() that lets you resume from where you left off. Very useful for consuming large files, or running a ETL script.

etl file-processing file-reader python python3 script

Last synced: 03 Apr 2025

https://github.com/montasim/terser-minify-tool

This script processes files within a project directory by minifying JavaScript files and copying other file types to a specified output directory. It leverages the Terser library for minification and tracks the size reduction achieved, reporting in kilobytes.

build-tool compression file-processing minification resource-optimization terser web-optimization

Last synced: 04 Apr 2025

https://github.com/oagoulart/luawalk

A file format reading and writing tool in Lua

file-format file-parser file-processing filesystem

Last synced: 21 Jun 2025

https://github.com/jigolka/file-to-image

Transform your files into images

file-processing image rust

Last synced: 03 Dec 2025

https://github.com/romelium/dircat

DirCat is a high-performance C++ utility that acts like the Unix cat command, but for entire directories. It efficiently concatenates and displays file contents, supporting multi-threading, recursion, and filtering. Note: This project heavily utilized AI tools during its development.

ai-tool ai-tools cmake console cpp cpp20 cross-platform dictionary directory-traversal file-processing filesystem multithreading utility

Last synced: 22 Mar 2025

https://github.com/artemzarubin/hamming-code

Implementation of Hamming code for error detection and correction. This Python application encodes/decodes text files using the 'ASCII -> binary -> Hamming code' scheme, allowing variable information block size (m). Developed for Work 1 in "Program and Data Security".

binary-data data-security encoder-decoder error-control-coding error-correction error-detection file-processing hamming-code lab-assignment python python3 university-project

Last synced: 15 Apr 2025

https://github.com/satheesh-meadi/medical-text-analysis-system

🚀 Medical Text Analysis System– An AI-powered web app that summarizes medical text, translates into 15+ languages, extracts entities (NER), and performs sentiment analysis. Built with Streamlit, PubMedBERT, and Google API to streamline healthcare data analysis.

chatbot file-processing llms machine-learning multilingual named-entity-recognition nlp ollama python streamlit

Last synced: 10 Apr 2026

https://github.com/xza85hrf/excel-comparison-app

Excel Comparison Application is a Python-based tool that compares two Excel files and generates a new Excel file with the differences. It's primarily designed to help in database updating by identifying new clients. The app also has a graphical user interface for easier use and logs operations for potential troubleshooting.

case-sensitive-comparison data-analysis data-difference database-comparison database-updates excel-comparison file-merging file-processing gui-application new-client-detection python

Last synced: 25 Mar 2025

https://github.com/valdotle/mangopeeler

CLI tool to detect and remove images inserted by aggregators and duplicates from your locally stored manga.

cli file-processing manga

Last synced: 18 Mar 2025

https://github.com/jet-logic/alterx

A powerful file processing toolkit for batch transformations of HTML, JSON, TOML, XML, and YAML files

command-line-tool file-processing html-parser json-parsing json-serialization python3 toml-parsing xml-parsing yaml-processor

Last synced: 20 Jan 2026

https://github.com/barannmeisterr/exceldataanalyzeravltree

Student Data Query is a Java project designed to manage and query student data using an AVL tree data structure.

apache-poi avl-tree balanced-search-trees data-structures excel file-processing java node searchquery strings xlsx

Last synced: 29 Apr 2026