An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with file-processing

A curated list of projects in awesome lists tagged with file-processing .

https://github.com/mayankpratap/samchika

A fast and light-weight multithreaded file processing library for Java.

concurrency file-processing java kotlin multithreading open-source parallel-processing performance scala

Last synced: 26 Jun 2025

https://github.com/znicholls/netcdf-scm

Simple wrappers for processing netcdf files for use in simple climate models

climate climate-analysis climate-models climate-science file-processing netcdf netcdf4

Last synced: 25 Mar 2025

https://github.com/mhmelshaaer/file-structures-organization-and-processing

Implementing basic file manipulations concepts and algorithms in c++

c-plus-plus file-organization file-processing hashing indexed-search

Last synced: 05 May 2025

https://github.com/vcfvct/fixed-width-ts-decorator

Fixed width file handler parser with TypeScript Decorator

decorator file-processing fixed-width javascript reflection typescript

Last synced: 13 Apr 2025

https://github.com/fairfield-programming/libiii

🎆 An embeddable library for the Interpolated Image Interchange format.

c clang embedded file file-processing image image-processing images library standard standards

Last synced: 22 Aug 2025

https://github.com/jabedude/acct

Rust crate for processing acct(5) files

acct accton crates file-processing log logging parsing rust

Last synced: 14 May 2025

https://github.com/sthagen/puhdistusalue

Puhdistusalue (Finnish for clean area here meaning purge range) - Purge monotonically named files in folders keeping range endpoints

compression developer-tools file-processing

Last synced: 19 Aug 2025

https://github.com/zejiran/transactions-email-processor

📬 Process a file containing debit and credit transactions on an account and generate a summary email with relevant information

email file-processing golang transactions

Last synced: 15 Mar 2025

https://github.com/shramkoweb/bookbot

A Python-based text analyzer that counts words and character frequencies in any .txt file, providing a detailed, sorted report. Perfect for quick text insights and learning text processing basics!

automation beginner-friendly character-frequency data-analysis file-processing open-source python text-analysis text-parser text-processing word-count

Last synced: 28 Mar 2025

https://github.com/ablomer/mediaconvert

⚙️ A modern web-based media converter that processes files entirely in your browser. Convert videos, images, and audio files between formats using FFmpeg and ImageMagick WebAssembly.

audio-converter browser-based client-side drag-and-drop dropzone ffmpeg file-converter file-processing firebase image-converter imagemagick mantine-ui media-converter react typescript video-converter vite web-application webassembly

Last synced: 29 Mar 2025

https://github.com/hitesh22rana/sourcecollector

A simple tool to consolidate multiple files into a single .txt file. Perfect for feeding your files to AI tools without any fuss.

ai-tools data-preparation file-processing text-processing utility

Last synced: 04 Nov 2025

https://github.com/dawidrylko/mergerocket

A CLI tool for recursively merging text file contents into a single output file, featuring customizable parameters and formatting optimized for LLMs.

concatenation file-processing merge text-merge

Last synced: 06 Jul 2025

https://github.com/dylan-stewart/capstone

Cloud-Based Analytics Application: CSV Check ~ Data Visualization for Inexperienced Users

automation cloud css csv data-visualization exploratory-data-analysis file-processing flask gcp gcs html javascript machine-learning python visualization web-app

Last synced: 14 Jun 2025

https://github.com/abitofhelp/optimized_adaptive_pipeline_rs

Adaptive Rust pipeline for high-throughput file processing—dynamic chunking, parallelism, AES/ChaCha encryption, backpressure, and Prometheus/tracing.

adaptive-concurrency backpressure chunking concurrency data-pipeline encryption file-processing metrics observability opentelemetry parallelism prometheus rust stream-processing tracing

Last synced: 05 Oct 2025

https://github.com/abitofhelp/adaptive_pipeline

Adaptive Rust pipeline for high-throughput file processing—dynamic chunking, parallelism, AES/ChaCha encryption, backpressure, and Prometheus/tracing.

adaptive-concurrency backpressure chunking concurrency data-pipeline encryption file-processing metrics observability opentelemetry parallelism prometheus rust stream-processing tracing

Last synced: 10 Oct 2025

https://github.com/maxinexiong/geocoding-web-service

This repository houses a geocoding web application built in Python with Flask that transforms address data within a file into precise latitude and longitude coordinates. Upon uploading a file, users can preview the output table on the website, download the converted file, and visualise the exact location of each address on a map.

css3 file-download file-processing file-upload flask flask-application flask-webapp folium folium-maps geocoder geocoding html5 pandas pandas-dataframe python web-application

Last synced: 08 Apr 2025

https://github.com/jet-logic/alterx

A powerful file processing toolkit for batch transformations of HTML, JSON, TOML, XML, and YAML files

command-line-tool file-processing html-parser json-parsing json-serialization python3 toml-parsing xml-parsing yaml-processor

Last synced: 11 Oct 2025

https://github.com/germabyte/chatlog-cleaner

chatlog-cleaner.py is a user-friendly program designed to streamline the process of cleaning and organizing markdown files generated from ChatGPT conversations. If you frequently work with markdown files containing dialogue, this tool helps by removing user inputs and retaining only ChatGPT's or Assistant's responses.

automation chatgpt cleaning data-cleaning file-processing gui markdown python text-processing tkinter utility

Last synced: 15 Jun 2025

https://github.com/diluv/hilo

Processing and file management service for Diluv.

file-management file-processing

Last synced: 16 Jun 2025

https://github.com/victornpb/file-open-resume

A substitute for open() that lets you resume from where you left off. Very useful for consuming large files, or running a ETL script.

etl file-processing file-reader python python3 script

Last synced: 03 Apr 2025

https://github.com/montasim/terser-minify-tool

This script processes files within a project directory by minifying JavaScript files and copying other file types to a specified output directory. It leverages the Terser library for minification and tracks the size reduction achieved, reporting in kilobytes.

build-tool compression file-processing minification resource-optimization terser web-optimization

Last synced: 04 Apr 2025

https://github.com/oagoulart/luawalk

A file format reading and writing tool in Lua

file-format file-parser file-processing filesystem

Last synced: 21 Jun 2025

https://github.com/jigolka/file-to-image

Transform your files into images

file-processing image rust

Last synced: 03 Dec 2025

https://github.com/romelium/dircat

DirCat is a high-performance C++ utility that acts like the Unix cat command, but for entire directories. It efficiently concatenates and displays file contents, supporting multi-threading, recursion, and filtering. Note: This project heavily utilized AI tools during its development.

ai-tool ai-tools cmake console cpp cpp20 cross-platform dictionary directory-traversal file-processing filesystem multithreading utility

Last synced: 22 Mar 2025

https://github.com/mrzslr/simple-file-processing-pipeline

A distributed system for processing image files using microservices and message queues using RabbitMQ.

docker docker-compose file-processing message-queue microservice python python3 rabbitmq

Last synced: 29 Jun 2025

https://github.com/artemzarubin/hamming-code

Implementation of Hamming code for error detection and correction. This Python application encodes/decodes text files using the 'ASCII -> binary -> Hamming code' scheme, allowing variable information block size (m). Developed for Work 1 in "Program and Data Security".

binary-data data-security encoder-decoder error-control-coding error-correction error-detection file-processing hamming-code lab-assignment python python3 university-project

Last synced: 15 Apr 2025

https://github.com/satheesh-meadi/medical-text-analysis-system

🚀 Medical Text Analysis System– An AI-powered web app that summarizes medical text, translates into 15+ languages, extracts entities (NER), and performs sentiment analysis. Built with Streamlit, PubMedBERT, and Google API to streamline healthcare data analysis.

chatbot file-processing llms machine-learning multilingual named-entity-recognition nlp ollama python streamlit

Last synced: 08 Apr 2025

https://github.com/barannmeisterr/exceldataanalyzeravltree

Student Data Query is a Java project designed to manage and query student data using an AVL tree data structure.

apache-poi avl-tree balanced-search-trees data-structures excel file-processing java node searchquery strings xlsx

Last synced: 12 Jun 2025

https://github.com/xza85hrf/excel-comparison-app

Excel Comparison Application is a Python-based tool that compares two Excel files and generates a new Excel file with the differences. It's primarily designed to help in database updating by identifying new clients. The app also has a graphical user interface for easier use and logs operations for potential troubleshooting.

case-sensitive-comparison data-analysis data-difference database-comparison database-updates excel-comparison file-merging file-processing gui-application new-client-detection python

Last synced: 25 Mar 2025

https://github.com/valdotle/mangopeeler

CLI tool to detect and remove images inserted by aggregators and duplicates from your locally stored manga.

cli file-processing manga

Last synced: 18 Mar 2025