Projects in Awesome Lists tagged with jsonl
A curated list of projects in awesome lists tagged with jsonl .
https://github.com/textualize/toolong
A terminal application to view, tail, merge, and search log files (plus JSONL).
jsonl rich terminal terminal-based textual tui
Last synced: 14 May 2025
https://github.com/Textualize/toolong
A terminal application to view, tail, merge, and search log files (plus JSONL).
jsonl rich terminal terminal-based textual tui
Last synced: 26 Mar 2025
https://github.com/Ruya-AI/cozempic
Context cleaning for Claude Code — prune bloated sessions, protect Agent Teams from context loss, auto-guard with tiered pruning
agent-teams claude-code claude-skills cli context context-management jsonl llm-tools pruning python session-management
Last synced: 20 Apr 2026
https://github.com/delexw/claude-code-trace
Real-time Claude Code session log viewer and monitor. Tail, search, and visualize JSONL session files with a native desktop GUI, a web and a tui. Built with Tauri 2, React, and TypeScript.
claude-code desktop-gui jsonl monitor react search session-log tail tauri-2 typescript viewer visualize
Last synced: 07 Apr 2026
https://github.com/tpeczek/json-streaming-dotnet
JsonStreaming.NET is a solution that provides a set of libraries (`Ndjson.AsyncStreams.*`) for working with asynchronous streaming data sources over HTTP using JSONL (JSON Lines) and NDJSON (Newline Delimited JSON).
asp-net-core asp-net-core-mvc async-streams dotnet jsonl jsonlines ndjson
Last synced: 26 Oct 2025
https://github.com/umarbutler/orjsonl
A lightweight, high-performance Python library for parsing jsonl files.
bzip2 deserialization gzip json json-lines jsonl jsonlines ndjson parser parsing python serialization xz zstandard
Last synced: 11 Oct 2025
https://github.com/seart-group/dl4se
Building Training Datasets for Deep Learning Models in Software Engineering and Empirical Software Engineering Research
dataset-generation deep-learning docker-compose jsonl liquibase mining-software-repositories mining-source-code msr postgresql software-engineering source-code-analysis spring-boot spring-boot-application spring-boot-server
Last synced: 10 Apr 2025
https://github.com/markusressel/openhasp-config-manager
A tool to manage all of your openHASP device configs in a centralized place.
cli configuration-management deployment deployment-automation docker esp32 helper jsonl mqtt openhasp python validation
Last synced: 12 Feb 2026
https://github.com/vearutop/flatjsonl
A tool to flatten JSONL into CSV or SQL
csv hacktoberfest json jsonl sqlite3
Last synced: 17 Oct 2025
https://github.com/ffdev-info/jsonid
Identification of JSON (JSONL, YAML, and TOML) objects: JSONID
archives code4lib digipres digital-preservation file-formats format-identification glam json jsonl toml yaml
Last synced: 04 Jan 2026
https://github.com/ryhkml/fine-tune-forge
JSONL generator designed for models like Google PaLM 2 and OpenAI GPT-3.5
ai fine-tuning gpt-3 image-ocr jsonl openai tool
Last synced: 11 Apr 2025
https://github.com/aiengineeringforgrandmas/gemini-prompt-engineer-toolkit
⚡Powered by Goggle TPUs and the latest (Aug 27, 2024) Gemini 1.5 Pro and Flash Models to generate high-quality engineered prompts, analyze text and images, and create datasets for fine-tuning AI models, helping you to become a prompt engineering pro
dataset-generation fine-tuning gemini-ai gemini-api gemini-flash gemini-pro html jsonl prompt-engineering python streamlit streamlit-webapp
Last synced: 18 Aug 2025
https://github.com/rmoralespp/jsonl
A lightweight Python library for handling jsonlines files
bzip2 deserialization files gzip json jsonl jsonlines ndjson python serialization tar utils xz zip
Last synced: 11 Mar 2026
https://github.com/kazu/vfs-index
no process, indexer like a DB on VFS(virtual filesystem)
csv flatbuffers go golang json jsonl lz4 search-engine
Last synced: 23 Apr 2025
https://github.com/petlack/rollup-plugin-jsonlines
🍣 A Rollup plugin which imports .jsonl (JSON Lines) files as JSON arrays.
javascript jsonl rollup rollup-plugin
Last synced: 11 May 2025
https://github.com/jimsmart/peanut
peanut is a Go package to write tagged data structs to disk in a variety of formats, simply and without ceremony.
csv excel go jsonl log-file sqlite struct-writer tsv
Last synced: 06 Jul 2025
https://github.com/nya1/bananareporter
An easy to use CLI to generate custom reports in JSON, JSONL, CSV from multiple sources
cli command-line-tool github gitlab jsonl report report-generator reporting reporting-tool
Last synced: 26 Oct 2025
https://github.com/arinbalyan/scrappy
Bulk job-board scraper with 100+ sites, email enrichment, deterministic quality scoring, and multi-format exports (CSV/JSONL/XLSX/Parquet). Designed for scheduled bulk-first operations with per-site rate limiting, proxy pools, and resume support.
ats csv data-engineering email-verification go golang job-board jsonl parquet proxy quality-scoring recruitment scraping socks5 web-scraping xlsx
Last synced: 27 Jun 2026
https://github.com/unfoldml/jsonl
JSON Lines
json jsonl jsonlines jsonlines-data streaming-data
Last synced: 23 Apr 2025
https://github.com/2nd1st/api-log
Transparent LLM gateway trace recorder · sub2api / CLIProxyAPI / new-api 的 LLM 网关流量录制器
anthropic api-gateway claude claude-code codex gemini jsonl llmops new-api observability openai-api openai-compatible reverse-proxy sub2api
Last synced: 07 Jun 2026
https://github.com/marvin-j97/x11-pasteboard
A micro-CLI to watch your X11 pasteboard and emit the contents as jsonl
clipboard jsonl listener pasteboard small-tools x11
Last synced: 12 Feb 2026
https://github.com/jaybird1291/anki-llm-review-stats-exporter
Export your Anki review history (revlog) as JSONL so you can analyze it with an LLM (ChatGPT, Claude, local models, etc.) without using any API.
anki anki-addon chatgpt data-analysis data-export jsonl llm review-stats revlog statistics
Last synced: 24 Dec 2025
https://github.com/zaneh/dataset-tools
Small collection of scripts to build datasets for LLMs.
csv dataset fine-tuning jsonl llm system-prompt training
Last synced: 01 Jun 2026
https://github.com/aminnairi/cristaline
An immutable database engine built on the Event Sourcing pattern.
event event-sourcing javascript json jsonl jsonlines jsonstream react sourcing store typescript
Last synced: 04 Mar 2026
https://github.com/mirpo/datamatic
Build multi-step AI workflows with schema-guided reasoning. Supports Ollama, LMStudio, OpenAI, OpenRouter, Gemini, and all latest models for structured generation, chaining, and data processing.
agentic-ai ai-workflow dataset deepseek-r1 jsonl llama3 llm lmstudio localllm ollama phi4 synthetic-data synthetic-dataset-generation
Last synced: 28 Apr 2026
https://github.com/yuan-cloud/vifei-suite-public
Deterministic, local-first CLI/TUI for AI Agent rerun evidence, incidental forensics, and fail-closed share-safe exports.
ai-agents auditability cli deterministic eventlog forensics indient-response jsonl local-first observability redaction redactionsecurity replay rust tui
Last synced: 10 Apr 2026
https://github.com/oada/cli
Pipeable OADA CLI client
cli client concatenated-json json jsonl jsonlines ldjson ndjson oada
Last synced: 17 Apr 2026
https://github.com/richie-rich90454/training-generator
Training Generator is a cross-platform desktop app built with Electron and Node.js that converts documents (PDF, DOCX, DOC, RTF, TXT, MD, HTML) into structured AI training data. Using local Ollama models, it extracts instructions, Q&A pairs, and conversation data for machine learning, AI fine-tuning, and NLP workflows, while keeping all processing.
ai ai-data-analysis ai-training-data cpp desktop-app document-conversion electron html-css-javascript jsonl local-ai ml ollama ollama-api training-materials
Last synced: 26 Feb 2026
https://github.com/kevinbenabdelhak/wp-fine-tuning
WP Fine-tuning est un plugin qui exporte vos publications de ou des types de contenus de votre choix, dans un fichier .jsonl prêt à être importé sur OpenAI. Boostez votre IA personnalisée à partir de votre site WordPress
author content export fine-tuning jsonl openai pattern plugin wordpress
Last synced: 22 Aug 2025
https://github.com/dominiek/promptcellar-format
Open spec for logging AI coding prompts to a JSONL file in your repo. Vendor-neutral. JSON Schema published. plf-1.
ai-coding developer-tools json-schema jsonl llm-tools mcp open-standard prompt-logging
Last synced: 29 May 2026
https://github.com/lcweden/jsontext
A state machine for incremental JSON processing.
json json-parser json-path json-pointer jsonl jsontext parser state-machine stream streaming web-streams
Last synced: 31 May 2026
https://github.com/forjd/runtrail
Portable event trails for agentic dev workflows: commands, repo diffs, browser QA, CI failures, and repair prompts.
agentic-workflows browser-automation ci cli debugging developer-tools event-log github-actions jsonl observability qa repair-prompts rust
Last synced: 31 May 2026
https://github.com/hitmman55/claude-code-chat-viewer
Offline HTML viewer for Claude Code JSONL session transcripts. Six UI languages, light/dark theme, no backend, no CDN.
anthropic chat-log claude claude-code html-viewer i18n jsonl jsonl-viewer log-viewer offline-first public-domain single-file transcript-viewer zero-dependencies
Last synced: 21 Apr 2026
https://github.com/mrpudn/maltrends
(mirror) MyAnimeList.net manga and anime trend data.
anime data json jsonl jsonlines manga myanimelist
Last synced: 20 Apr 2026
https://github.com/banyc/dfsql
SQL REPL/lib for Data Frames
cli csv data-analysis jsonl ndjson repl sql
Last synced: 31 Jul 2025
https://github.com/leodeveloper/fine-tune-with-google-cloud
Generative AI custom data fine tuning with google cloud vertax ai
aiplatform custom-data custom-dataset customdataset datasets datasets-csv fine-tuning gcp generative-ai google-cloud google-cloud-aiplatform jsonl jsonlines machine-learning python vertex-ai
Last synced: 24 Apr 2026
https://github.com/impossibleforge/pfc-duckdb
DuckDB extension to read PFC-JSONL compressed log files with block-level timestamp filtering
analytics compression duckdb duckdb-extension jsonl log-compression sql structured-logs
Last synced: 25 Apr 2026
https://github.com/impossibleforge/pfc-jsonl
High-ratio JSONL compressor with block-level random access. ~9% ratio on log data - 25% smaller than gzip, 37% smaller than zstd. Free for personal and open-source use.
cli compression data-compression duckdb fluent-bit jsonl log-compression structured-logs
Last synced: 25 Apr 2026
https://github.com/amitmishrg/agenticlens
Visual debugging, tracing, and replay for agent workflows.
agent-debugging agent-workflows agentic-ai ai ai-agents ai-observability debugging-tools developer-tools devtools execution-tracing jsonl llm log-visualization nodejs observability reactjs tracing visualizations workflow-visualization
Last synced: 08 Jun 2026
https://github.com/pearxteam/bashim-scraper
A utility written in Kotlin that scraps all the quotes from Russian bash.im website and writes them into a single JSONL file.
bashim gradle gradle-kotlin-dsl json jsonl jsoup kotlin kotlinx-serialization scraper scraping scraping-websites
Last synced: 01 May 2026
https://github.com/fumito-ito/swiftyjsonlines
The better way to deal with JSONLines data in Swift.
Last synced: 27 Feb 2026
https://github.com/sija/jsonl.cr
Crystal shard for handling JSONL (JSON Lines) parsing
Last synced: 05 May 2026
https://github.com/ipanalytics/bogonforge
BogonForge is a reproducible compiler for IANA/RFC special-use IP and ASN policy.
asn bogon cidr iana ip-intelligence ipset ipv4 jsonl network-security nftables nginx rfc siem special-use threat-intelligence
Last synced: 25 May 2026
https://github.com/neverendingqs/pprint-ndjson-ui
For pretty-printing newline delimited JSON.
jsonl jsonlines ndjson netlify react-bootstrap reactjs
Last synced: 08 May 2026
https://github.com/sile/jsonlrpc
A JSON-RPC 2.0 library that streams JSON objects in JSON Lines format.
Last synced: 03 Feb 2026
https://github.com/escoffier-labs/sourceharvest
SourceHarvest exports local source-system records to portable miseledger.adapter.v1 JSONL for archive, search, and evidence workflows.
ai-agents brigade cli evidence exporter go jsonl local-first miseledger sourceharvest
Last synced: 11 Jun 2026
https://github.com/escoffier-labs/stationtrail
StationTrail exports local station and agent-session logs to portable miseledger.adapter.v1 JSONL. Local-only, no network calls.
ai-agents brigade claude-code cli codex go jsonl local-first miseledger openclaw session-logs stationtrail
Last synced: 11 Jun 2026
https://github.com/annaleighsmith/bt
A minimal, local-first issue tracker for AI agents. Issues live in plain JSONL files alongside your code.
ai ai-agents beads claude claude-code cli developer-tools go issue-tracker jsonl
Last synced: 13 Jun 2026
https://github.com/dwalleck/rivets
A high-performance, Rust-based issue tracking system using JSONL storage
async cli issue-tracker jsonl project-management rust rust-lang
Last synced: 20 Jun 2026
https://github.com/zaibacu/json2jsonl
A tool to take large JSON and convert it to JSONL
Last synced: 11 Apr 2026
https://github.com/effet/cxusage
Codex usage analytics CLI that scans ~/.codex/sessions, aggregates tokens by day or model, and estimates cost using OpenRouter pricing.
anthropic claude cli codex command-line-tool cost-estimation daily-reports data-aggregation developer-tools jsonl log-analysis metrics nodejs observability openai openrouter reporting token-usage typescript usage-analytics
Last synced: 18 Apr 2026
https://github.com/sile/jlot
Command-line tool for JSON-RPC 2.0 over JSON Lines over TCP.
command-line-tool json-rpc jsonl rust
Last synced: 17 Feb 2026
https://github.com/mbe24/99problems
AI-native CLI tool for issue/PR context retrieval across Jira, GitLab, GitHub, and Bitbucket, with structured output for humans and AI agents plus a scaffold aligned with the Agent Skills standard.
agent-skills agentic-engineering ai-agents ai-native bitbucket-cloud bitbucket-datacenter cli developer-tools github-api gitlab-api issue-tracking jira-api json jsonl llm ndjson pull-requests rag rust yaml
Last synced: 13 Apr 2026
https://github.com/plaited/agent-eval-harness
Evaluate AI agents with Unix-style pipeline commands. Schema-driven adapters for any CLI agent, trajectory capture, pass@k metrics, and multi-run comparison.
agent-client-protocol agent-comparison agent-evaluation ai-agents bun cli eval-harness grader headless-adapter jsonl llm-evaluation pass-at-k trajectory-capture typescript unix-pipeline
Last synced: 20 Feb 2026
https://github.com/ondata/appaltipop
ETL scripts and issue tracking for AppaltiPOP project.
appaltipop csv elasticsearch json json-schema jsonl jupiter python
Last synced: 17 Mar 2026
https://github.com/ddjain/jsonl-visualizer
A beautiful web tool for visualizing JSONL files with syntax highlighting and multiple view modes
data-analysis json jsonl viusal
Last synced: 01 Jul 2026
https://github.com/effortlessmetrics/tokmd
Code intelligence for humans, machines, and LLMs: receipts, metrics, and insights from your codebase.
ai-tools ci cli code-metrics code-statistics csv devex github-actions jsonl llm llmops loc markdown monorepo repository-analysis rust sloc tokei tsv
Last synced: 15 Apr 2026
https://github.com/itsluketwist/jldc
Easily read/write JSONLines files that include dataclasses.
dataclasses dataclasses-json jsonl jsonlines
Last synced: 05 Apr 2025
https://github.com/abetomo/dump_to_jsonl
Generate JSONL from a `mysqldump` dump file.
Last synced: 29 Jun 2026
https://github.com/viggomeesters/go-project-template
Minimal .go template repository for repo-local agent workflows.
Last synced: 03 Jul 2026
https://github.com/viggomeesters/minimal-ai-vault-starter
Public-safe starter for a minimal AI-enabled Obsidian/JSONL vault
ai-vault jsonl obsidian personal-knowledge-management starter-kit
Last synced: 03 Jul 2026
https://github.com/viggomeesters/go-workflow-stack
Reusable CLI and schemas for repo-local .go agent workflow state.
agent-workflow developer-tools jsonl
Last synced: 03 Jul 2026
https://github.com/viggomeesters/jsonl-vault-spike
Synthetic JSONL-first vault/context proof of concept for agent-readable personal knowledge systems
agents context-engineering jsonl personal-knowledge-management synthetic-data
Last synced: 03 Jul 2026
https://github.com/iagolirapasssos/jsonl-file-generator
This is a simple web application that allows users to create JSONL files for fine-tuning OpenAI's GPT models. The application includes fields for entering "System", "User", and "Assistant" messages, stores data in the browser's local storage to prevent data loss on refresh, and supports multiple languages (English, Portuguese, Spanish, and Hindi).
Last synced: 18 May 2026
https://github.com/nightmarewalker/d-safelogger
Zero-dependency, stdlib-compatible Python logger with append-only routing, integrity sidecars, and multiprocess delivery-state observability.
append-only audit-logging configuration free-threading integrity jsonl log-rotation logging multi-processing observability pep-703 python python-free-threaded python-logging sha256 structured-logging windows zero-dependency
Last synced: 23 May 2026
https://github.com/adeladool/ner-news-articles
Named Entity Recognition (NER) on news articles using rule-based and spaCy models. Includes entity extraction, visualizations with displaCy, and comparison of small vs large spaCy models for analysis and insights.
csv data-science displacy jsonl machine-learning named-entity-recognition natural-language-processing natural-language-processing-nlp ner news-analysis python spacy text-mining text-processing visualization
Last synced: 16 May 2026
https://github.com/allendema/rewe_dl
Call store APIs. Parse responses and save the data to SQL, JSON or any custom format.
apprise discounts httpx json jsonl market price-comparison price-tracker products python3 ruff sqlite3 webhooks
Last synced: 21 Aug 2025
https://github.com/nikopastore/agent-memory-site
Turn Markdown/Obsidian AI-agent memory vaults into semantic HTML dashboards, search indexes, and retrieval-ready JSONL chunks.
agent-memory ai-agent claude codex developer-tools jsonl knowledge-base llm local-first markdown obsidian privacy rag retrieval semantic-html static-site static-site-generator typescript
Last synced: 24 May 2026
https://github.com/takanoriyanagitani/psplit-py
Programmable text file splitter
aggregate jsonl ldjson lines2files lines2line ndjson programmable python3 split text
Last synced: 19 Jun 2025
https://github.com/kadubon/bottleneck-audit-toolkit
Offline, fail-closed verifier for JSONL telemetry event logs. Emits deterministic audit certificates + human summaries with explicit claims/non-claims for bottleneck and integrity review.
ai audit bottleneck checkpointing distributed-training event-logs jsonl mlops offline-verification performance-monitoring silent-data-corruption tail-latency telemetry
Last synced: 13 Jan 2026
https://github.com/kadubon/oasg
Local-first, model-agnostic workflow optimizer for long-running AI agents: observable JSONL ledgers, deterministic reducers, no-meta gates, and receipt-backed self-improvement without LLM judges or model-weight updates.
agent-evaluation agent-memory agent-workflows ai-agents autonomous-agents deterministic-replay jsonl long-running-agents model-agnostic no-meta ollama python self-improving-agents verification workflow-automation workflow-optimization
Last synced: 30 May 2026
https://github.com/nlink-jp/lookup
A powerful Go-based CLI tool to enrich JSON/JSONL data streams by looking up values in CSV or JSON files. Inspired by Splunk's lookup command.
cli csv data-enrichment data-pipeline devpos-tools go golang json jsonl lookup splunk
Last synced: 14 Apr 2026
https://github.com/plaited/acp-harness
CLI for agent evaluation. Capture trajectories, run trials with pass@k metrics, and score with polyglot graders (TypeScript, Python, any language).
acp agent-client-protocol agent-evaluation ai-agents bun cli eval-harness grader jsonl llm-evaluation pass-at-k trajectory-capture typescript
Last synced: 21 Jan 2026
https://github.com/cahlen/conversation-dataset-generator
Craft conversational datasets (JSONL format with rich metadata) using LLMs. Specify parameters manually or use a creative brief for LLM-generated arguments with automatic topic/scenario variation. Optional web search improves persona grounding. Ideal for LoRA tuning, persona training, and creative writing. Includes Hugging Face Hub upload.
dataset-generation dialogue-generation fine-tuning huggingface jsonl llm lora nlp peft persona python synthentic-data transformers
Last synced: 27 Oct 2025
https://github.com/sdkks/nesdit
nesdit aka NestedEdit: Edit and convert your json(l),yaml,toml files in place or on the fly through streaming
ansible argo argocd configuration jq json jsonl kubernetes ndjson serialization spinnaker toml yaml yml yq
Last synced: 01 Jun 2026
https://github.com/muka/tap-jsonl
A tap to process jsonl files
jsonl meltano singer tap-jsonl
Last synced: 03 Apr 2026
https://github.com/zurd46/zurdsynthdatagen
This Electron project uses the OpenAI ChatCompletion API to generate synthetic datasets in either German (DE) or English (EN).
data data-structures dataset electron json jsonl nodejs openai synthetic
Last synced: 04 Apr 2026
https://github.com/mrizaln/octave-ndjson
Newline Delimited JSON (ndjson) or JSON Lines (jsonl) parser for Octave
json jsonl multithreading ndjson octave parser
Last synced: 20 Apr 2026
https://github.com/kuaner/cc-reader
A macOS app for reading and managing Claude Code session history.
claude-code jsonl macos swift swiftui xcodegen
Last synced: 21 Apr 2026
https://github.com/dbunk903/codex-oss-lens
Local-first usage dashboard for OpenAI Codex session logs.
codex jsonl openai oss-maintenance usage-dashboard
Last synced: 05 Jun 2026
https://github.com/siddhesh-agarwal/openai-ft-validate
A Go-based CLI tool that verifies the structure of JSONL files for OpenAI fine-tuning.
cli file-validation golang jsonl openai
Last synced: 21 Apr 2026
https://github.com/puukis/ccparse
TypeScript SDK for discovering, parsing, normalizing, and querying Claude Code local session data. npm: ccparse-sdk
claude-code jsonl parser sdk typescript
Last synced: 25 Apr 2026
https://github.com/impossibleforge/pfc-py
Python interface for PFC-JSONL
compression duckdb fluent-bit jsonl log-compression pypi python structured-logs
Last synced: 25 Apr 2026
https://github.com/codersacademy006/data-sanitizer
AI-powered data sanitizer with schema detection, dedupe, outlier detection, and LLM enrichment.
csv-cleaning data-cleaning data-engineering data-enrichment data-pipeline data-quality etl jsonl outlier-detection sqlite streaming-pipeline
Last synced: 06 Jun 2026
https://github.com/takanoriyanagitani/jsonl2jsons
convert jsonl(ldjson, ndjson, json-stream, ...) to json(s)
c json json-stream jsonl jsons ldjson ndjson
Last synced: 02 May 2026