An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with jsonl

A curated list of projects in awesome lists tagged with jsonl .

https://github.com/textualize/toolong

A terminal application to view, tail, merge, and search log files (plus JSONL).

jsonl rich terminal terminal-based textual tui

Last synced: 14 May 2025

https://github.com/Textualize/toolong

A terminal application to view, tail, merge, and search log files (plus JSONL).

jsonl rich terminal terminal-based textual tui

Last synced: 26 Mar 2025

https://github.com/noborus/trdsql

CLI tool that can execute SQL queries on CSV, LTSV, JSON, YAML and TBLN. Can output to various formats.

cli csv go golang json jsonl ltsv mysql postgresql sql sqlite3 tbln trdsql yaml

Last synced: 12 May 2026

https://github.com/Ruya-AI/cozempic

Context cleaning for Claude Code — prune bloated sessions, protect Agent Teams from context loss, auto-guard with tiered pruning

agent-teams claude-code claude-skills cli context context-management jsonl llm-tools pruning python session-management

Last synced: 20 Apr 2026

https://github.com/delexw/claude-code-trace

Real-time Claude Code session log viewer and monitor. Tail, search, and visualize JSONL session files with a native desktop GUI, a web and a tui. Built with Tauri 2, React, and TypeScript.

claude-code desktop-gui jsonl monitor react search session-log tail tauri-2 typescript viewer visualize

Last synced: 07 Apr 2026

https://github.com/datacoon/undatum

undatum: a command-line tool for data processing. Brings CSV simplicity to NDJSON, BSON, XML and other data files

bson cli command-line csv data dataset json jsonl jsonlines parquet

Last synced: 18 Jan 2026

https://github.com/tpeczek/json-streaming-dotnet

JsonStreaming.NET is a solution that provides a set of libraries (`Ndjson.AsyncStreams.*`) for working with asynchronous streaming data sources over HTTP using JSONL (JSON Lines) and NDJSON (Newline Delimited JSON).

asp-net-core asp-net-core-mvc async-streams dotnet jsonl jsonlines ndjson

Last synced: 26 Oct 2025

https://github.com/umarbutler/orjsonl

A lightweight, high-performance Python library for parsing jsonl files.

bzip2 deserialization gzip json json-lines jsonl jsonlines ndjson parser parsing python serialization xz zstandard

Last synced: 11 Oct 2025

https://github.com/salaah01/json-lineage

Tool to allow parsing large JSON files without laoding into memory. Developed in Rust with adapters in other programming langauges for easy adoption

json jsonl jsonlines python python3 rust rust-lang

Last synced: 21 Mar 2025

https://github.com/chand1012/sq

Convert and query JSON, JSONL, CSV, and SQLite with ease!

cli csv golang json jsonl scripting sqlite sqlite3

Last synced: 30 Jul 2025

https://github.com/markusressel/openhasp-config-manager

A tool to manage all of your openHASP device configs in a centralized place.

cli configuration-management deployment deployment-automation docker esp32 helper jsonl mqtt openhasp python validation

Last synced: 12 Feb 2026

https://github.com/vearutop/flatjsonl

A tool to flatten JSONL into CSV or SQL

csv hacktoberfest json jsonl sqlite3

Last synced: 17 Oct 2025

https://github.com/joeyism/jsonl-to-conll

A simple tool to convert JSONL to CONLL

conll convert converter data etl jsonl learning machine train training

Last synced: 11 Oct 2025

https://github.com/ffdev-info/jsonid

Identification of JSON (JSONL, YAML, and TOML) objects: JSONID

archives code4lib digipres digital-preservation file-formats format-identification glam json jsonl toml yaml

Last synced: 04 Jan 2026

https://github.com/ryhkml/fine-tune-forge

JSONL generator designed for models like Google PaLM 2 and OpenAI GPT-3.5

ai fine-tuning gpt-3 image-ocr jsonl openai tool

Last synced: 11 Apr 2025

https://github.com/aiengineeringforgrandmas/gemini-prompt-engineer-toolkit

⚡Powered by Goggle TPUs and the latest (Aug 27, 2024) Gemini 1.5 Pro and Flash Models to generate high-quality engineered prompts, analyze text and images, and create datasets for fine-tuning AI models, helping you to become a prompt engineering pro

dataset-generation fine-tuning gemini-ai gemini-api gemini-flash gemini-pro html jsonl prompt-engineering python streamlit streamlit-webapp

Last synced: 18 Aug 2025

https://github.com/rmoralespp/jsonl

A lightweight Python library for handling jsonlines files

bzip2 deserialization files gzip json jsonl jsonlines ndjson python serialization tar utils xz zip

Last synced: 11 Mar 2026

https://github.com/kazu/vfs-index

no process, indexer like a DB on VFS(virtual filesystem)

csv flatbuffers go golang json jsonl lz4 search-engine

Last synced: 23 Apr 2025

https://github.com/petlack/rollup-plugin-jsonlines

🍣 A Rollup plugin which imports .jsonl (JSON Lines) files as JSON arrays.

javascript jsonl rollup rollup-plugin

Last synced: 11 May 2025

https://github.com/jimsmart/peanut

peanut is a Go package to write tagged data structs to disk in a variety of formats, simply and without ceremony.

csv excel go jsonl log-file sqlite struct-writer tsv

Last synced: 06 Jul 2025

https://github.com/nya1/bananareporter

An easy to use CLI to generate custom reports in JSON, JSONL, CSV from multiple sources

cli command-line-tool github gitlab jsonl report report-generator reporting reporting-tool

Last synced: 26 Oct 2025

https://github.com/arinbalyan/scrappy

Bulk job-board scraper with 100+ sites, email enrichment, deterministic quality scoring, and multi-format exports (CSV/JSONL/XLSX/Parquet). Designed for scheduled bulk-first operations with per-site rate limiting, proxy pools, and resume support.

ats csv data-engineering email-verification go golang job-board jsonl parquet proxy quality-scoring recruitment scraping socks5 web-scraping xlsx

Last synced: 27 Jun 2026

https://github.com/2nd1st/api-log

Transparent LLM gateway trace recorder · sub2api / CLIProxyAPI / new-api 的 LLM 网关流量录制器

anthropic api-gateway claude claude-code codex gemini jsonl llmops new-api observability openai-api openai-compatible reverse-proxy sub2api

Last synced: 07 Jun 2026

https://github.com/marvin-j97/x11-pasteboard

A micro-CLI to watch your X11 pasteboard and emit the contents as jsonl

clipboard jsonl listener pasteboard small-tools x11

Last synced: 12 Feb 2026

https://github.com/mosuka/wikipedia-jsonl

wikipedia-jsonl is a CLI that converts Wikipedia dump XML to JSON Lines format.

cli go golang jsonl mediawiki ndjson wikipedia xml

Last synced: 17 May 2026

https://github.com/jaybird1291/anki-llm-review-stats-exporter

Export your Anki review history (revlog) as JSONL so you can analyze it with an LLM (ChatGPT, Claude, local models, etc.) without using any API.

anki anki-addon chatgpt data-analysis data-export jsonl llm review-stats revlog statistics

Last synced: 24 Dec 2025

https://github.com/zaneh/dataset-tools

Small collection of scripts to build datasets for LLMs.

csv dataset fine-tuning jsonl llm system-prompt training

Last synced: 01 Jun 2026

https://github.com/aminnairi/cristaline

An immutable database engine built on the Event Sourcing pattern.

event event-sourcing javascript json jsonl jsonlines jsonstream react sourcing store typescript

Last synced: 04 Mar 2026

https://github.com/mirpo/datamatic

Build multi-step AI workflows with schema-guided reasoning. Supports Ollama, LMStudio, OpenAI, OpenRouter, Gemini, and all latest models for structured generation, chaining, and data processing.

agentic-ai ai-workflow dataset deepseek-r1 jsonl llama3 llm lmstudio localllm ollama phi4 synthetic-data synthetic-dataset-generation

Last synced: 28 Apr 2026

https://github.com/yuan-cloud/vifei-suite-public

Deterministic, local-first CLI/TUI for AI Agent rerun evidence, incidental forensics, and fail-closed share-safe exports.

ai-agents auditability cli deterministic eventlog forensics indient-response jsonl local-first observability redaction redactionsecurity replay rust tui

Last synced: 10 Apr 2026

https://github.com/s3rgeym/sqldump2json

Converts SQL dump to a JSON stream.

converter jq json jsonl parser python python3 sql

Last synced: 05 Apr 2025

https://github.com/oada/cli

Pipeable OADA CLI client

cli client concatenated-json json jsonl jsonlines ldjson ndjson oada

Last synced: 17 Apr 2026

https://github.com/richie-rich90454/training-generator

Training Generator is a cross-platform desktop app built with Electron and Node.js that converts documents (PDF, DOCX, DOC, RTF, TXT, MD, HTML) into structured AI training data. Using local Ollama models, it extracts instructions, Q&A pairs, and conversation data for machine learning, AI fine-tuning, and NLP workflows, while keeping all processing.

ai ai-data-analysis ai-training-data cpp desktop-app document-conversion electron html-css-javascript jsonl local-ai ml ollama ollama-api training-materials

Last synced: 26 Feb 2026

https://github.com/kevinbenabdelhak/wp-fine-tuning

WP Fine-tuning est un plugin qui exporte vos publications de ou des types de contenus de votre choix, dans un fichier .jsonl prêt à être importé sur OpenAI. Boostez votre IA personnalisée à partir de votre site WordPress

author content export fine-tuning jsonl openai pattern plugin wordpress

Last synced: 22 Aug 2025

https://github.com/dominiek/promptcellar-format

Open spec for logging AI coding prompts to a JSONL file in your repo. Vendor-neutral. JSON Schema published. plf-1.

ai-coding developer-tools json-schema jsonl llm-tools mcp open-standard prompt-logging

Last synced: 29 May 2026

https://github.com/maxlath/ndjson-2-json

minimal CLI converter from newline-delimited JSON to a JSON array

cli converter json jsonl ndjson

Last synced: 02 Mar 2025

https://github.com/forjd/runtrail

Portable event trails for agentic dev workflows: commands, repo diffs, browser QA, CI failures, and repair prompts.

agentic-workflows browser-automation ci cli debugging developer-tools event-log github-actions jsonl observability qa repair-prompts rust

Last synced: 31 May 2026

https://github.com/hitmman55/claude-code-chat-viewer

Offline HTML viewer for Claude Code JSONL session transcripts. Six UI languages, light/dark theme, no backend, no CDN.

anthropic chat-log claude claude-code html-viewer i18n jsonl jsonl-viewer log-viewer offline-first public-domain single-file transcript-viewer zero-dependencies

Last synced: 21 Apr 2026

https://github.com/mrpudn/maltrends

(mirror) MyAnimeList.net manga and anime trend data.

anime data json jsonl jsonlines manga myanimelist

Last synced: 20 Apr 2026

https://github.com/banyc/dfsql

SQL REPL/lib for Data Frames

cli csv data-analysis jsonl ndjson repl sql

Last synced: 31 Jul 2025

https://github.com/impossibleforge/pfc-duckdb

DuckDB extension to read PFC-JSONL compressed log files with block-level timestamp filtering

analytics compression duckdb duckdb-extension jsonl log-compression sql structured-logs

Last synced: 25 Apr 2026

https://github.com/impossibleforge/pfc-jsonl

High-ratio JSONL compressor with block-level random access. ~9% ratio on log data - 25% smaller than gzip, 37% smaller than zstd. Free for personal and open-source use.

cli compression data-compression duckdb fluent-bit jsonl log-compression structured-logs

Last synced: 25 Apr 2026

https://github.com/pearxteam/bashim-scraper

A utility written in Kotlin that scraps all the quotes from Russian bash.im website and writes them into a single JSONL file.

bashim gradle gradle-kotlin-dsl json jsonl jsoup kotlin kotlinx-serialization scraper scraping scraping-websites

Last synced: 01 May 2026

https://github.com/fumito-ito/swiftyjsonlines

The better way to deal with JSONLines data in Swift.

jsonl jsonlines swift swift6

Last synced: 27 Feb 2026

https://github.com/sija/jsonl.cr

Crystal shard for handling JSONL (JSON Lines) parsing

crystal json jsonl

Last synced: 05 May 2026

https://github.com/ipanalytics/bogonforge

BogonForge is a reproducible compiler for IANA/RFC special-use IP and ASN policy.

asn bogon cidr iana ip-intelligence ipset ipv4 jsonl network-security nftables nginx rfc siem special-use threat-intelligence

Last synced: 25 May 2026

https://github.com/neverendingqs/pprint-ndjson-ui

For pretty-printing newline delimited JSON.

jsonl jsonlines ndjson netlify react-bootstrap reactjs

Last synced: 08 May 2026

https://github.com/kyleking/tail-jsonl

Tail JSONL Logs

cli jsonl logging

Last synced: 10 Apr 2025

https://github.com/sile/jsonlrpc

A JSON-RPC 2.0 library that streams JSON objects in JSON Lines format.

jsonl jsonrpc rust

Last synced: 03 Feb 2026

https://github.com/escoffier-labs/sourceharvest

SourceHarvest exports local source-system records to portable miseledger.adapter.v1 JSONL for archive, search, and evidence workflows.

ai-agents brigade cli evidence exporter go jsonl local-first miseledger sourceharvest

Last synced: 11 Jun 2026

https://github.com/escoffier-labs/stationtrail

StationTrail exports local station and agent-session logs to portable miseledger.adapter.v1 JSONL. Local-only, no network calls.

ai-agents brigade claude-code cli codex go jsonl local-first miseledger openclaw session-logs stationtrail

Last synced: 11 Jun 2026

https://github.com/annaleighsmith/bt

A minimal, local-first issue tracker for AI agents. Issues live in plain JSONL files alongside your code.

ai ai-agents beads claude claude-code cli developer-tools go issue-tracker jsonl

Last synced: 13 Jun 2026

https://github.com/dwalleck/rivets

A high-performance, Rust-based issue tracking system using JSONL storage

async cli issue-tracker jsonl project-management rust rust-lang

Last synced: 20 Jun 2026

https://github.com/zaibacu/json2jsonl

A tool to take large JSON and convert it to JSONL

json jsonl transform

Last synced: 11 Apr 2026

https://github.com/effet/cxusage

Codex usage analytics CLI that scans ~/.codex/sessions, aggregates tokens by day or model, and estimates cost using OpenRouter pricing.

anthropic claude cli codex command-line-tool cost-estimation daily-reports data-aggregation developer-tools jsonl log-analysis metrics nodejs observability openai openrouter reporting token-usage typescript usage-analytics

Last synced: 18 Apr 2026

https://github.com/sile/jlot

Command-line tool for JSON-RPC 2.0 over JSON Lines over TCP.

command-line-tool json-rpc jsonl rust

Last synced: 17 Feb 2026

https://github.com/mbe24/99problems

AI-native CLI tool for issue/PR context retrieval across Jira, GitLab, GitHub, and Bitbucket, with structured output for humans and AI agents plus a scaffold aligned with the Agent Skills standard.

agent-skills agentic-engineering ai-agents ai-native bitbucket-cloud bitbucket-datacenter cli developer-tools github-api gitlab-api issue-tracking jira-api json jsonl llm ndjson pull-requests rag rust yaml

Last synced: 13 Apr 2026

https://github.com/plaited/agent-eval-harness

Evaluate AI agents with Unix-style pipeline commands. Schema-driven adapters for any CLI agent, trajectory capture, pass@k metrics, and multi-run comparison.

agent-client-protocol agent-comparison agent-evaluation ai-agents bun cli eval-harness grader headless-adapter jsonl llm-evaluation pass-at-k trajectory-capture typescript unix-pipeline

Last synced: 20 Feb 2026

https://github.com/ondata/appaltipop

ETL scripts and issue tracking for AppaltiPOP project.

appaltipop csv elasticsearch json json-schema jsonl jupiter python

Last synced: 17 Mar 2026

https://github.com/ddjain/jsonl-visualizer

A beautiful web tool for visualizing JSONL files with syntax highlighting and multiple view modes

data-analysis json jsonl viusal

Last synced: 01 Jul 2026

https://github.com/effortlessmetrics/tokmd

Code intelligence for humans, machines, and LLMs: receipts, metrics, and insights from your codebase.

ai-tools ci cli code-metrics code-statistics csv devex github-actions jsonl llm llmops loc markdown monorepo repository-analysis rust sloc tokei tsv

Last synced: 15 Apr 2026

https://github.com/itsluketwist/jldc

Easily read/write JSONLines files that include dataclasses.

dataclasses dataclasses-json jsonl jsonlines

Last synced: 05 Apr 2025

https://github.com/abetomo/dump_to_jsonl

Generate JSONL from a `mysqldump` dump file.

jsonl mysqldump

Last synced: 29 Jun 2026

https://github.com/viggomeesters/go-project-template

Minimal .go template repository for repo-local agent workflows.

agent-workflow jsonl template

Last synced: 03 Jul 2026

https://github.com/viggomeesters/minimal-ai-vault-starter

Public-safe starter for a minimal AI-enabled Obsidian/JSONL vault

ai-vault jsonl obsidian personal-knowledge-management starter-kit

Last synced: 03 Jul 2026

https://github.com/viggomeesters/go-workflow-stack

Reusable CLI and schemas for repo-local .go agent workflow state.

agent-workflow developer-tools jsonl

Last synced: 03 Jul 2026

https://github.com/viggomeesters/jsonl-vault-spike

Synthetic JSONL-first vault/context proof of concept for agent-readable personal knowledge systems

agents context-engineering jsonl personal-knowledge-management synthetic-data

Last synced: 03 Jul 2026

https://github.com/iagolirapasssos/jsonl-file-generator

This is a simple web application that allows users to create JSONL files for fine-tuning OpenAI's GPT models. The application includes fields for entering "System", "User", and "Assistant" messages, stores data in the browser's local storage to prevent data loss on refresh, and supports multiple languages (English, Portuguese, Spanish, and Hindi).

finetuning gpt jsonl openai

Last synced: 18 May 2026

https://github.com/nightmarewalker/d-safelogger

Zero-dependency, stdlib-compatible Python logger with append-only routing, integrity sidecars, and multiprocess delivery-state observability.

append-only audit-logging configuration free-threading integrity jsonl log-rotation logging multi-processing observability pep-703 python python-free-threaded python-logging sha256 structured-logging windows zero-dependency

Last synced: 23 May 2026

https://github.com/adeladool/ner-news-articles

Named Entity Recognition (NER) on news articles using rule-based and spaCy models. Includes entity extraction, visualizations with displaCy, and comparison of small vs large spaCy models for analysis and insights.

csv data-science displacy jsonl machine-learning named-entity-recognition natural-language-processing natural-language-processing-nlp ner news-analysis python spacy text-mining text-processing visualization

Last synced: 16 May 2026

https://github.com/allendema/rewe_dl

Call store APIs. Parse responses and save the data to SQL, JSON or any custom format.

apprise discounts httpx json jsonl market price-comparison price-tracker products python3 ruff sqlite3 webhooks

Last synced: 21 Aug 2025

https://github.com/nikopastore/agent-memory-site

Turn Markdown/Obsidian AI-agent memory vaults into semantic HTML dashboards, search indexes, and retrieval-ready JSONL chunks.

agent-memory ai-agent claude codex developer-tools jsonl knowledge-base llm local-first markdown obsidian privacy rag retrieval semantic-html static-site static-site-generator typescript

Last synced: 24 May 2026

https://github.com/kadubon/bottleneck-audit-toolkit

Offline, fail-closed verifier for JSONL telemetry event logs. Emits deterministic audit certificates + human summaries with explicit claims/non-claims for bottleneck and integrity review.

ai audit bottleneck checkpointing distributed-training event-logs jsonl mlops offline-verification performance-monitoring silent-data-corruption tail-latency telemetry

Last synced: 13 Jan 2026

https://github.com/kadubon/oasg

Local-first, model-agnostic workflow optimizer for long-running AI agents: observable JSONL ledgers, deterministic reducers, no-meta gates, and receipt-backed self-improvement without LLM judges or model-weight updates.

agent-evaluation agent-memory agent-workflows ai-agents autonomous-agents deterministic-replay jsonl long-running-agents model-agnostic no-meta ollama python self-improving-agents verification workflow-automation workflow-optimization

Last synced: 30 May 2026

https://github.com/nlink-jp/lookup

A powerful Go-based CLI tool to enrich JSON/JSONL data streams by looking up values in CSV or JSON files. Inspired by Splunk's lookup command.

cli csv data-enrichment data-pipeline devpos-tools go golang json jsonl lookup splunk

Last synced: 14 Apr 2026

https://github.com/plaited/acp-harness

CLI for agent evaluation. Capture trajectories, run trials with pass@k metrics, and score with polyglot graders (TypeScript, Python, any language).

acp agent-client-protocol agent-evaluation ai-agents bun cli eval-harness grader jsonl llm-evaluation pass-at-k trajectory-capture typescript

Last synced: 21 Jan 2026

https://github.com/cahlen/conversation-dataset-generator

Craft conversational datasets (JSONL format with rich metadata) using LLMs. Specify parameters manually or use a creative brief for LLM-generated arguments with automatic topic/scenario variation. Optional web search improves persona grounding. Ideal for LoRA tuning, persona training, and creative writing. Includes Hugging Face Hub upload.

dataset-generation dialogue-generation fine-tuning huggingface jsonl llm lora nlp peft persona python synthentic-data transformers

Last synced: 27 Oct 2025

https://github.com/sdkks/nesdit

nesdit aka NestedEdit: Edit and convert your json(l),yaml,toml files in place or on the fly through streaming

ansible argo argocd configuration jq json jsonl kubernetes ndjson serialization spinnaker toml yaml yml yq

Last synced: 01 Jun 2026

https://github.com/arthur87/asa_json

asa_json is a library for conveniently JSON

csv gem json jsonl ruby

Last synced: 12 Feb 2026

https://github.com/muka/tap-jsonl

A tap to process jsonl files

jsonl meltano singer tap-jsonl

Last synced: 03 Apr 2026

https://github.com/zurd46/zurdsynthdatagen

This Electron project uses the OpenAI ChatCompletion API to generate synthetic datasets in either German (DE) or English (EN).

data data-structures dataset electron json jsonl nodejs openai synthetic

Last synced: 04 Apr 2026

https://github.com/mrizaln/octave-ndjson

Newline Delimited JSON (ndjson) or JSON Lines (jsonl) parser for Octave

json jsonl multithreading ndjson octave parser

Last synced: 20 Apr 2026

https://github.com/kuaner/cc-reader

A macOS app for reading and managing Claude Code session history.

claude-code jsonl macos swift swiftui xcodegen

Last synced: 21 Apr 2026

https://github.com/dbunk903/codex-oss-lens

Local-first usage dashboard for OpenAI Codex session logs.

codex jsonl openai oss-maintenance usage-dashboard

Last synced: 05 Jun 2026

https://github.com/siddhesh-agarwal/openai-ft-validate

A Go-based CLI tool that verifies the structure of JSONL files for OpenAI fine-tuning.

cli file-validation golang jsonl openai

Last synced: 21 Apr 2026

https://github.com/puukis/ccparse

TypeScript SDK for discovering, parsing, normalizing, and querying Claude Code local session data. npm: ccparse-sdk

claude-code jsonl parser sdk typescript

Last synced: 25 Apr 2026

https://github.com/codersacademy006/data-sanitizer

AI-powered data sanitizer with schema detection, dedupe, outlier detection, and LLM enrichment.

csv-cleaning data-cleaning data-engineering data-enrichment data-pipeline data-quality etl jsonl outlier-detection sqlite streaming-pipeline

Last synced: 06 Jun 2026

https://github.com/takanoriyanagitani/json2strings

convert json to string array(json -> json string array)

convert flat flatten json jsonl rust wasi

Last synced: 29 Apr 2026

https://github.com/takanoriyanagitani/jsonl2jsons

convert jsonl(ldjson, ndjson, json-stream, ...) to json(s)

c json json-stream jsonl jsons ldjson ndjson

Last synced: 02 May 2026