{"id":50546366,"url":"https://github.com/alerque/acceptarium","last_synced_at":"2026-06-03T23:04:05.854Z","repository":{"id":344901412,"uuid":"1181230573","full_name":"alerque/acceptarium","owner":"alerque","description":"Tools to facilitate scanning receipts, extracting useful data, archiving the assets, and importing the results into plain text accounting systems.","archived":false,"fork":false,"pushed_at":"2026-04-25T11:31:48.000Z","size":783,"stargazers_count":3,"open_issues_count":0,"forks_count":0,"subscribers_count":0,"default_branch":"master","last_synced_at":"2026-04-29T22:45:50.194Z","etag":null,"topics":["beancount","cli","git-annex","hledger","ledger-cli","llm","ocr","plaintext-accounting"],"latest_commit_sha":null,"homepage":"https://codeberg.org/plaintextaccounting/acceptarium","language":"Rust","has_issues":false,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/alerque.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGELOG.md","contributing":null,"funding":null,"license":"LICENSES/0BSD.txt","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2026-03-13T22:21:54.000Z","updated_at":"2026-04-25T11:31:52.000Z","dependencies_parsed_at":null,"dependency_job_id":null,"html_url":"https://github.com/alerque/acceptarium","commit_stats":null,"previous_names":["alerque/acceptarium"],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/alerque/acceptarium","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alerque%2Facceptarium","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alerque%2Facceptarium/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alerque%2Facceptarium/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alerque%2Facceptarium/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/alerque","download_url":"https://codeload.github.com/alerque/acceptarium/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/alerque%2Facceptarium/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":33883158,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-26T15:22:16.424Z","status":"online","status_checked_at":"2026-06-03T02:00:06.370Z","response_time":59,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["beancount","cli","git-annex","hledger","ledger-cli","llm","ocr","plaintext-accounting"],"created_at":"2026-06-03T23:04:04.751Z","updated_at":"2026-06-03T23:04:05.841Z","avatar_url":"https://github.com/alerque.png","language":"Rust","funding_links":[],"categories":[],"sub_categories":[],"readme":"# acceptarium\n\nA collection of tooling to facilitate scanning receipts, extracting useful data, archiving the assets, and importing the results into [Plain Text Accounting][pta] systems.\n\naccipiō\n:   (*Classical Latin*) [akˈkɪ.pi.oː] to receive, accept\n\nacceptarius\n:   (*Latin*) allotment-holding\n:   (*Medieval*) receipt book\n\n----\n\n# Overview\n\n```mermaid\n---\nconfig:\n  layout: elk\n  look: handDrawn\n  theme: redux-dark\n---\nflowchart LR\n  A[\"Ingest/Scan\"]\n  B[\"ID (Store)\"]\n  C[\"Traditional OCR\"]\n  D[\"Regex Extract\"]\n  E[\"Rules\"]\n  F[\"Review/Edit\"]\n  G[\"Export\"]\n  L1[\"LLM Vision\"]\n  L2[\"LLM Extract\"]\n  L3[\"Retrain\"]\n  A --\u003e B --\u003e C \u0026 L1 --\u003e D \u0026 L2 --\u003e F --\u003e G\n  F --\u003e E \u0026 L3\n  E --\u003e D\n  L3 --\u003e L2\n  style L1 stroke-dasharray: 5\n  style L2 stroke-dasharray: 5\n  style L3 stroke-dasharray: 5\n```\n\n1. Scan or import scanned receipts, individually or in bulk.\n1. Store identifiable scanned assets using [Git Annex][gitannex] or pluggable backends (LFS? WebDAV?).\n1. **Optionally** extract data via OCR using local LLM tooling ([Ollama][ollama] or pluggable remote tooling).\n1. **Optionally** automatically process data into structured transaction info (via local LLM tooling or pattern matching).\n1. Facilitate either manual data entry or automatic data extraction with review and a chance to chance to edit.\n1. **Optionally** use final data to update regex rules or train the LLM model to improve future extractions.\n1. Export extracted data as transaction(s) via CVS? JSON? (or possibly directly to journal for [HLedger][hledger], [Ledger CLI][ledgercli], [Beancount][beancount], etc.).\n\n## Goals\n\n* Automate as many steps as possible to make it easy to handle receipts (and possibly invoices, etc.) in bulk.\n* Disable all LLM related features by default and remain functional without them requiring explicit opt-in for use.\n* Use only local-first privacy-preserving tooling by default — even where LLMs may be involved.\n* Facilitate human review/approval and fully featured editing for any non-deterministic steps like LLM or OCR based meta-data extraction.\n* Allow re-processing data from initial assets in the event of improved tooling (better OCR, more journal import rules, etc.).\n\n## Non-goals\n\n* Avoid lock-in to any particular PTA solution (pair with [HLedger][hledger], [Ledger CLI][ledgercli], [Beancount][beancount], or similar journal tools)\n* Avoid dictating the entire accounting workflow; people have their own data handling already, we just want to mix in digitized assets.\n\n[beancount]: https://beancount.io/\n[gitannex]: https://git-annex.branchable.com/\n[hledger]: https://hledger.org/\n[ledgercli]: https://ledger-cli.org/\n[ollama]: https://ollama.com/\n[pta]: https://plaintextaccounting.org/\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Falerque%2Facceptarium","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Falerque%2Facceptarium","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Falerque%2Facceptarium/lists"}