An open API service indexing awesome lists of open source software.

https://github.com/node0/timbermill

OCR-powered chat session renderer that slices long conversations into paginated, searchable PDFs
https://github.com/node0/timbermill

chat-archive chatgpt cv2 document-processing llm-tools ocr pdf-generation python

Last synced: about 1 month ago
JSON representation

OCR-powered chat session renderer that slices long conversations into paginated, searchable PDFs

Awesome Lists containing this project

README

        

# Project Timbermill

**OCR-powered chat session renderer that slices long conversations into paginated, searchable PDFs.**

Timbermill processes full-length ChatGPT-style conversation screenshots or PDFs by cutting them into equal-length pages at logical whitespace breaks using OpenCV. Once segmented, each page is either OCR’d individually before assembly or assembled into a unified PDF with embedded searchable text, depending on pipeline strategy. This project aims to streamline the archival and export of mobile LLM chat logs, especially from iOS where traditional exports are cumbersome. It is designed for short-lived, high-utility deployment.