https://github.com/ebowwa/chatgpt-export-processor
π€ Extract, analyze & search your ChatGPT conversations locally | Privacy-first tool for OpenAI ChatGPT data export processing | Python CLI with embeddings support
https://github.com/ebowwa/chatgpt-export-processor
ai-tools chatgpt chatgpt-export chatgpt-tools cli conversation-analysis data-analysis data-extraction embeddings local-first nlp openai openai-api privacy python
Last synced: about 1 month ago
JSON representation
π€ Extract, analyze & search your ChatGPT conversations locally | Privacy-first tool for OpenAI ChatGPT data export processing | Python CLI with embeddings support
- Host: GitHub
- URL: https://github.com/ebowwa/chatgpt-export-processor
- Owner: ebowwa
- License: mit
- Created: 2025-07-20T19:18:59.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2025-07-20T21:41:28.000Z (11 months ago)
- Last Synced: 2025-09-19T15:36:06.069Z (9 months ago)
- Topics: ai-tools, chatgpt, chatgpt-export, chatgpt-tools, cli, conversation-analysis, data-analysis, data-extraction, embeddings, local-first, nlp, openai, openai-api, privacy, python
- Language: Python
- Size: 47.9 KB
- Stars: 1
- Watchers: 0
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# ChatGPT Export Processor π€
**Extract, analyze, and search your ChatGPT conversations locally with complete privacy**
[](https://www.python.org/downloads/)
[](LICENSE)
[](https://github.com/ebowwa/chatgpt-export-processor)
A powerful Python CLI tool for processing ChatGPT data exports from OpenAI. Extract your conversations, analyze metadata, generate embeddings, and search through your AI chat history - all while keeping your data 100% private and local.
## π Key Features
- **ποΈ ChatGPT Export Processing**: Seamlessly extract and organize your ChatGPT conversation exports
- **π Metadata Analysis**: Analyze conversation statistics, message counts, and file sizes
- **π Local Search** (coming soon): Search through your conversations with embeddings
- **π 100% Private**: All processing happens on your machine - no data ever leaves your device
- **β‘ Fast CLI**: Efficient command-line interface for batch processing
- **π§© Extensible**: Modular architecture ready for custom analysis plugins
- **π€ OpenAI Compatible**: Works with official ChatGPT data exports
## π¦ Installation
```bash
# Clone the repository
git clone https://github.com/ebowwa/chatgpt-export-processor.git
cd chatgpt-export-processor
# Install dependencies (optional, for future features)
pip install -r requirements.txt # Coming soon
```
## π― Quick Start
### 1οΈβ£ Export your ChatGPT data
Go to [ChatGPT Settings](https://chat.openai.com/settings) β Data Controls β Export data
### 2οΈβ£ Process your export
```bash
python -m interfaces.cli process your-chatgpt-export.zip
```
### 3οΈβ£ Explore your data
```bash
# List all extracted conversations
python -m interfaces.cli list
# Analyze metadata for specific dataset
python -m interfaces.cli metadata ./user-data/2025-07-20_Sunday_12-04-32
# Get help
python -m interfaces.cli --help
```
## π Project Structure
```
chatgpt-export-processor/
βββ interfaces/ # User interfaces (CLI, API, etc.)
β βββ cli/ # Command-line interface
βββ src/ # Core functionality
β βββ uploading/ # Extraction and metadata utilities
βββ main.py # Main processing engine
βββ .gitignore # Protects your personal data
βββ README.md # Documentation
```
## π Privacy & Security
**Your conversations never leave your machine:**
- β
100% local processing - no cloud, no external APIs
- β
Your data stays in `user-data/` (automatically gitignored)
- β
No telemetry, no tracking, no data collection
- β
Open source - inspect every line of code
## πΊοΈ Roadmap
- [ ] **Embeddings Generation** - Semantic search through conversations
- [ ] **Vector Database** - Efficient similarity search with FAISS/ChromaDB
- [ ] **Advanced Analytics** - Conversation insights and patterns
- [ ] **Export Formats** - JSON, CSV, Markdown exports
- [ ] **Web UI** - Browser-based interface
- [ ] **API Server** - REST API for integrations
- [ ] **LLM Fine-tuning** - Prepare data for model training
## π€ Contributing
We welcome contributions! Please see [CONTRIBUTING.md](CONTRIBUTING.md) for guidelines.
**Important**: Never commit personal conversation data. Check `.gitignore` before pushing.
## π License
MIT License - see [LICENSE](LICENSE) file for details.
## π Related Projects
- [ChatGPT](https://chat.openai.com) - OpenAI's conversational AI
- [OpenAI API](https://platform.openai.com) - Build with GPT models
- [LangChain](https://github.com/langchain-ai/langchain) - LLM application framework
## π Support
- π [Report bugs](https://github.com/ebowwa/chatgpt-export-processor/issues)
- π‘ [Request features](https://github.com/ebowwa/chatgpt-export-processor/issues)
- π [Documentation](https://github.com/ebowwa/chatgpt-export-processor/wiki)
---
**Keywords**: ChatGPT export, OpenAI data export, conversation analysis, ChatGPT backup, AI chat history, local ChatGPT search, privacy-first AI tools, ChatGPT data processing, conversation embeddings, ChatGPT analytics