https://github.com/flamehaven01/flamehaven-filesearch

Self-hosted RAG search engine — 34 formats, BM25+hybrid search, multi-LLM (Gemini/OpenAI/Claude/Ollama), FastAPI + Docker, production-ready in 3 min
https://github.com/flamehaven01/flamehaven-filesearch
bm25 crewai docker document-parsing document-search fastapi haystack hybrid-search knowledge-base langchain llamaindex llm ollama open-source python rag self-hosted semantic-search vector-search
Last synced: 3 months ago
JSON representation
Self-hosted RAG search engine — 34 formats, BM25+hybrid search, multi-LLM (Gemini/OpenAI/Claude/Ollama), FastAPI + Docker, production-ready in 3 min
Host: GitHub
URL: https://github.com/flamehaven01/flamehaven-filesearch
Owner: flamehaven01
License: mit
Created: 2025-11-11T16:58:05.000Z (9 months ago)
Default Branch: main
Last Pushed: 2026-04-19T17:11:23.000Z (3 months ago)
Last Synced: 2026-04-19T19:28:13.415Z (3 months ago)
Topics: bm25, crewai, docker, document-parsing, document-search, fastapi, haystack, hybrid-search, knowledge-base, langchain, llamaindex, llm, ollama, open-source, python, rag, self-hosted, semantic-search, vector-search
Language: Python
Homepage: https://flamehaven.space/work/flamehaven-filesearch/
Size: 5.47 MB
Stars: 96
Watchers: 2
Forks: 12
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Security: SECURITY.md
- Roadmap: ROADMAP.md
Awesome Lists containing this project

README

          




# FLAMEHAVEN FileSearch

### Self-hosted RAG search engine. Production-ready in 3 minutes.

[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)

[![Version](https://img.shields.io/badge/version-1.6.1-blue.svg)](CHANGELOG.md)

[![Python](https://img.shields.io/badge/python-3.8+-blue.svg)](https://www.python.org/)

[![Docker](https://img.shields.io/badge/docker-ready-brightgreen.svg)](https://hub.docker.com/r/flamehaven/filesearch)

[Quick Start](#-quick-start) • [Features](#-features) • [Documentation](#-documentation) • [API Reference](http://localhost:8000/docs) • [Contributing](#-contributing)



---

## 🎯 Why FLAMEHAVEN FileSearch?

Stop sending your sensitive documents to third-party services. FLAMEHAVEN FileSearch is a production-grade RAG search engine — BM25+hybrid retrieval, 34 file formats, multi-LLM (Gemini, OpenAI, Claude, Ollama) — running self-hosted in minutes, not days.

```bash

# One command. Three minutes. Done.

docker run -d -p 8000:8000 -e GEMINI_API_KEY="your_key" flamehaven-filesearch:1.6.1

```

🚀 Fast

Production deployment in 3 minutes


Vector generation in <1ms


Zero ML dependencies


🔒 Private

100% self-hosted


Your data never leaves your infrastructure


Enterprise-grade security


💰 Cost-Effective

Free tier: 1,500 queries/month


No infrastructure costs


Open source & MIT licensed


---

## Features ✨

### Core Capabilities

| Capability | Detail |

|---|---|

| **Search Modes** | Keyword, semantic, and hybrid (BM25+RRF) with automatic typo correction |

| **34 File Formats** | PDF, DOCX/DOC, XLSX, PPTX, RTF, HTML, CSV, LaTeX, WebVTT, images + plain text — see [Document Parsing](docs/wiki/Document_Parsing.md) |

| **RAG Pipeline** | Structure-aware chunking, KnowledgeAtom 2-level indexing, sliding-window context enrichment, mtime parse cache |

| **Ultra-Fast Vectors** | DSP v2.0 generates embeddings in <1ms — no ML frameworks required |

| **Source Attribution** | Every answer links back to the originating document and chunk |

| **Framework SDKs** | LangChain, LlamaIndex, Haystack, CrewAI adapters out of the box |

| **Enterprise Auth** | API key hashing (SHA256+salt), OAuth2/OIDC, fine-grained permissions |

| **Admin Dashboard** | Real-time metrics, quota management, batch processing (1–100 queries) |

| **Flexible Storage** | SQLite (default) · PostgreSQL + pgvector · Redis cache (optional) |

> **What changed in each release?** See [CHANGELOG.md](CHANGELOG.md) for the full version history.

---

## Quick Start 🚀

### Option 1: Docker (Recommended)

The fastest path to production:

```bash

docker run -d \

  -p 8000:8000 \

  -e GEMINI_API_KEY="your_gemini_api_key" \

  -e FLAMEHAVEN_ADMIN_KEY="secure_admin_password" \

  -v $(pwd)/data:/app/data \

  flamehaven-filesearch:1.6.1

```

✅ Server running at `http://localhost:8000`

### Option 2: Python SDK

Perfect for integrating into existing applications:

```python

from flamehaven_filesearch import FlamehavenFileSearch, FileSearchConfig

# Initialize

config = FileSearchConfig(google_api_key="your_gemini_key")

fs = FlamehavenFileSearch(config)

# Upload and search

fs.upload_file("company_handbook.pdf", store="docs")

result = fs.search("What is our remote work policy?", store="docs")

print(result['answer'])

# Output: "Employees can work remotely up to 3 days per week..."

```

### Option 3: REST API

For language-agnostic integration:

```bash

# 1. Generate API key

curl -X POST http://localhost:8000/api/admin/keys \

  -H "X-Admin-Key: your_admin_key" \

  -d '{"name":"production","permissions":["upload","search"]}'

# 2. Upload document

curl -X POST http://localhost:8000/api/upload/single \

  -H "Authorization: Bearer sk_live_abc123..." \

  -F "file=@document.pdf" \

  -F "store=my_docs"

# 3. Search

curl -X POST http://localhost:8000/api/search \

  -H "Authorization: Bearer sk_live_abc123..." \

  -H "Content-Type: application/json" \

  -d 

  '{ 

    "query": "What are the main findings?",

    "store": "my_docs",

    "search_mode": "hybrid"

  }'

```

---

## 📦 Installation

```bash

# Core package (HTML, CSV, LaTeX, WebVTT, plain-text parsing included — zero extra deps)

pip install flamehaven-filesearch

# + Document parsers: PDF (pymupdf/pypdf), DOCX, XLSX, PPTX, RTF

pip install flamehaven-filesearch[parsers]

# + Image OCR (Pillow + pytesseract; requires Tesseract system binary)

pip install flamehaven-filesearch[vision]

# + Google Gemini API

pip install flamehaven-filesearch[google]

# + REST API server (FastAPI + uvicorn)

pip install flamehaven-filesearch[api]

# + HNSW vector index

pip install flamehaven-filesearch[vector]

# + PostgreSQL backend

pip install flamehaven-filesearch[postgres]

# Everything

pip install flamehaven-filesearch[all]

# Build from source

git clone https://github.com/flamehaven01/Flamehaven-Filesearch.git

cd Flamehaven-Filesearch

docker build -t flamehaven-filesearch:1.6.1 .

```

### Framework Integrations

Framework SDKs (LangChain, LlamaIndex, etc.) are imported lazily — install only

what you need:

```python

# LangChain  (pip install langchain-core)

from flamehaven_filesearch.integrations import FlamehavenLangChainLoader

docs = FlamehavenLangChainLoader("report.pdf", chunk=True).load()

# LlamaIndex  (pip install llama-index-core)

from flamehaven_filesearch.integrations import FlamehavenLlamaIndexReader

nodes = FlamehavenLlamaIndexReader(chunk=True).load_data(["report.pdf", "slides.pptx"])

# Haystack  (pip install haystack-ai)

from flamehaven_filesearch.integrations import FlamehavenHaystackConverter

result = FlamehavenHaystackConverter().run(sources=["report.pdf"])

# CrewAI  (pip install crewai)

from flamehaven_filesearch.integrations import FlamehavenCrewAITool

tool = FlamehavenCrewAITool()           # pass to your agent's tools list

```

---

## Configuration ⚙️

### Required Environment Variables

```bash

export GEMINI_API_KEY="your_google_gemini_api_key"

export FLAMEHAVEN_ADMIN_KEY="your_secure_admin_password"

```

### Optional Configuration

```bash

export HOST="0.0.0.0"              # Bind address

export PORT="8000"                  # Server port

export REDIS_HOST="localhost"       # Distributed caching

export REDIS_PORT="6379"            # Redis port

```

### Advanced Configuration

Create a `config.yaml` for fine-tuned control:

```yaml

vector_store:

  quantization: int8

  compression: gravitas_pack

  

search:

  default_mode: hybrid

  typo_correction: true

  max_results: 10

  

security:

  rate_limit: 100  # requests per minute

  max_file_size: 52428800  # 50MB

```

---

## 📊 Performance

Metric

Value

Notes

Vector Generation

<1ms

DSP v2.0, zero ML dependencies

Memory Footprint

75% reduced

Int8 quantization vs float32

Metadata Size

90% smaller

Gravitas-Pack compression

Test Suite

476 tests

All passing (pytest)

Cold Start

3 seconds

Docker container ready

### Real-World Benchmarks

```

Environment: Docker on Apple M1 Mac, 16GB RAM

Document Set: 500 PDFs, ~2GB total

Health Check:           8ms

Search (cache hit):     9ms

Search (cache miss):    1,250ms  (includes Gemini API call)

Batch Search (10):      2,500ms  (parallel processing)

Upload (50MB file):     3,200ms  (with indexing)

```

---

## Architecture 🏗️

```mermaid

flowchart TD

    Client(["Client\n(HTTP / SDK)"])

    subgraph API["REST API Layer (FastAPI)"]

        Upload["/api/upload"]

        Search["/api/search"]

        Admin["/api/admin"]

    end

    subgraph Engine["Engine Layer"]

        FP["FileParser\n+ BackendRegistry\n(34 formats)"]

        Cache["ParseCache\n(mtime-based)"]

        Chunker["TextChunker\n+ KnowledgeAtom\n(chunk atoms)"]

        DSP["DSP v2.0\nEmbedding Generator\n(<1ms, zero-ML)"]

        BM25["BM25 + RRF\nHybrid Search\n(v1.6.0)"]

        Scorer["SemanticScorer\n+ TypoCorrector"]

    end

    subgraph Storage["Storage Layer"]

        SQLite[("SQLite\nMetadata Store")]

        Vec[("Vector Store\n(local / pgvector)")]

        Redis[("Redis Cache\n(optional)")]

    end

    Gemini["Google Gemini API\n(reasoning)"]

    Metrics["Metrics Logger"]

    Client --> Upload & Search & Admin

    Upload --> FP

    FP <-->|"cache hit/miss"| Cache

    FP --> Chunker

    Chunker --> DSP

    DSP --> Vec

    FP --> SQLite

    Search --> Scorer

    Scorer --> DSP

    DSP --> Vec

    Scorer --> Gemini

    Gemini --> Client

    Admin --> Metrics

    Admin --> SQLite

    Storage <-->|"read / write"| Redis

```

> Full layer detail: [Architecture.md](docs/wiki/Architecture.md)

---

## Security 🔒

FLAMEHAVEN takes security seriously:

- ✅ **API Key Hashing** - SHA256 with salt

- ✅ **Rate Limiting** - Per-key quotas (default: 100/min)

- ✅ **Permission System** - Granular access control

- ✅ **Audit Logging** - Complete request history

- ✅ **OWASP Headers** - Security headers enabled by default

- ✅ **Input Validation** - Strict file type and size checks

### Security Best Practices

```bash

# Use strong admin keys

export FLAMEHAVEN_ADMIN_KEY=$(openssl rand -base64 32)

# Enable HTTPS in production

# (use nginx/traefik as reverse proxy)

# Rotate API keys regularly

curl -X DELETE http://localhost:8000/api/admin/keys/old_key_id \

  -H "X-Admin-Key: $FLAMEHAVEN_ADMIN_KEY"

```

---

## Roadmap 🗺️

Full roadmap: [ROADMAP.md](ROADMAP.md)

### v1.4.x (Completed)

- [x] Multimodal search (image + text)

- [x] HNSW vector indexing for faster search

- [x] OAuth2/OIDC integration

- [x] PostgreSQL backend (metadata + pgvector)

- [x] Usage-budget controls and reporting

- [x] pgvector tuning and reliability hardening

- [x] CI/CD — ruff replaces flake8; pipelines fully green

### v1.5.x (Completed)

- [x] Universal Document Parser — 34 formats, zero doc-AI dependency (v1.5.0)

- [x] Internal text chunker — structure-aware + token-aware, zero ML deps (v1.5.0)

- [x] Framework integrations — LangChain, LlamaIndex, Haystack, CrewAI (v1.5.0)

- [x] Backend Plugin Architecture — `AbstractFormatBackend` + `BackendRegistry` (v1.5.2)

- [x] Parse cache — mtime-based, `extract_text(use_cache=True)` (v1.5.2)

- [x] ContextExtractor — sliding-window RAG chunk enrichment (v1.5.2)

- [x] Multi-provider LLM support — OpenAI, Claude, Ollama, Gemini (v1.5.3)

### v1.6.0 (Completed)

- [x] BM25 + RRF hybrid search — Korean+English tokenizer, lazy per-store index

- [x] KnowledgeAtom 2-level indexing — chunk atoms with fragment URIs

- [x] Stable URI scheme — `local:///`, collision-free

- [x] core.py mixin segmentation — 1258 → 221 lines, 3 focused modules

- [x] Fix: `search_stream` double intent-refine bug

### v2.0.0 (Q3 2026)

- [ ] Multi-language support (15+ languages) — multilingual stopwords + jieba

- [ ] Kubernetes Helm charts

- [ ] Distributed indexing

---

## Troubleshooting 🐛

❌ 401 Unauthorized Error

**Problem:** API returns 401 when making requests.

**Solutions:**

1. Verify `FLAMEHAVEN_ADMIN_KEY` environment variable is set

2. Check `Authorization: Bearer sk_live_...` header format

3. Ensure API key hasn't expired (check admin dashboard)

```bash

# Debug: Check if admin key is set

echo $FLAMEHAVEN_ADMIN_KEY

# Regenerate API key

curl -X POST http://localhost:8000/api/admin/keys \

  -H "X-Admin-Key: $FLAMEHAVEN_ADMIN_KEY" \

  -d '{"name":"debug","permissions":["search"]}'

```

🐌 Slow Search Performance

**Problem:** Searches taking >5 seconds.

**Solutions:**

1. Check cache hit rate: `FLAMEHAVEN_METRICS_ENABLED=1 curl http://localhost:8000/metrics`

2. Enable Redis for distributed caching

3. Verify Gemini API latency (should be <1.5s)

```bash

# Enable Redis caching

docker run -d --name redis redis:7-alpine

export REDIS_HOST=localhost

```

💾 High Memory Usage

**Problem:** Container using >2GB RAM.

**Solutions:**

1. Enable Redis with LRU eviction policy

2. Reduce max file size in config

3. Monitor with Prometheus endpoint

```bash

# Configure Redis memory limit

docker run -d \

  -p 6379:6379 \

  redis:7-alpine \

  --maxmemory 512mb \

  --maxmemory-policy allkeys-lru

```

More solutions in our [Wiki Troubleshooting Guide](docs/wiki/Troubleshooting.md).

---

## Documentation 📚

### Documentation Hub

Use the links below to jump to the most relevant guide.

| Topic | Description |

|-------|-------------|

| [Document Parsing](docs/wiki/Document_Parsing.md) | Supported formats, internal parsers, RAG chunking |

| [Hybrid Search](docs/wiki/Hybrid_Search.md) | BM25+RRF, KnowledgeAtom indexing, stable URI scheme (v1.6.0) |

| [Framework Integrations](docs/wiki/Framework_Integrations.md) | LangChain, LlamaIndex, Haystack, CrewAI adapters |

| [API Reference](docs/wiki/API_Reference.md) | REST endpoints, payloads, rate limits |

| [Architecture](docs/wiki/Architecture.md) | How all layers fit together (v1.6.0) |

| [Configuration Reference](docs/wiki/Configuration.md) | Full list of environment variables and config fields |

| [Production Deployment](docs/wiki/Production_Deployment.md) | Docker, systemd, reverse proxy, scaling tips |

| [Troubleshooting](docs/wiki/Troubleshooting.md) | Step-by-step debugging playbook |

| [Benchmarks](docs/wiki/Benchmarks.md) | Performance measurements and methodology |

These Markdown files live inside the repository so they stay versioned alongside the code. Feel free to contribute improvements via pull requests.

### Additional Resources

- **[Interactive API Docs](http://localhost:8000/docs)** - OpenAPI/Swagger interface (when server is running)

- **[CHANGELOG](CHANGELOG.md)** - Version history and breaking changes

- **[CONTRIBUTING](CONTRIBUTING.md)** - How to contribute code

- **[Examples](examples/)** - Sample integrations and use cases

---

## Contributing 🤝

We love contributions! FLAMEHAVEN is better because of developers like you.

### Good First Issues

- 🟢 **[Easy]** Add dark mode to admin dashboard (1-2 hours)

- 🟡 **[Medium]** PostgreSQL backend for usage tracker (multi-instance deployments)

- 🔴 **[Advanced]** Kubernetes Helm charts for production deployment

See [CONTRIBUTING.md](CONTRIBUTING.md) for development setup and guidelines.

### Contributors



  



---

## Community & Support 💬

- **💬 Discussions:** [GitHub Discussions](https://github.com/flamehaven01/Flamehaven-Filesearch/discussions)

- **🐛 Bug Reports:** [GitHub Issues](https://github.com/flamehaven01/Flamehaven-Filesearch/issues)

- **🔒 Security:** security@flamehaven.space

- **📧 General:** info@flamehaven.space

---

## License 📄

Distributed under the MIT License. See [LICENSE](LICENSE) for more information.

---

## 🙏 Acknowledgments

Built with amazing open source tools:

- [FastAPI](https://fastapi.tiangolo.com/) - Modern Python web framework

- [Google Gemini](https://ai.google.dev/) - Semantic understanding and reasoning

- [SQLite](https://www.sqlite.org/) - Lightweight, embedded database

- [Redis](https://redis.io/) - In-memory caching (optional)

---



**[⭐ Star us on GitHub](https://github.com/flamehaven01/Flamehaven-Filesearch)** • **[📖 Read the Docs](docs/wiki/README.md)** • **[🚀 Deploy Now](#-quick-start)**

Built with 🔥 by the Flamehaven Core Team

*Last updated: April 19, 2026 • Version 1.6.1*
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/flamehaven01/flamehaven-filesearch

Awesome Lists containing this project

README

🚀 Fast

🔒 Private

💰 Cost-Effective