An open API service indexing awesome lists of open source software.

https://github.com/haz3141/ai-dev-lab

AI-Enhanced Dev Lab v3.0 — lab/app split, MCP, DSPy, RAG
https://github.com/haz3141/ai-dev-lab

ai dev-lab dspy github-actions mcp rag

Last synced: 2 months ago
JSON representation

AI-Enhanced Dev Lab v3.0 — lab/app split, MCP, DSPy, RAG

Awesome Lists containing this project

README

          

# AI-Enhanced Dev Lab v0.6.4

AI Development Lab with MCP Server for secure, auditable AI tool interactions and RAG evaluation gates.

## Quick Start

1. Install dependencies:
```bash
pip install -r requirements.txt
```

2. Start the MCP server:
```bash
.venv/bin/python -m mcp_server.simple_server
```

3. Run tests:
```bash
pytest
```

## RAG Evaluation Gates

Run evaluation locally:
```bash
# Run full evaluation
python eval/run.py --dataset eval/data/lab/lab_dev.jsonl --output eval/runs/$(date +%Y%m%d-%H%M%S)

# Check gates
python scripts/ci/parse_metrics.py eval/runs/*/metrics.json

# Start MCP server
.venv/bin/python -m mcp_server.simple_server
```

### MCP Tools Available

The MCP server provides the following tools:

#### Terminal Operations
- **`run_command`**: Execute terminal commands safely with timeout
- **`check_file`**: Check if files exist and get metadata
- **`read_file`**: Safely read files with line limits
- **`list_directory`**: List directory contents with limits

#### Evaluation Operations
- **`run_eval`**: Run RAG evaluation safely
- **`check_gates`**: Check if evaluation gates pass

#### Usage Examples
```bash
# Test MCP server
curl -X POST http://localhost:8000/tools/run_command \
-H "Content-Type: application/json" \
-d '{"command": "ls -la", "timeout": 10}'

# Check file existence
curl -X POST http://localhost:8000/tools/check_file \
-H "Content-Type: application/json" \
-d '{"filepath": "eval/run.py"}'

# Run evaluation
curl -X POST http://localhost:8000/tools/run_eval \
-H "Content-Type: application/json" \
-d '{"dataset": "eval/data/lab/lab_dev.jsonl", "output_dir": "eval/runs/test"}'
```

## Architecture

- **MCP Server**: FastAPI-based server providing AI tools via MCP protocol
- **Security**: Guardian-based access control and PII redaction
- **Audit**: Comprehensive logging of all tool interactions
- **Evaluation**: Automated testing and metrics for AI models
- **RAG Gates**: Comprehensive evaluation framework with automated CI integration

## Project Structure
- `lab/` - Research and development experiments
- `eval/` - Evaluation framework and gates
- `mcp_server/` - MCP server implementation
- `evidence/` - Evaluation evidence and reports

## Development

See [docs/cursor-usage.md](docs/cursor-usage.md) for Cursor IDE setup and usage.