An open API service indexing awesome lists of open source software.

https://github.com/semcod/prefact


https://github.com/semcod/prefact

Last synced: 8 days ago
JSON representation

Awesome Lists containing this project

README

          

# prefact

[![PyPI version](https://badge.fury.io/py/prefact.svg)](https://badge.fury.io/py/prefact)
[![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/downloads/)
[![License: Apache-2.0](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)
[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)

## AI Cost Tracking

![PyPI](https://img.shields.io/badge/pypi-costs-blue) ![Version](https://img.shields.io/badge/version-0.1.58-blue) ![Python](https://img.shields.io/badge/python-3.9+-blue) ![License](https://img.shields.io/badge/license-Apache--2.0-green)
![AI Cost](https://img.shields.io/badge/AI%20Cost-$16.10-orange) ![Human Time](https://img.shields.io/badge/Human%20Time-24.3h-blue) ![Model](https://img.shields.io/badge/Model-openrouter%2Fqwen%2Fqwen3--coder--next-lightgrey)

- πŸ€– **LLM usage:** $16.1026 (63 commits)
- πŸ‘€ **Human dev:** ~$2434 (24.3h @ $100/h, 30min dedup)

Generated on 2026-05-24 using [openrouter/qwen/qwen3-coder-next](https://openrouter.ai/qwen/qwen3-coder-next)

---

Automatic Python prefactoring toolkit β€” detect, fix, and validate common code issues introduced by LLMs and humans alike.

## The Problem

![img.png](img.png)

When using LLMs for code generation, they often silently change import paths from absolute to deep relative:

```python
# ❌ LLM introduces this
from ....llm.generator import generate_strategy
from ....loaders.yaml_loader import save_strategy_yaml

# βœ… You wanted this
from planfile.llm.generator import generate_strategy
from planfile.loaders.yaml_loader import save_strategy_yaml
```

**prefact** automatically **detects**, **fixes**, and **validates** such issues in a three-phase pipeline.

## Features

| Rule | ID | Auto-fix | Description |
|---|---|---|---|
| Relative β†’ Absolute imports | `relative-imports` | βœ… | Converts `from ....x import y` to `from pkg.x import y` |
| Unused imports | `unused-imports` | βœ… | Removes imports never referenced in the module |
| Duplicate imports | `duplicate-imports` | βœ… | Removes the same name imported twice |
| Wildcard imports | `wildcard-imports` | πŸ” | Flags `from x import *` |
| Unsorted imports | `sorted-imports` | πŸ” | Flags import blocks not ordered stdlibβ†’3rd-partyβ†’local |
| String concatenation | `string-concat` | πŸ” | Flags `"Hello " + name` β†’ suggests f-strings |
| Missing return types | `missing-return-type` | πŸ” | Flags public functions without return type hints |

βœ… = auto-fix Β· πŸ” = scan-only (report)

## Performance Improvements

- **Parallel Processing**: Scans files in parallel when enabled
- **Smart Filtering**: Automatically skips large files (>100KB) and empty files
- **Optimized Scanning**: Excludes test directories and examples by default
- **Deduplication**: Prevents duplicate tickets and TODO entries

## Examples

The `examples/` directory contains comprehensive examples for different use cases:

| Example | Description |
|---|---|
| [sample-project](examples/sample-project/) | Realistic project with all issues demonstrated |
| [01-individual-rules](examples/01-individual-rules/) | Each rule explained with before/after code |
| [02-multiple-rules](examples/02-multiple-rules/) | Combining multiple rules for comprehensive cleanup |
| [03-output-formats](examples/03-output-formats/) | Console vs JSON output examples |
| [04-custom-rules](examples/04-custom-rules/) | Writing your own prefactoring rules |
| [05-ci-cd](examples/05-ci-cd/) | GitHub Actions, GitLab CI, Azure DevOps configs |
| [06-api-usage](examples/06-api-usage/) | Using prefact programmatically from Python |

### Quick Example

```bash
# Try the sample project
cd examples/sample-project
prefact scan --path . --config prefact.yaml
prefact fix --path . --config prefact.yaml
```

See [examples/README.md](examples/README.md) for a detailed guide to all examples.

## Installation

```bash
pip install -e .

# with dev dependencies (pytest)
pip install -e ".[dev]"
```

## Quick Start

```bash
# Generate config file
prefact init

# List all available rules
prefact rules

# Scan only (no changes)
prefact scan --path ./my_project --package mypackage

# Fix + validate (with backups)
prefact fix --path ./my_project --package mypackage

# Dry-run (show what would change)
prefact fix --path ./my_project --package mypackage --dry-run

# Check a single file
prefact check ./my_project/src/mypackage/core/service.py --package mypackage

# JSON output for CI
prefact fix --path . --format json -o report.json
```

πŸ“š **Want to see prefact in action?** Check out our [comprehensive examples](examples/) with real-world scenarios!

## Pipeline Architecture

```
β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β” β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
β”‚ SCAN β”‚ ──→ β”‚ FIX β”‚ ──→ β”‚ VALIDATE β”‚
β”‚ β”‚ β”‚ β”‚ β”‚ β”‚
β”‚ Detect β”‚ β”‚ Apply β”‚ β”‚ Syntax OK? β”‚
β”‚ issues β”‚ β”‚ fixes β”‚ β”‚ Regressionsβ”‚
β”‚ per ruleβ”‚ β”‚ + backupβ”‚ β”‚ preserved? β”‚
β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜ β””β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”˜
```

1. **Scan** β€” each rule walks the AST / CST and emits `Issue` objects
2. **Fix** β€” rules with auto-fix transform the source (via `libcst` for formatting-safe changes)
3. **Validate** β€” post-fix checks: syntax valid, no regressions, import counts preserved

## Configuration

Create `prefact.yaml` (auto-generated via `prefact init`):

```yaml
package_name: planfile

include:
- "**/*.py"

exclude:
- "**/venv/**"
- "**/build/**"
- "**/tests/**"
- "**/test*/**"
- "**/examples/**"

tools:
parallel: true
cache: true
performance:
max_workers: 4

rules:
relative-imports:
enabled: true
severity: warning
unused-imports:
enabled: true
severity: info
duplicate-imports:
enabled: true
wildcard-imports:
enabled: true
severity: error
sorted-imports:
enabled: false
string-concat:
enabled: true
missing-return-type:
enabled: false
```

## Autonomous Mode

Prefact includes an autonomous mode that automatically:
- Scans your project for issues
- Generates TODO.md with all found issues
- Creates tickets in planfile.yaml for tracking
- Updates CHANGELOG.md with fixes
- Optionally runs TestQL scenarios and bridges failures into tickets

```bash
# Run full autonomous workflow
prefact -a

# Or skip tests/examples for faster runs
prefact -a --skip-tests --skip-examples

# Include TestQL validation as the final step
prefact -a --with-testql

# Use a custom directory for *.testql.toon.yaml scenarios
prefact -a --with-testql --testql-dir ./testql-scenarios
```

## TestQL Integration

Prefact can run TestQL DSL validation scenarios and bridge failing checks directly into planfile tickets, TODO.md, and configured backends (GitHub, GitLab, Jira).

### `prefact testql` β€” Run a Single Scenario

```bash
# Validate a scenario and create/sync tickets
prefact testql testql-scenarios/smoke.testql.toon.yaml

# Dry-run: validate without creating tickets
prefact testql testql-scenarios/smoke.testql.toon.yaml --dry-run

# Custom project root and strategy
prefact testql scenarios/api.testql.toon.yaml -p ./my-api -s my-api/planfile.yaml

# Limit ticket generation and disable sync
prefact testql scenarios/api.testql.toon.yaml --max-tickets 10 --no-sync
```

### Options

| Option | Default | Description |
|--------|---------|-------------|
| `-p`, `--path` | `.` | Project root directory |
| `--url` | `http://localhost:8101` | TestQL service base URL |
| `--dry-run` | `False` | Parse/validate only |
| `-s`, `--strategy` | `/planfile.yaml` | Target planfile YAML |
| `--create-tickets` / `--no-create-tickets` | `True` | Create tickets for failures |
| `--sync` / `--no-sync` | `True` | Sync to TODO.md and integrations |
| `--max-tickets` | `25` | Max tickets per run |
| `--testql-bin` | `testql` | TestQL CLI executable |
| `--testql-repo-path` | `/home/tom/github/oqlos/testql` | Fallback local repo path |

### Identity-Aware Deduplication

When creating tickets, prefact uses identity-aware deduplication based on:
- Ticket `id` / `ticket_id`
- Integration-specific IDs (`github_id`, `gitlab_id`, `jira_id`)
- Keys (`github_key`, `gitlab_key`, `jira_key`)
- URLs (`github_url`, `gitlab_url`, `jira_url`, `external_url`)
- `source` and `external_refs` metadata

If a ticket already exists with any matching identity key, it is skipped to avoid duplicates.

## Performance Improvements

Recent updates have significantly improved performance:
- **Parallel Processing**: Scans files using multiple workers (configurable)
- **Smart Filtering**: Skips large files (>100KB) and files with minimal content
- **Optimized Exclusions**: Automatically excludes test directories and examples
- **Deduplication**: Prevents duplicate tickets and TODO entries across runs

## Python API

```python
from pathlib import Path
from prefact.config import Config
from prefact.engine import RefactoringEngine

config = Config(
project_root=Path("./my_project"),
package_name="planfile",
dry_run=False,
backup=True,
)

engine = RefactoringEngine(config)
result = engine.run()

print(f"Found {result.total_issues} issues")
print(f"Fixed {result.total_fixed}")
print(f"All valid: {result.all_valid}")
```

## Writing Custom Rules

Extend `BaseRule` and use the `@register` decorator:

```python
from prefact.rules import BaseRule, register
from prefact.models import Issue, Fix, ValidationResult

@register
class MyCustomRule(BaseRule):
rule_id = "my-custom-rule"
description = "Does something useful."

def scan_file(self, path, source):
# Return list[Issue]
...

def fix(self, path, source, issues):
# Return (fixed_source, list[Fix])
...

def validate(self, path, original, fixed):
# Return ValidationResult
...
```

## CI/CD Integration

```yaml
# GitHub Actions
- name: prefact check
run: |
pip install ./prefact
prefact scan --path . --format json -o prefact-report.json
prefact fix --path . --dry-run
```

## Running Tests

```bash
pip install -e ".[dev]"
pytest -v
```

## License

Licensed under Apache-2.0.
## Author

Tom Sapletta
## Status

_Last updated by [taskill](https://github.com/oqlos/taskill) at 2026-04-25 13:43 UTC_

| Metric | Value |
|---|---|
| HEAD | `0aac827` |
| Coverage | β€” |
| Failing tests | β€” |
| Commits in last cycle | 50 |

> Primarily documentation and refactoring work: the docs and README were updated, the code-analysis engine and configuration/CLI were refactored and improved, and Markdown output and example modules were added. Minor fixes include suppressing mypy errors with type: ignore and auto-fixing ruff formatting and imports.