https://github.com/semcod/prefact

Last synced: about 2 months ago
JSON representation
Host: GitHub
URL: https://github.com/semcod/prefact
Owner: semcod
License: apache-2.0
Created: 2026-03-27T05:19:45.000Z (4 months ago)
Default Branch: main
Last Pushed: 2026-05-13T16:49:19.000Z (2 months ago)
Last Synced: 2026-05-13T18:31:26.190Z (2 months ago)
Language: Python
Size: 62.4 MB
Stars: 0
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE
Awesome Lists containing this project

README

          # prefact

[![PyPI version](https://badge.fury.io/py/prefact.svg)](https://badge.fury.io/py/prefact)

[![Python 3.10+](https://img.shields.io/badge/python-3.10+-blue.svg)](https://www.python.org/downloads/)

[![License: Apache-2.0](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://opensource.org/licenses/Apache-2.0)

[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/psf/black)

## AI Cost Tracking

![PyPI](https://img.shields.io/badge/pypi-costs-blue) ![Version](https://img.shields.io/badge/version-0.1.58-blue) ![Python](https://img.shields.io/badge/python-3.9+-blue) ![License](https://img.shields.io/badge/license-Apache--2.0-green)

![AI Cost](https://img.shields.io/badge/AI%20Cost-$16.10-orange) ![Human Time](https://img.shields.io/badge/Human%20Time-24.3h-blue) ![Model](https://img.shields.io/badge/Model-openrouter%2Fqwen%2Fqwen3--coder--next-lightgrey)

- 🤖 **LLM usage:** $16.1026 (63 commits)

- 👤 **Human dev:** ~$2434 (24.3h @ $100/h, 30min dedup)

Generated on 2026-05-24 using [openrouter/qwen/qwen3-coder-next](https://openrouter.ai/qwen/qwen3-coder-next)

---

Automatic Python prefactoring toolkit — detect, fix, and validate common code issues introduced by LLMs and humans alike.

## The Problem

![img.png](img.png)

When using LLMs for code generation, they often silently change import paths from absolute to deep relative:

```python

# ❌ LLM introduces this

from ....llm.generator import generate_strategy

from ....loaders.yaml_loader import save_strategy_yaml

# ✅ You wanted this

from planfile.llm.generator import generate_strategy

from planfile.loaders.yaml_loader import save_strategy_yaml

```

**prefact** automatically **detects**, **fixes**, and **validates** such issues in a three-phase pipeline.

## Features

| Rule | ID | Auto-fix | Description |

|---|---|---|---|

| Relative → Absolute imports | `relative-imports` | ✅ | Converts `from ....x import y` to `from pkg.x import y` |

| Unused imports | `unused-imports` | ✅ | Removes imports never referenced in the module |

| Duplicate imports | `duplicate-imports` | ✅ | Removes the same name imported twice |

| Wildcard imports | `wildcard-imports` | 🔍 | Flags `from x import *` |

| Unsorted imports | `sorted-imports` | 🔍 | Flags import blocks not ordered stdlib→3rd-party→local |

| String concatenation | `string-concat` | 🔍 | Flags `"Hello " + name` → suggests f-strings |

| Missing return types | `missing-return-type` | 🔍 | Flags public functions without return type hints |

✅ = auto-fix  ·  🔍 = scan-only (report)

## Performance Improvements

- **Parallel Processing**: Scans files in parallel when enabled

- **Smart Filtering**: Automatically skips large files (>100KB) and empty files

- **Optimized Scanning**: Excludes test directories and examples by default

- **Deduplication**: Prevents duplicate tickets and TODO entries

## Examples

The `examples/` directory contains comprehensive examples for different use cases:

| Example | Description |

|---|---|

| [sample-project](examples/sample-project/) | Realistic project with all issues demonstrated |

| [01-individual-rules](examples/01-individual-rules/) | Each rule explained with before/after code |

| [02-multiple-rules](examples/02-multiple-rules/) | Combining multiple rules for comprehensive cleanup |

| [03-output-formats](examples/03-output-formats/) | Console vs JSON output examples |

| [04-custom-rules](examples/04-custom-rules/) | Writing your own prefactoring rules |

| [05-ci-cd](examples/05-ci-cd/) | GitHub Actions, GitLab CI, Azure DevOps configs |

| [06-api-usage](examples/06-api-usage/) | Using prefact programmatically from Python |

### Quick Example

```bash

# Try the sample project

cd examples/sample-project

prefact scan --path . --config prefact.yaml

prefact fix --path . --config prefact.yaml

```

See [examples/README.md](examples/README.md) for a detailed guide to all examples.

## Installation

```bash

pip install -e .

# with dev dependencies (pytest)

pip install -e ".[dev]"

```

## Quick Start

```bash

# Generate config file

prefact init

# List all available rules

prefact rules

# Scan only (no changes)

prefact scan --path ./my_project --package mypackage

# Fix + validate (with backups)

prefact fix --path ./my_project --package mypackage

# Dry-run (show what would change)

prefact fix --path ./my_project --package mypackage --dry-run

# Check a single file

prefact check ./my_project/src/mypackage/core/service.py --package mypackage

# JSON output for CI

prefact fix --path . --format json -o report.json

```

📚 **Want to see prefact in action?** Check out our [comprehensive examples](examples/) with real-world scenarios!

## Pipeline Architecture

```

┌─────────┐      ┌─────────┐      ┌────────────┐

│  SCAN   │ ──→  │   FIX   │ ──→  │  VALIDATE  │

│         │      │         │      │            │

│ Detect  │      │ Apply   │      │ Syntax OK? │

│ issues  │      │ fixes   │      │ Regressions│

│ per rule│      │ + backup│      │ preserved? │

└─────────┘      └─────────┘      └────────────┘

```

1. **Scan** — each rule walks the AST / CST and emits `Issue` objects

2. **Fix** — rules with auto-fix transform the source (via `libcst` for formatting-safe changes)

3. **Validate** — post-fix checks: syntax valid, no regressions, import counts preserved

## Configuration

Create `prefact.yaml` (auto-generated via `prefact init`):

```yaml

package_name: planfile

include:

  - "**/*.py"

exclude:

  - "**/venv/**"

  - "**/build/**"

  - "**/tests/**"

  - "**/test*/**"

  - "**/examples/**"

tools:

  parallel: true

  cache: true

  performance:

    max_workers: 4

rules:

  relative-imports:

    enabled: true

    severity: warning

  unused-imports:

    enabled: true

    severity: info

  duplicate-imports:

    enabled: true

  wildcard-imports:

    enabled: true

    severity: error

  sorted-imports:

    enabled: false

  string-concat:

    enabled: true

  missing-return-type:

    enabled: false

```

## Autonomous Mode

Prefact includes an autonomous mode that automatically:

- Scans your project for issues

- Generates TODO.md with all found issues

- Creates tickets in planfile.yaml for tracking

- Updates CHANGELOG.md with fixes

- Optionally runs TestQL scenarios and bridges failures into tickets

```bash

# Run full autonomous workflow

prefact -a

# Or skip tests/examples for faster runs

prefact -a --skip-tests --skip-examples

# Include TestQL validation as the final step

prefact -a --with-testql

# Use a custom directory for *.testql.toon.yaml scenarios

prefact -a --with-testql --testql-dir ./testql-scenarios

```

## TestQL Integration

Prefact can run TestQL DSL validation scenarios and bridge failing checks directly into planfile tickets, TODO.md, and configured backends (GitHub, GitLab, Jira).

### `prefact testql` — Run a Single Scenario

```bash

# Validate a scenario and create/sync tickets

prefact testql testql-scenarios/smoke.testql.toon.yaml

# Dry-run: validate without creating tickets

prefact testql testql-scenarios/smoke.testql.toon.yaml --dry-run

# Custom project root and strategy

prefact testql scenarios/api.testql.toon.yaml -p ./my-api -s my-api/planfile.yaml

# Limit ticket generation and disable sync

prefact testql scenarios/api.testql.toon.yaml --max-tickets 10 --no-sync

```

### Options

| Option | Default | Description |

|--------|---------|-------------|

| `-p`, `--path` | `.` | Project root directory |

| `--url` | `http://localhost:8101` | TestQL service base URL |

| `--dry-run` | `False` | Parse/validate only |

| `-s`, `--strategy` | `/planfile.yaml` | Target planfile YAML |

| `--create-tickets` / `--no-create-tickets` | `True` | Create tickets for failures |

| `--sync` / `--no-sync` | `True` | Sync to TODO.md and integrations |

| `--max-tickets` | `25` | Max tickets per run |

| `--testql-bin` | `testql` | TestQL CLI executable |

| `--testql-repo-path` | `/home/tom/github/oqlos/testql` | Fallback local repo path |

### Identity-Aware Deduplication

When creating tickets, prefact uses identity-aware deduplication based on:

- Ticket `id` / `ticket_id`

- Integration-specific IDs (`github_id`, `gitlab_id`, `jira_id`)

- Keys (`github_key`, `gitlab_key`, `jira_key`)

- URLs (`github_url`, `gitlab_url`, `jira_url`, `external_url`)

- `source` and `external_refs` metadata

If a ticket already exists with any matching identity key, it is skipped to avoid duplicates.

## Performance Improvements

Recent updates have significantly improved performance:

- **Parallel Processing**: Scans files using multiple workers (configurable)

- **Smart Filtering**: Skips large files (>100KB) and files with minimal content

- **Optimized Exclusions**: Automatically excludes test directories and examples

- **Deduplication**: Prevents duplicate tickets and TODO entries across runs

## Python API

```python

from pathlib import Path

from prefact.config import Config

from prefact.engine import RefactoringEngine

config = Config(

    project_root=Path("./my_project"),

    package_name="planfile",

    dry_run=False,

    backup=True,

)

engine = RefactoringEngine(config)

result = engine.run()

print(f"Found {result.total_issues} issues")

print(f"Fixed {result.total_fixed}")

print(f"All valid: {result.all_valid}")

```

## Writing Custom Rules

Extend `BaseRule` and use the `@register` decorator:

```python

from prefact.rules import BaseRule, register

from prefact.models import Issue, Fix, ValidationResult

@register

class MyCustomRule(BaseRule):

    rule_id = "my-custom-rule"

    description = "Does something useful."

    def scan_file(self, path, source):

        # Return list[Issue]

        ...

    def fix(self, path, source, issues):

        # Return (fixed_source, list[Fix])

        ...

    def validate(self, path, original, fixed):

        # Return ValidationResult

        ...

```

## CI/CD Integration

```yaml

# GitHub Actions

- name: prefact check

  run: |

    pip install ./prefact

    prefact scan --path . --format json -o prefact-report.json

    prefact fix --path . --dry-run

```

## Running Tests

```bash

pip install -e ".[dev]"

pytest -v

```

## License

Licensed under Apache-2.0.

## Author

Tom Sapletta

## Status

_Last updated by [taskill](https://github.com/oqlos/taskill) at 2026-04-25 13:43 UTC_

| Metric | Value |

|---|---|

| HEAD | `0aac827` |

| Coverage | — |

| Failing tests | — |

| Commits in last cycle | 50 |

> Primarily documentation and refactoring work: the docs and README were updated, the code-analysis engine and configuration/CLI were refactored and improved, and Markdown output and example modules were added. Minor fixes include suppressing mypy errors with type: ignore and auto-fixing ruff formatting and imports.
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/semcod/prefact

Awesome Lists containing this project

README