Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/charliermarsh/autobot

GitHub Copilot, for your existing codebase.
https://github.com/charliermarsh/autobot

Last synced: 17 days ago
JSON representation

GitHub Copilot, for your existing codebase.

Host: GitHub
URL: https://github.com/charliermarsh/autobot
Owner: charliermarsh
License: mit
Created: 2022-09-13T21:10:15.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2022-09-29T23:06:22.000Z (about 2 years ago)
Last Synced: 2024-04-26T01:21:41.144Z (7 months ago)
Language: Python
Homepage:
Size: 127 KB
Stars: 69
Watchers: 4
Forks: 7
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # autobot

[![Actions status](https://github.com/charliermarsh/autobot/workflows/CI/badge.svg)](https://github.com/charliermarsh/autobot/actions)

[![PyPI version](https://badge.fury.io/py/autobot-ml.svg)](https://badge.fury.io/py/autobot-ml)

An automated code refactoring tool powered by GPT-3. Like GitHub Copilot, for your existing

codebase.

Autobot takes an example change as input and generates patches for you to review by scanning your

codebase for similar code blocks and "applying" that change to the existing source code.



  



See more examples on 

Twitter, or read the 

blog post.

_N.B. Autobot is a prototype and isn't recommended for use on large codebases. See: ["Limitations"](#Limitations)._

## Getting started

Autobot is available as [`autobot-ml`](https://pypi.org/project/autobot-ml/) on PyPI:

```shell

pip install autobot-ml

```

Autobot depends on the [OpenAI API](https://openai.com/api/) and, in particular, expects your OpenAI

organization ID and API key to be exposed as the `OPENAI_ORGANIZATION` and `OPENAI_API_KEY`

environment variables, respectively.

Autobot can also read from a `.env` file:

```

OPENAI_ORGANIZATION=${YOUR_OPENAI_ORGANIZATION}

OPENAI_API_KEY=${YOUR_OPENAI_API_KEY}

```

From there, you can run any of Autobot's built-in refactors (called "schematics"):

```shell

autobot run useless_object_inheritance /path/to/file.py

```

## Example usage

_TL;DR: Autobot is a command-line tool. To generate patches, use `autobot run`; to review the

generated patches, use `autobot review`._

Autobot is designed around a two-step workflow.

In the first step (`autobot run {schematic} {files_to_analyze}`), we point Autobot to (1) the

"schematic" that defines our desired change and (2) the files to which the change should be

applied.

In the second step (`autobot review`), we review the patches that Autobot generated and, for each

suggested change, either apply it to the codebase or reject the patch entirely.

Autobot ships with several schematics that you can use out-of-the-box:

- `assert_equals`

- `convert_to_dataclass`

- `numpy_builtin_aliases`

- `print_statement`

- `sorted_attributes`

- `standard_library_generics`

- `unnecessary_f_strings`

- `use_generator`

- `useless_object_inheritance`

For example: to remove any usages of NumPy's deprecated `np.int` and associated aliases, we'd first

run `autobot run numpy_builtin_aliases /path/to/file.py`, followed by `autobot review`.

The `schematic` argument to `autobot run` can either reference a directory within `schematics` (like

`numpy_builtin_aliases`, above) or a path to a user-defined schematic directory on-disk.

### Implementing a new refactor ("schematic")

Every refactor facilitated by Autobot requires a "schematic". Autobot ships with a few schematics

in the `schematics` directory, but it's intended to be used with user-provided schematics.

A schematic is a directory containing two files:

1. `before.py`: A code snippet demonstrating the "before" state of the refactor.

2. `after.py`: A code snippet demonstrating the "after" state of the refactor.

Each file is expected to consist of a brief top-level docstring describing the "before" or "after"

state, followed by a single function or class.

For example: in Python 3, `class Foo(object)` is equivalent to `class Foo`. To automatically remove

those useless object inheritances from our codebase, we'd create a `useless_object_inheritance`

directory, and add the following two files:

```python

# before.py

"""...with object inheritance."""

class Foo(Bar, object):

    def __init__(self, x: int) -> None:

        self.x = x

```

```python

# after.py

"""...without object inheritance."""

class Foo(Bar):

    def __init__(self, x: int) -> None:

        self.x = x

```

We'd then run `autobot run ./useless_object_inheritance /path/to/file/or/directory` to generate

patches, followed by `autobot review` to apply or reject the suggested changes.

## Limitations

1. Running Autobot consumes OpenAI credits and thus could cost you money. Be careful!

2. By default, Autobot uses OpenAI's `text-davinci-002` model, though `autobot run` accepts a

   `--model` parameter, allowing you to select an alternative OpenAI model. Note, though, that

   OpenAI's Codex models are currently in a private beta, so `code-davinci-002` and friends may

   error for you.

4. To speed up execution, Autobot calls out to the OpenAI API in parallel. If you haven't upgraded

   to a paid account, you may hit rate-limit errors. You can pass `--nthreads 1` to `autobot run`

   to disable multi-threading. Running Autobot over large codebases is not recommended (yet).

5. Depending on the transform type, Autobot will attempt to generate a patch for every function or

   every

   class. Any function or class that's "too long" for GPT-3's maximum prompt size will be skipped.

6. Autobot isn't smart enough to handle nested functions (or nested classes), so nested functions

   will likely be processed and appear twice.

7. Autobot only supports Python code for now. (Autobot relies on parsing the AST to extract relevant

   code snippets, so additional languages require extending AST support.)

## Roadmap

1. **Multi-language support.** Autobot only supports Python code for now. Extending to

   multi-language support, at least with the current algorithm, will require supporting additional

   AST parsers. The most likely outcome here will either be to leverage [`tree-sitter`](https://github.com/tree-sitter/tree-sitter).

2. **Supporting large codebases.** What would it take to run Autobot over hundreds of thousands of

   lines of code?

## License

MIT