https://github.com/royerlab/napari-chatgpt
A napari plugin to process and analyse images with chatGPT!
https://github.com/royerlab/napari-chatgpt
Last synced: 4 months ago
JSON representation
A napari plugin to process and analyse images with chatGPT!
- Host: GitHub
- URL: https://github.com/royerlab/napari-chatgpt
- Owner: royerlab
- License: bsd-3-clause
- Created: 2023-04-06T03:19:27.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2024-10-19T05:48:47.000Z (over 1 year ago)
- Last Synced: 2025-06-25T02:06:22.489Z (12 months ago)
- Language: CSS
- Size: 61.5 MB
- Stars: 275
- Watchers: 12
- Forks: 29
- Open Issues: 16
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-ChatGPT-repositories - napari-chatgpt - A napari plugin to process and analyse images with chatGPT! (Browser-extensions)
- awesome-scientific-image-analysis - Omega - Napari with ChatGPT. (🏝️ Napari / Plugins)
README
## Home of _Omega_, a napari-aware autonomous LLM-based agent specialized in image processing and analysis.
[](https://github.com/royerlab/napari-chatgpt/raw/main/LICENSE)
[](https://pypi.org/project/napari-chatgpt)
[](https://python.org)
[](https://pepy.tech/project/napari-chatgpt)
[](https://pepy.tech/project/napari-chatgpt)
[](https://napari-hub.org/plugins/napari-chatgpt)
[](https://doi.org/10.5281/zenodo.10828225)
[](https://github.com/royerlab/napari-chatgpt/)
[](https://git:hub.com/royerlab/napari-chatgpt/)

A [napari](napari.org) plugin that leverages OpenAI's Large Language Model
ChatGPT to implement _Omega_
a napari-aware agent capable of performing image processing and analysis tasks
in a conversational manner.
This repository started as a 'week-end project'
by [Loic A. Royer](https://twitter.com/loicaroyer)
who leads a [research group](https://royerlab.org) at
the [Chan Zuckerberg Biohub](https://royerlab.org). It
leverages [OpenAI](https://openai.com)'s ChatGPT API via
the [LangChain](https://python.langchain.com/en/latest/index.html) Python
library, as well as [napari](https://napari.org), a fast, interactive,
multi-dimensional
image viewer for
Python, [another](https://ilovesymposia.com/2019/10/24/introducing-napari-a-fast-n-dimensional-image-viewer-in-python/)
week-end project, initially started by Loic and [Juan Nunez-Iglesias](https://github.com/jni).
# What is Omega?
Omega is an LLM-based and tool-armed autonomous agent that demonstrates the
potential for Large Language Models (LLMs) to be applied to image processing,
analysis and visualization.
Can LLM-based agents write image processing code and napari widgets, correct its
coding mistakes, performing follow-up analysis, and controlling the napari viewer?
The answer appears to be yes.
The publication is available here: [10.1038/s41592-024-02310-w](https://doi.org/10.1038/s41592-024-02310-w).
The preprint can be downloaded here: [10.5281/zenodo.10828225](https://doi.org/10.5281/zenodo.10828225).
#### In this video, I ask Omega to segment an image using the [SLIC](https://www.iro.umontreal.ca/~mignotte/IFT6150/Articles/SLIC_Superpixels.pdf) algorithm. It makes a first attempt using the implementation in scikit-image but fails because of an inexistent 'multichannel' parameter. Realizing that, Omega tries again, and this time succeeds:
https://user-images.githubusercontent.com/1870994/235768559-ca8bfa84-21f5-47b6-b2bd-7fcc07cedd92.mp4
#### After loading a sample 3D image of cell nuclei in napari, I asked Omega to segment the nuclei using the Otsu method. My first request was vague, so it just segmented foreground versus background. I then ask to segment the foreground into distinct segments for each connected component. Omega does a rookie mistake by forgetting to 'import np'. No problem; it notices, tries again, and succeeds:
https://user-images.githubusercontent.com/1870994/235769990-a281a118-1369-47aa-834a-b491f706bd48.mp4
#### In this video, one of my favorites, I ask Omega to make a 'Max color projection widget.' It is not a trivial task, but it manages!
https://github.com/royerlab/napari-chatgpt/assets/1870994/bb9b35a4-d0aa-4f82-9e7c-696ef5859a2f
As LLMs improve, Omega will become even more adept at handling complex
image processing and analysis tasks. GPT 4.0 has been a significant upgrade
compared to GPT 3.5, and many of the videos (see below and here) are highly reproducible,
with a typically 90% success rate (see preprint for a reproducibility analysis).
While open-source models are promising and rapidly improving, they must get better to run Omega reliably.
More recent models by OpenAI's competitors, such as Google and Anthropic, are great news,
but Omega still needs to support these newer models fully -- it seems every week comes with a new batch of models.
Omega could eventually help non-experts process and analyze images, especially
in the bioimage domain.
It is also potentially valuable for educative purposes as it could
assist in teaching image processing and analysis, making it more accessible.
Although ChatGPT, which powers Omega, may still need to be on par with an expert image
analyst or computer vision expert, it is just a matter of time...
Omega holds a conversation with the user and uses different tools to answer questions,
download and operate on images, write widgets for napari, and more.
## Omega's Built-in AI-Augmented Code Editor
The Omega AI-Augmented Code Editor is a new feature within Omega, designed to enhance the Omega's user experience. This
editor is not just a text editor; it's a powerful interface that interacts with the Omega dialogue agent to generate,
optimize, and manage code for advanced image analysis tasks.

#### Key Features
- **Code Highlighting and Completion**: For ease of reading and writing, the code editor comes with built-in syntax
highlighting and intelligent code completion features.
- **LLM-Augmented Tools**: The editor is equipped with AI tools that assist in commenting, cleaning up, fixing,
modifying, and performing safety checks on the code.
- **Persistent Code Snippets**: Users can save and manage snippets of code, preserving their work across multiple Napari
sessions.
- **Network Code Sharing (Code-Drop)**: Facilitates the sharing of code snippets across the local network, empowering
collaborative work and knowledge sharing.
#### Usage Scenarios
- **Widget Creation**: Expert users can create widgets using the Omega dialogue agent and retain them for future use.
- **Collaboration**: Share custom widgets with colleagues or the community, even if they don't have access to an API
key.
- **Learning**: New users can learn from the AI-augmented suggestions, improving their coding skills in Python and image
analysis workflows.
You can find more information in the
corresponding [wiki page](https://github.com/royerlab/napari-chatgpt/wiki/OmegaCodeEditor).
----------------------------------
## Omega's Installation instructions:
Assuming you have a Python environment with a working napari installation, you can simply:
pip install napari-chatgpt
Or install the plugin from napari's plugin installer.
For detailed instructions and variations, check [this page](http://github.com/royerlab/napari-chatgpt/wiki/InstallOmega)
of our wiki.
## Requirements:
You need an OpenAI key; there is no way around this, I have been experimenting with
other models, including open-source models, but right now, the best results, by far, are obtained with ChatGPT 4 (and to
a lesser extent 3.5). Check [here](https://github.com/royerlab/napari-chatgpt/wiki/OpenAIKey) for details on how to get
your OpenAI key. In particular, check [this](https://github.com/royerlab/napari-chatgpt/wiki/AccessToGPT4) for how to
gain access to GPT-4 models.
## Usage:
Check this [page](https://github.com/royerlab/napari-chatgpt/wiki/HowToStartOmega) of
our [wiki](https://github.com/royerlab/napari-chatgpt/wiki) for details on how to start Omega.
## Tips, Tricks, and Example prompts:
Check our guide on how to prompt Omega and some
examples [here](https://github.com/royerlab/napari-chatgpt/wiki/Tips&Tricks).
## Video Demos:
You can check the original release videos [here](https://github.com/royerlab/napari-chatgpt/wiki/VideoDemos).
You can also find the latest preprint videos on [Vimeo](https://vimeo.com/showcase/10983382).
## How does Omega work?
The publication is available here: [10.1038/s41592-024-02310-w](https://doi.org/10.1038/s41592-024-02310-w).
Check our preprint here: [10.5281/zenodo.10828225](https://doi.org/10.5281/zenodo.10828225).
and our [wiki page](https://github.com/royerlab/napari-chatgpt/wiki/OmegaDesign) on Omega's design and architecture.
## Cost:
Developing the initial version of Omega cost me $13.97, hardly a fortune.
OpenAI [pricing](https://openai.com/pricing) on ChatGPT 4 is very reasonable at 0.01 dollars per 1K tokens, which means $
1 per 750000 words.
Note: you can limit the burn rate to a certain amount of dollars per month, just
in case you let Omega think over the weekend and forget to stop it (don't worry,
this is actually **not** possible).
## Disclaimer:
Do not use this software lightly; it will download libraries of its own volition
and write any code it deems necessary; it might do what you ask, even
if it is a very bad idea. Also, beware that it might _misunderstand_ what you ask and
then do something bad in ways that elude you. For example, it is unwise to use Omega to delete
'some' files from your system; it might end up deleting more than that if you are unclear in
your request.
Omega is generally safe as long as you do not make dangerous requests. To be 100% safe, and
if your experiments with Omega could be potentially problematic, I recommend using this
software from within a sandboxed virtual machine.
API keys are only as safe as the overall machine is, see the section below on API key hygiene.
THE SOFTWARE IS PROVIDED “AS IS”, WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A
PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS
BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT,
TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR
THE USE OR OTHER DEALINGS IN THE SOFTWARE.
## API key hygiene:
Best Practices for Managing Your API Keys:
- **Host Computer Hygiene:** Ensure that the machine you’re installing napari-chagot/Onega on is secure, free of malware
and viruses, and otherwise not compromised. Make sure to install antivirus software on Windows.
- **Security:** Treat your API key like a password. Do not share it with others or expose it in public repositories or
forums.
- **Cost Control:** Set spending limits on your OpenAI account (see [here](https://platform.openai.com/account/limits)).
- **Regenerate Keys:** If you believe your API key has been compromised, cancel and regenerate it from the OpenAI API
dashboard immediately.
- **Key Storage:** Omega has a built-in 'API Key Vault' that encrypts keys using a password, this is the preferred
approach. You can also store the key in an environment variable, but that is not encrypted and could compromise the
key.
## Contributing
Contributions are extremely welcome. The project uses a Makefile for development:
```bash
make setup # Install with dev dependencies + pre-commit hooks
make test # Run all tests
make test-cov # Run tests with coverage
make format # Format code with black and isort
make check # Run all code checks
```
Please ensure the coverage stays the same before you submit a pull request.
## License
Distributed under the terms of the [BSD-3] license,
"napari-chatgpt" is free and open-source software
## Issues
If you encounter any problems, please [file an issue] along with a detailed
description.
[napari]: https://github.com/napari/napari
[Cookiecutter]: https://github.com/audreyr/cookiecutter
[@napari]: https://github.com/napari
[MIT]: http://opensource.org/licenses/MIT
[BSD-3]: http://opensource.org/licenses/BSD-3-Clause
[GNU GPL v3.0]: http://www.gnu.org/licenses/gpl-3.0.txt
[GNU LGPL v3.0]: http://www.gnu.org/licenses/lgpl-3.0.txt
[Apache Software License 2.0]: http://www.apache.org/licenses/LICENSE-2.0
[Mozilla Public License 2.0]: https://www.mozilla.org/media/MPL/2.0/index.txt
[cookiecutter-napari-plugin]: https://github.com/napari/cookiecutter-napari-plugin
[file an issue]: https://github.com/royerlab/napari-chatgpt/issues
[napari]: https://github.com/napari/napari
[tox]: https://tox.readthedocs.io/en/latest/
[pip]: https://pypi.org/project/pip/
[PyPI]: https://pypi.org/