https://github.com/tychenjiajun/exif-ai

A Node.js CLI and library that uses OpenAI, Ollama, ZhipuAI, Google Gemini or Coze to write AI-generated image descriptions and/or tags to EXIF metadata by its content.
https://github.com/tychenjiajun/exif-ai

ai cli cli-tool coze exif gemini image jpeg jpg llm metadata mllm ollama openai openai-api photo zhipu

Last synced: 5 months ago
JSON representation

A Node.js CLI and library that uses OpenAI, Ollama, ZhipuAI, Google Gemini or Coze to write AI-generated image descriptions and/or tags to EXIF metadata by its content.

Host: GitHub
URL: https://github.com/tychenjiajun/exif-ai
Owner: tychenjiajun
License: gpl-2.0
Created: 2024-08-31T07:18:29.000Z (10 months ago)
Default Branch: main
Last Pushed: 2025-02-09T12:39:42.000Z (5 months ago)
Last Synced: 2025-02-09T13:33:33.305Z (5 months ago)
Topics: ai, cli, cli-tool, coze, exif, gemini, image, jpeg, jpg, llm, metadata, mllm, ollama, openai, openai-api, photo, zhipu
Language: TypeScript
Homepage:
Size: 14.8 MB
Stars: 12
Watchers: 1
Forks: 3
Open Issues: 8
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Exif AI

[![NPM Downloads](https://img.shields.io/npm/dw/exif-ai)](https://www.npmjs.com/package/exif-ai)

https://github.com/user-attachments/assets/a445d46a-0d3c-44a2-a42e-f98c23e9c1b4

_Read this in other languages:_
[_简体中文_](README.zh-CN.md)

## About

_Exif AI_ is a powerful CLI tool designed to write AI-generated image descriptions and/or tags directly into the metadata of image files. This tool leverages advanced AI models to analyze image content and generate descriptive metadata, enhancing the accessibility and searchability of your images.

## Usage Example

### CLI

#### Without Installation

You can run Exif AI directly using npx without installing it globally:

```bash
npx exif-ai -i example.jpeg -a ollama
```

#### With Installation

If you have installed Exif AI globally, you can run it directly from the command line:

```bash
exif-ai -i example.jpeg -a ollama
```

#### Options

Required options:

- `-a, --api-provider `: Name of the AI provider to use (`ollama` for Ollama, `zhipu` for ZhipuAI, `google` for Google Gemini, `coze_bot` for Coze Bot, `openai` for OpenAI).

Optional options:

- `-T, --tasks `: List of tasks to perform ('description', 'tag', 'face').
- `-i, --input ` Path to the input image file.
- `-p, --description-prompt `: Custom prompt for the AI provider to generate description. Defaults to a generic image description prompt.
- `--tag-prompt `: Custom prompt for the AI provider to generate tags. Defaults to a generic image tagging prompt.
- `-m, --model `: Specify the AI model to use, if supported by the provider.
- `-t, --description-tags `: List of EXIF tags to write the description to. Defaults to common description tags.
- `--tag-tags `: List of EXIF tags to write the tags to. Defaults to common tags.
- `-v, --verbose`: Enable verbose output for debugging.
- `-d, --dry-run`: Preview AI-generated content without writing to the image file.
- `--exif-tool-write-args `: Additional ExifTool arguments for writing metadata.
- `--provider-args `: Additional arguments for the AI provider.
- `-w, --watch `: Watch directory for new files to process.
- `--avoid-overwrite`: Avoid overwriting if EXIF tags already exist in the file.
- `--ext `: File extensions to watch. Only files with this extensions will be processed.
- `--concurrency `: The numbers of files to process concurrently in watch mode.
- `--face-group-ids `: List of face group IDs to use for face recognition.
- `--repeat`: The number of times to repeat the task if the AI-generated result is deemed unacceptable. This parameter helps ensure the quality of the output by allowing multiple attempts. Default value is 0. An AI-generated description is considered acceptable if it has more than 10 characters and is not in markdown format. AI-generated tags are considered acceptable if there are more than 1 tag and they are not in markdown format. Using this parameter will consume more tokens, which may incur additional costs. Use it at your own risk.

Example usage:

```bash
exif-ai -i example.jpg -a ollama -p "Describe this landscape photo."
```

### Library

To use Exif AI as a library in your project, import it and use the provided functions:

```typescript
import { execute } from "exif-ai";

const options = {
tasks: ["description"], // List of tasks to perform
path: "example.jpg", // Path to the input image file
provider: "ollama", // Name of the AI provider to use
descriptionTags: [
"XPComment",
"Description",
"ImageDescription",
"Caption-Abstract",
], // List of EXIF tags to write the description to
tagTags: ["Subject", "TagsList", "Keywords"], // List EXIF tags to write the tags to
descriptionPrompt: "Describe this landscape photo.", // Custom prompt for the AI provider to generate description
tagPrompt: "Tag this image based on subject, object, event, place.", // Custom prompt for the AI provider to generate tags
verbose: false, // Enable verbose output for debugging
dry: false, // Preview AI-generated content without writing to the image file
writeArgs: [], // Additional ExifTool arguments for writing metadata
providerArgs: [], // Additional arguments for the AI provider
avoidOverwrite: false, // Avoid overwriting if EXIF tags already exist in the file
doNotEndExifTool: false, // Do not end ExifTool process after writing metadata
faceGroupIds: [], // List of face group IDs to use for face recognition
repeat: 0, // The number of times to repeat the task if the AI-generated result is deemed unacceptable
};

execute(options)
.then(() => {
console.log("Image description has been written to EXIF metadata.");
})
.catch((error) => {
console.error("An error occurred:", error);
});
```

## Installation

To install Exif AI globally, use the following command:

```bash
npm install -g exif-ai
```

## Tasks

### Description

The `description` task generates a description of the image using the AI provider. The description is written to the specified EXIF tags defined in `descriptionTags`.

### Tag

The `tag` task generates tags for the image using the AI provider. The tags are written to the specified EXIF tags defined in `tagTags`.

### Face Recognition

The `face` task performs face recognition on the image using the [Tencent Cloud API](https://cloud.tencent.com/document/api/867/44994). The face recognition results are written to the specified EXIF tags defined in `tagTags`.

Currently, the `face` task requires user to enable face recognition service on Tencent Cloud and set a pair of Tencent Cloud API Secret ID and Tencent CLoud API Secret Key in the environment variable.

```bash
export TENCENTCLOUD_SECRET_ID=your_tencentcloud_secret_id
export TENCENTCLOUD_SECRET_KEY=your_tencentcloud_secret_key
```

### Note

Please ensure that you securely manage your API keys. Do not expose them in public repositories or other public forums.

## API Providers

Exif AI is designed to leverage various API providers for generating image descriptions and tags. We currently support five prominent providers, each offering unique capabilities and integration options. Below is a summary of the supported providers, including details on their requirements and features.

### Supported Providers

- ZhipuAI: A cutting-edge AI service provider known for its advanced algorithms. Access to this service requires an API key.
- Ollama: An innovative local AI service that operates directly on your machine. This option does not require an API key, offering a seamless and private experience.
- Google Gemini: A robust AI service powered by Google, renowned for its high-quality image processing capabilities.
- Coze: Coze is a state-of-the-art AI platform designed for everyone, enabling the development of next-generation applications and chatbots.
- OpenAI: A preeminent AI service provider, recognized for its wide range of AI-powered tools and applications. Like ZhipuAI, it requires an API key for access.

### Custom Providers

For users seeking to extend the functionality of Exif AI or integrate with additional AI services, we offer the flexibility to develop custom providers. By implementing the provider interface, you can create a custom provider to integrate with other AI services or tailor the image description generation process to your specific needs.

## Configuration

### OpenAI

To use [OpenAI](https://openai.com/), you need to set the API key. You can do this by setting an environment variable:

```bash
export OPENAI_API_KEY=your_openai_api_key
```

If you wish to use a custom API service provider that is compatible with the OpenAI API, you can set the `OPEN_API_BASEURL` environment variable to point to the desired endpoint.

```bash
export OPEN_API_BASEURL=https://api.customprovider.com/v1
```

### Zhipu AI

To use [ZhipuAI](https://open.bigmodel.cn/usercenter/apikeys), you need to set the API key. You can do this by setting an environment variable:

```bash
export ZHIPUAI_API_KEY=your_zhipuai_api_key
```

If you don't have a ZhipuAI account, you can sign up for one [here](https://www.bigmodel.cn/invite?icode=INWAHJuWBFUp07JYI6oBveZLO2QH3C0EBTSr%2BArzMw4%3D).

### Google Gemini

To use [Google Gemini](https://ai.google.dev/), you need to set the API key. You can do this by setting an environment variable:

```bash
export API_KEY=your_google_api_key
```

### Coze Bot

To use [Coze](https://www.coze.com/) bot, you need to set the API key. You can do this by setting an environment variable:

```bash
export COZE_API_KEY=your_coze_api_key
```

When utilizing Coze within the Chinese region, configure the endpoint setting as follows:

```bash
export COZE_ENDPOINT=https://api.coze.cn
```

In Coze Bot, the `model` parameter corresponds to the bot's identifier. To interact with the Coze API, use the following command format:

```bash
exif-ai -a coze_bot -i image.jpg -m 7402199305639034921
```

Here, `-a coze_bot` specifies the Coze API, `-i image.jpg` indicates the input image file, and `-m 7402199305639034921` is the unique identifier for the bot model you wish to use.

### Ollama

Ollama runs locally and does not require an API key. Ensure that Ollama is installed and properly configured on your machine. Refer to the [Ollama GitHub repository](https://github.com/ollama/ollama) for installation and setup instructions.

To use remote Ollama service, you can defined the url in providerArgs:

```bash
exif-ai --providerArgs "http://ollama.example.com:8080" -a ollama -i image.jpg
```

```js
providerArgs: ["http://ollama.example.com:8080"],
```

## Develop

### Prerequisites

- Node.js >=16
- pnpm

### Clone the Repository

First, clone the Exif AI repository to your local machine:

```bash
git clone https://github.com/tychenjiajun/exif-ai.git
cd exif-ai
```

### Install Dependencies

Next, install the required dependencies using `pnpm`.

```bash
pnpm install
```

### Build

```bash
pnpm run build
```

### Watch

```bash
pnpm run watch
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/tychenjiajun/exif-ai

Awesome Lists containing this project

README