https://github.com/ktsu-dev/imagedescriber

bulk-processing cli-tool csharp dotnet image-captioning image-describer image-hashing image-search llama-vision llm local-ai ollama vision-model

Last synced: 10 days ago
JSON representation

Host: GitHub
URL: https://github.com/ktsu-dev/imagedescriber
Owner: ktsu-dev
License: mit
Created: 2026-02-09T11:29:32.000Z (6 months ago)
Default Branch: main
Last Pushed: 2026-07-15T03:56:15.000Z (11 days ago)
Last Synced: 2026-07-15T05:29:59.994Z (11 days ago)
Topics: bulk-processing, cli-tool, csharp, dotnet, image-captioning, image-describer, image-hashing, image-search, llama-vision, llm, local-ai, ollama, vision-model
Language: C#
Size: 309 KB
Stars: 1
Watchers: 0
Forks: 1
Open Issues: 1
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- License: LICENSE.md
- Authors: AUTHORS.url
- Copyright: COPYRIGHT.md

Awesome Lists containing this project

README

          # ImageDescriber

A .NET 10 CLI that uses a local Ollama vision model to generate descriptions and suggested filenames for images in bulk.

## What it does

Recursively scans a directory for images (`.jpg`, `.png`, `.gif`, `.bmp`, `.webp`, `.tiff`), computes a content hash for each file, and asks a local Ollama instance to caption each unique image. Descriptions, suggested filenames, and metadata are persisted in a JSON database keyed by the content hash, so identical images at different paths share one record and re-scanning skips already-described content.

No cloud APIs are called — all inference runs locally against Ollama.

## Prerequisites

- [Ollama](https://ollama.com) running locally or on the network (defaults to `http://localhost:11434`).

- A vision-capable model installed in Ollama (default `llama3.2-vision`):

  ```bash

  ollama pull llama3.2-vision

  ollama serve

  ```

- .NET 10 SDK.

## Installation

```bash

git clone 

cd ImageDescriber

dotnet build

```

## Usage

Without arguments the tool opens an interactive menu. All verbs can also be invoked directly.

```bash

# Interactive menu

ImageDescriber

# Scan a directory

ImageDescriber Scan -p "C:\photos"

# Scan with a custom model and remote endpoint

ImageDescriber Scan -p "C:\photos" -m llava -e http://192.168.1.100:11434

# Search stored descriptions

ImageDescriber Search -q "dog"

# Export / import the database

ImageDescriber Export -o descriptions.csv      # or .json

ImageDescriber Import -i backup.json            # or .csv

# Print database statistics

ImageDescriber Stats

```

### Verbs

| Verb | Purpose |

|---|---|

| `Menu` *(default)* | Interactive console menu. |

| `Scan` | Hash images in a directory and describe each unique one. |

| `Search` | Keyword search across stored descriptions and paths. |

| `Configure` | Edit endpoint, model, concurrency, and prompt templates. |

| `Export` | Dump the database to JSON or CSV. |

| `Import` | Merge a JSON or CSV export back into the database. |

| `Stats` | Print database statistics — total descriptions, total file size, models used, date range, duplicate count, and average description length. |

### Common options

| Option | Long form | Effect |

|---|---|---|

| `-p` | `--path` | Directory to scan (`Scan`) or default path. |

| `-e` | `--endpoint` | Ollama URL. Defaults to `http://localhost:11434`. |

| `-m` | `--model` | Vision model name. Defaults to `llama3.2-vision`. |

| `-q` | `--query` | Search query (`Search`). |

| `-o` | `--output` | Export file path. The extension picks the format. |

| `-i` | `--input` | Import file path. |

## Storage

Settings and the description database are stored via `ktsu.AppDataStorage` (typically `%APPDATA%\ktsu\ImageDescriber` on Windows).

## License

MIT — see `LICENSE.md`.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ktsu-dev/imagedescriber

Awesome Lists containing this project

README