https://github.com/inokinoki/ai-no-jimaku-gumi

AI no jimaku gumi (AIの字幕組), a subtitle maker for video using AI.
https://github.com/inokinoki/ai-no-jimaku-gumi

ai ffmpeg llm ml multilingual subtitle subtitle-conversion subtitle-generator whisper whisper-cpp

Last synced: about 1 month ago
JSON representation

AI no jimaku gumi (AIの字幕組), a subtitle maker for video using AI.

Host: GitHub
URL: https://github.com/inokinoki/ai-no-jimaku-gumi
Owner: Inokinoki
License: mit
Created: 2024-12-10T22:36:43.000Z (11 months ago)
Default Branch: master
Last Pushed: 2025-01-14T17:59:28.000Z (10 months ago)
Last Synced: 2025-03-28T22:11:28.443Z (8 months ago)
Topics: ai, ffmpeg, llm, ml, multilingual, subtitle, subtitle-conversion, subtitle-generator, whisper, whisper-cpp
Language: Rust
Homepage:
Size: 64.5 KB
Stars: 187
Watchers: 2
Forks: 13
Open Issues: 7
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # AI no jimaku gumi (AIの字幕組)

[![Build and test on Linux](https://github.com/Inokinoki/ai-no-jimaku-gumi/actions/workflows/linux-rust.yml/badge.svg)](https://github.com/Inokinoki/ai-no-jimaku-gumi/actions/workflows/linux-rust.yml)

AI no jimaku gumi is a cli utility to facilitate the translation and subtitle making of video.

## Setup

To get started with AI no jimaku gumi, follow these steps:

1. Clone the repository:

    ```bash

    git clone https://github.com/Inokinoki/ai-no-jimaku-gumi.git

    ```

2. Navigate to the project directory:

    ```bash

    cd ai-no-jimaku-gumi

    ```

3. Install build dependencies:

### macOS

Using Homebrew:

```bash

brew install cmake ffmpeg

```

### Linux

Ubuntu:

```bash

apt-get install -y clang cmake make pkg-config \

    libavcodec-dev libavdevice-dev libavfilter-dev libavformat-dev \

    libavutil-dev libpostproc-dev libswresample-dev libswscale-dev 

```

Fedora:

```bash

dnf install clang cmake ffmpeg-free-devel make pkgconf-pkg-config

```

Arch Linux:

```bash

pacman -S clang cmake ffmpeg make pkgconf

```

Please look for `clang`, `cmake`, `make`, `pkgconfig` and `ffmpeg` packages in your distribution, if it's not one of above.

You might need to install some other packages to enable GPU/NPU acceleration.

### Windows

TODO

## Build

Build with cargo:

```bash

cargo build

```

Download whisper model(you can also download other models refer: https://huggingface.co/ggerganov/whisper.cpp):

```bash

wget https://huggingface.co/ggerganov/whisper.cpp/resolve/main/ggml-tiny.bin

```

Run it with your video path after `--input-video-path` and target language after `-t`.

## Usage

To use AI no jimaku gumi, refer this help:

```

aI NO jimaKu gumI, a subtitle maker using AI.

Usage: ainojimakugumi [OPTIONS] --input-video-path 

Options:

  -i, --input-video-path 

          Path to the input video

      --source-language 

          Which language to translate from (default: "ja") (possible values: "en", "es", "fr", "de", "it", "ja", "ko", "pt", "ru", "zh") (example: "ja") [default: ja]

      --target-language 

          Which language to translate to (default: "en") (possible values: "en", "es", "fr", "de", "it", "ja", "ko", "pt", "ru", "zh") (example: "en") [default: en]

      --start-time 

          Video start time (not used yet) [default: 0]

      --end-time 

          Video end time (not used yet) [default: 0]

      --subtitle-source 

          Subtitle source (default: "audio") (possible values: "audio", "container", "ocr") (example: "audio") (long_about: "Subtitle source to use") [default: audio]

      --ggml-model-path 

          ggml model path (default: "ggml-tiny.bin") (example: "ggml-tiny.bin", ggml-small.bin") (long_about: "Path to the ggml model") [default: ggml-tiny.bin]

      --only-extract-audio

          Only extract the audio (default: false) (long_about: "Only extract the audio, if subtitle source is audio, but do not transcribe (Debug purpose)") (example: true)

      --only-transcript

          Only save the transcripted subtitle (default: false) (long_about: "Only save the transcripted subtitle but do not translate (Debug purpose)") (example: true)

      --original-subtitle-path 

          Original subtitle SRT file path (default: "") (example: "origin.srt") (long_about: "Original subtitle path to save the transcripted subtitle as SRT") [default: ]

      --only-translate

          Only translate the subtitle (default: false) (long_about: "Only translate the subtitle but do not export (Debug purpose)")

  -s, --subtitle-backend 

          Subtitle backend (default: "srt") (possible values: "srt", "container", "embedded") (example: "srt") (long_about: "Subtitle backend to use") [default: srt]

      --subtitle-output-path 

          Subtitle output path (default: "None") (example: "output.srt") (long_about: "Subtitle output path (if srt) or video output path (if container or embedded)")

  -t, --translator-backend 

          Translator backend (default: "deepl") (possible values: "deepl", "google", "llm", "whisper") (example: "google") (long_about: "Translator backend to use") [default: deepl]

      --llm-model-name 

          Model name (if llm) (default: "gpt-4o") (example: "gpt-4o") (long_about: "Model name (if using llm for translation)") [default: gpt-4o]

      --llm-api-base 

          API base (if llm) (default: "https://api.openai.com") (example: "https://api.openai.com") (long_about: "API base used in `genai` crate (if using llm for translation)") [default: https://api.openai.com]

      --llm-prompt 

          Prompt (if llm) (default: "") (example: "Translate the following text to English") (long_about: "Prompt (if using llm for translation)") [default: ]

  -h, --help

          Print help

  -V, --version

          Print version

```

We are currently supporting only `deepl`, `llm`, `whisper` translation and `srt` export.

### Translator backend

You might need to follow the specific instructions to use a translator backend:

- `deepl` (default): please provide your own DeepL API key in `DEEPL_API_KEY` env, and `DEEPL_API_URL=https://api.deepl.com` if you are using the paid API version.

- `llm`: if you are using llm translate, please refer the repo [rust-genai](https://github.com/jeremychone/rust-genai) for more detail. An example here:

```cli

export CUSTOM_API_KEY=sk-xxxxxxxxxxxxxxxxxxxxxxx

./target/debug/ainojimakugumi --input-video-path one.webm \

    --translator-backend llm \

    --llm-api-base https://sssss.com/v1/ \

    --llm-prompt 'translate this to English' \

    --llm-model-name 'gpt-4o-mini'

    --ggml-model-path ggml-small.bin

```

- `whisper` (experimental): use Whisper.cpp to directly output translated subtitles from audio **(audio only, English only)**.

## Star History

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/inokinoki/ai-no-jimaku-gumi

Awesome Lists containing this project

README