https://github.com/jacoblincool/whisper-cli

A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.
https://github.com/jacoblincool/whisper-cli

Last synced: about 1 year ago
JSON representation

A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.

Host: GitHub
URL: https://github.com/jacoblincool/whisper-cli
Owner: JacobLinCool
License: mit
Created: 2023-03-05T21:14:06.000Z (over 3 years ago)
Default Branch: main
Last Pushed: 2025-05-07T00:19:21.000Z (about 1 year ago)
Last Synced: 2025-05-07T01:27:24.442Z (about 1 year ago)
Language: TypeScript
Size: 373 KB
Stars: 16
Watchers: 2
Forks: 2
Open Issues: 13
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Whisper CLI

A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.

It supports running [Smart Whisper](https://github.com/JacobLinCool/smart-whisper) locally with `whisper smart` subcommand.

```sh
❯ whisper help
Usage: whisper [options] [command]

A CLI speech recognition tool, using OpenAI Whisper, supports audio file transcription and near-realtime microphone input.

Options:
-V, --version output the version number
-h, --help display help for command

Commands:
recognize|rec [options] Recognize text from an audio file
microphone|mic [options] Recognize text from microphone
help [command] display help for command
```

## Installation

```sh
npm install -g whisper-cli
```

## Usage

You need to set the `OPENAI_API_KEY` environment variable first.

> You can also put it in a `.env` file in the current directory.

```sh
whisper help
```

### Smart Whisper

[Smart Whisper](https://github.com/JacobLinCool/smart-whisper) allows you to run whisper locally with native performance.

```sh
whisper smart help # show help
whisper smart model download base # download base model
whisper smart transcribe --gpu --model base # transcribe audio file with base model on GPU
whisper smart server --gpu --port 3000 --model large-v3 # run server on port 3000 with large-v3 model on GPU
```

`whisper smart server` runs a transcribe server, which manages the model memory automatically, it will offload the model when idling, and load it back when needed.

The OpenAPI spec is available at `http://localhost:/openapi.json`. You can use API-Spec to browse it:

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jacoblincool/whisper-cli

Awesome Lists containing this project

README