https://github.com/Cap-go/capacitor-speech-recognition

Capacitor plugin for speech recognition.
https://github.com/Cap-go/capacitor-speech-recognition
Last synced: 4 months ago
JSON representation
Capacitor plugin for speech recognition.
Host: GitHub
URL: https://github.com/Cap-go/capacitor-speech-recognition
Owner: Cap-go
License: mpl-2.0
Created: 2025-11-09T01:19:57.000Z (7 months ago)
Default Branch: main
Last Pushed: 2026-02-04T02:22:19.000Z (4 months ago)
Last Synced: 2026-02-04T14:26:30.070Z (4 months ago)
Language: Java
Size: 452 KB
Stars: 8
Watchers: 0
Forks: 2
Open Issues: 3
Metadata Files:
- Readme: README.md
- Changelog: CHANGELOG.md
- Contributing: CONTRIBUTING.md
- Funding: .github/FUNDING.yml
- License: LICENSE
- Agents: AGENTS.md
Awesome Lists containing this project

awesome-ionic - capacitor-speech-recognition - Capacitor plugin for comprehensive on-device speech recognition with live partial results. (Capgo Capacitor Plugins)
awesome-capacitor - Speech Recognition - Comprehensive on-device speech recognition with live partial results. ([Capgo plugins](https://capgo.app/) / Communication & Messaging)
README

          # @capgo/capacitor-speech-recognition

 



   ➡️ Get Instant updates for your App with Capgo

   Missing a feature? We’ll build the plugin for you 💪



Natural, low-latency speech recognition for Capacitor apps with parity across iOS and Android, streaming partial results, and permission helpers baked in.

## Why this plugin?

This package starts from the excellent [`capacitor-community/speech-recognition`](https://github.com/capacitor-community/speech-recognition) plugin, but folds in the most requested pull requests from that repo (punctuation support, segmented sessions, crash fixes) and keeps them maintained under the Capgo umbrella. You get the familiar API plus:

- ✅ **Merged community PRs** – punctuation toggles on iOS (PR #74), segmented results & silence handling on Android (PR #104), and the `recognitionRequest` safety fix (PR #105) ship out-of-the-box.

- 🚀 **New Capgo features** – configurable silence windows, streaming segment listeners, consistent permission helpers, and a refreshed example app.

- 🛠️ **Active maintenance** – same conventions as all Capgo plugins (SPM, Podspec, workflows, example app) so it tracks Capacitor major versions without bit-rot.

- 📦 **Drop-in migration** – TypeScript definitions remain compatible with the community plugin while exposing the extra options (`addPunctuation`, `allowForSilence`, `segmentResults`, etc.).

## Documentation

The most complete doc is available here: https://capgo.app/docs/plugins/speech-recognition/

## Install

```bash

npm install @capgo/capacitor-speech-recognition

npx cap sync

```

## Usage

```ts

import { SpeechRecognition } from '@capgo/capacitor-speech-recognition';

await SpeechRecognition.requestPermissions();

const { available } = await SpeechRecognition.available();

if (!available) {

  console.warn('Speech recognition is not supported on this device.');

}

const partialListener = await SpeechRecognition.addListener('partialResults', (event) => {

  console.log('Partial:', event.matches?.[0]);

});

await SpeechRecognition.start({

  language: 'en-US',

  maxResults: 3,

  partialResults: true,

});

// Later, when you want to stop listening

await SpeechRecognition.stop();

await partialListener.remove();

```

### iOS usage descriptions

Add the following keys to your app `Info.plist`:

- `NSSpeechRecognitionUsageDescription`

- `NSMicrophoneUsageDescription`

## API

* [`available()`](#available)

* [`start(...)`](#start)

* [`stop()`](#stop)

* [`getSupportedLanguages()`](#getsupportedlanguages)

* [`isListening()`](#islistening)

* [`checkPermissions()`](#checkpermissions)

* [`requestPermissions()`](#requestpermissions)

* [`getPluginVersion()`](#getpluginversion)

* [`addListener('endOfSegmentedSession', ...)`](#addlistenerendofsegmentedsession-)

* [`addListener('segmentResults', ...)`](#addlistenersegmentresults-)

* [`addListener('partialResults', ...)`](#addlistenerpartialresults-)

* [`addListener('listeningState', ...)`](#addlistenerlisteningstate-)

* [`removeAllListeners()`](#removealllisteners)

* [Interfaces](#interfaces)

* [Type Aliases](#type-aliases)

### available()

```typescript

available() => Promise

```

Checks whether the native speech recognition service is usable on the current device.

**Returns:** Promise<SpeechRecognitionAvailability>

--------------------

### start(...)

```typescript

start(options?: SpeechRecognitionStartOptions | undefined) => Promise

```

Begins capturing audio and transcribing speech.

When `partialResults` is `true`, the returned promise resolves immediately and updates are

streamed through the `partialResults` listener until {@link stop} is called.

| Param         | Type                                                                                    |

| ------------- | --------------------------------------------------------------------------------------- |

| **`options`** | SpeechRecognitionStartOptions |

**Returns:** Promise<SpeechRecognitionMatches>

--------------------

### stop()

```typescript

stop() => Promise

```

Stops listening and tears down native resources.

--------------------

### getSupportedLanguages()

```typescript

getSupportedLanguages() => Promise

```

Gets the locales supported by the underlying recognizer.

Android 13+ devices no longer expose this list; in that case `languages` is empty.

**Returns:** Promise<SpeechRecognitionLanguages>

--------------------

### isListening()

```typescript

isListening() => Promise

```

Returns whether the plugin is actively listening for speech.

**Returns:** Promise<SpeechRecognitionListening>

--------------------

### checkPermissions()

```typescript

checkPermissions() => Promise

```

Gets the current permission state.

**Returns:** Promise<SpeechRecognitionPermissionStatus>

--------------------

### requestPermissions()

```typescript

requestPermissions() => Promise

```

Requests the microphone + speech recognition permissions.

**Returns:** Promise<SpeechRecognitionPermissionStatus>

--------------------

### getPluginVersion()

```typescript

getPluginVersion() => Promise<{ version: string; }>

```

Returns the native plugin version bundled with this package.

Useful when reporting issues to confirm that native and JS versions match.

**Returns:** Promise<{ version: string; }>

--------------------

### addListener('endOfSegmentedSession', ...)

```typescript

addListener(eventName: 'endOfSegmentedSession', listenerFunc: () => void) => Promise

```

Listen for segmented session completion events (Android only).

| Param              | Type                                 |

| ------------------ | ------------------------------------ |

| **`eventName`**    | 'endOfSegmentedSession' |

| **`listenerFunc`** | () => void           |

**Returns:** Promise<PluginListenerHandle>

--------------------

### addListener('segmentResults', ...)

```typescript

addListener(eventName: 'segmentResults', listenerFunc: (event: SpeechRecognitionSegmentResultEvent) => void) => Promise

```

Listen for segmented recognition results (Android only).

| Param              | Type                                                                                                                    |

| ------------------ | ----------------------------------------------------------------------------------------------------------------------- |

| **`eventName`**    | 'segmentResults'                                                                                           |

| **`listenerFunc`** | (event: SpeechRecognitionSegmentResultEvent) => void |

**Returns:** Promise<PluginListenerHandle>

--------------------

### addListener('partialResults', ...)

```typescript

addListener(eventName: 'partialResults', listenerFunc: (event: SpeechRecognitionPartialResultEvent) => void) => Promise

```

Listen for partial transcription updates emitted while `partialResults` is enabled.

| Param              | Type                                                                                                                    |

| ------------------ | ----------------------------------------------------------------------------------------------------------------------- |

| **`eventName`**    | 'partialResults'                                                                                           |

| **`listenerFunc`** | (event: SpeechRecognitionPartialResultEvent) => void |

**Returns:** Promise<PluginListenerHandle>

--------------------

### addListener('listeningState', ...)

```typescript

addListener(eventName: 'listeningState', listenerFunc: (event: SpeechRecognitionListeningEvent) => void) => Promise

```

Listen for changes to the native listening state.

| Param              | Type                                                                                                            |

| ------------------ | --------------------------------------------------------------------------------------------------------------- |

| **`eventName`**    | 'listeningState'                                                                                   |

| **`listenerFunc`** | (event: SpeechRecognitionListeningEvent) => void |

**Returns:** Promise<PluginListenerHandle>

--------------------

### removeAllListeners()

```typescript

removeAllListeners() => Promise

```

Removes every registered listener.

--------------------

### Interfaces

#### SpeechRecognitionAvailability

| Prop            | Type                 |

| --------------- | -------------------- |

| **`available`** | boolean |

#### SpeechRecognitionMatches

| Prop          | Type                  |

| ------------- | --------------------- |

| **`matches`** | string[] |

#### SpeechRecognitionStartOptions

Configure how the recognizer behaves when calling {@link SpeechRecognitionPlugin.start}.

| Prop                  | Type                 | Description                                                                                                                                                                 |

| --------------------- | -------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |

| **`language`**        | string  | Locale identifier such as `en-US`. When omitted the device language is used.                                                                                                |

| **`maxResults`**      | number  | Maximum number of final matches returned by native APIs. Defaults to `5`.                                                                                                   |

| **`prompt`**          | string  | Prompt message shown inside the Android system dialog (ignored on iOS).                                                                                                     |

| **`popup`**           | boolean | When `true`, Android shows the OS speech dialog instead of running inline recognition. Defaults to `false`.                                                                 |

| **`partialResults`**  | boolean | Emits partial transcription updates through the `partialResults` listener while audio is captured.                                                                          |

| **`addPunctuation`**  | boolean | Enables native punctuation handling where supported (iOS 16+).                                                                                                              |

| **`allowForSilence`** | number  | Allow a number of milliseconds of silence before splitting the recognition session into segments. Required to be greater than zero and currently supported on Android only. |

#### SpeechRecognitionLanguages

| Prop            | Type                  |

| --------------- | --------------------- |

| **`languages`** | string[] |

#### SpeechRecognitionListening

| Prop            | Type                 |

| --------------- | -------------------- |

| **`listening`** | boolean |

#### SpeechRecognitionPermissionStatus

Permission map returned by `checkPermissions` and `requestPermissions`.

On Android the state maps to the `RECORD_AUDIO` permission.

On iOS it combines speech recognition plus microphone permission.

| Prop                    | Type                                                        |

| ----------------------- | ----------------------------------------------------------- |

| **`speechRecognition`** | PermissionState |

#### PluginListenerHandle

| Prop         | Type                                      |

| ------------ | ----------------------------------------- |

| **`remove`** | () => Promise<void> |

#### SpeechRecognitionSegmentResultEvent

Raised whenever a segmented result is produced (Android only).

| Prop          | Type                  |

| ------------- | --------------------- |

| **`matches`** | string[] |

#### SpeechRecognitionPartialResultEvent

Raised whenever a partial transcription is produced.

| Prop          | Type                  |

| ------------- | --------------------- |

| **`matches`** | string[] |

#### SpeechRecognitionListeningEvent

Raised when the listening state changes.

| Prop         | Type                                |

| ------------ | ----------------------------------- |

| **`status`** | 'started' \| 'stopped' |

### Type Aliases

#### PermissionState

'prompt' | 'prompt-with-rationale' | 'granted' | 'denied'
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/Cap-go/capacitor-speech-recognition

Awesome Lists containing this project

README

➡️ Get Instant updates for your App with Capgo

Missing a feature? We’ll build the plugin for you 💪