https://github.com/picovoice/speech-to-intent-benchmark

benchmark for Speech-to-Intent engines
https://github.com/picovoice/speech-to-intent-benchmark

natural-language-understanding speech-recognition speech-to-intent speech-to-text voice-assistant voice-commands voice-recognition

Last synced: 7 months ago
JSON representation

benchmark for Speech-to-Intent engines

Host: GitHub
URL: https://github.com/picovoice/speech-to-intent-benchmark
Owner: Picovoice
License: apache-2.0
Created: 2019-05-05T00:26:31.000Z (about 6 years ago)
Default Branch: master
Last Pushed: 2024-06-04T19:40:01.000Z (about 1 year ago)
Last Synced: 2024-06-04T21:48:48.812Z (about 1 year ago)
Topics: natural-language-understanding, speech-recognition, speech-to-intent, speech-to-text, voice-assistant, voice-commands, voice-recognition
Language: Python
Homepage: https://picovoice.ai/
Size: 82.8 MB
Stars: 15
Watchers: 9
Forks: 4
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # Speech-to-Intent Benchmark

[![License](https://img.shields.io/badge/License-Apache%202.0-blue.svg)](https://github.com/Picovoice/speech-to-intent-benchmark/blob/master/LICENSE)

Made in Vancouver, Canada by [Picovoice](https://picovoice.ai)

[![Twitter URL](https://img.shields.io/twitter/url?label=%40AiPicovoice&style=social&url=https%3A%2F%2Ftwitter.com%2FAiPicovoice)](https://twitter.com/AiPicovoice)

[![YouTube Channel Views](https://img.shields.io/youtube/channel/views/UCAdi9sTCXLosG1XeqDwLx7w?label=YouTube&style=social)](https://www.youtube.com/channel/UCAdi9sTCXLosG1XeqDwLx7w)

This framework benchmarks the accuracy of Picovoice's Speech-to-Intent engine, [Rhino](https://github.com/Picovoice/rhino).

It compares the accuracy of Rhino with:

- [Amazon Lex](https://aws.amazon.com/lex/)

- [Google Dialogflow](https://dialogflow.com/)

- [IBM Watson](https://www.ibm.com/watson)

- [Microsoft LUIS](https://www.luis.ai/)

## Table of Contents

- [Speech-to-Intent Benchmark](#speech-to-intent-benchmark)

  - [Table of Contents](#table-of-contents)

  - [Results](#results)

  - [Data](#data)

  - [How to Reproduce?](#how-to-reproduce)

## Results

Command acceptance rate is the probability of an engine correctly understanding the spoken command. Below is the summary:

![](data/misc/result-summary.svg)

The figure below depicts engines performance at each SNR:

![](data/misc/result.svg)

## Data

The speech data are crowd-sourced from more than 50 unique speakers. Each speaker contributed about ten different utterances.

Collectively there are 619 commands used in this benchmark. We test the engines in noisy conditions to simulate real-world

situations. Noise is from [Freesound](https://freesound.org/).

## How to Reproduce?

Clone the repository:

```console

git clone https://github.com/Picovoice/speech-to-intent-benchmark.git

```

Get the usage message:

```console

python3 src/bench.py --help

```

Then run the script for each engine.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/picovoice/speech-to-intent-benchmark

Awesome Lists containing this project

README