https://github.com/dictation-toolbox/dragonfly

Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx
https://github.com/dictation-toolbox/dragonfly

python speech-recognition

Last synced: 5 months ago
JSON representation

Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx

Host: GitHub
URL: https://github.com/dictation-toolbox/dragonfly
Owner: dictation-toolbox
License: lgpl-3.0
Created: 2017-05-26T08:23:01.000Z (about 9 years ago)
Default Branch: master
Last Pushed: 2026-01-22T01:03:47.000Z (6 months ago)
Last Synced: 2026-01-22T14:52:19.849Z (6 months ago)
Topics: python, speech-recognition
Language: Python
Homepage:
Size: 5.58 MB
Stars: 412
Watchers: 20
Forks: 77
Open Issues: 26
Metadata Files:
- Readme: README.rst
- Changelog: CHANGELOG.rst
- Contributing: CONTRIBUTING.rst
- License: LICENSE.txt
- Authors: AUTHORS.txt

Awesome Lists containing this project

README

Dragonfly
=========

|Docs Status|
|Join Matrix/Gitter chat|

.. contents:: Contents

Introduction
----------------------------------------------------------------------------

Dragonfly is a speech recognition framework for Python that makes it
convenient to create custom commands to use with speech recognition
software. It was written to make it very easy for Python macros, scripts,
and applications to interface with speech recognition engines. Its design
allows speech commands and grammar objects to be treated as first-class
Python objects.

This project is a fork of the original
`t4ngo/dragonfly `__ project. It is
known as *dragonfly2* on the PyPI repository, but for purposes of
functionality, it is referred to as Dragonfly below.

Dragonfly can be used for general programming by voice. It is flexible
enough to allow programming in any language, not just Python. It can also be
used for speech-enabling applications, automating computer activities
and dictating prose.

Dragonfly contains its own powerful framework for defining and executing
actions. It includes actions for text input and key-stroke simulation. This
framework is cross-platform, working on Windows, macOS and Linux (X11 only).
See the `actions sub-package documentation
`__
for more information, including code examples.

Dragonfly currently supports the following speech recognition engines:

- *Dragon*, a product of *Nuance*. All versions up to 16 (the latest)
should be supported. *Home*, *Professional Individual* and previous
similar editions of *Dragon* are supported. Other editions may work too.
- *Windows Speech Recognition* (WSR), included with Microsoft Windows
Vista, Windows 7+, and freely available for Windows XP.
- *Kaldi*, open source (AGPL) and multi-platform.
- *CMU Pocket Sphinx*, open source and multi-platform.

Documentation and FAQ
----------------------------------------------------------------------------

Dragonfly's documentation is available online at `Read the
Docs `__. The changes in
each release are listed in the project's `changelog
`__.
Dragonfly's FAQ is available in the documentation `here
`__.
There are also a number of Dragonfly-related questions on `Stackoverflow
`__, although
many of them are related to issues resolved in the latest version of
Dragonfly. Real-time chat and historical discussions about Dragonfly can be
found in the `Gitter chat room
`__ (also `bridged to
Matrix `__).

CompoundRule Usage example
----------------------------------------------------------------------------

A very simple example of Dragonfly usage is to create a static voice
command with a callback that will be called when the command is spoken.
This is done as follows:

.. code-block:: python

from dragonfly import Grammar, CompoundRule

# Voice command rule combining spoken form and recognition processing.
class ExampleRule(CompoundRule):
spec = "do something computer" # Spoken form of command.
def _process_recognition(self, node, extras): # Callback when command is spoken.
print("Voice command spoken.")

# Create a grammar which contains and loads the command rule.
grammar = Grammar("example grammar") # Create a grammar to contain the command rule.
grammar.add_rule(ExampleRule()) # Add the command rule to the grammar.
grammar.load() # Load the grammar.

To use this example, save it in a command module in your module loader
directory or Natlink user directory, load it and then say *do something
computer*. If the speech recognition engine recognized the command, then
``Voice command spoken.`` will be printed in the Natlink messages window.
If you're not using Dragon, then it will be printed into the console window.

MappingRule usage example
----------------------------------------------------------------------------

A more common use of Dragonfly is the ``MappingRule`` class, which allows
defining multiple voice commands. The following example is a simple grammar
to be used when Notepad is the foreground window:

.. code-block:: python

from dragonfly import (Grammar, AppContext, MappingRule, Dictation,
Key, Text)

# Voice command rule combining spoken forms and action execution.
class NotepadRule(MappingRule):
# Define the commands and the actions they execute.
mapping = {
"save [file]": Key("c-s"),
"save [file] as": Key("a-f, a/20"),
"save [file] as ": Key("a-f, a/20") + Text("%(text)s"),
"find ": Key("c-f/20") + Text("%(text)s\n"),
}

# Define the extras list of Dragonfly elements which are available
# to be used in mapping specs and actions.
extras = [
Dictation("text")
]

# Create the grammar and the context under which it'll be active.
context = AppContext(executable="notepad")
grammar = Grammar("Notepad example", context=context)

# Add the command rule to the grammar and load it.
grammar.add_rule(NotepadRule())
grammar.load()

To use this example, save it in a command module in your module loader
directory or Natlink user directory, load it, open a Notepad window and then
say one of mapping commands. For example, saying *save* or *save file* will
cause the control and S keys to be pressed.

The example aboves don't show any of Dragonfly's exciting features, such as
dynamic speech elements. To learn more about these, please take a look at
`Dragonfly's online docs `__.

Installation
----------------------------------------------------------------------------

Dragonfly is a Python package. It can be installed as *dragonfly* using
pip:

.. code:: shell

pip install dragonfly2

If you wish to install the latest release candidate for version 1.0.0,
please run the following command instead:

.. code:: shell

pip install dragonfly2==1.0.0-rc2

These versions are more up-to-date and have fewer requirements. The
documentation for them is available at `Read the Docs (latest)
`_.

If you are installing this on Linux, you will also need to install the
`wmctrl `__, `xdotool
`__ and `xsel
`__ programs.

Please note that, on Linux, Dragonfly is only fully functional in an X11
session. Input action classes, application contexts and the ``Window``
class will **not** be functional under Wayland. It is recommended that
Wayland users switch to X11, Windows or macOS.

Dragonfly can also be installed by cloning this repository or
downloading it from `the releases
page `__ and
running the following (or similar) command in the project's root
directory:

.. code:: shell

pip install -e .

If pip fails to install *dragonfly* or any of its required or extra
dependencies, then you may need to upgrade pip with the following command:

.. code:: shell

pip install --upgrade pip

To build the dragonfly python package, run these commands in the projects root directory.

.. code:: shell

pip install build
Python -m build

Speech recognition engine back-ends
----------------------------------------------------------------------------

Installation instructions, requirements and API references for each
Dragonfly speech recognition engine are documented separately on the
following pages:

* `Natlink and DNS engine
`_
* `SAPI 5 and WSR engine
`_
* `Kaldi engine
`_
* `CMU Pocket Sphinx engine
`_
* `Text-input engine
`_

Existing command modules
----------------------------------------------------------------------------

The related resources page of Dragonfly's documentation has a section on
`command
modules `__
which lists various sources.

.. |Docs Status| image:: https://readthedocs.org/projects/dragonfly/badge/?version=latest&style=flat
:target: https://dragonfly.readthedocs.io
.. |Join Matrix/Gitter chat| image:: https://img.shields.io/matrix/dragonfly2:matrix.org.svg?label=%5BMatrix%20chat%5D
:target: https://app.gitter.im/#/room/#dragonfly2:matrix.org

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/dictation-toolbox/dragonfly

Awesome Lists containing this project

README