An open API service indexing awesome lists of open source software.

https://github.com/santhoshtr/sfst

Stuttgart Finite State Transducer system
https://github.com/santhoshtr/sfst

bison-yacc cmake computational-morphologies fst pybind11 readline

Last synced: 9 months ago
JSON representation

Stuttgart Finite State Transducer system

Awesome Lists containing this project

README

          

# SFST - Stuttgart Finite State Transducer

| CI | status |
|----------------------|--------|
| CMake builds | [![CMake Actions Status][actions-cmake-badge]][actions-cmake-link] |
| Pip builds | [![Pip Actions Status][actions-pip-badge]][actions-pip-link] |
| Wheels builds | [![Wheels Actions Status][actions-wheels-badge]][actions-wheels-link] |

[actions-cmake-link]: https://github.com/santhoshtr/sfst/actions?query=workflow%3A%22CMake
[actions-pip-link]: https://github.com/santhoshtr/sfst/actions?query=workflow%3A%22Pip
[actions-wheels-link]: https://github.com/santhoshtr/sfst/actions?query=workflow%3A%22Wheels
[actions-cmake-badge]: https://github.com/santhoshtr/sfst/workflows/CMake/badge.svg
[actions-pip-badge]: https://github.com/santhoshtr/sfst/workflows/Pip/badge.svg
[actions-wheels-badge]: https://github.com/santhoshtr/sfst/workflows/Wheels/badge.svg

## Installation

SFST can be compiled in Unix/Linux, Windows and Mac operating systems.

The SFST command line utilities has a few external requirements.

* The "Flex" scanner generator which can be downloaded from: . In linux systems they can be installed using package manager. For example, in Ubuntu, `apt install flex` installs it.
* The "Bison" parser generator available from . In linux systems they can be installed using package manager. For example, in Ubuntu, `apt install bison` installs it.

After unpacking the software package, in the top directory of source code,

```bash
mkdir build
cd build
cmake ..
cmake --build .
```

to compile the tools. Then call

```bash
cmake install
```

to install the tools in /usr/local/bin.

If you want to install the tools in a different directory, specify a different prefix using the --prefix option:

```bash
cmake --install . --prefix /my/install/prefix
```

Finall, to install the manpages in /usr/local/man/man1:

```bash
make maninstall
```

The subdirectory data contains a simple example of an English
morphological analyser, the source code of the German SMOR morphology
(with just a few sample lexicon entries), and the general morphology
XMOR which may be used as a starting point for the development of a
computational morphology.

## Usage

See the manual SFST-Manual.pdf in the doc subdirectory and the man
pages for more information on the SFST tools. doc/SFST-Tutorial.pdf
explains how computational morphologies are implemented in SFST.

If you want to implement your own tools based on the SFST code, have a
look at fst-infl.C and fst-infl2.C. They show how analysers are
implemented with the standard (fst-infl) and the compact (fst-infl2)
transducer format.

## Author

The SFST tools have been implemented by Helmut Schmid, Institute for
Computational Linguistics, University of Stuttgart, Germany and they
are available under the GNU public license version 2 or higher.

Currently maintained by Santhosh Thottingal.

Please cite the following publication if you want to refer to the SFST tools:

`A Programming Language for Finite State Transducers, Proceedings of the 5th International Workshop on Finite State Methods in Natural Language Processing (FSMNLP 2005), Helsinki, Finland.` [pdf](https://www.cis.uni-muenchen.de/~schmid/papers/SFST-PL.pdf)

Same can be cited as:

```bibtex
@inproceedings{schmid2005programming,
title={A programming language for finite state transducers},
author={Schmid, Helmut},
booktitle={Proceedings of the 5th International Workshop on Finite State Methods in Natural Language Processing (FSMNLP)},
volume={4002},
pages={308--309},
year={2005}
}
```