An open API service indexing awesome lists of open source software.

https://github.com/apertium/apertium-kir

Apertium linguistic data for Kyrgyz
https://github.com/apertium/apertium-kir

apertium-languages

Last synced: about 1 month ago
JSON representation

Apertium linguistic data for Kyrgyz

Awesome Lists containing this project

README

        

# apertium-kir: Kyrgyz for apertium

This is an Apertium monolingual language package for Kyrgyz. What
you can use this language package for:

* Morphological analysis of Kyrgyz
* Morphological generation of Kyrgyz
* Part-of-speech tagging of Kyrgyz

Requirements
-------------------------------------------------------------------------------

You will need the following software installed:

* lttoolbox (>= 3.3.0)
* apertium (>= 3.3.0)
* vislcg3 (>= 0.9.9.10297)
* hfst (>= 3.8.2)

If this does not make any sense, we recommend you look at: apertium.org.

Compiling
-------------------------------------------------------------------------------

Given the requirements being installed, you should be able to just run:

$ ./configure
$ make

You can use `./autogen.sh` instead of `./configure` if you're compiling
from source.

If you're doing development, you don't have to install the data, you
can use it directly from this directory.

If you are installing this language package as a prerequisite for an
Apertium translation pair, then do (typically as root / with sudo):

# make install

You can give a `--prefix` to `./configure` to install as a non-root user,
but make sure to use the same prefix when installing the translation
pair and any other language packages.

Testing
-------------------------------------------------------------------------------

If you are in the source directory after running make, the following
commands should work:

* Morphological analysis:

$ echo "Бул кыргызча морфологиялык талдоо" | apertium -d . kir-morph
^Бул/бул/бул/бул+э/бул+э$ ^кыргызча/кыргызча/кыргызча/кыргызча/кыргызча+э/кыргызча+э$ ^морфологиялык/морфологиялык/морфологиялык+э/морфологиялык+э$ ^талдоо/талда/талда+э/талда+э$^./.$

* Tagging (analysis + disambiguation):

$ echo "Бул кыргызча морфологиялык талдоо" | apertium -d . kir-tagger
^Бул/бул$ ^кыргызча/кыргызча$ ^морфологиялык/морфологиялык$ ^талдоо/талда$^./.$
# NOTE: Disambiguation needs some work. The correct output should be:
# ^Бул/бул$ ^кыргызча/кыргызча$ ^морфологиялык/морфологиялык$ ^талдоо/талда+э$

* Morphological generation:

$ echo "^бул$ ^кыргызча$ ^морфологиялык$ ^талда+э$" | apertium -f none -d . kir-gener
бул кыргызча морфологиялык талдоо

Files and data
-------------------------------------------------------------------------------

* `apertium-kir.kir.lexc` - Morphotactic dictionary
* `apertium-kir.kir.twol` - Morphophonological rules
* `apertium-kir.kir.rlx` - Constraint Grammar disambiguation rules
* `apertium-kir.post-kir.dix` - Post-generator
* `kir.prob` - Tagger model
* `modes.xml` - Translation modes

For more information
-------------------------------------------------------------------------------

* https://wiki.apertium.org/wiki/Installation
* https://wiki.apertium.org/wiki/apertium-kir
* https://wiki.apertium.org/wiki/Using_an_lttoolbox_dictionary

Help and support
-------------------------------------------------------------------------------

If you need help using this language pair or data, you can contact:

* Mailing list: [email protected]
* IRC: `#apertium` on `irc.oftc.net`

See also the file AUTHORS included in this distribution.