https://github.com/apertium/apertium-san
Apertium linguistic data for Sanskrit
https://github.com/apertium/apertium-san
apertium-languages
Last synced: 4 months ago
JSON representation
Apertium linguistic data for Sanskrit
- Host: GitHub
- URL: https://github.com/apertium/apertium-san
- Owner: apertium
- License: gpl-2.0
- Created: 2018-03-08T01:41:49.000Z (over 7 years ago)
- Default Branch: master
- Last Pushed: 2021-07-20T00:40:07.000Z (over 4 years ago)
- Last Synced: 2025-07-09T06:26:16.527Z (4 months ago)
- Topics: apertium-languages
- Language: XML
- Size: 1010 KB
- Stars: 1
- Watchers: 7
- Forks: 0
- Open Issues: 1
-
Metadata Files:
- Readme: README
- Changelog: ChangeLog
- License: COPYING
Awesome Lists containing this project
README
Morphological analyser for Sanskrit
===================================
==Files==
* apertium-san.san.dix - Sanskrit morphological dictionary
* apertium-san.ucs-wx.xml - Transliteration file
==Installation==
To compile it you need lttoolbox installed:
$ lt-comp lr apertium-san.san.dix san.automorf.bin
To use it:
$ echo "rAmeNa" | lt-proc -c san.automorf.bin
You should get the output:
^rAmeNa/rAma$
Note: the -c option is important to respect case.
To use the generator:
$ lt-comp rl apertium-san.san.dix san.autogen.bin
Then:
$ echo "^rAma$" | lt-proc -c -g san.autogen.bin
rAmeNa
If you want to install it and have it to work
with apertium deformatters etc. you will need to have
apertium installed.
Just run:
./autogen.sh
./configure
make
make install
==Encoding==
The dictionary is in ASCII and uses the wx-notation.
==Further reading==
You can find more information on lttoolbox on the Apertium
wiki: http://xixona.dlsi.ua.es/wiki/index.php/lttoolbox