Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/geneweb/unidecode

Convert unicode strings into its ASCII representation
https://github.com/geneweb/unidecode

Last synced: about 2 months ago
JSON representation

Convert unicode strings into its ASCII representation

Awesome Lists containing this project

README

        

# unidecode

Convert unicode strings into its ASCII representation.

The purpose of this library is the same as python's
[unidecode](https://pypi.org/project/Unidecode/) library (version
1.1.1).

Code of the initial release of this library has been extracted from
[GeneWeb](https://github.com/geneweb/geneweb) and adapted to be
released in an independent library.

## Installation

```bash
opam install unidecode
```

## License

Released under the terms of the [GNU GENERAL PUBLIC LICENSE](LICENSE).

## Limitations

- Only supports NFC normalization form.
- Transliteration targets french language
(i.e. russian `у` gives `ou` while `u` could be expected).
This will eventually be parameterizable.
- Transliteration might produce strange casing
(e.g. `У` produce `OU` while `Ou` could be expected).
Choosing between default (current) behavior, lower casing, upper
casing, and capitalization will eventually be an option.

## Instructions for developpers

```bash
dune build # build the library
dune install # install the built library
dune clean # clean compilation artifacts
dune runtest # run unit tests
dune build @runbench # compare with other libs
```