https://github.com/digitalheir/cebuano-dictionary-js
🇵🇠A dictionary and stemmer for the Cebuano language spoken in the Philippines
https://github.com/digitalheir/cebuano-dictionary-js
cebuano cebuano-dictionary dictionary javascript philippines stemmer
Last synced: 8 months ago
JSON representation
🇵🇠A dictionary and stemmer for the Cebuano language spoken in the Philippines
- Host: GitHub
- URL: https://github.com/digitalheir/cebuano-dictionary-js
- Owner: digitalheir
- Created: 2017-01-28T13:21:11.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2022-11-14T11:26:20.000Z (over 3 years ago)
- Last Synced: 2024-10-10T09:25:31.009Z (over 1 year ago)
- Topics: cebuano, cebuano-dictionary, dictionary, javascript, philippines, stemmer
- Language: TypeScript
- Homepage: https://digitalheir.github.io/cebuano-dictionary-js/
- Size: 11.5 MB
- Stars: 4
- Watchers: 3
- Forks: 6
- Open Issues: 5
-
Metadata Files:
- Readme: readme.md
Awesome Lists containing this project
README
# Cebuano dictionary and stemmer
[](https://travis-ci.org/digitalheir/cebuano-dictionary-js)
[](https://www.npmjs.com/package/cebuano-stemmer)
[](https://github.com/digitalheir/cebuano-dictionary-js/blob/master/LICENSE)
[Live demo in browser](https://digitalheir.github.io/cebuano-dictionary-js/)
A dictionary and stemmer for the Cebuano language spoken in the Philippines.
Written in TypeScript, compiled to [ES5 CommonJS module](https://www.npmjs.com/package/cebuano-stemmer) (for use in Node.js) and a [single-file ES5 UMD module](https://github.com/digitalheir/cebuano-stemmer-js/releases) (for use in the browser).
Based on [J. Hellingman's Cebuano stemmer for Java and dictionary for Android](https://github.com/jhellingman/cebuano-dictionary-app/), which is in turn based on J. Wolff's seminal work [*A Dictionary of Cebuano Visayan*](http://www.gutenberg.org/files/40074/40074-h/40074-h.htm).
## Dictionary
The dictionary data is converted from [a SQLite database](https://github.com/jhellingman/cebuano-dictionary-app/tree/master/app/src/main/assets/databases) to a publicly-readable CouchDB instance hosted at https://publicdomainreview.cloudant.com/cebuano_dictionary.
The primary index for an entry is the normalized root form of the Cebuano word plus an identification number.
You can use the regular [CouchDB API](http://docs.couchdb.org/en/2.0.0/api/) to query. For instance, to get the first 20 entries starting with "k":
[https://publicdomainreview.cloudant.com/cebuano_dictionary/_all_docs?include_docs=true&limit=20&startkey="k"](https://publicdomainreview.cloudant.com/cebuano_dictionary/_all_docs?include_docs=true&limit=20&startkey=%22k%22)
### Search
The database provides full-text search for English and Cebuano. You can query using the [Lucene Query Parser Syntax](https://docs.cloudant.com/search.html#query-syntax).
Examples:
|description|url|
|---|---|
|English full text search through entries for "fire"|https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/search/_search/fromEnglish?q=fire|
|English synonym search for "fire"|https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/search/_search/fromEnglish?q=synonym:fire|
|Cebuano full text search through entries for "abat"|https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/search/_search/fromCebuano?q=abat|
|Cebuano synonym search for "abat"|https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/search/_search/fromCebuano?q=synonym:abat|
(for more information, add the query parameter `include_docs=true`)
## Stemmer
A JSON list of word roots used in the stemmer is available at https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/show/_list/keyset/normalized_heads_where_pos_is_not_empty_string?group_level=1
A JSON list of all heads is available at https://publicdomainreview.cloudant.com/cebuano_dictionary/_design/show/_list/keyset/heads?group_level=1