https://github.com/aditeyabaral/hashmap-lemmatizer
A simple lemmatizer made as a part of the Data Structures (UE18CS202) course at PES University. We used the concepts of graph networks combined with a HashMap to create the database of words along with their lemma.
https://github.com/aditeyabaral/hashmap-lemmatizer
c data-structures graph hashing lemmatization linked-list natural-language-processing
Last synced: 3 months ago
JSON representation
A simple lemmatizer made as a part of the Data Structures (UE18CS202) course at PES University. We used the concepts of graph networks combined with a HashMap to create the database of words along with their lemma.
- Host: GitHub
- URL: https://github.com/aditeyabaral/hashmap-lemmatizer
- Owner: aditeyabaral
- Created: 2020-03-22T05:14:36.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2020-04-20T18:32:10.000Z (over 5 years ago)
- Last Synced: 2025-03-09T13:11:17.392Z (7 months ago)
- Topics: c, data-structures, graph, hashing, lemmatization, linked-list, natural-language-processing
- Language: C
- Homepage:
- Size: 20.5 KB
- Stars: 1
- Watchers: 1
- Forks: 2
- Open Issues: 3
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# HashMap-Lemmatizer
A simple lemmatizer made as a part of the Data Structures (UE18CS202) course at PES University. We used the concepts of graph networks combined with a HashMap to create the database of words along with their lemma. Our lemmatizer worked faster than WordNet's Lemmatizer on Python, and hence can be used on a higher scale to lemmatize large corpuses in marginal time.# Dependencies
* __WordForms.py__: Scrapes content from an online database of verb forms and stores them in VerbForms.csv
* __VerbForms.csv__: Contains words along with their lemmas# Execution
```
gcc HashMapLemma.c
./a.out
```