Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/johnbumgarner/wordhoard
This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.
https://github.com/johnbumgarner/wordhoard
antonyms bag-of-words definitions dictionary homophones hypernyms hyponyms lexicon nlp python python3 synonyms text-analysis textual-analysis wordlists wordnet wordnets wordsearch
Last synced: 3 months ago
JSON representation
This Python module can be used to obtain antonyms, synonyms, hypernyms, hyponyms, homophones and definitions.
- Host: GitHub
- URL: https://github.com/johnbumgarner/wordhoard
- Owner: johnbumgarner
- License: mit
- Created: 2020-10-23T19:28:04.000Z (about 4 years ago)
- Default Branch: master
- Last Pushed: 2024-05-29T11:39:07.000Z (6 months ago)
- Last Synced: 2024-06-28T06:41:43.904Z (5 months ago)
- Topics: antonyms, bag-of-words, definitions, dictionary, homophones, hypernyms, hyponyms, lexicon, nlp, python, python3, synonyms, text-analysis, textual-analysis, wordlists, wordnet, wordnets, wordsearch
- Language: Python
- Homepage:
- Size: 539 KB
- Stars: 110
- Watchers: 4
- Forks: 11
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- Funding: .github/FUNDING.yml
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
- Security: SECURITY.md
Awesome Lists containing this project
README
Overviews
![PyPI](https://img.shields.io/pypi/v/wordhoard)
![License: MIT](https://img.shields.io/github/license/johnbumgarner/wordhoard)
![GitHub issues](https://img.shields.io/github/issues/johnbumgarner/wordhoard)
![GitHub pull requests](https://img.shields.io/github/issues-pr/johnbumgarner/wordhoard)
[![Downloads](https://static.pepy.tech/personalized-badge/wordhoard?period=total&units=international_system&left_color=grey&right_color=brightgreen&left_text=Downloads)](https://pepy.tech/project/wordhoard)Primary Use Case
Textual analysis is a broad term for various research methodologies used to qualitatively describe, interpret and understand text data. These methodologies are mainly used in academic research to analyze content related to media and communication studies, popular culture, sociology, and philosophy. Textual analysis allows these researchers to quickly obtain relevant insights from unstructured data. All types of information can be gleaned from textual data, especially from social media posts or news articles. Some of this information includes the overall concept of the subtext, symbolism within the text, assumptions being made and potential relative value to a subject (e.g. data science). In some cases it is possible to deduce the relative historical and cultural context of a body of text using analysis techniques coupled with knowledge from different disciplines, like linguistics and semiotics.
Word frequency is the technique used in textual analysis to measure the frequency of a specific word or word grouping within unstructured data. Measuring the number of word occurrences in a corpus allows a researcher to garner interesting insights about the text. A subset of word frequency is the correlation between a given word and that word's relationship to either antonyms and synonyms within the specific corpus being analyzed. Knowing these relationships is critical to improving word frequencies and topic modeling.WordHoard was designed to assist researchers performing textual analysis to build more comprehensive lists of antonyms, synonyms, hypernyms, hyponyms and homophones.
Installation
Install the distribution via pip:```python
pip3 install wordhoard
```General Package Utilization
Please reference the WordHoard Documentation for package usage guidance and parameters.Sources
This package is currently designed to query these online sources for antonyms, synonyms, hypernyms, hyponyms and definitions:1. classicthesaurus.com
2. collinsdictionary.com
3. merriam-webster.com
4. synonym.com
5. thesaurus.com
6. wordhippo.com
7. wordnet.princeton.eduDependencies
This package has these core dependencies:
1. backoff
2. BeautifulSoup
3. deckar01-ratelimit
4. deepl
5. lxml
6. requests
7. urllib3
Additional details on this package's dependencies can be found here.Development Roadmap
If you would like to contribute to the WordHoard project please read the contributing guidelines.
Items currently under development:
- Expanding the list of hypernyms, hyponyms and homophones
- Adding part-of-speech filters in queriesIssues
This repository is actively maintained. Feel free to open any issues related to bugs, coding errors, broken links or enhancements.You can also contact me at [John Bumgarner](mailto:[email protected]?subject=[GitHub]%20wordhoard%20project%20request) with any issues or enhancement requests.
Sponsorship
If you would like to contribute financially to the development and maintenance of the WordHoard project please read the sponsorship information.
License
The MIT License (MIT). Please see License File for more information.Author
Copyright (c) 2020 John Bumgarner