https://github.com/yeremi/stopwords
A lightweight and efficient PHP library tailored for developers working on Natural Language Processing (NLP) tasks in Brazilian Portuguese.
https://github.com/yeremi/stopwords
elasticsearch extract-information fulltext-search indexing-querying natural-language-processing php portuguese search-engine snowball stemming stop stop-words stopwords
Last synced: 5 months ago
JSON representation
A lightweight and efficient PHP library tailored for developers working on Natural Language Processing (NLP) tasks in Brazilian Portuguese.
- Host: GitHub
- URL: https://github.com/yeremi/stopwords
- Owner: yeremi
- License: other
- Created: 2017-10-04T12:33:33.000Z (about 8 years ago)
- Default Branch: main
- Last Pushed: 2025-01-05T02:54:50.000Z (9 months ago)
- Last Synced: 2025-03-30T18:43:57.608Z (6 months ago)
- Topics: elasticsearch, extract-information, fulltext-search, indexing-querying, natural-language-processing, php, portuguese, search-engine, snowball, stemming, stop, stop-words, stopwords
- Language: PHP
- Homepage:
- Size: 167 KB
- Stars: 0
- Watchers: 0
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- Funding: .github/FUNDING.yml
- License: LICENSE
Awesome Lists containing this project
README
# Stopwords
### A PHP Library for Managing Stopwords
**Stopwords** is a lightweight and efficient PHP library designed to simplify Natural Language Processing (NLP) tasks.
By default, it includes a comprehensive list of 508 stopwords in Portuguese, categorized into:- Pronouns
- Numerals
- Temporal terms
- Locative terms
- Prepositions
- Conjunctions
- Articles
- Adverbs
- Interjections
- Contractions
- Miscellaneous termsThe library is highly flexible and can be extended to include stopword dictionaries for other languages, making it a
versatile tool for a wide range of text processing needs.Leverage **Stopwords** to streamline text processing, enhance SEO strategies, and improve accuracy in text-based
applications.## Table of Contents
1. [Key Features](#key-features)
2. [Installation](#installation)
3. [How to use](/docs)
1. [Basic Examples](/docs/how-to-use)
2. [Creating Custom Dictionary](/docs/creating-dictionary.md)
3. [Migration v1 to v2](/docs/mirgation-v1-to-v2.md)
4. [Contributing](#contributing)
5. [License](#license)## **Key Features**
1. **NLP Optimization:** Efficiently filter out common stopwords, saving processing time and enhancing text analysis.
2. **SEO Utility:** Reduce irrelevant data during search engine crawling or indexing, improving website performance.
3. **Customizable:** Easily extend or modify stopword lists to suit specific project requirements.
4. **Composer-Ready:** Quickly integrate the library into your project using Composer.
## Installation
Add the library to your project using Composer:
```bash
composer require yeremi/stopwords
```## Contributing
Contributions are welcome! If you encounter any issues or have suggestions for improvement, feel free to open an issue
or submit a pull request.## License
Stopwords is licensed under the MIT License. See the [LICENSE](LICENSE) file for details.