Projects in Awesome Lists tagged with text-analytics
A curated list of projects in awesome lists tagged with text-analytics .
https://github.com/dipanjans/text-analytics-with-python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
clustering gensim natural-language natural-language-processing nltk pattern python scikit-learn semantic sentiment sentiment-analysis spacy stanford-nlp text-analytics text-classification text-summarization
Last synced: 15 May 2025
https://github.com/dipanjanS/text-analytics-with-python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
clustering gensim natural-language natural-language-processing nltk pattern python scikit-learn semantic sentiment sentiment-analysis spacy stanford-nlp text-analytics text-classification text-summarization
Last synced: 05 May 2025
https://github.com/obsei/obsei
Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysis, comparative study and more .
anonymization artificial-intelligence business-process-automation customer-engagement customer-support issue-tracking-system low-code lowcode natural-language-processing nlp process-automation python sentiment-analysis social-listening social-network-analysis text-analysis text-analytics text-classification workflow workflow-automation
Last synced: 14 Jan 2026
https://github.com/quanteda/quanteda
An R package for the Quantitative Analysis of Textual Data
corpus natural-language-processing quanteda r text-analytics
Last synced: 16 May 2025
https://github.com/meta-toolkit/meta
A Modern C++ Data Sciences Toolkit
c-plus-plus graph-algorithms inverted-index language-modeling nlp nlp-parsing pos-tag search-engine text-analysis text-analytics text-classification word-embeddings
Last synced: 15 Mar 2025
https://github.com/ahmedkhemiri95/PDFs-TextExtract
Multiple and Large PDF Documents Text Extraction.
data-science extract-text parser pdf pdf-document pdf-processing pdfminer pdfs pdfs-textextract pypdf2 python text-analytics
Last synced: 04 Apr 2025
https://github.com/SergeyShk/ruTS
Библиотека для извлечения статистик из текстов на русском языке.
computational-linguistics natural-language-processing nlp russian-specific text-analytics
Last synced: 19 Jul 2025
https://github.com/senderle/topic-modeling-tool
A point-and-click tool for creating and analyzing topic models produced by MALLET.
data-science digital-humanities mallet text-analytics topic-modeling
Last synced: 21 Jan 2026
https://github.com/fareedkhan-dev/basiclingua-llm-based-nlp
LLM Based NLP Library.
anyscale api deep-learning gemini-api large-language-models llm llms machine-learning natural-language-processing nlp nlp-library openai python text-analytics
Last synced: 01 Mar 2025
https://github.com/TheCodeTraveler/azure-for-developers-workshop
The Azure cloud is huge and the vast service catalog may appear daunting at first, but it doesn’t have to be!
azure azure-active-directory azure-app-service azure-blob azure-cli azure-functions azure-resource-manager azure-storage cognitive-services text-analytics xamarin xamarin-android xamarin-forms xamarin-ios
Last synced: 18 Apr 2025
https://github.com/ibmstreams/samples
This repository contains open-source sample applications for IBM Streams.
database geofence geofencing hdfs healthcare ibm-streams samples stream-processing text-analytics timeseries
Last synced: 15 Jul 2025
https://github.com/rosette-api/rosette-elasticsearch-plugin
Document Enrichment plugin for Elasticsearch
categorization elasticsearch elasticsearch-plugin entity-extraction fuzzy-name-matching fuzzy-search identity-resolution machine-learning named-entity-recognition natural-language-processing nlp rosette-plugin sentiment-analysis text-analytics text-mining
Last synced: 17 Mar 2025
https://github.com/bartczernicki/MachineIntelligence-TextAnalytics-TPLDataFlows
Machine Intelligence using OpenAI, Semantic Kernel, Vector Search, SQL Server
dotnet-core machine-intelligence machine-learning openai semantic-kernel sql-server text-analytics
Last synced: 03 Apr 2025
https://github.com/easonlai/facebook_post_scraping_and_text_analytics
This is demo repo to demostrate how to scrape post data from Facebook by Python with library facebook_scraper. And then use Azure Text Analytics to perform sentiment analysis for post text content.
azure azure-text-analysis azure-text-analytics data-scraping facebook facebook-post facebook-scraper facebook-scraping key-phrase-extraction microsoft-azure microsoft-cognitive-services pandas python python3 scraping-python seaborn text-analytics wordcloud
Last synced: 26 Apr 2025
https://github.com/wittline/tf-idf
Term Frequency-Inverse Document Frequency from Scratch
feature-engineering python text-analytics tfidf
Last synced: 13 Apr 2025
https://github.com/easonlai/playstore_reviews_scraping_and_text_analytics
This is demo repo to demostrate how to scrape apps review data from Google Play Store by Python with library Google-Play-Scraper. And then use Azure Text Analytics to perform sentiment analysis for reviews content (aka comments).
azure azure-text-analysis azure-text-analytics data-scraping datascraping google-play-store google-play-store-data-analysis google-play-store-scraper microsoft-azure microsoft-cognitive-services pandas python python3 seaborn sentiment-analysis text-analytics
Last synced: 08 Aug 2025
https://github.com/liamca/medical-ner-search
Leveraging Apache CTakes and Azure Search to Build and Medical Search App
azure azure-search ctakes medical natural-language-processing ner nlp search-engine text-analytics
Last synced: 14 May 2025
https://github.com/rosette-api/java
Rosette API Client Library for Java
entity-extraction entity-linking fuzzy-matching java machine-learning name-translation natural-language-processing nlp rosette text-analytics text-mining tokenization
Last synced: 07 Apr 2025
https://github.com/meaningcloud/meaningcloud-php
MeaningCloud's official PHP SDK
meaningcloud nlp php text-analytics
Last synced: 17 Mar 2026
https://github.com/rosette-api/csharp
Babel Street Analytics Client Library for C#
capi csharp entity-extraction language-identification machine-learning morphology name-translation natural-language-processing nlp nuget rosette text-analysis text-analytics text-embedding visual-studio
Last synced: 02 May 2025
https://github.com/rosette-api/php
Babel Street Analytics Client Library for PHP
entity-extraction language-identification lemma morphology named-entity-recognition natural-language-processing nlp php text-analytics text-embedding tokenization
Last synced: 09 May 2025
https://github.com/mathworks/hospitalreadmission_mimic_textanalytics_matlab
Predict Hospital Readmissions using Text Analytics in MATLAB
matlab mimic-iii natural-language-processing text-analytics text-classification
Last synced: 04 Oct 2025
https://github.com/rosette-api/ruby
Babel Street Analytics Client Library for Ruby
deduplication entity-extraction language-identification machine-learning morphology named-entity-recognition natural-language-processing nlp ruby sentiment-analysis text-analytics text-embedding tokenization
Last synced: 02 May 2025
https://github.com/cosmoduende/r-holy-books-sentiment-data-analysis
What's the most positive or negative religion? . Sentiment and Data Analysis of Holy Books with R. Analysis of religious dogmas by exploring their Holy Books (The Bible, The Quran, The Dhammapada, and The Book of Mormon) with R
bible book-of-mormon data-analysis data-analytics data-visualisation data-visualization dataviz dhammapada holy-scriptures quran religions-studies religious religious-studies sentiment-analysis sentiment-polarity sentimental-analysis text-analysis text-analytics text-mining text-mining-analysis
Last synced: 07 Mar 2026
https://github.com/rosette-api/shell
Shell scripts for accessing Babel Street Analytics
bash-script categorization entity-extraction entity-linking linked-entities machine-learning name-translation natural-language-processing nlp sentiment shell-script shell-scripting text-analysis text-analytics text-mining tokenization
Last synced: 02 May 2025
https://github.com/lykmapipo/us-inaugural-addresses
Python scripts to download, process, and analyze US Inaugural Addresses
beautifulsoup4 gensim joblib lykmapipo natural-language-processing nlp nltk python python-scripts requests spacy text-analysis text-analytics text-extraction text-processing web-scraping
Last synced: 16 May 2026
https://github.com/dbracewell/hermes
A Natural Language Processing framework for Java
natural-language-processing nlp text-analytics text-mining
Last synced: 14 Jan 2026
https://github.com/easonlai/demo_for_azure_cognitive_services
Demo repository for Azure Cognitive Services
azure cognitive-services computer-vision face-detection face-recognition microsoft-azure microsoft-cognitive-services speech-to-text text-analytics text-to-speech
Last synced: 15 Jun 2026
https://github.com/easonlai/content_based_product_recommendation_samples
The sample code repository leverages Azure Text Analytics to extract key phrases from the product description as additional product features. And perform text relationship analysis with TF-IDF vectorization and Cosine Similarity for product recommendation.
azure-cognitive-services azure-text-analytics content-based-filtering content-based-recommendation cosine-similarity key-phrase-extraction machine-learning product-recommendation python python3 text-analytics tf-idf
Last synced: 15 Jun 2026
https://github.com/dajoker29/asimov
A Text Analysis Engine
text-analysis text-analytics writing-tool
Last synced: 18 Apr 2026
https://github.com/uts-cic/tap-api
A streaming version of the text analytics pipeline
akka-http akka-streams learning-analytics natural-language-processing text-analytics
Last synced: 12 May 2026
https://github.com/felipecruz91/twitter-analytics-pi
Twitter Analytics running in a Raspberry Pi 3 Model B
azure azure-cognitive-services docker influxdb raspberry-pi-3 sentiment-analysis text-analytics twitter
Last synced: 12 Apr 2026
https://github.com/rosette-api-community/rosette-for-excel
Microsoft Excel add-in that implements many endpoints through ribbon functions and formula support
entity-extraction excel machine-learning natural-language-processing nlp nlp-apis rosette text-analytics
Last synced: 28 Feb 2025
https://github.com/markomanninen/grcriddles
Study and examination of alphabetical and isopsephical riddles of the Ancient Greeks
greek jupyter-notebooks language-processing python semiotic text-analytics
Last synced: 14 May 2026
https://github.com/junioralive/sms-spam-detection
An interactive SMS Spam Detection application using Streamlit and machine learning. This app allows users to classify messages as spam or ham and view performance metrics for different models.
classification data-science machine-learning nlp sms-or-ham sms-spam sms-spam-classification sms-spam-detection spam-detection streamlit text-analytics
Last synced: 18 Apr 2026
https://github.com/aayushpatel007/docprofiler
A Python package to generate document profiles and extract metadata from text in parallel using several Docker images and NLP tools/frameworks.
aiohttp asynchronous-programming asyncio containerization docker entity-linking information-retrieval keyphrase-extraction metadata-extraction named-entity-recognition natural-language-processing python3 text-analytics text-summarization unsupervised-learning
Last synced: 05 May 2026
https://github.com/aayush-bhargav/text-analyst-app
The Text Analyst App is a versatile and user-friendly application developed using React, designed to enhance your writing and text manipulation experience.
css frontend-web html javascript react react-components react-router-dom reactjs text-analytics text-to-speech texteditor
Last synced: 27 Apr 2026
https://github.com/till-tietz/gsdmm
GSDMM Short Text Clustering via Dirichlet Mixture Models
cpp r rcpp text-analytics text-clustering
Last synced: 26 Feb 2026
https://github.com/rosette-api/curl-examples
cUrl examples for Babel Street Analytics
categorization curl entity-extraction lemmatization morphology natural-language-processing nlp relation-extraction sentiment text-analytics text-embedding text-mining tokenization
Last synced: 02 May 2025
https://github.com/avidlearnerinprogress/text_analytics_101
Coursework solutions for the course COMP47600 - Text Analytics
data-science data-visualization natural-language-processing nlp python3 r text-analytics
Last synced: 05 May 2026
https://github.com/craigtrim/tfidf-zones
TF IDF Zones
artificial-intelligence natural-language-processing stylometry text-analytics tf-idf tfidf
Last synced: 05 Feb 2026
https://github.com/zenklinov/introduction_to_text_analysis
This repository contains materials and resources for learning Text Analysis. The included Jupyter Notebook demonstrates key concepts using TextBlob for sentiment analysis.
Last synced: 12 Oct 2025
https://github.com/shiyis/politics
The political text ideology classification tool provides a solution to expedite interpreting an author's intentions and subjectivity through quantitative measures.
political-analysis political-nlp text-analytics variational-inference
Last synced: 15 Feb 2026
https://github.com/cintia-shinoda/test-birdie
Data Science Internship @ Birdie
Last synced: 18 Apr 2026
https://github.com/rosette-api-community/rosette-for-docs
Google Docs add-on offering users the ability to extract entities, translate names, and research entities on wikipedia from within their multilingual document.
entities entity-extraction extract-entities language machine-learning name-translation natural-language-processing nlp text-analytics unstructured-data
Last synced: 03 Jun 2026
https://github.com/basemax/smartfilter
A Smart Filtering to keep and remove the character or words of the text. (SOON)
extract extract-data extract-features extract-information extract-text extraction extractive-summarization extractor php split splitter splitting text text-analysis text-analytics text-analyzer text-mining
Last synced: 02 May 2026
https://github.com/avijay24/wsd_from_scratch
Implemented a dictionary-based Word Sense Disambiguation(WSD) system that disambiguates the sense by comparing the definitions of the target word to the definitions of relevant words in the context. (Simple Lesk and Corpus Lesk)
corpus-lesk predictive-modeling python simple-lesk supervised-learning supervised-wsd text-analytics word-sense-disambiguation
Last synced: 29 Apr 2026
https://github.com/maastrichtu-ids/text-analytics-bootcamp-pggm
Work materials for the Text Analytics Bootcamp at PGGM
data-science data-science-bootcamp financial-data financial-sentiment-analysis nltk text-analytics
Last synced: 29 Jul 2025
https://github.com/passadis/sentiment-analysis-durable
Analyze Text with Durable Functions and AI Language
azure azureai durable-functions language-model sentiment-analysis text-analytics
Last synced: 02 May 2026
https://github.com/anwai98/extraction-based-text-summarization
Extraction Based Text Summarization (Text Analytics) using Page Rank Algorithm
data-mining natural-language-processing nlp nltk page-rank-algorithm python text-analytics text-summarization
Last synced: 11 May 2026
https://github.com/nexmo-community/tweet-sentiment-analysis
How kind are you on twitter? Try this react application to analyze the sentiment of your most recent tweet and send the result to your phone
azure express nexmo react sentiment-analysis text-analytics twitter twitter-api
Last synced: 05 Apr 2025
https://github.com/epomatti/sentiment-analysis
Using Azure Cognitive Services API to identify sentiments within a text.
artificial-intelligence az-300 azure chai mocha nodejs sentiment-analysis text-analytics
Last synced: 07 Apr 2026
https://github.com/rosette-api/ruby-script
Contains Ruby scripts for accessing Babel Street Analytics
api categorization entity-extraction entity-relationship entity-resolution lemmatization machine-learning natural-language-processing nlp relation-extraction ruby ruby-script sentiment-analysis text-analytics text-mining tokenization
Last synced: 20 Jul 2025
https://github.com/m-ah07/text-analysis-service-python
A Python-based service for performing advanced text analysis,
language-detection nlp-tools open-source open-source-python python python-library python-scripts text-analysis text-analytics text-utilities word-analysis
Last synced: 10 Feb 2026