Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
spaCy
spaCy is a free library for advanced Natural Language Processing (NLP) in Python. It’s designed specifically for production use and helps you build applications that process and “understand” large volumes of text. It can be used to build information extraction or natural language understanding systems.
- GitHub: https://github.com/topics/spacy
- Wikipedia: https://en.wikipedia.org/wiki/SpaCy
- Repo: https://github.com/explosion/spaCy
- Created by: Explosion
- Related Topics: machine-learning, natural-language-processing, text-classification, named-entity-recognition, tokenization, entity-linking, dependency-parsing, relation-extraction, part-of-speech-tagging, lemmatization,
- Last updated: 2024-11-05 00:28:59 UTC
- JSON Representation
https://github.com/legendarym4x/data_science
Data Science Course
jupyter-notebook keras matplotlib nltk numpy pandas scikit-learn spacy tensorflow
Last synced: 12 Oct 2024
https://github.com/yashaswini-lankalapalli/text-summarization
Here are the two NLP models for text summarization: Abstractive NLP and Extractive NLP.
Last synced: 12 Oct 2024
https://github.com/sudeatesoglu/nlp-document-processor
An NLP tool for processing documents in different formats with functionalities of similarity score detection, highlighting given pattern and similar words between PDFs, and NER extraction.
Last synced: 31 Oct 2024
https://github.com/cmilamaya/flight-dashboard-app
This project is an application that processes attached PDF documents containing flight information and extracts relevant data. The data is stored in a PostgreSQL database and visualized on a dynamic dashboard using Streamlit.
pandas pdfplumber python spacy
Last synced: 31 Oct 2024
https://github.com/thjbdvlt/spacy-turlututu
french morphological analysis model for spacy
french french-nlp morphological-analysis nlp part-of-speech-tagging pos-tagging spacy
Last synced: 31 Oct 2024
https://github.com/woranov/spacy-lazy-docbin
Lazy-loadable and indexable spaCy DocBins
Last synced: 12 Oct 2024
https://github.com/prashver/nlp-driven-video-summarizer-and-insight-tool
An NLP-powered tool for transcribing, summarizing, and indexing podcast content, with video-to-audio conversion and multilingual support.
flask-application huggingface-transformers keyword-extraction named-entity-recognition natural-language-processing ntlk spacy speech-to-text speech-translation text-summarization topic-modeling
Last synced: 31 Oct 2024
https://github.com/nanditha-prabhu/qa-system-via-srl
Question Answering System via Semantic Role Labeling Using Token Classification and Parsing Techniques
Last synced: 31 Oct 2024
https://github.com/zevio/pcu_nlp
NLP pipeline (spacy.io) for PCU project
component natural-language-processing nlp nlp-pipeline pcu pcu-nlp pipeline python spacy
Last synced: 18 Oct 2024
https://github.com/hansalemaos/spacy2df
converts a spaCy object into a pandas DataFrame
Last synced: 31 Oct 2024
https://github.com/adishtienmetz/context-game
A context word guessing game. Try to guess the word in minimum tries!
Last synced: 09 Oct 2024
https://github.com/atharvapathak/twitter_sentiment_analysis_project
Twitter sentiment analysis is the process of analyzing tweets posted on the Twitter platform to determine the overall sentiment expressed within them. It involves using natural language processing (NLP) and machine learning techniques to classify tweets.
api bag-of-words bert cnn data gbm nltk rnn spacy twitter
Last synced: 31 Oct 2024
https://github.com/dagmawi-22/hotel-ai
Hotel Customer Support Chatbot Rest API
django nltk pyspellchecker python spacy
Last synced: 09 Oct 2024
https://github.com/kivanc57/nlp_data_visualization
This project provides Python scripts for analyzing and visualizing text data using efficient NLP methods. It includes tools for creating bar plots, histograms, pie charts, treemaps, violin plots, and word clouds, using libraries such as matplotlib, seaborn, wordcloud, spacy, and textblob.
data-science matplotlib nlp parsing plotting python spacy visualization
Last synced: 09 Oct 2024
https://github.com/ahmedkhaled404/ner-with-spacy
Named entity recognition using traditional NLP methods
machine-learning matplotlib ner nlp nlp-machine-learning python spacy
Last synced: 31 Oct 2024
https://github.com/atharvapathak/customer_service_chatbot
Customer Service Chatbot Repository includes a range of features for building custom chatbots that can handle customer service queries and support requests. These features include NLP capabilities and pre-built dialog flows that can help chatbots understand and respond to customer.
chatbot database dialogflow nlp nltk reinforcement-learning restful-api spacy tensorflow
Last synced: 31 Oct 2024
https://github.com/kazkozdev/novelgenerator
NovelGenerator - AI-powered fiction book generator that uses Ollama's LLMs to create complete novels with coherent plot structures, developed characters and multiple writing styles.
ai-novels fiction-generator nlp novel-writing ollama python spacy text-generation
Last synced: 02 Nov 2024
https://github.com/abinashsahoo007/project-resume-classification
The document classification solution should significantly reduce the manual human effort in the HRM. It should achieve a higher level of accuracy and automation with minimal human intervention.
corpus count-vectorizer label-encoding lemmitization machine-learning nltk part-of-speech-tagging resume-classification spacy stemming text-mining text-preprocessing textract tfidf-vectorizer tokenization wordcloud
Last synced: 31 Oct 2024
https://github.com/f1uctus/webanno2spacy
Convert WebAnno TSVs to spaCy's Doc-s.
spacy spacy-extension webanno webanno-tsv
Last synced: 09 Oct 2024
https://github.com/asaficontact/stack_classifier_project
We classified Stack Overflow Python questions from 2008-2016 with Natural Language Processing and Deep Learning. Using Regular Expressions, we removed HTML tags and punctuation. We also utilized spaCy to tokenize, lemmatize and remove stop words. Using Keras, we built a 4 layered artificial neural network with a 20% dropout rate using relu and softmax activation functions. We also utilized the adam optimizer and categorical cross-entropy loss function which classified 11 tags 88% successfully.
cross-entropy-loss deep-learning deep-neural-networks keras lemmatization neural-networks object-oriented-programming pandas python3 regular-expressions relu sklearn spacy spacy-nlp stackoverflow tfidf tokenization
Last synced: 05 Nov 2024
https://github.com/sydney-informatics-hub/clause-segmenter
A clause segmenting tool utilising Python's SpaCy
Last synced: 09 Oct 2024
https://github.com/shwetajanwekar/capstone-project_1
Capstone project_1 include python code for best fit regression model, SQL feature store and tableau dashboard
accuracy create engine lasso-regression-model linear-regression mysql-database pandas ridge-regression seaborn skewness sklearn-library spacy sqlalchemy standard-deviation transformation variense xtrain ytrain
Last synced: 03 Nov 2024
https://github.com/viniciusmecosta/cv_classifier
A REST API that classifies resumes into occupation fields and seniority levels using machine learning. Trained on 3,000+ resumes across 26 occupations, the API provides accurate classifications with efficient PDF text extraction.
catboost fastapi python3 sklearn spacy
Last synced: 09 Oct 2024
https://github.com/imvladikon/spacy-trankit
💥 Trankit models directly in spaCy💥
nlp spacy spacy-extension spacy-nlp spacy-pipeline trankit
Last synced: 14 Oct 2024
https://github.com/ssciwr/argumentation-management
Annotator combining different NLP pipelines.
corpus-linguistics cwb hacktoberfest natural-language-processing nlp part-of-speech python sentencizer spacy tokenization
Last synced: 12 Oct 2024
https://github.com/gabrielmazzotta/nlp-clustering--movie-similarity-from-plot-summaries
A Python-based movie recommendation system leveraging NLP and clustering techniques. This project includes data processing, vectorization of plot summaries, and the implementation of recommendation algorithms to suggest similar movies based on user input.
clustering cosine-similarity hierarchical-clustering kmeans lemmatization nlp recommendation-engine scikit-learn similarity-score spacy tokenization
Last synced: 03 Nov 2024