Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Natural language processing
Natural language processing (NLP) is a field of computer science that studies how computers and humans interact. In the 1950s, Alan Turing published an article that proposed a measure of intelligence, now called the Turing test. More modern techniques, such as deep learning, have produced results in the fields of language modeling, parsing, and natural-language tasks.
- GitHub: https://github.com/topics/nlp
- Wikipedia: https://en.wikipedia.org/wiki/Natural_language_processing
- Created by: Alan Turing
- Aliases: natural-language-processing, nlp-machine-learning, nlp-resources,
- Last updated: 2024-12-28 00:15:00 UTC
- JSON Representation
https://github.com/synyi/poplar
A web-based annotation tool for natural language processing (NLP)
Last synced: 30 Oct 2024
https://github.com/Unbabel/COMET
A Neural Framework for MT Evaluation
artificial-intelligence evaluation-metrics machine-learning machine-translation natural-language-processing nlp
Last synced: 27 Nov 2024
https://github.com/shibing624/pytextclassifier
pytextclassifier is a toolkit for text classification. 文本分类,LR,Xgboost,TextCNN,FastText,TextRNN,BERT等分类模型实现,开箱即用。
bert classification focalloss-pytorch hierarchical machine-learning nlp pytextclassifier python pytorch softmax text-classification text-classifier
Last synced: 27 Dec 2024
https://github.com/johnsnowlabs/langtest
Deliver safe & effective language models
ai-safety ai-testing artificial-intelligence benchmark-framework benchmarks ethics-in-ai large-language-models llm llm-as-evaluator llm-evaluation-toolkit llm-test llm-testing ml-safety ml-testing mlops model-assessment nlp responsible-ai trustworthy-ai
Last synced: 23 Dec 2024
https://github.com/howiehwong/trustllm
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
ai benchmark dataset evaluation large-language-models llm natural-language-processing nlp pypi-package toolkit trustworthy-ai trustworthy-machine-learning
Last synced: 27 Dec 2024
https://github.com/hamelsmu/code_search
Code For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deep Learning"
code-search data-science deep-learning fastai keras machine-learning machine-learning-on-source-code ml-on-code natural-language-processing nlp python pytorch search search-algorithm searching-algorithms semantic-search semantic-search-engine tensorflow tutorial
Last synced: 26 Oct 2024
https://github.com/phantominsights/subreddit-analyzer
A comprehensive Data and Text Mining workflow for submissions and comments from any given public subreddit.
matplotlib nlp pandas python3 seaborn spacy wordcloud
Last synced: 23 Dec 2024
https://github.com/Brokenwind/BertSimilarity
Computing similarity of two sentences with google's BERT algorithm。利用Bert计算句子相似度。语义相似度计算。文本相似度计算。
bert nlp python semantic similarity tensorflow
Last synced: 02 Nov 2024
https://github.com/salesforce/matchbox
Write PyTorch code at the level of individual examples, then run it efficiently on minibatches.
deep-learning minibatch nlp pytorch
Last synced: 14 Nov 2024
https://github.com/dccuchile/beto
BETO - Spanish version of the BERT model
bert bert-model nlp spanish transformers transformers-library
Last synced: 22 Nov 2024
https://github.com/houbb/opencc4j
🇨🇳Open Chinese Convert is an opensource project for conversion between Traditional Chinese and Simplified Chinese.(java 中文繁简体转换)
chinese dfa java java7 nlp opencc simple-tranditional trie trie-tree
Last synced: 23 Dec 2024
https://github.com/AnubhavGupta3377/Text-Classification-Models-Pytorch
Implementation of State-of-the-art Text Classification Models in Pytorch
attention classification convolutional-neural-networks deep-learning fasttext nlp pytorch rcnn recurrent-neural-networks seq2seq transformer
Last synced: 13 Nov 2024
https://github.com/yaoguangluo/Deta_Parser
快速中文分词分析word segmentation
artificial-intelligence-algorithms binary eculid entropy-rate forest hmm multi-language nero nlp orthor parser pos quicksort science segmentation sensing sonar translator-speech-api turing-machine vpc
Last synced: 13 Nov 2024
https://github.com/phantominsights/mexican-government-report
Text Mining on the 2019 Mexican Government Report, covering from extracting text from a PDF file to plotting the results.
geopandas matplotlib nlp numpy pandas seaborn spacy
Last synced: 22 Dec 2024
https://github.com/judahpaul16/gpt-home
ChatGPT at home! Basically a better Google Nest Hub or Amazon Alexa home assistant. Built on the Raspberry Pi using the OpenAI API.
ai async automation chatgpt docker fastapi home-assistant home-automation iot llm nginx nlp nodejs openai python raspberry-pi react speech-recognition spotify typescript
Last synced: 22 Dec 2024
https://github.com/prithivirajdamodaran/styleformer
A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/casual, active/passive, and many more. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
active formal-languages informal-sentences nlp passive slang style-transfer text-style text-style-transfer text-style-transfer-benchmark
Last synced: 25 Dec 2024
https://github.com/explosion/prodigy-recipes
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
active-learning annotation annotation-tool artificial-intelligence computer-vision data-annotation data-science labeling-tool machine-learning machine-teaching natural-language-processing nlp prodigy spacy
Last synced: 28 Dec 2024
https://github.com/proycon/pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
computational-linguistics evaluation-metrics folia language-modelling library linguistics machine-learning natural-language-processing nlp nlp-library python search-algorithms text-processing
Last synced: 22 Dec 2024
https://github.com/microsoft/xpretrain
Multi-modality pre-training
computer-vision multimedia multimodal-learning nlp pre-training
Last synced: 21 Dec 2024
https://github.com/chengchingwen/Transformers.jl
Julia Implementation of Transformer models
attention deep-learning flux machine-learning natural-language-processing nlp transformer
Last synced: 13 Nov 2024
https://github.com/chengchingwen/transformers.jl
Julia Implementation of Transformer models
attention deep-learning flux machine-learning natural-language-processing nlp transformer
Last synced: 03 Dec 2024
https://github.com/ynqa/wego
Word Embeddings in Go!
glove go machine-learning nlp word-embeddings word2vec
Last synced: 21 Dec 2024
https://github.com/PrithivirajDamodaran/Styleformer
A Neural Language Style Transfer framework to transfer natural language text smoothly between fine-grained language styles like formal/casual, active/passive, and many more. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
active formal-languages informal-sentences nlp passive slang style-transfer text-style text-style-transfer text-style-transfer-benchmark
Last synced: 03 Nov 2024
https://github.com/CogComp/cogcomp-nlp
CogComp's Natural Language Processing Libraries and Demos: Modules include lemmatizer, ner, pos, prep-srl, quantifier, question type, relation-extraction, similarity, temporal normalizer, tokenizer, transliteration, verb-sense, and more.
big-data cogcomp data-mining dependency-parsing lemmatization lemmatizer named-entity-recognition natural-language-processing natural-language-understanding ner nlp parts-of-speech-tagging pos pos-tagging relation-extraction similarity tokenizer transliteration
Last synced: 30 Oct 2024
https://github.com/EagleW/PaperRobot
Code for PaperRobot: Incremental Draft Generation of Scientific Ideas
attention-mechanism datasets end-to-end-learning generation memory-networks natural-language-generation nlp paper-generation pytorch text-generation
Last synced: 18 Nov 2024
https://github.com/louisowen6/NLP_bahasa_resources
A Curated List of Dataset and Usable Library Resources for NLP in Bahasa Indonesia
bahasa-indonesia corpus corpus-linguistics dataset indonesian indonesian-language library natural-language-processing nlp nlp-bahasa-resources packages sentiment-analysis sentiment-analysis-dataset
Last synced: 07 Nov 2024
https://github.com/Beomi/KcBERT
🤗 Pretrained BERT model & WordPiece tokenizer trained on Korean Comments 한국어 댓글로 프리트레이닝한 BERT 모델과 데이터셋
bert bert-model korean-nlp nlp transformers
Last synced: 09 Nov 2024
https://github.com/huggingface/node-question-answering
Fast and production-ready question answering in Node.js
bert nlp nodejs question-answering tensorflow transformers typescript
Last synced: 27 Dec 2024
https://github.com/koaning/whatlies
Toolkit to help understand "what lies" in word embeddings. Also benchmarking!
Last synced: 29 Oct 2024
https://github.com/HowieHwong/TrustLLM
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
ai benchmark dataset evaluation large-language-models llm natural-language-processing nlp pypi-package toolkit trustworthy-ai trustworthy-machine-learning
Last synced: 16 Nov 2024
https://github.com/chenglongchen/kaggle-homedepot
3rd Place Solution for HomeDepot Product Search Results Relevance Competition on Kaggle.
homedepot kaggle kaggle-competition kaggle-homedepot natural-language-processing nlp product-search relevance-competition search-engine search-relevance semantic-matching semantic-similarity
Last synced: 22 Dec 2024
https://github.com/lingdong-/cope
A modern IDE for writing classical Chinese poetry 格律诗编辑程序
bag-of-words chinese chinese-poetry editor electron ide nlp poetry
Last synced: 23 Dec 2024
https://github.com/ematvey/hierarchical-attention-networks
Document classification with Hierarchical Attention Networks in TensorFlow. WARNING: project is currently unmaintained, issues will probably not be addressed.
deep-learning document-classification hierarchical-attention-networks machine-learning nlp tensorflow
Last synced: 06 Nov 2024
https://github.com/hendrikstrobelt/detecting-fake-text
Giant Language Model Test Room
Last synced: 28 Dec 2024
https://github.com/LingDong-/cope
A modern IDE for writing classical Chinese poetry 格律诗编辑程序
bag-of-words chinese chinese-poetry editor electron ide nlp poetry
Last synced: 01 Nov 2024
https://github.com/towhee-io/examples
Analyze the unstructured data with Towhee, such as reverse image search, reverse video search, audio classification, question and answer systems, molecular search, etc.
audio-classification cross-modal embeddings image-classification machine-learning nlp video-tagging
Last synced: 21 Dec 2024
https://github.com/microsoft/XPretrain
Multi-modality pre-training
computer-vision multimedia multimodal-learning nlp pre-training
Last synced: 04 Nov 2024
https://github.com/jina-ai/examples
Jina examples and demos to help you get started
deep-learning examples jina neural-search nlp onboarding python semantic-search tutorials
Last synced: 01 Nov 2024
https://github.com/filyp/autocorrect
Spelling corrector in python
autocorrect autocorrection czech english languages levenshtein-distance multilanguage multilingual nlp ocr polish portuguese python russian spanish spellchecker spelling spelling-corrector turkish ukrainian
Last synced: 29 Nov 2024
https://github.com/ruu3f/freegpt
freeGPT provides free access to text and image generation models.
ai artificial-intelligence chatgpt deep-learning freegpt gpt gpt4all gpt4free llama llm machine-learning nlp python
Last synced: 10 Oct 2024
https://github.com/imgarylai/bert-embedding
🔡 Token level embeddings from BERT model on mxnet and gluonnlp
bert gluonnlp mxnet natural-language-processing nlp word-embeddings
Last synced: 02 Nov 2024
https://github.com/KristiyanVachev/Question-Generation
Generating multiple choice questions from text using Machine Learning.
ai cosine-similarity machine-learning naive-bayes nlp question-generation question-generator questions-and-answers quiz spacy spacy-nlp word-embeddings
Last synced: 13 Nov 2024
https://github.com/huggingface/large_language_model_training_playbook
An open collection of implementation tips, tricks and resources for training large language models
cuda large-language-models llm nccl nlp performance python pytorch scalability troubleshooting
Last synced: 11 Nov 2024
https://github.com/kristiyanvachev/question-generation
Generating multiple choice questions from text using Machine Learning.
ai cosine-similarity machine-learning naive-bayes nlp question-generation question-generator questions-and-answers quiz spacy spacy-nlp word-embeddings
Last synced: 22 Dec 2024
https://github.com/gagan3012/keytotext
Keywords to Sentences
api docker huggingface-transformers keytotext keywords nlp sentences streamlit t5
Last synced: 23 Dec 2024
https://github.com/johanmodin/clifs
Contrastive Language-Image Forensic Search allows free text searching through videos using OpenAI's machine learning model CLIP
ai machine-learning nlp openai python search text video
Last synced: 22 Dec 2024
https://github.com/james-bowman/nlp
Selected Machine Learning algorithms for natural language processing and semantic analysis in Golang
feature-hash go golang latent-dirichlet-allocation latent-semantic-analysis latent-semantic-indexing lda locality-sensitive-hashing lsa lsh lsi machine-learning natural-language-processing nlp random-indexing random-projections simhash singular-value-decomposition svd tf-idf
Last synced: 26 Oct 2024
https://github.com/adbar/German-NLP
Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German
computational-linguistics corpus-linguistics german-language natural-language-processing nlp text-mining
Last synced: 26 Oct 2024
https://github.com/Guitaricet/relora
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
deep-learning distributed-training llama nlp peft transformer
Last synced: 29 Nov 2024
https://github.com/ayoungprogrammer/nlquery
Natural Language Engine on WikiData
Last synced: 19 Nov 2024
https://github.com/mindspore-courses/step_into_llm
MindSpore online courses: Step into LLM
bert chatglm chatglm2 chatgpt codegeex gpt gpt2 instruction-tuning large-language-models llama llama2 llm mindspore moe natural-language-processing nlp parallel-computing peft prompt-tuning rlhf
Last synced: 28 Dec 2024
https://github.com/jxmorris12/language_tool_python
a free python grammar checker 📝✅
grammar grammar-checker grammar-parser languagetool nlp python spellchecker
Last synced: 26 Dec 2024
https://github.com/Cartus/AGGCN
Attention Guided Graph Convolutional Networks for Relation Extraction (authors' PyTorch implementation for the ACL19 paper)
deep-learning graph-convolutional-networks graph-neural-networks information-extraction nlp relation-extraction
Last synced: 02 Nov 2024
https://github.com/airaria/visual-chinese-llama-alpaca
多模态中文LLaMA&Alpaca大语言模型(VisualCLA)
alpaca chinese llama llm lora multimodal nlp vision-language
Last synced: 22 Dec 2024
https://github.com/intelligo-mn/intelligo
Intelligo is powerful chatbot builder that enables anyone to create and deploy chatbots anywhere.
ai artificial-intelligence bot bot-framework bots chatbot machine-learning messenger-api messenger-bot messenger-chatbots nlp nodejs slack slack-bot
Last synced: 21 Dec 2024
https://github.com/bdbc-kg-nlp/ie-survey
北京航空航天大学大数据高精尖中心自然语言处理研究团队对信息抽取领域的调研。包括实体识别,关系抽取,属性抽取等子任务,每类子任务分别对学术界和工业界进行调研。
Last synced: 12 Nov 2024
https://github.com/modelscope/adaseq
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
bert chinese-nlp crf entity-typing information-extraction multi-modal-ner named-entity-recognition natural-language-processing natural-language-understanding ner nlp pytorch relation-extraction sequence-labeling token-classification word-segmentation
Last synced: 22 Dec 2024
https://github.com/hyunwoongko/kss
KSS: Korean String processing Suite
korean korean-nlp kss nlp sentences split-sentences
Last synced: 26 Dec 2024
https://github.com/mishushakov/dialogflow-web-v2
Dialogflow Web Integration. Supports rich components
agent bot chat chatbot conversational-agents conversational-ai conversational-ui dark-theme dialogflow dialogflow-v2 javascript nlp ui vue vue3 webchat
Last synced: 22 Dec 2024
https://github.com/abdur75648/deep-learning-specialization-coursera
This repo contains the updated version of all the assignments/labs (done by me) of Deep Learning Specialization on Coursera by Andrew Ng. It includes building various deep learning models from scratch and implementing them for object detection, facial recognition, autonomous driving, neural machine translation, trigger word detection, etc.
andrew-ng andrew-ng-machine-learning computer-vision convolutional-neural-networks coursera deep-learning deep-learning-andrew-ng deep-learning-coursera deep-learning-specialization face-recognition lstm machine-learning neural-networks nlp object-detection recurrent-neural-networks tensorflow transformer-architecture unet-segmentation updated
Last synced: 27 Dec 2024
https://github.com/airaria/Visual-Chinese-LLaMA-Alpaca
多模态中文LLaMA&Alpaca大语言模型(VisualCLA)
alpaca chinese llama llm lora multimodal nlp vision-language
Last synced: 28 Nov 2024
https://github.com/pochih/RL-Chatbot
🤖 Deep Reinforcement Learning Chatbot
chatbot deep-learning nlp reinforcement-learning seq2seq-model tensorflow
Last synced: 11 Nov 2024
https://github.com/interpretml/interpret-text
A library that incorporates state-of-the-art explainers for text-based machine learning models and visualizes the result with a built-in dashboard.
azure-sdk black-box-explanations data-analyst data-scientists explainer glass-box-explainers grey-box-explainers jupyter-notebook linear-models local-explanations microsoft-azureml nlp nlp-models nlp-scenarios npm python text-classification text-interpretability visualization-dashboard
Last synced: 23 Dec 2024
https://github.com/chancefocus/PIXIU
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).
aifinance chatgpt fintech gpt-4 large-language-models llama machine-learning named-entity-recognition natural-language-processing nlp pixiu question-answering sentiment-analysis stock-price-prediction text-classification
Last synced: 27 Nov 2024
https://github.com/chancefocus/pixiu
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).
aifinance chatgpt fintech gpt-4 large-language-models llama machine-learning named-entity-recognition natural-language-processing nlp pixiu question-answering sentiment-analysis stock-price-prediction text-classification
Last synced: 13 Dec 2024
https://github.com/The-FinAI/PIXIU
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).
aifinance chatgpt fintech gpt-4 large-language-models llama machine-learning named-entity-recognition natural-language-processing nlp pixiu question-answering sentiment-analysis stock-price-prediction text-classification
Last synced: 24 Oct 2024
https://github.com/the-finai/pixiu
This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).
aifinance chatgpt fintech gpt-4 large-language-models llama machine-learning named-entity-recognition natural-language-processing nlp pixiu question-answering sentiment-analysis stock-price-prediction text-classification
Last synced: 09 Nov 2024
https://github.com/dialogflow/dialogflow-javascript-client
JavaScript Web SDK for Dialogflow
apiai javascript natural-language-processing nlp nlu sdk typescript
Last synced: 22 Dec 2024
https://github.com/philschmid/clipper.js
HTML to Markdown converter and crawler.
crawl html-to-markdown markdown nlp retrieval-augmented-generation search
Last synced: 28 Dec 2024
https://github.com/tomaarsen/spanmarkerner
SpanMarker for Named Entity Recognition
huggingface ner nlp spacy spacy-extension transformers
Last synced: 27 Dec 2024
https://github.com/llhthinker/NLP-Papers
Natural Language Processing Papers
Last synced: 10 Nov 2024
https://github.com/ai-forever/ner-bert
BERT-NER (nert-bert) with google bert https://github.com/google-research.
atis attention bert bert-model bilstm-crf classification conll-2003 elmo factrueval joint-models ner ner-task nlp nmt python python3 pytorch pytorch-model transfer-learning
Last synced: 23 Dec 2024
https://github.com/microsoft/rat-sql
A relation-aware semantic parsing model from English to SQL
dbqa nl2sql nlp program-synthesis question-answering semantic-parsing transformers
Last synced: 21 Dec 2024
https://github.com/Droidtown/ArticutAPI
API of Articut 中文斷詞 (兼具語意詞性標記):「斷詞」又稱「分詞」,是中文資訊處理的基礎。Articut 不用機器學習,不需資料模型,只用現代白話中文語法規則,即能達到 SIGHAN 2005 F1-measure 94% 以上,Recall 96% 以上的成績。
artificial-intelligence cws natural-language-processing natural-language-understanding nlp nlu part-of-speech-embdding part-of-speech-tagger pos-tagger pos-tagging
Last synced: 30 Oct 2024
https://github.com/r1j1t/contextualspellcheck
✔️Contextual word checker for better suggestions
bert chatbot help-wanted natural-language-processing nlp oov preprocessing python python-spelling-corrector spacy spacy-extension spellcheck spellchecker spelling-correction spelling-corrections
Last synced: 26 Dec 2024
https://github.com/shibing624/nlp-tutorial
自然语言处理(NLP)教程,包括:词向量,词法分析,预训练语言模型,文本分类,文本语义匹配,信息抽取,翻译,对话。
dialogue language-model machine-translation nlp seq2seq text-classification text-generation torch word-embedding
Last synced: 23 Dec 2024
https://github.com/MuQiuJun-AI/bert4pytorch
超轻量级bert的pytorch版本,大量中文注释,容易修改结构,持续更新
Last synced: 06 Nov 2024
https://github.com/erickrf/nlpnet
A neural network architecture for NLP tasks, using cython for fast performance. Currently, it can perform POS tagging, SRL and dependency parsing.
natural-language-processing neural-network nlp parsing pos-tagging semantic-role-labeling
Last synced: 15 Nov 2024
https://github.com/santhoshkolloju/Abstractive-Summarization-With-Transfer-Learning
Abstractive summarisation using Bert as encoder and Transformer Decoder
abstractive-summarization abstractive-text-summarization bert bert-model nlg nlp summarization transfer-learning transformer
Last synced: 02 Nov 2024
https://github.com/muqiujun-ai/bert4pytorch
超轻量级bert的pytorch版本,大量中文注释,容易修改结构,持续更新
Last synced: 01 Oct 2024
https://github.com/modelscope/AdaSeq
AdaSeq: An All-in-One Library for Developing State-of-the-Art Sequence Understanding Models
bert chinese-nlp crf entity-typing information-extraction multi-modal-ner named-entity-recognition natural-language-processing natural-language-understanding ner nlp pytorch relation-extraction sequence-labeling token-classification word-segmentation
Last synced: 27 Oct 2024
https://github.com/kunalj101/Data-Science-Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
computer-vision data data-analysis data-science data-visualization dataset hacks image-augmentation ipynb machine-learning nlp nlp-machine-learning numpy pandas pandas-dataframe pandas-python pandas-tutorial python python3 tips-and-tricks
Last synced: 13 Nov 2024
https://github.com/dsfsi/textaugment
TextAugment: Text Augmentation Library
augmentation augmentation-methods hacktoberfest low-resouce-language mixup natural-language-processing nlp nlp-augmentation synonym word2vec wordnet
Last synced: 13 Nov 2024
https://github.com/kunalj101/data-science-hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
computer-vision data data-analysis data-science data-visualization dataset hacks image-augmentation ipynb machine-learning nlp nlp-machine-learning numpy pandas pandas-dataframe pandas-python pandas-tutorial python python3 tips-and-tricks
Last synced: 11 Oct 2024
https://github.com/erre-quadro/spikex
SpikeX - SpaCy Pipes for Knowledge Extraction
abbreviations-detection acronym-recognition clustering entity-linking named-entity-recognition nlp noun-phrase-extract sentence-splitting spacy spacy-pipes verb-phrase-extract wikigraph wikipedia wikipedia-graph
Last synced: 21 Dec 2024
https://github.com/microsoft/azureml-bert
End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service
azure-machine-learning azureml-bert bert bert-model finetuning language-model nlp pretrained-models pretraining pytorch tuning
Last synced: 21 Dec 2024
https://github.com/microsoft/AzureML-BERT
End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service
azure-machine-learning azureml-bert bert bert-model finetuning language-model nlp pretrained-models pretraining pytorch tuning
Last synced: 27 Nov 2024
https://github.com/oneflow-inc/libai
LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training
data-parallelism deep-learning distributed-training large-scale model-parallelism nlp oneflow pipeline-parallelism self-supervised-learning transformer vision-transformer
Last synced: 22 Dec 2024
https://github.com/OpenMOSS/CoLLiE
Collaborative Training of Large Language Models in an Efficient Way
deep-learning deepspeed nlp pytorch
Last synced: 16 Nov 2024
https://github.com/opencog/link-grammar
The CMU Link Grammar natural language parser
english grammar link-grammar natural-language natural-language-processing nlp parser russian
Last synced: 27 Dec 2024
https://github.com/openmoss/collie
Collaborative Training of Large Language Models in an Efficient Way
deep-learning deepspeed nlp pytorch
Last synced: 22 Dec 2024
https://github.com/msg-systems/holmes-extractor
Information extraction from English and German texts based on predicate logic
information-extraction machine-learning nlp ontology python semantics spacy spacy-extension
Last synced: 24 Dec 2024
https://github.com/thunlp/few-nerd
Code and data of ACL 2021 paper "Few-NERD: A Few-shot Named Entity Recognition Dataset"
deep-learning entity-typing few-shot-learning named-entity-recognition nlp
Last synced: 22 Dec 2024
https://github.com/Microsoft/rat-sql
A relation-aware semantic parsing model from English to SQL
dbqa nl2sql nlp program-synthesis question-answering semantic-parsing transformers
Last synced: 10 Dec 2024
https://github.com/Microsoft/AzureML-BERT
End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service
azure-machine-learning azureml-bert bert bert-model finetuning language-model nlp pretrained-models pretraining pytorch tuning
Last synced: 02 Nov 2024
https://github.com/Shixzie/nlp
[UNMANTEINED] Extract values from strings and fill your structs with nlp.
go golang natural-language-processing nlp parse text text-extraction
Last synced: 26 Oct 2024