Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/microsoft/MT-DNN

Multi-Task Deep Neural Networks for Natural Language Understanding

mt-dnn multi-task-learning natural-language-processing natural-language-understanding

Last synced: 03 Jul 2024

https://github.com/explosion/spacy-models

💫 Models for the spaCy Natural Language Processing (NLP) library

machine-learning machine-learning-models models natural-language-processing nlp spacy spacy-models statistical-models

Last synced: 02 Jul 2024

https://github.com/gkiril/MinSCIE

MinScIE is an Open Information Extraction system which provides structured knowledge enriched with semantic information about citations.

information-extraction natural-language-processing natural-language-toolkit natural-language-understanding nlp nlp-apis nlp-resources open-information-extraction

Last synced: 02 Jul 2024

https://github.com/allenai/allennlp

An open-source NLP research library, built on PyTorch.

data-science deep-learning natural-language-processing nlp python pytorch

Last synced: 02 Jul 2024

https://github.com/icoxfog417/awesome-financial-nlp

Researches for Natural Language Processing for Financial Domain

finance financial-analysis machine-learning natural-language-processing

Last synced: 01 Jul 2024

https://github.com/cs230-stanford/cs230-code-examples

Code examples in pyTorch and Tensorflow for CS230

computer-vision natural-language-processing pytorch tensorflow

Last synced: 01 Jul 2024

https://github.com/datasciencecampus/pyGrams

Extracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence

dsc-projects emergence-calculations natural-language-processing nlp nltk patents python scikit-learn tf-idf

Last synced: 01 Jul 2024

https://github.com/mindspore-courses/d2l-mindspore

《动手学深度学习》的MindSpore实现。供MindSpore学习者配合李沐老师课程使用。

computer-vision deep-learning machine-learning mindspore natural-language-processing notebook

Last synced: 01 Jul 2024

https://github.com/davikawasaki/utfpr-ce-undergrad-final-project

UTFPR Computer Engineering Undergrad Final Project - Computing Exam Questions Classification Using Natural-Language Processing

adaptive-teaching computing-classification machine-learning natural-language-processing nlp nltk python sklearn

Last synced: 30 Jun 2024

https://github.com/segment-any-text/wtpsplit

Code for Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation

deep-learning machine-learning natural-language-processing pretrained-models python sentence-boundary-detection sentence-segmentation sentence-segmenter

Last synced: 29 Jun 2024

https://github.com/anupamchugh/iowncode

A curated collection of iOS, ML, AR resources sprinkled with some UI additions

alamofire arkit computer-vision coreml coremltools ios keras ml-kit natural-language-processing nlp realitykit swift swiftui vision vision-framework

Last synced: 29 Jun 2024

https://github.com/CogComp/cogcomp-nlpy

CogComp's light-weight Python NLP annotators

data-mining natural-language-processing nlp text-mining text-processing

Last synced: 29 Jun 2024

https://github.com/the-finai/pixiu

This repository introduces PIXIU, an open-source resource featuring the first financial large language models (LLMs), instruction tuning data, and evaluation benchmarks to holistically assess financial LLMs. Our goal is to continually push forward the open-source development of financial artificial intelligence (AI).

aifinance chatgpt fintech gpt-4 large-language-models llama machine-learning named-entity-recognition natural-language-processing nlp pixiu question-answering sentiment-analysis stock-price-prediction text-classification

Last synced: 29 Jun 2024

https://github.com/ukairia777/tensorflow-nlp-tutorial

tensorflow를 사용하여 텍스트 전처리부터, Topic Models, BERT, GPT, LLM과 같은 최신 모델의 다운스트림 태스크들을 정리한 Deep Learning NLP 저장소입니다.

bert bert-ner dpo huggingface keras-tutorial llama llm lora named-entity-recognition natural-language-processing nlp nlp-tutorial question-answering sft tensorflow trainer transformers

Last synced: 29 Jun 2024

https://github.com/DaizeDong/Unified-MoE-Compression

The official implementation of the paper "Demystifying the Compression of Mixture-of-Experts Through a Unified Framework".

deep-learning large-language-models machine-learning mixture-of-experts model-compression natural-language-processing

Last synced: 28 Jun 2024

https://github.com/ART-Group-it/GASP

GASP! Dataset - Generating Abstracts of Scientific Papers from Abstracts of Cited Papers

corpus dataset machine-learning natural-language-processing nlp

Last synced: 28 Jun 2024

https://thunlp.github.io/fewrel

A Large-Scale Few-Shot Relation Extraction Dataset

few-shot-learning natural-language-processing relation-extraction

Last synced: 28 Jun 2024

https://github.com/jerryji1993/DNABERT

DNABERT: pre-trained Bidirectional Encoder Representations from Transformers model for DNA-language in genome

deep-learning dnabert-model genome gpu kmer kmer-format machine-learning natural-language-processing nlp sequence

Last synced: 28 Jun 2024

https://github.com/ymcui/cmrc2018

A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)

bert natural-language-processing question-answering reading-comprehension

Last synced: 28 Jun 2024

https://github.com/SamEdwardes/spacytextblob

A TextBlob sentiment analysis pipeline component for spaCy.

natural-language-processing nlp python spacy

Last synced: 27 Jun 2024

https://github.com/de-mh/persian_phonemizer

A tool for translating Persian text to IPA (International Phonetic Alphabet).

dependency-parser natural-language-processing part-of-speech-tagger persian phonemization python

Last synced: 27 Jun 2024

https://github.com/elyase/geotext

Geotext extracts country and city mentions from text

information-extraction natural-language-processing

Last synced: 27 Jun 2024

https://github.com/mojtaba-khallash/NHazm

A C# version of Hazm (Python library for digesting Persian text)

natural-language-processing persian

Last synced: 27 Jun 2024

https://github.com/neilgupta/Sherlock

Natural-language event parser for Javascript

datetime event-parser javascript natural-language-processing nlp regex

Last synced: 27 Jun 2024

https://github.com/hse-aml/natural-language-processing

Resources for "Natural Language Processing" Coursera course.

natural-language-processing

Last synced: 26 Jun 2024

https://github.com/PaddlePaddle/models

Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.

computer-vision cv deep-learning models natural-language-processing neural-network nlp paddlepaddle recommendation speech

Last synced: 26 Jun 2024

https://github.com/RUCAIBox/TextBox

TextBox 2.0 is a text generation library with pre-trained language models

deep-learning natural-language-generation natural-language-processing pretrained-models python pytorch seq2seq text-generation

Last synced: 26 Jun 2024

https://github.com/NELSONZHAO/zhihu

This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.

autoencoder convolutional-neural-networks deep-learning gan machine-translation natural-language-processing recurrent-neural-networks style-transfer tensorflow-examples

Last synced: 26 Jun 2024

https://github.com/guillermoscript/repo-assistant

AI Github assistant for your repo. Your proactive GitHub bot that auto-detects duplicates using OpenAI embeddings and Supabase magic!

ai assistant bot duplicate-issues github github-bot gpt-4 issue-management llm natural-language-processing openai openai-embeddings rag supabase typescript vector-similarity

Last synced: 25 Jun 2024

https://github.com/BartJongejan/Bracmat

Programming language for symbolic computation with unusual combination of pattern matching features: Tree patterns, associative patterns and expressions embedded in patterns.

bignumbers computer-algebra differentiation epoc expression-evaluator gcc high-level-language html json language-technology natural-language-processing pattern-matching programming-language rosettacode semi-structured-data structured-data symbolic-computation tree-structure unstructured-data xml

Last synced: 24 Jun 2024

https://github.com/linuxscout/mishkal

Mishkal is an arabic text vocalization software

arabic natural-language-processing python webapp

Last synced: 24 Jun 2024

https://github.com/AliAbdelaal/ATKSpy

this repository is a python package that supports SOAP interface to communicate with the Microsoft ATKS

arabic arabic-nlp atks microsoft natural-language-processing nlp parser pos-tagger pos-tagging python3 soap-web-services

Last synced: 24 Jun 2024

https://github.com/CAMTL/CA-MTL

Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data

ca-mtl multitask-learning natural-language-processing natural-language-understanding

Last synced: 23 Jun 2024

https://github.com/grammarly/gector

Official implementation of the papers "GECToR – Grammatical Error Correction: Tag, Not Rewrite" (BEA-20) and "Text Simplification by Tagging" (BEA-21)

bert grammatical-error-correction natural-language-processing nlp roberta sequence-labeling text-simplification transformers xlnet

Last synced: 23 Jun 2024

https://github.com/salesforce/WikiSQL

A large annotated semantic parsing corpus for developing natural language interfaces.

database dataset machine-learning natural-language natural-language-interface natural-language-processing

Last synced: 23 Jun 2024

https://github.com/zjunlp/openue

OpenUE是一个轻量级知识图谱抽取工具 (An Open Toolkit for Universal Extraction from Text published at EMNLP2020: https://aclanthology.org/2020.emnlp-demos.1.pdf)

bert event-extraction intent-classification named-entity-recognition natural-language-processing nlp nlp-extraction-tasks openue pytorch relation-extraction slot-filling triple-extraction

Last synced: 23 Jun 2024

https://github.com/txsun1997/CoLAKE

COLING'2020: CoLAKE: Contextualized Language and Knowledge Embedding

deep-learning knowledge-embedding knowledge-graph language-model natural-language-processing

Last synced: 23 Jun 2024

https://github.com/RUCAIBox/MVP

This repository is the official implementation of our paper MVP: Multi-task Supervised Pre-training for Natural Language Generation.

data-to-text dialog multi-task-learning natural-language-generation natural-language-processing nlg nlp plm pre-trained-model question-answering question-generation seq2seq sequence-to-sequence story-generation summarization text-generation

Last synced: 23 Jun 2024

https://github.com/lafmdp/Awesome-Papers-Autonomous-Agent

A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.

agent artificial-intelligence autonomous-agent awesome-paper-collection large-language-models machine-learning natural-language-processing reinforcement-learning

Last synced: 23 Jun 2024

https://github.com/OpenLemur/Lemur

Lemur: Open Foundation Models for Language Agents

code-generation language-model machine-learning natural-language-processing nlp text-reasoning

Last synced: 23 Jun 2024

https://github.com/wx-chevalier/AI-Notes

:books: [.md & .ipynb] Series of Artificial Intelligence & Deep Learning, including Mathematics Fundamentals, Python Practices, NLP Application, etc. 💫 人工智能与深度学习实战,数理统计篇 | 机器学习篇 | 深度学习篇 | 自然语言处理篇 | 工具实践 Scikit & Tensoflow & PyTorch 篇 | 行业应用 & 课程笔记

artificial-intelligence datascience deeplearning machinelearning natural-language-processing neural-network wx-doc

Last synced: 23 Jun 2024

https://github.com/obss/trapper

State-of-the-art NLP through transformer models in a modular design and consistent APIs.

allennlp deep-learning natural-language-processing nlp python pytorch pytorch-transformers transformer transformers

Last synced: 22 Jun 2024

https://github.com/graph4ai/graph4nlp

Graph4nlp is the library for the easy use of Graph Neural Networks for NLP. Welcome to visit our DLG4NLP website (https://dlg4nlp.github.io/index.html) for various learning resources!

deep-learning graph-neural-networks machine-learning natural-language-processing nlp pytorch

Last synced: 22 Jun 2024

https://github.com/Shujian2015/FreeML

A List of Data Science/Machine Learning Resources (Mostly Free)

data-science deep-learning machine-learning natural-language-processing

Last synced: 22 Jun 2024

https://github.com/datanada/Awesome-Korean-NLP

A curated list of resources for NLP (Natural Language Processing) for Korean

korean-nlp natural-language-processing nlp

Last synced: 22 Jun 2024

http://yomguithereal.github.io/talisman/

Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.

clustering deduplication fuzzy-matching information-retrieval machine-learning natural-language-processing record-linkage

Last synced: 22 Jun 2024

https://github.com/melvynator/ELK_twitter

This is a data pipeline for Twitter (ETL) using the elastic stack Elasticsearch, Logstash and Kibana (version 6.1)

data-collection data-visualization elasticsearch elk elk-stack kibana logstash machine-learning natural-language-processing twitter twitter-api

Last synced: 22 Jun 2024

https://github.com/delip/PyTorchNLPBook

Code and data accompanying Natural Language Processing with PyTorch published by O'Reilly Media https://amzn.to/3JUgR2L

deep-learning deep-neural-networks natural-language-processing neural-machine-translation neural-networks nlp pytorch pytorch-nlp pytorch-tutorial

Last synced: 21 Jun 2024

https://github.com/neph0s/awesome-llm-role-playing-with-persona

Awesome-llm-role-playing-with-persona: a curated list of resources for large language models for role-playing with assigned personas

agent ai aigc awesome awesome-list character chatgpt conversational-ai deep-learning large-language-models llm natural-language-processing nlp paper-list persona role-playing survey

Last synced: 21 Jun 2024

https://github.com/bnosac/sentencepiece

R package for Byte Pair Encoding / Unigram modelling based on Sentencepiece

byte natural-language-processing sentencepiece word-segmentation

Last synced: 21 Jun 2024

https://github.com/intelligo-mn/neuro

🔮 Neuro.js is machine learning library for building AI assistants and chat-bots.

ai ai-assistants bot chat-bot chat-bots chatbot machine-learning natural-language-processing nlp nodejs

Last synced: 21 Jun 2024

https://flairnlp.github.io/flair/

A very simple framework for state-of-the-art Natural Language Processing (NLP)

machine-learning named-entity-recognition natural-language-processing nlp pytorch semantic-role-labeling sequence-labeling word-embeddings

Last synced: 21 Jun 2024

https://github.com/thammegowda/mtdata

A tool that locates, downloads, and extracts machine translation corpora

dataset machine-translation multilingual natural-language-generation natural-language-processing parallel-data

Last synced: 20 Jun 2024

https://github.com/vlgiitr/DL_Topics

List of DL topics and resources essential for cracking interviews

computer-vision deep-learning generative-models linear-algebra natural-language-processing probability-statistics

Last synced: 20 Jun 2024

https://github.com/NiuTrans/ABigSurvey

A collection of 1000+ survey papers on Natural Language Processing (NLP) and Machine Learning (ML).

deep-learning machine-learning natural-language-processing neural-networks paper-list surveys

Last synced: 20 Jun 2024

https://github.com/franciellevargas/HateBR

HateBR is the first large-scale expert annotated dataset of Brazilian Instagram comments for hate speech and offensive language detection on the web and social media.

brazilian-portuguese dataset hatespeech-detection machine-learning natural-language-processing text-classification

Last synced: 20 Jun 2024

https://github.com/evelinacs/semantic_parsing_with_IRTGs

Experiments of developing an IRTG which simultaneously encodes transformations between phrase structure trees, dependency graphs and semantic graphs.

computational-linguistics dependency-graph grammar grammar-rules graph-transformation irtg natural-language-processing nlp penn-treebank phrase-structure-tree python python3 rule-based semantic-parsing surface-realization universal-dependencies

Last synced: 20 Jun 2024

https://github.com/MariPlaza/dscooking_old

Data Science Cooking is a personal project to combine Data Science and Cooking.

cooking mysql natural-language-processing network-analysis python

Last synced: 20 Jun 2024

https://github.com/M4t1ss/parallel-corpora-tools

Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.

cleaning corpora corpus-tools data-processing data-science filtering language language-processing machine machine-translation natural-language natural-language-processing neural neural-machine-translation nlp nmt translation

Last synced: 20 Jun 2024

https://github.com/alphadl/inspiring_papers

Papers related to Machine Translation (continuously updating & welcome Star/Fork/PR)

machine-translation natural-language-processing nlp

Last synced: 20 Jun 2024

https://github.com/zhangshaolei1998/Awesome-Simultaneous-Translation

Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.

awesome machine-translation natural-language-processing nlp paper paperlist simultaneous-machine-translation simultaneous-translation speech-translation streaming text-translation

Last synced: 20 Jun 2024

https://github.com/NiuTrans/MTBook

《机器翻译:基础与模型》肖桐 朱靖波 著 - Machine Translation: Foundations and Models

deep-learning machine-learning machine-translation natural-language-processing neural-machine-translation statistical-machine-translation tex

Last synced: 20 Jun 2024