Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/wikiti/deepl-rb

A simple ruby gem for the DeepL API

api deepl machine-translation ruby translator wrapper-api

Last synced: 03 Jul 2024

https://github.com/NELSONZHAO/zhihu

This repo contains the source code in my personal column (https://zhuanlan.zhihu.com/zhaoyeyu), implemented using Python 3.6. Including Natural Language Processing and Computer Vision projects, such as text generation, machine translation, deep convolution GAN and other actual combat code.

autoencoder convolutional-neural-networks deep-learning gan machine-translation natural-language-processing recurrent-neural-networks style-transfer tensorflow-examples

Last synced: 26 Jun 2024

https://github.com/Tencent/TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

albert bert decoder gpt2 gpu huggingface-transformers inference machine-translation nlp pytorch roberta transformer

Last synced: 22 Jun 2024

https://nvidia.github.io/NeMo/

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

asr deeplearning generative-ai large-language-models machine-translation multimodal neural-networks speaker-diariazation speaker-recognition speech-synthesis speech-translation tts

Last synced: 21 Jun 2024

https://github.com/thammegowda/mtdata

A tool that locates, downloads, and extracts machine translation corpora

dataset machine-translation multilingual natural-language-generation natural-language-processing parallel-data

Last synced: 20 Jun 2024

https://github.com/M4t1ss/MWE-Tools

A set of useful tools for use with multiword expression extraction from parallel corpora for Moses statistical machine translation system

machine-learning machine-translation multiword-expressions

Last synced: 20 Jun 2024

https://github.com/vsetka/deepl-translator

This module provides promised methods for translating text using DeepL Translator (https://www.deepl.com/translator) undocumented API.

deepl deepl-translator machine-translation node nodejs promised translate translation translator

Last synced: 20 Jun 2024

https://github.com/M4t1ss/parallel-corpora-tools

Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.

cleaning corpora corpus-tools data-processing data-science filtering language language-processing machine machine-translation natural-language natural-language-processing neural neural-machine-translation nlp nmt translation

Last synced: 20 Jun 2024

https://github.com/alphadl/inspiring_papers

Papers related to Machine Translation (continuously updating & welcome Star/Fork/PR)

machine-translation natural-language-processing nlp

Last synced: 20 Jun 2024

https://github.com/zhangshaolei1998/Awesome-Simultaneous-Translation

Paper list of simultaneous translation / streaming translation, including text-to-text machine translation and speech-to-text translation.

awesome machine-translation natural-language-processing nlp paper paperlist simultaneous-machine-translation simultaneous-translation speech-translation streaming text-translation

Last synced: 20 Jun 2024

https://github.com/NiuTrans/NiuTrans.SMT

NiuTrans.SMT is an open-source statistical machine translation system developed by a joint team from NLP Lab. at Northeastern University and the NiuTrans Team. The NiuTrans system is fully developed in C++ language. So it runs fast and uses less memory. Currently it supports phrase-based, hierarchical phrase-based and syntax-based (string-to-tree, tree-to-string and tree-to-tree) models for research-oriented studies.

decoder machine-translation parsing phrase-based-translation statistical-machine-translation

Last synced: 20 Jun 2024

https://github.com/NiuTrans/NiuTrans.NMT

A Fast Neural Machine Translation System. It is developed in C++ and resorts to NiuTensor for fast tensor APIs.

fast-decoding machine-translation neural-machine-translation transformer

Last synced: 20 Jun 2024

https://github.com/EdinburghNLP/nematus

Open-Source Neural Machine Translation in Tensorflow

machine-translation mt neural-machine-translation nmt sequence-to-sequence

Last synced: 20 Jun 2024

https://github.com/modernmt/modernmt

Neural Adaptive Machine Translation that adapts to context and learns from corrections.

machine-learning machine-translation mmt mt neural neural-machine-translation neural-network translation

Last synced: 20 Jun 2024

https://github.com/browsermt/bergamot-translator

Cross platform C++ library focusing on optimized machine translation on the consumer-grade device.

cpp cross-platform emscripten machine-translation neural-machine-translation neural-networks python wasm webassembly

Last synced: 20 Jun 2024

https://github.com/NiuTrans/MTBook

《机器翻译:基础与模型》肖桐 朱靖波 著 - Machine Translation: Foundations and Models

deep-learning machine-learning machine-translation natural-language-processing neural-machine-translation statistical-machine-translation tex

Last synced: 20 Jun 2024

https://github.com/ymoslem/CTranslate-NMT-Web-Interface

Machine Translation (MT) Web Interface for OpenNMT and FairSeq models using CTranslate and Streamlit

machine-translation neural-machine-translation web-interface

Last synced: 20 Jun 2024

https://github.com/rsennrich/Bleualign

Machine-Translation-based sentence alignment tool for parallel text

machine-translation sentence-alignment

Last synced: 20 Jun 2024

https://github.com/vsetka/deepl-translator-cli

This command line tool delivers text translation capabilities to your console and is powered by DeepL (https://www.deepl.com/translator)

cli command-line command-line-tool deepl deepl-translator deeplearning machine-translation translate translator

Last synced: 17 Jun 2024

https://github.com/AmrHendy/programming-language-translator

An easy way to use the released TransCoder by Facebook AI Research to convert code from one programming language to another using unsupervised neural machine translation (NMT) systems that use deep-learning to translate text from one natural language to another and is trained only on monolingual source data.

machine-translation nlp programming-language transcoder transformer unsupervised-deep-learning unsupervised-translation

Last synced: 15 Jun 2024

https://github.com/google/seq2seq

A general-purpose encoder-decoder framework for Tensorflow

deeplearning machine-translation neural-network tensorflow translation

Last synced: 15 Jun 2024

https://github.com/opennmt/opennmt

Open Source Neural Machine Translation in Torch (deprecated)

deep-learning lua machine-translation neural-machine-translation opennmt torch

Last synced: 15 Jun 2024

https://github.com/srvk/how2-dataset

This repository contains code and metadata of How2 dataset

corpus dataset how2-dataset language machine-translation multimodality speech-recognition video

Last synced: 13 Jun 2024

https://github.com/mit-han-lab/hardware-aware-transformers

[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

efficient-model hardware-aware machine-translation natural-language-processing specialization transformer

Last synced: 13 Jun 2024

https://github.com/bangoc123/transformer

Build English-Vietnamese machine translation with ProtonX Transformer. :D

machine-translation tensorflow2 transformer

Last synced: 07 Jun 2024

https://github.com/i18n-pro/solid

Lightweight, simple, flexible, automatic translation internationalization tool for Solid(适用于 Solid 的轻量、简单、灵活、自动翻译的国际化工具)

auto-translation automatic-translation i18n i18n-pro machine-translation solid solid-i18n translator

Last synced: 06 Jun 2024

https://github.com/NVIDIA/NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

asr deeplearning generative-ai large-language-models machine-translation multimodal neural-networks speaker-diariazation speaker-recognition speech-synthesis speech-translation tts

Last synced: 17 May 2024

https://github.com/tensorflow/tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

deep-learning machine-learning machine-translation reinforcement-learning tpu

Last synced: 16 May 2024

https://github.com/nlpcloud/nlpcloud-js

NLP Cloud serves high performance pre-trained or custom models for NER, sentiment-analysis, classification, summarization, paraphrasing, intent classification, product description and ad generation, chatbot, grammar and spelling correction, keywords and keyphrases extraction, text generation, image generation, code generation, and much more...

ad-generator chatbot code-generation conversational-ai embeddings intent-classification keywords-extraction language-detection machine-translation ner nlp paraphrasing question-answering semantic-similarity sentiment-analysis text-classification text-generation text-summarization tokenization

Last synced: 16 May 2024

https://github.com/StrombergNLP/bornholmsk

NLP tools / data for Bornholmsk, NODALIDA 2019

machine-translation natural-language-processing nlp

Last synced: 15 May 2024

https://github.com/alvations/myth

Myanmar and Thai Language Resources

machine-translation myanmar nlp thai

Last synced: 15 May 2024

https://github.com/mikeroyal/NLP-Guide

Natural Language Processing (NLP). Covering topics such as Tokenization, Part Of Speech tagging (POS), Machine translation, Named Entity Recognition (NER), Classification, and Sentiment analysis.

awesome awesome-list gpt-3 langauge-model machine-translation natural-language natural-language-processing natural-language-procressing nlp nlp-keywords-extraction nlp-library nlp-machine-learning nlp-parsing nlp-resources semantic-search speech-enhancement speech-processing speech-recognition speech-synthesis

Last synced: 14 May 2024

https://github.com/AdamG012/moe-paper-models

A sumary of MoE experimental setups across a number of different papers.

awesome-list language-model machine-learning machine-translation mixture-of-experts moe papers transformer transformers

Last synced: 14 May 2024

https://github.com/ictnlp/STEMM

Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

machine-translation speech-to-text speech-translation

Last synced: 12 May 2024

https://github.com/osdg-ai/osdg-tool

OSDG is an open-source tool that maps and connects activities to the UN Sustainable Development Goals (SDGs) by identifying SDG-relevant content in any text. The tool is available online at www.osdg.ai. API access available for research purposes.

machine-learning machine-learning-algorithms machine-translation ml open-source osdg sdg sdg-data sdgs sustainability sustainability-score sustainable-development sustainable-development-goals united-nations

Last synced: 09 May 2024

https://github.com/THUNLP-MT/THUMT

An open-source neural machine translation toolkit developed by Tsinghua Natural Language Processing Group

deep-learning machine-translation neural-machine-translation

Last synced: 08 May 2024

https://github.com/sebastianruder/NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

dialogue machine-learning machine-translation named-entity-recognition natural-language-processing nlp-tasks

Last synced: 07 May 2024

https://github.com/argosopentech/argos-translate

Open-source offline translation library written in Python

language-models linux machine-translation nlp open-source python transformers translation

Last synced: 07 May 2024

https://github.com/FlineDev/BartyCrouch

Localization/I18n: Incrementally update/translate your Strings files from .swift, .h, .m(m), .storyboard or .xib files.

code incremental language localization machine-translation storyboard swift translation xcode xib

Last synced: 27 Apr 2024

https://github.com/i18n-pro/vue2

Lightweight, simple, flexible, automatic translation internationalization tool for Vue 2(适用于 Vue 2 的轻量、简单、灵活、自动翻译的国际化工具)

auto-translation automatic-translation i18n i18n-pro machine-translation translator vue vue-i18n vue2

Last synced: 27 Apr 2024

https://github.com/i18n-pro/vue

Lightweight, simple, flexible, automatic translation internationalization tool for Vue(适用于 Vue 的轻量、简单、灵活、自动翻译的国际化工具)

auto-translation automatic-translation i18n i18n-pro machine-translation translator vue vue-i18n

Last synced: 27 Apr 2024

https://github.com/sinaahmadi/KurdishMT

Towards Machine Translation for the Kurdish Language

kurdish kurdish-language-processing less-resource-languages machine-translation nlp

Last synced: 23 Apr 2024

https://github.com/keon/seq2seq

Minimal Seq2Seq model with Attention for Neural Machine Translation in PyTorch

deep-learning machine-translation seq2seq

Last synced: 19 Apr 2024

https://github.com/asyml/texar-pytorch

Integrating the Best of TF into PyTorch, for Machine Learning, Natural Language Processing, and Text Generation. This is part of the CASL project: http://casl-project.ai/

bert casl-project data-processing deep-learning dialog-systems gpt-2 machine-learning machine-translation natural-language-processing python pytorch roberta texar texar-pytorch text-data text-generation xlnet

Last synced: 19 Apr 2024

https://github.com/OpenNMT/OpenNMT-py

Open Source Neural Machine Translation and (Large) Language Models in PyTorch

deep-learning language-model llms machine-translation neural-machine-translation pytorch

Last synced: 19 Apr 2024

https://github.com/sillsdev/machine

Machine is a natural language processing library for .NET that is focused on providing tools for processing resource-poor languages.

language-translation machine-translation natural-language-processing

Last synced: 17 Apr 2024

https://github.com/joom/dilacar

A rule-based machine translation system from Ottoman Turkish to Modern Turkish.

computational-linguistics historical-linguistics machine-translation ottoman rule-based turkish turkish-language

Last synced: 13 Apr 2024

https://github.com/mkiol/dsnote

Speech Note Linux app. Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine translation.

asr flatpak-applications linux-desktop machine-translation nmt offline sailfishos speech-recognition speech-synthesis speech-to-text stt text-to-speech translation translator tts

Last synced: 12 Apr 2024

https://github.com/potamides/pantran.nvim

Use your favorite machine translation engines without having to leave your favorite editor.

apertium argos deepl google lua machine-translation neovim plugin translation translator yandex

Last synced: 12 Apr 2024

https://github.com/asyml/texar

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

bert casl-project data-processing deep-learning dialog-systems gpt-2 machine-learning machine-translation natural-language-processing python tensorflow texar text-data text-generation xlnet

Last synced: 11 Apr 2024

https://github.com/THUNLP-MT/MT-Reading-List

A machine translation reading list maintained by Tsinghua Natural Language Processing Group

machine-translation reading-list

Last synced: 10 Apr 2024

https://github.com/pyurbans/urbans

A tool for translating text from source grammar to target grammar (context-free) with corresponding dictionary.

artificial-intelligence data-science machine-translation nlp python

Last synced: 09 Apr 2024

https://github.com/chriskonnertz/DeepLy

PHP client for the DeepL.com translation API (unofficial)

ai api client deepl i18n language laravel library machine-translation neural php translate translation translator

Last synced: 08 Apr 2024

https://github.com/Flinesoft/BartyCrouch

Localization/I18n: Incrementally update/translate your Strings files from .swift, .h, .m(m), .storyboard or .xib files.

code incremental language localization machine-translation storyboard swift translation xcode xib

Last synced: 05 Apr 2024

https://github.com/sharad461/nepali-translator

Neural Machine Translation on the Nepali-English language pair

data-cleaning machine-translation nepali-english parallel-corpus

Last synced: 02 Apr 2024

https://github.com/soumendrak/MTEnglish2Odia

Machine Translation from English to Odia language.

indic-languages machine-translation odia odia-language parallel-corpus python3

Last synced: 02 Apr 2024

https://github.com/csebuetnlp/banglanmt

This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.

bangla-dataset-machine-translation bangla-machine-translation bangla-nlp emnlp-2020 low-resource-languages low-resource-machine-translation low-resource-nlp machine-translation neural-machine-translation parallel-corpora parallel-corpus

Last synced: 02 Apr 2024

https://github.com/priyanshu2103/Sanskrit-Hindi-Machine-Translation

Machine Translation from Sanskrit to Hindi using Unsupervised and Supervised Learning

fasttext-embeddings hindi machine-translation monolingual-corpora parallel-corpus sanskrit sanskrit-english

Last synced: 02 Apr 2024

https://github.com/mozilla/firefox-translations-training

Training pipelines for Firefox Translations neural machine translation models

machine-translation neural-machine-translation

Last synced: 01 Apr 2024

https://github.com/jerinphilip/ilmulti

Tooling to play around with multilingual machine translation for Indian Languages.

indian-languages machine-translation machine-translation-models multilingual-translation multilingual-translations pytorch tokenizer wrappers

Last synced: 27 Mar 2024

https://github.com/adobe/NLP-Cube

Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing

dependency-parser dependency-parsing embeddings information-extraction language-pipeline lemmatization machine-translation nlp-cube parse part-of-speech-tagger sentence-splitting tokenization universal-dependencies

Last synced: 27 Mar 2024

https://github.com/wxjiao/ParroT

The ParroT framework to enhance and regulate the Translation Abilities during Chat based on open-sourced LLMs (e.g., LLaMA-7b, Bloomz-7b1-mt) and human written translation and evaluation data.

bloomz chatgpt contrastive error-guided gpt-4 human-feedback instruction-tuning llama lora machine-translation

Last synced: 24 Mar 2024

https://github.com/kakaobrain/jejueo

Jejueo Datasets for Machine Translation and Speech Synthesis

jejueo korean language machine-translation speech-synthesis

Last synced: 20 Mar 2024

https://github.com/ictnlp/BayLing

“百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型,具有优越的英语/中文能力,在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced language alignment, showing superior capability in English/Chinese generation, instruction following and multi-turn interaction.

aigc bayling chatgpt chinese cross-lingual general-language-model gpt4 human-performance instruction-tuning interactive large-language-models llama machine-translation multilingual-translation translation

Last synced: 19 Mar 2024

https://github.com/rsennrich/subword-nmt

Unsupervised Word Segmentation for Neural Machine Translation and Text Generation

bpe machine-translation neural-machine-translation nmt segmentation subword-units

Last synced: 17 Mar 2024

https://github.com/OpenNMT/OpenNMT-tf

Neural machine translation and sequence learning using TensorFlow

deep-learning machine-translation natural-language-processing neural-machine-translation opennmt python tensorflow

Last synced: 17 Mar 2024