Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

awesome-machine-translation

A list of awesome Machine Translation frameworks, libraries, software and papers
https://github.com/maidis/awesome-machine-translation

Last synced: 1 day ago
JSON representation

  • Aligners 🌌

    • Getting started with Sentence Alignment - A list of sentence alignment tools.
    • yasa - Yet Another Sentence Aligner.
    • Bleualign - A machine translation based sentence alignment tool for parallel text.
    • Bleualign-cpp - A C++ sentence alignment tool based on Bleualign. Bleualign-cpp is expected to be used together with document-aligner.
    • Vecalign - An accurate sentence alignment algorithm that works in about 100 languages, without the need for a machine translation system or lexicon.
    • yalign - A sentence aligner for comparable corpora.
    • yasa - Yet Another Sentence Aligner.
    • hunalign - A tool that aligns bilingual text on the sentence level.
    • LF Aligner - A tool to create translation memories from texts and their translations. It relies on Hunalign for automatic sentence pairing.
    • Web Align Toolkit - Online parallel texts aligner and format converter.
  • Applications πŸ’»

    • Intento - A simple API to third-party machine translation services from many vendors.
    • iTranslate - A translation and dictionary app.
    • Argos Translate - An open-source offline translation library written in Python. Uses OpenNMT for translations, SentencePiece for tokenization, Stanza for sentence boundary detection, and PyQt for GUI.
    • CTranslate-NMT-Web-Interface - A Machine Translation web interface for OpenNMT and FairSeq models using CTranslate and Streamlit.
    • DesktopTranslator - A local cross-platform machine translation GUI, based on CTranslate2.
    • LibreOffice Translate - An extension providing neural machine translation for LibreOffice with a single click.
    • LibreTranslate - A free and open source machine translation API.
    • Local-NMT - A pre-trained Huggingface Machine Translation engine with UI on local computer.
    • translateLocally - A fast and secure translation on your local machine, powered by marian and Bergamot.
    • Mantra - A highly accurate automatic translation of manga.
    • Skype Translator - A real-time voice and text translator.
    • Canopy Speak - A freemium smart medical phrase mobile app.
    • Slatona Translator - A translation app for macOS that annotates word senses.
  • Books πŸ“š

    • Learning Machine Translation - [Cyril Goutte](https://sites.google.com/site/cyrilgoutte/), [Nicola Cancedda](https://dblp.uni-trier.de/pers/hd/c/Cancedda:Nicola), [Marc Dymetman](http://www.europe.naverlabs.com/NAVER-LABS-Europe/People/Marc-Dymetman), [George Foster](http://www-labs.iro.umontreal.ca/~foster/) - 2008 - The book looks first at enabling technologies that solve problems that are not Machine Translation proper but are linked closely to the development of a Machine Translation system, and then presents some Machine Translation techniques.
    • Machine Translation - [Thierry Poibeau](http://lattice.cnrs.fr/Thierry-Poibeau) - 2018 - A concise, nontechnical overview of the development of machine translation, including the different approaches, evaluation issues, and market potential.
    • Machine Translation - [Pushpak Bhattacharyya](https://www.cse.iitb.ac.in/~pb/) - 2015 - A book that compares and contrasts the salient principles and practices of rule-based machine translation, statistical machine translation, and and example-based machine translation.
    • Statistical Machine Translation - [Philipp Koehn](https://github.com/phikoehn) - An introductory text to statistical machine translation (SMT) provides all of the theories and methods needed to build a statistical machine translator.
    • Makine Γ‡evirisi - [ErdinΓ§ Aslan](https://avesis.marmara.edu.tr/erdinc.aslan) - 2019 - Turkish - A book that will provide a good introduction to students taking courses such as Translation Technologies and those starting to work in the field of machine translation.
    • Machine Translation: Foundations and Models - Tong Xiao, Jingbo Zhu - 2020... - Chinese - A book that gives a systematic introduction to the basic knowledge and modeling methods of machine translation, and on this basis, discuss some cutting-edge technologies of machine translation. It can be used for the study of senior undergraduates and graduate students in computer and artificial intelligence related majors, and can also be used as a reference material for researchers related to natural language processing, especially machine translation.
    • Syntax-based Statistical Machine Translation - [Philip Williams](http://homepages.inf.ed.ac.uk/s0898777/), [Rico Sennrich](http://homepages.inf.ed.ac.uk/rsennric/), [Matt Post](https://mjpost.github.io/), [Philipp Koehn](http://www.cs.jhu.edu/~phi/) - 2016 - A comprehensive introduction to the syntax-based statistical machine translation models.
    • Neural Machine Translation - [Philipp Koehn](https://github.com/phikoehn) - 2020 - A book that introduces the challenge of machine translation and evaluation, including historical, linguistic, and applied context, then develops the core deep learning methods used for natural language applications.
  • Frameworks πŸ–Ό

    • Marian - A neural machine translation framework written in pure C++ with minimal dependencies.
    • OpenNMT - An open source initiative for neural machine translation and neural sequence modeling.
    • THUMT - An open source toolkit for neural machine translation.
    • Bergamot - Cross platform C++ library focusing on optimized machine translation on the consumer-grade device.
    • EnglishTurkishTranslation-CPP - An English-Turkish phrase-based translation library.
    • fairseq - A sequence modeling toolkit to train custom models for translation, summarization, language modeling and other text generation tasks.
    • Joey NMT - A minimalist NMT for educational purposes.
    • ModernMT - A neural adaptive machine translation that adapts to context and learns from corrections.
    • Moses - A statistical machine translation system that allows to automatically train translation models for any language pair.
    • Nematus - Attention-based encoder-decoder model for neural machine translation built in Tensorflow.
    • NiuTrans.NMT - A fast Neural Machine Translation system that developed in C++ and resorts to NiuTensor for fast tensor APIs.
    • NiuTrans.SMT - An open source statistical machine translation system that fully developed in C++ language.
  • Hardware 🎧

    • ili - An instant offline translation device for travelers.
  • Online MT Services 🌐

    • Masakhane - A machine translation service for African languages.
    • ModernMT - ModernMT online demo.
    • MyDutchPal's Neural MT Gateway - A free online neural machine translation system to translate short pieces of text.
    • NiuTrans - A neural machine translation engine for 115 languages.
    • SYSTRAN Translate - A demonstrator of SYSTRAN's MT engines.
    • THUMT - THUMT online demo.
    • Ubiqus Online Text Translation - Free online translation for information purposes only in English, French, German, Spanish, Italian, Dutch. Up to 2,500 characters i.e. about 350 words.
    • Yandex.Translate - A web service provided by Yandex, intended for the translation of text or web pages into another language.
    • Bing Microsoft Translator - A service to translate texts or entire web pages into different languages.
    • Ubiqus Online Text Translation - Free online translation for information purposes only in English, French, German, Spanish, Italian, Dutch. Up to 2,500 characters i.e. about 350 words.
  • Organizations and Events πŸŽ‰

    • AAMT - Asia-Pacific Association for Machine Translation.
    • AMTA - Association for Machine Translation in the Americas.
    • EAMT (European Association for Machine Translation) - An organization that serves the growing community of people interested in MT and translation tools, including users, developers, and researchers.
    • WMT18 - A conference builds on a series of annual workshops and conferences on statistical machine translation, going back to 2006.
  • Other MT Lists πŸ“

  • Parallel Texts ⏸️

    • Publicly accessible translation memories - Several online services allowing access to aggregated translation memories.
    • OmegaWiki - A collaborative project to produce a free, multilingual dictionary for every language with lexicological, terminological and thesaurus information.
  • Tools πŸ› 

    • SMT Corpus Tools - A tool set to process corpus files for machine translation.
  • Tutorials and Blogs πŸŽ’

    • KantanAI - A SaaS-based Machine Translation platform.
    • Lingua Custodia - A machine translation company specializes in finance.
    • SDL Machine Translation - Neural and statistical based machine translation services.
    • Unbabel - A company that provides AI-powered, human-refined translation for customer support.
    • Waverly Labs - A tech startup in NYC at the convergence of wearable technology and machine translation.