Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/stefangabos/world_countries

Constantly updated lists of world countries and their associated alpha-2, alpha-3 and numeric country codes as defined by the ISO 3166 standard, available in CSV, JSON , PHP, SQL and XML formats, in multiple languages and with national flags included; also available are the ISO 3166-2 codes of provinces/ states associated with the countries

countries csv flags iso-3166-1 json multilingual mysql national-flags sql xml

Last synced: 04 Jul 2024

https://github.com/qcri/LLMeBench

Benchmarking Large Language Models

benchmarking large-language-models llm multilingual

Last synced: 04 Jul 2024

https://github.com/kkoomen/vim-doge

(Do)cumentation (Ge)nerator for nearly 20 languages ๐Ÿ“š Generate proper code documentation with a single keypress. โšก๏ธ๐Ÿ”ฅ

boilerplate docblock documentation doge fast generator instant languages multilingual neovim polyglot skeleton vim

Last synced: 01 Jul 2024

https://github.com/wxjiao/Is-ChatGPT-A-Good-Translator

A preliminary evaluation of ChatGPT/GPT-4 for machine translation.

chatgpt gpt-4 multilingual pivot-translation prompt robustness translation

Last synced: 30 Jun 2024

https://github.com/andre-fuchs/kerning-pairs

The ultimate list of kerning pairs for type designers

kerning kerning-pairs multilingual python typedesign

Last synced: 29 Jun 2024

https://github.com/bigscience-workshop/data-preparation

Code used for sourcing and cleaning the BigScience ROOTS corpus

dataset large-language-models multilingual

Last synced: 28 Jun 2024

https://github.com/AgentMaker/AgentOCR

ไธ€ไธชๅคš่ฏญ่จ€ๆ”ฏๆŒใ€ๆ˜“ไฝฟ็”จ็š„ OCR ้กน็›ฎใ€‚An easy-to-use OCR project with multilingual support.

easy-deploy multilingual ocr onnx

Last synced: 27 Jun 2024

https://github.com/myshell-ai/MeloTTS

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

chinese english french japanese korean multilingual spanish text-to-speech tts

Last synced: 23 Jun 2024

https://github.com/PhilipMay/stsb-multi-mt

Machine translated multilingual STS benchmark dataset.

dataset multilingual nlp

Last synced: 23 Jun 2024

https://github.com/thammegowda/mtdata

A tool that locates, downloads, and extracts machine translation corpora

dataset machine-translation multilingual natural-language-generation natural-language-processing parallel-data

Last synced: 20 Jun 2024

https://github.com/paviliondev/discourse-multilingual

A Discourse Plugin that makes it easier to administer a Multilingual Forum.

discourse discourse-plugin multilingual

Last synced: 20 Jun 2024

https://github.com/cmints/cmints

CMS and Static Site Generator created with the internationalization in mind

cms i18n internationalization l10n multilingual nodejs static-site static-site-generator

Last synced: 20 Jun 2024

https://github.com/omerbyrk/flutter-boilerplate

A flutter boilerplate project can be used both enterprise & individual application.

bloc boilerplate dart flutter flutter-boilerplate multilingual responsive-design

Last synced: 18 Jun 2024

https://github.com/lindera-morphology/lindera

A multilingual morphological analysis library.

analyzer library morphological multilingual tokenizer

Last synced: 17 Jun 2024

https://github.com/project-miracl/miracl

A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.

benchmark dataset information-retrieval multilingual

Last synced: 16 Jun 2024

https://github.com/DAMO-NLP-SG/M3Exam

Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"

ai-education chatgpt evaluation gpt-4 large-language-models llms multilingual multimodal

Last synced: 16 Jun 2024

https://github.com/chen3feng/blade-build

Blade is a powerful build system from Tencent, supports many mainstream programming languages, such as C/C++, java, scala, python, protobuf...

build-system build-tool cplusplus java monorepo multilingual ninja protobuf python scala

Last synced: 14 Jun 2024

https://github.com/ultrabug/mkdocs-static-i18n

MkDocs i18n plugin using static translation markdown files

i18n mkdocs mkdocs-material mkdocs-plugin multilingual

Last synced: 14 Jun 2024

https://github.com/twardoch/lorem-chatum-for-indesign

Lorem Chatum script for Adobe InDesign that uses ChatGPT to produce better lorem ipsum

adobe-uxp chatgpt chatgpt-api font-specimen indesign indesign-plugin indesign-scripts lorem-ipsum-generator multilingual uxp

Last synced: 14 Jun 2024

https://github.com/alexandrevl/supersummarizeai

Unleash the power of AI with SuperSummarizeAI! Effortlessly extract, condense, and clip content from webpages and YouTube videos using ChatGPT. Turning endless streams of content into digestible summaries.

beautifulsoup chatgpt content-analysis multilingual nlp openai papperclip text text-processing text-summarization web-scraping youtube

Last synced: 14 Jun 2024

https://github.com/carboneio/carbone

Fast and simple report generator, from JSON to pdf, xslx, docx, odt...

carbone document-conversion javascript libreoffice microsoft-office multilingual nodejs pdf-generation report-generator template-engine

Last synced: 11 Jun 2024

https://github.com/jitsejan/python-flask-with-javascript

This repository contains an example app to communicate between JavaScript and Python.

flask javascript multilingual python

Last synced: 31 May 2024

https://github.com/bkader/skeleton

A ready-to-use CodeIgniter skeleton with tons of new features and a whole new concept of hooks (actions and filters) as well as a ready-to-use and application-free themes and plugins system.

actions codeigniter codeigniter-skeleton dashboard demo entreprise filters framework history-management hooks modular multilingual plugins skeleton starter-kit themes user-management

Last synced: 27 May 2024

https://github.com/TypiCMS/Base

Multilingual CMS built with Laravel.

cms laravel multilingual php website website-builder

Last synced: 26 May 2024

https://github.com/NouamaneTazi/bloomz.cpp

C++ implementation for BLOOM

bloom cpp multilingual

Last synced: 25 May 2024

https://github.com/google-research-datasets/wit

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

cc-by-sa-3 machine-learning multilingual multimodal nlp wikipedia

Last synced: 13 May 2024

https://github.com/gaalcaras/academic

Jekyll theme with a focus on simplicity, typography and flexibility

gem i18n jekyll jekyll-theme multilingual

Last synced: 01 May 2024

https://github.com/deanishe/alfred-searchio

Alfred workflow to auto-suggest search results from multiple search engines and languages.

alfred alfred-workflow amazon autosuggest google multilingual python search wikipedia wiktionary

Last synced: 20 Apr 2024

https://github.com/HIT-SCIR/ELMoForManyLangs

Pre-trained ELMo Representations for Many Languages

elmo multilingual nlp

Last synced: 19 Apr 2024

https://github.com/yuvalpinter/Mimick

Code for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017)

convolutional-neural-networks lstm multilingual neural-network part-of-speech-tagger word-embeddings

Last synced: 19 Apr 2024

https://github.com/christos-c/bible-corpus

A multilingual parallel corpus created from translations of the Bible.

bible bible-corpus corpus multilingual translation

Last synced: 18 Apr 2024

https://github.com/wagtail/wagtailtrans

A Wagtail add-on for supporting multilingual sites

multilingual wagtail

Last synced: 15 Apr 2024

https://github.com/zelon88/HRConvert2

A self-hosted, drag-and-drop & nosql file conversion server & share tool that supports 86 file formats in 13 languages.

archiver conversion converter document-conversion extractor file-converter file-sharing format image multilingual ocr ocr-recognition pdf-converter php server virustotal

Last synced: 15 Apr 2024

https://github.com/ArtificiAI/Multilingual-Latent-Dirichlet-Allocation-LDA

A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.

clustering english french latent-dirichlet-allocation lda machine-learning multilingual natural-language-processing

Last synced: 11 Apr 2024

https://github.com/martignoni/hugo-notice

A Hugo theme component to display nice notices

hugo hugo-theme-component multilingual notice

Last synced: 10 Apr 2024

https://github.com/keevitaja/linguist

Easy multilingual urls and redirection support for the Laravel framework

estonia i18n laravel linguist locales localization multilingual php translation

Last synced: 10 Apr 2024

https://github.com/wet-boew/wet-boew

Web Experience Toolkit (WET): Open source code library for building innovative websites that are accessible, usable, interoperable, mobile-friendly and multilingual. This collaborative open source project is led by the Government of Canada.

accessiblity framework multilingual wcag web

Last synced: 09 Apr 2024

https://github.com/pH7Software/pH7-Internationalization

๐ŸŽŒ pH7CMS Internationalization (I18N) package ๐Ÿ™Š Get new languages for your pH7CMS website!

brazilian-portuguese dutch french gettext i18n indonesian internationalisation internationalization language multilingual ph7cms php portuguese spanish translation translations

Last synced: 08 Apr 2024

https://github.com/akb89/witokit

A Python toolkit to generate a tokenized dump of Wikipedia for NLP

dump multilingual nlp tokenize wikipedia wikipedia-dump

Last synced: 06 Apr 2024

https://github.com/AkariAsai/XORQA

This is the official repository for NAACL 2021, "XOR QA: Cross-lingual Open-Retrieval Question Answering".

multilingual open-domain-qa question-answering

Last synced: 02 Apr 2024

https://github.com/bheinzerling/bpemb

Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)

embeddings multilingual natural-language-processing nlp subword-embeddings

Last synced: 02 Apr 2024

https://github.com/csebuetnlp/xl-sum

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.

abstractive-summarization abstractive-text-summarization dataset deep-learning low-resource-languages low-resource-summarization low-resource-text-summarizarion machine-learning multilingual multilingual-summarization multilingual-text-summarization multilinguality summarization-corpora summarization-dataset text-summarisation text-summarization text-summarization-dataset text-summarization-model

Last synced: 27 Mar 2024

https://github.com/notAI-tech/Anuvaad

State of the art open-source translation for Indic languages.

hindi india indic-languages kannada malayalam marathi mt5 multilingual nlp tamil telugu transformer transformers translation

Last synced: 27 Mar 2024

https://github.com/anoopkunchukuttan/geomm

Geometry-aware Multilingual Embeddings

bilingual-word-embedding multilingual nlp translation word-embedding

Last synced: 27 Mar 2024

https://github.com/vinkla/laravel-translator

An Eloquent translator for Laravel

composer eloquent laravel multilingual php translator

Last synced: 21 Mar 2024

https://github.com/pacollins/hugo-future-imperfect-slim

Multilingual Blogging Theme for Hugo | Check the Wiki for Documentation

hugo hugo-theme multilingual staticman

Last synced: 21 Mar 2024

https://github.com/sdebacker/TypiCMS

Multilingual CMS built with Laravel 4.2

cms laravel multilingual

Last synced: 20 Mar 2024

https://github.com/Tomiinek/Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

code-switching multilingual speech-synthesis text-to-speech tts voice-cloning

Last synced: 16 Mar 2024

https://github.com/jonsafari/tok-tok

A fast, simple, multilingual tokenizer

multilingual nlp tokeniser tokenizer

Last synced: 16 Mar 2024