{"id":37659178,"url":"https://github.com/avidale/encodechka","last_synced_at":"2026-01-16T11:46:02.011Z","repository":{"id":47323339,"uuid":"210292501","full_name":"avidale/encodechka","owner":"avidale","description":"The tiniest sentence encoder for Russian language","archived":false,"fork":false,"pushed_at":"2024-07-25T09:34:38.000Z","size":2904,"stargazers_count":161,"open_issues_count":6,"forks_count":10,"subscribers_count":7,"default_branch":"master","last_synced_at":"2024-07-25T10:41:16.302Z","etag":null,"topics":["natural-language-processing","nlp","python","russian-specific","sentence-encoding"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/avidale.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-09-23T07:34:09.000Z","updated_at":"2024-07-25T09:34:42.000Z","dependencies_parsed_at":"2022-08-22T10:00:35.620Z","dependency_job_id":"2df3f2a9-3586-4a12-acee-e1ee2c2c3807","html_url":"https://github.com/avidale/encodechka","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/avidale/encodechka","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/avidale%2Fencodechka","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/avidale%2Fencodechka/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/avidale%2Fencodechka/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/avidale%2Fencodechka/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/avidale","download_url":"https://codeload.github.com/avidale/encodechka/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/avidale%2Fencodechka/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":28478369,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-01-16T06:30:42.265Z","status":"ssl_error","status_checked_at":"2026-01-16T06:30:16.248Z","response_time":107,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["natural-language-processing","nlp","python","russian-specific","sentence-encoding"],"created_at":"2026-01-16T11:46:01.943Z","updated_at":"2026-01-16T11:46:02.004Z","avatar_url":"https://github.com/avidale.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# encodechka\n## encodechka-eval\n\nЭтот репозиторий - развитие подхода к оценке моделей из поста\n[Маленький и быстрый BERT для русского языка](https://habr.com/ru/post/562064), \nэволюционировавшего в [Рейтинг русскоязычных энкодеров предложений](https://habr.com/ru/post/669674/).\nИдея в том, чтобы понять, как хорошо разные модели превращают короткие тексты\nв осмысленные векторы.\n\nПохожие проекты:\n* [RussianSuperGLUE](https://russiansuperglue.com/): фокус на дообучаемых моделях\n* [MOROCCO](https://github.com/RussianNLP/MOROCCO/): RussianSuperGLUE + оценка производительности, трудновоспроизводим\n* [RuSentEval](https://github.com/RussianNLP/RuSentEval): более академические/лингвистические задачи\n* Статья от Вышки [Popov et al, 2019](https://arxiv.org/abs/1910.13291): первая научная статья на эту тему, но маловато моделей и задач\n* [SentEvalRu](https://github.com/comptechml/SentEvalRu) и [deepPavlovEval](https://github.com/deepmipt/deepPavlovEval): два хороших, но давно не обновлявшихся бенчмарка.\n* **ruMTEB** ([пост](https://habr.com/ru/companies/sberdevices/articles/831150/), [код MTEB](https://github.com/embeddings-benchmark/mteb)) - русскоязычная часть MTEB. С недавнего времени там есть 23 разнообразных задачи (включая поиск и переранжирование, которых нет в Encodechka), но, кажется, там нет замеров быстродействия. Если последнее вам не критично, рекомендуется использовать ruMTEB вместо Encodechka.\n\nПример запуска метрик – в блокноте [evaluation example](https://github.com/avidale/encodechka/blob/master/evaluation%20example.ipynb). \n\nБлокнот для воспроизведения лидерборда: [v2021](https://colab.research.google.com/drive/1fu2i7A-Yr-85Ex_NvIyeCIO7lN2R7P-k?usp=sharing), \n[v2023](https://colab.research.google.com/drive/1t956aJsp5qPnst3379vI8NNRqiqJUFMn?usp=sharing).\n\nЛидерборд на [HuggingFace Space](https://huggingface.co/spaces/Samoed/Encodechka).\n\n### Лидерборд\n\nРанжирование моделей в по среднему качеству и производительности. \nПодсвечены Парето-оптимальные модели по каждому из критериев. \n\n| model                                                       | CPU       | GPU      | size          |   Mean S | Mean S+W   |   dim |\n|:------------------------------------------------------------|:----------|:---------|:--------------|---------:|:-----------|------:|\n| deepvk/USER-bge-m3                                          | **523.4** | **22.5** | **1371.1**  |    0.799 | 0.709      |  1024 |\n| BAAI/bge-m3                                                 | 523.4     | 22.5     | 2166.0    |    0.787 | 0.696      |  1024 |\n| intfloat/multilingual-e5-large-instruct                     | 501.5     | 25.71    | 2136.0    |    0.784 | 0.684      |  1024 |\n| intfloat/multilingual-e5-large                              | 506.8     | 30.8     | 2135.9389 |    0.78  | 0.686      |  1024 |\n| deepvk/USER-base                                            | 33.1      | **12.2** | 473.2402      |    0.772 | 0.688      |   768 |    \n| sentence-transformers/paraphrase-multilingual-mpnet-base-v2 | **20.5**  | **19.9** | **1081.8485** |    0.762 |            |   768 |\n| intfloat/multilingual-e5-base                               | 130.61    | 14.39    | **1061.0**    |    0.761 | 0.669      |   768 |\n| sergeyzh/rubert-tiny-turbo                                  | **5.5**   | **3.3**  | **111.4**     |    0.749 | 0.667      |   312 |\n| intfloat/multilingual-e5-small                              | 40.86     | 12.09    | **449.0**     |    0.742 | 0.645      |   384 |\n| symanto/sn-xlm-roberta-base-snli-mnli-anli-xnli             | **20.2**  | **16.5** | **1081.8474** |    0.739 |            |   768 |\n| cointegrated/LaBSE-en-ru                                    | 133.4     | **15.3** | **489.6621**  |    0.739 | 0.668      |   768 |\n| sentence-transformers/LaBSE                                 | 135.1     | **13.3** | 1796.5078     |    0.739 | 0.667      |   768 |\n| MUSE-3                                                      | 200.1     | 30.7     | **303.0**     |    0.736 |            |   512 |\n| text-embedding-ada-002                                      | ?         |          |              |    0.734 |            |  1536 |\n| sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 | **18.2**  | 14.9     | 479.2547      |    0.734 |            |   384 |\n| sentence-transformers/distiluse-base-multilingual-cased-v1  | **11.8**  | **8.0**  | 517.7452      |    0.722 |            |   512 |\n| SONAR                                                       | ?         | ?        | 3060.0        |    0.721 |            |  1024 |\n| facebook/nllb-200-distilled-600M                            | 252.3     | 15.9     | 1577.4828     |    0.709 | 0.64       |  1024 |\n| sentence-transformers/distiluse-base-multilingual-cased-v2  | **11.2**  | 9.2      | 517.7453      |    0.708 |            |   512 |\n| cointegrated/rubert-tiny2                                   | **5.5**   | **3.3**  | **111.3823**  |    0.704 | 0.638      |   312 |\n| ai-forever/sbert_large_mt_nlu_ru                            | 504.5     | 29.7     | 1628.6539     |    0.703 | 0.626      |  1024 |\n| laser                                                       | 192.5     | 13.5     | 200.0         |    0.699 |            |  1024 |\n| laser2                                                      | 163.4     | 8.6      | 175.0         |    0.694 |            |  1024 |\n| ai-forever/sbert_large_nlu_ru                               | 497.7     | 29.9     | 1628.6539     |    0.688 | 0.626      |  1024 |\n| clips/mfaq                                                  | 18.1      | 18.2     | 1081.8576     |    0.687 |            |   768 |\n| cointegrated/rut5-base-paraphraser                          | 137.0     | 15.6     | 412.0015      |    0.685 | 0.634      |   768 |\n| DeepPavlov/rubert-base-cased-sentence                       | 128.4     | 13.2     | 678.5215      |    0.678 | 0.612      |   768 |\n| DeepPavlov/distilrubert-base-cased-conversational           | 64.2      | 10.4     | 514.002       |    0.676 | 0.624      |   768 |\n| DeepPavlov/distilrubert-tiny-cased-conversational           | 21.2      | **3.3**  | 405.8292      |    0.67  | 0.616      |   768 |\n| cointegrated/rut5-base-multitask                            | 136.9     | 12.7     | 412.0015      |    0.668 | 0.623      |   768 |\n| ai-forever/ruRoberta-large                                  | 512.3     | 25.5     | 1355.7162     |    0.666 | 0.609      |  1024 |\n| DeepPavlov/rubert-base-cased-conversational                 | 127.5     | 16.3     | 678.5215      |    0.653 | 0.606      |   768 |\n| deepvk/deberta-v1-base                                      | 128.6     | 19.0     | 473.2402      |    0.653 | 0.591      |   768 |\n| cointegrated/rubert-tiny                                    | 7.5       | 5.9      | **44.97**     |    0.645 | 0.575      |   312 |\n| ai-forever/FRED-T5-large                                    | 479.4     | 23.3     | 1372.9988     |    0.639 | 0.551      |  1024 |\n| inkoziev/sbert_synonymy                                     | 6.9       | 4.2      | 111.3823      |    0.637 | 0.566      |   312 |\n| numind/NuNER-multilingual-v0.1                              | 186.9     | 10       | 678.0         |    0.633 | 0.572      |   768 |\n| cointegrated/rubert-tiny-toxicity                           | 10        | 5.5      | 47.2          |    0.621 | 0.553      |   312 |\n| ft_geowac_full                                              | **0.3**   |          | 1910.0        |    0.617 | 0.55       |   300 |\n| bert-base-multilingual-cased                                | 141.4     | 13.7     | 678.5215      |    0.614 | 0.565      |   768 |\n| ai-forever/ruT5-large                                       | 489.6     | 20.2     | 1277.7571     |    0.61  | 0.578      |  1024 |\n| cointegrated/rut5-small                                     | 37.6      | 8.6      | 111.3162      |    0.602 | 0.564      |   512 |\n| ft_geowac_21mb                                              | 1.2       |          | **21.0**      |    0.597 | 0.531      |   300 |\n| inkoziev/sbert_pq                                           | 7.4       | 4.2      | 111.3823      |    0.596 | 0.526      |   312 |\n| ai-forever/ruT5-base                                        | 126.3     | 12.8     | 418.2325      |    0.571 | 0.544      |   768 |\n| hashing_1000_char                                           | 0.5       |          | **1.0**       |    0.557 | 0.464      |  1000 |\n| cointegrated/rut5-base                                      | 127.8     | 15.5     | 412.0014      |    0.554 | 0.53       |   768 |\n| hashing_300_char                                            | 0.8       |          | 1.0           |    0.529 | 0.433      |   300 |\n| hashing_1000                                                | **0.2**   |          | 1.0           |    0.513 | 0.416      |  1000 |\n| hashing_300                                                 | 0.3       |          | 1.0           |    0.491 | 0.397      |   300 |\n\nРанжирование моделей по задачам.\nПодсвечены наилучшие модели по каждой из задач. \n\n| model                                                       | STS      | PI       | NLI      | SA       | TI       | IA       | IC       | ICX      | NE1      | NE2      |\n|:------------------------------------------------------------|:---------|:---------|:---------|:---------|:---------|:---------|:---------|:---------|:---------|:---------|\n| deepvk/USER-bge-m3                                          | **0.87** | **0.76** | 0.58     | **0.82** | 0.97    | 0.79     | 0.81     | **0.78** |0.28     | 0.43     |\n| BAAI/bge-m3                                                 | 0.86     | 0.75     | 0.51     | **0.82** | 0.97    | 0.79     | 0.81     | **0.78** | 0.24     | 0.42     |\n| intfloat/multilingual-e5-large-instruct                     | 0.86     | 0.74     | 0.47     | 0.81     | 0.98    | 0.8      | **0.82** | 0.77     | 0.21     | 0.35     |\n| intfloat/multilingual-e5-large                              | 0.86     | 0.73     | 0.47     | 0.81     | 0.98    | 0.8      | 0.82     | 0.77     | 0.24     | 0.37     |\n| deepvk/USER-base                                            | 0.85     | 0.74     | 0.48     | 0.81     | 0.99     | **0.81**  | 0.8    | 0.7      | 0.29     | 0.41     |\n| sentence-transformers/paraphrase-multilingual-mpnet-base-v2 | 0.85     | 0.66     | 0.54     | 0.79     | 0.95     | 0.78     | 0.79     | 0.74     |          |          |\n| intfloat/multilingual-e5-base                               | 0.83     | 0.7      | 0.46     | 0.8      | 0.96    | 0.78     | 0.8      | 0.74     | 0.23     | 0.38     |\n| sergeyzh/rubert-tiny-turbo                                  | 0.83     | 0.72     | 0.48     | 0.79     | 0.95    | 0.76     | 0.78      | 0.68     | 0.30     | 0.37     |\n| intfloat/multilingual-e5-small                              | 0.82     | 0.71     | 0.46     | 0.76     | 0.96    | 0.76     | 0.78     | 0.69     | 0.23     | 0.27     |\n| symanto/sn-xlm-roberta-base-snli-mnli-anli-xnli             | 0.76     | 0.6      | **0.86** | 0.76     | 0.91     | 0.72     | 0.71     | 0.6      |          |          |\n| cointegrated/LaBSE-en-ru                                    | 0.79     | 0.66     | 0.43     | 0.76     | 0.95     | 0.77     | 0.79     | 0.77     | 0.35     | 0.42     |\n| sentence-transformers/LaBSE                                 | 0.79     | 0.66     | 0.43     | 0.76     | 0.95     | 0.77     | 0.79     | 0.76     | 0.35     | 0.41     |\n| MUSE-3                                                      | 0.81     | 0.61     | 0.42     | 0.77     | 0.96     | 0.79     | 0.77     | 0.75     |          |          |\n| text-embedding-ada-002                                      | 0.78     | 0.66     | 0.44     | 0.77     | 0.96     | 0.77     | 0.75     | 0.73     |          |          |\n| sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 | 0.84     | 0.62     | 0.5      | 0.76     | 0.92     | 0.74     | 0.77     | 0.72     |          |          |\n| sentence-transformers/distiluse-base-multilingual-cased-v1  | 0.8      | 0.6      | 0.43     | 0.75     | 0.94     | 0.76     | 0.76     | 0.74     |          |          |\n| SONAR                                                       | 0.71     | 0.58     | 0.41     | 0.77     | 0.98     | 0.79     | 0.78     | 0.74     |          |          |\n| facebook/nllb-200-distilled-600M                            | 0.71     | 0.54     | 0.41     | 0.76     | 0.95     | 0.76     | 0.8      | 0.75     | 0.31     | 0.42     |\n| sentence-transformers/distiluse-base-multilingual-cased-v2  | 0.79     | 0.55     | 0.42     | 0.75     | 0.91     | 0.75     | 0.76     | 0.73     |          |          |\n| cointegrated/rubert-tiny2                                   | 0.75     | 0.65     | 0.42     | 0.74     | 0.94     | 0.75     | 0.76     | 0.64     | 0.36     | 0.39     |\n| ai-forever/sbert_large_mt_nlu_ru                           | 0.78     | 0.65     | 0.4      | 0.8      | 0.98     | 0.8      | 0.76     | 0.45     | 0.3      | 0.34     |\n| laser                                                       | 0.75     | 0.6      | 0.41     | 0.73     | 0.96     | 0.72     | 0.72     | 0.7      |          |          |\n| laser2                                                      | 0.74     | 0.6      | 0.41     | 0.73     | 0.95     | 0.72     | 0.72     | 0.69     |          |          |\n| ai-forever/sbert_large_nlu_ru                              | 0.68     | 0.62     | 0.39     | 0.78     | 0.98     | 0.8      | 0.78     | 0.48     | 0.36     | 0.4      |\n| clips/mfaq                                                  | 0.63     | 0.59     | 0.35     | 0.79     | 0.95     | 0.74     | 0.76     | 0.69     |          |          |\n| cointegrated/rut5-base-paraphraser                          | 0.65     | 0.53     | 0.4      | 0.78     | 0.95     | 0.75     | 0.75     | 0.67     | 0.45     | 0.41     |\n| DeepPavlov/rubert-base-cased-sentence                       | 0.74     | 0.66     | 0.49     | 0.75     | 0.92     | 0.75     | 0.72     | 0.39     | 0.36     | 0.34     |\n| DeepPavlov/distilrubert-base-cased-conversational           | 0.7      | 0.56     | 0.39     | 0.76     | 0.98     | 0.78     | 0.76     | 0.48     | 0.4      | 0.43     |\n| DeepPavlov/distilrubert-tiny-cased-conversational           | 0.7      | 0.55     | 0.4      | 0.74     | 0.98     | 0.78     | 0.76     | 0.45     | 0.35     | 0.44     |\n| cointegrated/rut5-base-multitask                            | 0.65     | 0.54     | 0.38     | 0.76     | 0.95     | 0.75     | 0.72     | 0.59     | 0.47     | 0.41     |\n| ai-forever/ruRoberta-large                                 | 0.7      | 0.6      | 0.35     | 0.78     | 0.98     | 0.8      | 0.78     | 0.32     | 0.3      | **0.46** |\n| DeepPavlov/rubert-base-cased-conversational                 | 0.68     | 0.52     | 0.38     | 0.73     | 0.98     | 0.78     | 0.75     | 0.42     | 0.41     | 0.43     |\n| deepvk/deberta-v1-base                                      | 0.68     | 0.54     | 0.38     | 0.76     | 0.98     | 0.8      | 0.78     | 0.29     | 0.29     | 0.4      |\n| cointegrated/rubert-tiny                                    | 0.66     | 0.53     | 0.4      | 0.71     | 0.89     | 0.68     | 0.7      | 0.58     | 0.24     | 0.34     |\n| ai-forever/FRED-T5-large                                    | 0.62     | 0.44     | 0.37     | 0.78     | 0.98     | **0.81** | 0.67     | 0.45     | 0.25     | 0.15     |\n| inkoziev/sbert_synonymy                                     | 0.69     | 0.49     | 0.41     | 0.71     | 0.91     | 0.72     | 0.69     | 0.47     | 0.32     | 0.24     |\n| numind/NuNER-multilingual-v0.1                              | 0.67     | 0.53     | 0.4      | 0.71     | 0.89    | 0.72     | 0.7      | 0.46     | 0.32     | 0.34     |\n| cointegrated/rubert-tiny-toxicity                           | 0.57     | 0.44     | 0.37     | 0.68     | **1.0** | 0.78     | 0.7      | 0.43     | 0.24     | 0.32     |\n| ft_geowac_full                                              | 0.69     | 0.53     | 0.37     | 0.72     | 0.97     | 0.76     | 0.66     | 0.26     | 0.22     | 0.34     |\n| bert-base-multilingual-cased                                | 0.66     | 0.53     | 0.37     | 0.7      | 0.89     | 0.7      | 0.69     | 0.38     | 0.36     | 0.38     |\n| ai-forever/ruT5-large                                      | 0.51     | 0.39     | 0.35     | 0.77     | 0.97     | 0.79     | 0.72     | 0.38     | 0.46     | 0.44     |\n| cointegrated/rut5-small                                     | 0.61     | 0.53     | 0.34     | 0.73     | 0.92     | 0.71     | 0.7      | 0.27     | 0.44     | 0.38     |\n| ft_geowac_21mb                                              | 0.68     | 0.52     | 0.36     | 0.72     | 0.96     | 0.74     | 0.65     | 0.15     | 0.21     | 0.32     |\n| inkoziev/sbert_pq                                           | 0.57     | 0.41     | 0.38     | 0.7      | 0.92     | 0.69     | 0.68     | 0.43     | 0.26     | 0.24     |\n| ai-forever/ruT5-base                                       | 0.5      | 0.28     | 0.34     | 0.73     | 0.97     | 0.76     | 0.7      | 0.29     | 0.45     | 0.41     |\n| hashing_1000_char                                           | 0.7      | 0.53     | 0.4      | 0.7      | 0.84     | 0.59     | 0.63     | 0.05     | 0.05     | 0.14     |\n| cointegrated/rut5-base                                      | 0.44     | 0.28     | 0.33     | 0.74     | 0.92     | 0.75     | 0.58     | 0.39     | **0.48** | 0.39     |\n| hashing_300_char                                            | 0.69     | 0.51     | 0.39     | 0.67     | 0.75     | 0.57     | 0.61     | 0.04     | 0.03     | 0.08     |\n| hashing_1000                                                | 0.63     | 0.49     | 0.39     | 0.66     | 0.77     | 0.55     | 0.57     | 0.05     | 0.02     | 0.04     |\n| hashing_300                                                 | 0.61     | 0.48     | 0.4      | 0.64     | 0.71     | 0.54     | 0.5      | 0.05     | 0.02     | 0.02     |\n\n#### Задачи\n- Semantic text similarity (**STS**) на основе переведённого датасета [STS-B](https://huggingface.co/datasets/stsb_multi_mt);\n- Paraphrase identification (**PI**) на основе датасета paraphraser.ru;\n- Natural language inference (**NLI**) на датасете [XNLI](https://github.com/facebookresearch/XNLI);\n- Sentiment analysis (**SA**) на данных [SentiRuEval2016](http://www.dialog-21.ru/evaluation/2016/sentiment/).\n- Toxicity identification (**TI**) на датасете токсичных комментариев из [OKMLCup](https://cups.mail.ru/ru/contests/okmlcup2020);\n- Inappropriateness identification (**IA**) на [датасете Сколтеха](https://github.com/skoltech-nlp/inappropriate-sensitive-topics);\n- Intent classification (**IC**) и её кросс-язычная версия **ICX** на датасете [NLU-evaluation-data](https://github.com/xliuhw/NLU-Evaluation-Data), который я автоматически перевёл на русский. В IC классификатор обучается на русских данных, а в ICX – на английских, а тестируется в обоих случаях на русских.\n- Распознавание именованных сущностей на датасетах [factRuEval-2016](https://github.com/dialogue-evaluation/factRuEval-2016) (**NE1**) и [RuDReC](https://github.com/cimm-kzn/RuDReC) (**NE2**). Эти две задачи требуют получать эмбеддинги отдельных токенов, а не целых предложений; поэтому там участвуют не все модели.\n\n### Changelog\n* Август 2023 - обновил рейтинг:\n   * поправив ошибку в вычислении mean token embeddings\n   * добавил несколько моделей, включая нового лидера - `intfloat/multilingual-e5-large`\n   * по просьбам трудящихся, добавил `text-embedding-ada-002` (размер и производительность указаны от балды)\n* Лето 2022 - опубликовал первый рейтинг\n* \n\n## Цитирование\n\nЕсли вы упоминаете бенчмарк в научной работе, можете сослаться на мой пост на Хабре следующим образом (bibtex):\n```\n@misc{dale_encodechka, \n   author = \"Dale, David\",\n   title  = \"Рейтинг русскоязычных энкодеров предложений\", \n   editor = \"habr.com\", \n   url    = \"https://habr.com/ru/articles/669674/\", \n   month  = {June},\n   year   = {2022},   \n   note = {[Online; posted 12-June-2022]},\n}\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Favidale%2Fencodechka","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Favidale%2Fencodechka","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Favidale%2Fencodechka/lists"}