An open API service indexing awesome lists of open source software.

https://github.com/t-systems-on-site-services-gmbh/german-stsbenchmark

German translation of the STSbenchmark dataset
https://github.com/t-systems-on-site-services-gmbh/german-stsbenchmark

Last synced: 2 months ago
JSON representation

German translation of the STSbenchmark dataset

Awesome Lists containing this project

README

        

# German STSbenchmark Dataset
This is the German translation of the [STSbenchmark dataset](https://ixa2.si.ehu.es/stswiki/index.php/STSbenchmark).

It can be used to train German [sentence embeddings](https://github.com/UKPLab/sentence-transformers) like [T-Systems-onsite/bert-german-dbmdz-uncased-sentence-stsb](https://huggingface.co/T-Systems-onsite/bert-german-dbmdz-uncased-sentence-stsb).

## Different Translations
One translation has been done with [deepl.com](https://www.deepl.com/), the other with [Amazon Translate](https://aws.amazon.com/translate/). They can be combined to get some more variation in the data.