https://github.com/t-systems-on-site-services-gmbh/german-stsbenchmark
German translation of the STSbenchmark dataset
https://github.com/t-systems-on-site-services-gmbh/german-stsbenchmark
Last synced: 2 months ago
JSON representation
German translation of the STSbenchmark dataset
- Host: GitHub
- URL: https://github.com/t-systems-on-site-services-gmbh/german-stsbenchmark
- Owner: t-systems-on-site-services-gmbh
- License: other
- Created: 2020-07-23T18:39:07.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2020-08-12T20:01:10.000Z (almost 5 years ago)
- Last Synced: 2025-01-21T13:11:18.250Z (4 months ago)
- Homepage:
- Size: 863 KB
- Stars: 6
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# German STSbenchmark Dataset
This is the German translation of the [STSbenchmark dataset](https://ixa2.si.ehu.es/stswiki/index.php/STSbenchmark).It can be used to train German [sentence embeddings](https://github.com/UKPLab/sentence-transformers) like [T-Systems-onsite/bert-german-dbmdz-uncased-sentence-stsb](https://huggingface.co/T-Systems-onsite/bert-german-dbmdz-uncased-sentence-stsb).
## Different Translations
One translation has been done with [deepl.com](https://www.deepl.com/), the other with [Amazon Translate](https://aws.amazon.com/translate/). They can be combined to get some more variation in the data.