https://github.com/paperswithcode/paperswithcode-data
The full dataset behind paperswithcode.com
https://github.com/paperswithcode/paperswithcode-data
Last synced: 8 months ago
JSON representation
The full dataset behind paperswithcode.com
- Host: GitHub
- URL: https://github.com/paperswithcode/paperswithcode-data
- Owner: paperswithcode
- Created: 2019-01-31T15:23:49.000Z (almost 7 years ago)
- Default Branch: master
- Last Pushed: 2021-10-08T09:43:41.000Z (about 4 years ago)
- Last Synced: 2025-04-15T06:43:46.618Z (8 months ago)
- Size: 2.93 KB
- Stars: 341
- Watchers: 19
- Forks: 34
- Open Issues: 14
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-scholarly-data-analysis - Papers with Code - Links between papers and repositories and extraction of SOTA results
README
# Papers with code datasets
You can download the full dataset behind [paperswithcode.com](https://paperswithcode.com) here:
Download links for the data dumps are:
- [All papers with abstracts](https://production-media.paperswithcode.com/about/papers-with-abstracts.json.gz)
- [Links between papers and code](https://production-media.paperswithcode.com/about/links-between-papers-and-code.json.gz)
- [Evaluation tables](https://production-media.paperswithcode.com/about/evaluation-tables.json.gz)
- [Methods](https://production-media.paperswithcode.com/about/methods.json.gz)
- [Datasets](https://production-media.paperswithcode.com/about/datasets.json.gz)
The last JSON is in the [sota-extractor](https://github.com/paperswithcode/sota-extractor) format and the code
from there can be used to load in the JSON into a set of Python classes.
At the moment, data is regenerated daily.
Part of the data is coming from the sources listed in the [sota-extractor README](https://github.com/paperswithcode/sota-extractor).
## Licence
All data is licenced under [CC-BY-SA](https://creativecommons.org/licenses/by-sa/4.0/).