Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/openlegaldata/awesome-legal-data
Collection of Datasets for Legal Text Processing
https://github.com/openlegaldata/awesome-legal-data
List: awesome-legal-data
Last synced: 3 months ago
JSON representation
Collection of Datasets for Legal Text Processing
- Host: GitHub
- URL: https://github.com/openlegaldata/awesome-legal-data
- Owner: openlegaldata
- Created: 2019-02-19T11:15:03.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2023-06-26T14:46:26.000Z (over 1 year ago)
- Last Synced: 2024-05-20T08:00:50.899Z (6 months ago)
- Homepage: https://openlegaldata.io
- Size: 18.6 KB
- Stars: 69
- Watchers: 6
- Forks: 3
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- ultimate-awesome - awesome-legal-data - Collection of Datasets for Legal Text Processing. (Other Lists / PowerShell Lists)
README
# awesome-legal-data
[![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome)
A curated list of resources dedicated to legal data.
The collection contains data sets, tools and other links related to the legal domain.
Most resources are openly available.## United States
- [Caselaw Access Project by Harvard Law School](https://case.law/)
- [CourtListner](https://courtlistener.com) - Search millions of opinions by case name, topic, or citation. 403 Jurisdictions. Sponsored by the Non-Profit [Free Law Project](https://free.law).
- [H2O Open Case Book](https://opencasebook.org/)## UK
- [British and Irish Legal Information Institute](http://www.bailii.org/)
## Canada
- [Canadian Legal Information Institute](https://www.canlii.org/en/)
## Australia
- [Australasian Legal Information Institute](http://www.austlii.edu.au/)
- [Open Australian Legal Corpus: The First Multijurisdictional Open Corpus of Australian Legislative and Judicial Documents](https://huggingface.co/datasets/umarbutler/open-australian-legal-corpus)## Germany
- [OpenJur](https://openjur.de/)
- [Open Legal Data](https://openlegaldata.io/)
- [A Dataset of German Legal Documents for Named Entity Recognition (Lynx Project)](https://github.com/elenanereiss/Legal-Entity-Recognition)
- [GerDaLIR: A German Dataset for Legal Information Retrieval](https://github.com/lavis-nlp/GerDaLIR) [(Paper)](https://aclanthology.org/2021.nllp-1.13.pdf)
- [gesp: Download all available German court decisions straight from the command line](https://github.com/niklaswais/gesp)
- [German Legal Sentences (GLS): Semantic sentence matching and citation recommendation](https://huggingface.co/datasets/lavis-nlp/german_legal_sentences)## Switzerland
- [entscheidsuche.ch](https://entscheidsuche.ch/)
- [Swiss-Judgment-Prediction: A Multilingual Legal Judgment Prediction Benchmark](https://arxiv.org/abs/2110.00806)## Netherlands
- [rechtspraak.nl](https://www.rechtspraak.nl/)
## Norway
- [rettspraksis.no](https://rettspraksis.no/wiki/Forside)
## Poland
- [mojeprawo.io](https://mojeprawo.io/)
## Czech
- [Czech Court Decisions Corpus](https://lindat.mff.cuni.cz/repository/xmlui/handle/11372/LRT-3052) [(Paper)](https://arxiv.org/pdf/1910.09513.pdf)
## Finland
- [FinLex](https://www.finlex.fi/en/)
## France
- [Legifrance](https://www.legifrance.gouv.fr/Traductions/en-English)
## EU
- [EUR-Lex](https://eur-lex.europa.eu/)
- [MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer](https://arxiv.org/abs/2109.00904)
- [Mining Legal Arguments in Court Decisions - Data and software (European Court of Human Rights (ECHR))](https://github.com/trusthlt/mining-legal-arguments)## Japan
- [Competition on Legal Information Extraction/Entailment (COLIEE 2020)](https://sites.ualberta.ca/~rabelo/COLIEE2020/)
---
## Other datasets
- [LexGLUE: A Benchmark Dataset for Legal Language Understanding in English](https://github.com/coastalcph/lex-glue)
- [PileOfLaw](https://github.com/Breakend/PileOfLaw)## Tools
- [Blackstone - A spaCy pipeline and model for NLP on unstructured legal text.](https://github.com/ICLRandD/Blackstone)
- [Pseudo-anonymization of French legal cases](https://github.com/ELS-RD/anonymisation)
- [Scripts to crawl English legal corpora](https://github.com/iliaschalkidis/LegalCrawler)
- [LEGAL-BERT: The Muppets straight out of Law School](https://arxiv.org/abs/2010.02559)
- [Law-OMNI-BERT-Project](https://github.com/Lukas-Justen/Law-OMNI-BERT-Project)## Other links
- [Liquid-Legal-Institute/Legal-Text-Analytics](https://github.com/Liquid-Legal-Institute/Legal-Text-Analytics)
- [Natural Legal Language Processing Workshop](https://nllpw.org/)## License
[![CC0](http://mirrors.creativecommons.org/presskit/buttons/88x31/svg/cc-zero.svg)](https://creativecommons.org/publicdomain/zero/1.0/)