Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/openlegaldata/awesome-legal-data

Collection of Datasets for Legal Text Processing
https://github.com/openlegaldata/awesome-legal-data

List: awesome-legal-data

Last synced: 3 months ago
JSON representation

Collection of Datasets for Legal Text Processing

Awesome Lists containing this project

README

        

# awesome-legal-data

[![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome)

A curated list of resources dedicated to legal data.
The collection contains data sets, tools and other links related to the legal domain.
Most resources are openly available.

## United States

- [Caselaw Access Project by Harvard Law School](https://case.law/)
- [CourtListner](https://courtlistener.com) - Search millions of opinions by case name, topic, or citation. 403 Jurisdictions. Sponsored by the Non-Profit [Free Law Project](https://free.law).
- [H2O Open Case Book](https://opencasebook.org/)

## UK

- [British and Irish Legal Information Institute](http://www.bailii.org/)

## Canada

- [Canadian Legal Information Institute](https://www.canlii.org/en/)

## Australia

- [Australasian Legal Information Institute](http://www.austlii.edu.au/)
- [Open Australian Legal Corpus: The First Multijurisdictional Open Corpus of Australian Legislative and Judicial Documents](https://huggingface.co/datasets/umarbutler/open-australian-legal-corpus)

## Germany

- [OpenJur](https://openjur.de/)
- [Open Legal Data](https://openlegaldata.io/)
- [A Dataset of German Legal Documents for Named Entity Recognition (Lynx Project)](https://github.com/elenanereiss/Legal-Entity-Recognition)
- [GerDaLIR: A German Dataset for Legal Information Retrieval](https://github.com/lavis-nlp/GerDaLIR) [(Paper)](https://aclanthology.org/2021.nllp-1.13.pdf)
- [gesp: Download all available German court decisions straight from the command line](https://github.com/niklaswais/gesp)
- [German Legal Sentences (GLS): Semantic sentence matching and citation recommendation](https://huggingface.co/datasets/lavis-nlp/german_legal_sentences)

## Switzerland

- [entscheidsuche.ch](https://entscheidsuche.ch/)
- [Swiss-Judgment-Prediction: A Multilingual Legal Judgment Prediction Benchmark](https://arxiv.org/abs/2110.00806)

## Netherlands

- [rechtspraak.nl](https://www.rechtspraak.nl/)

## Norway

- [rettspraksis.no](https://rettspraksis.no/wiki/Forside)

## Poland

- [mojeprawo.io](https://mojeprawo.io/)

## Czech

- [Czech Court Decisions Corpus](https://lindat.mff.cuni.cz/repository/xmlui/handle/11372/LRT-3052) [(Paper)](https://arxiv.org/pdf/1910.09513.pdf)

## Finland

- [FinLex](https://www.finlex.fi/en/)

## France

- [Legifrance](https://www.legifrance.gouv.fr/Traductions/en-English)

## EU

- [EUR-Lex](https://eur-lex.europa.eu/)
- [MultiEURLEX - A multi-lingual and multi-label legal document classification dataset for zero-shot cross-lingual transfer](https://arxiv.org/abs/2109.00904)
- [Mining Legal Arguments in Court Decisions - Data and software (European Court of Human Rights (ECHR))](https://github.com/trusthlt/mining-legal-arguments)

## Japan

- [Competition on Legal Information Extraction/Entailment (COLIEE 2020)](https://sites.ualberta.ca/~rabelo/COLIEE2020/)

---

## Other datasets

- [LexGLUE: A Benchmark Dataset for Legal Language Understanding in English](https://github.com/coastalcph/lex-glue)
- [PileOfLaw](https://github.com/Breakend/PileOfLaw)

## Tools

- [Blackstone - A spaCy pipeline and model for NLP on unstructured legal text.](https://github.com/ICLRandD/Blackstone)
- [Pseudo-anonymization of French legal cases](https://github.com/ELS-RD/anonymisation)
- [Scripts to crawl English legal corpora](https://github.com/iliaschalkidis/LegalCrawler)
- [LEGAL-BERT: The Muppets straight out of Law School](https://arxiv.org/abs/2010.02559)
- [Law-OMNI-BERT-Project](https://github.com/Lukas-Justen/Law-OMNI-BERT-Project)

## Other links

- [Liquid-Legal-Institute/Legal-Text-Analytics](https://github.com/Liquid-Legal-Institute/Legal-Text-Analytics)
- [Natural Legal Language Processing Workshop](https://nllpw.org/)

## License

[![CC0](http://mirrors.creativecommons.org/presskit/buttons/88x31/svg/cc-zero.svg)](https://creativecommons.org/publicdomain/zero/1.0/)