{"id":18289152,"url":"https://github.com/quanteda/quanteda.corpora","last_synced_at":"2025-04-05T09:31:36.329Z","repository":{"id":56421243,"uuid":"114780415","full_name":"quanteda/quanteda.corpora","owner":"quanteda","description":"A collection of corpora for quanteda","archived":false,"fork":false,"pushed_at":"2020-11-09T06:10:16.000Z","size":209698,"stargazers_count":19,"open_issues_count":3,"forks_count":5,"subscribers_count":7,"default_branch":"master","last_synced_at":"2025-03-21T02:21:23.916Z","etag":null,"topics":["quanteda","text-analysis"],"latest_commit_sha":null,"homepage":"","language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/quanteda.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2017-12-19T15:19:05.000Z","updated_at":"2024-11-06T10:41:55.000Z","dependencies_parsed_at":"2022-08-15T18:20:45.871Z","dependency_job_id":null,"html_url":"https://github.com/quanteda/quanteda.corpora","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/quanteda%2Fquanteda.corpora","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/quanteda%2Fquanteda.corpora/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/quanteda%2Fquanteda.corpora/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/quanteda%2Fquanteda.corpora/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/quanteda","download_url":"https://codeload.github.com/quanteda/quanteda.corpora/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":247318228,"owners_count":20919456,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["quanteda","text-analysis"],"created_at":"2024-11-05T14:04:55.277Z","updated_at":"2025-04-05T09:31:31.313Z","avatar_url":"https://github.com/quanteda.png","language":"R","funding_links":[],"categories":[],"sub_categories":[],"readme":"[![CRAN status](https://www.r-pkg.org/badges/version/quanteda.corpora)](https://cran.r-project.org/package=quanteda.corpora)\n[![Travis build status](https://travis-ci.org/quanteda/quanteda.corpora.svg?branch=master)](https://travis-ci.org/quanteda/quanteda.corpora)\n\n# Corpora for quanteda\n\nPackage to provide easy access to large corpora for [**quanteda**](http://github.com/quanteda/quanteda).\n\n## How to Install\n\nYou can download the files and build the package from source, or you can use the devtools library to install the package directly from GitHub. This is done as follows:\n\n```r\ndevtools::install_github(\"quanteda/quanteda.corpora\")\n```\n\n## Available corpora\n\nCorpora contained in the package are the following:\n\nCorpus | Name\n--|--\nAmicus curiae briefs from Bakke (1978) and Bollinger (2008) | data_corpus_amicus\nAnnual budget speeches from the Irish Dáil, 2008-2012 | data_corpus_irishbudgets\nUK news articles from 2014 that mention immigration | data_corpus_immigrationnews\nMovie reviews from Pang, Lee, and Vaithyanathan (2002) | _moved to_ **quanteda.textmodels** \nUS State of the Union addresses from 1790 to present | data_corpus_sotu\nUK political party manifestos, 1945-2005 | data_corpus_ukmanifestos\nUN General Debate speeches, 2017 | data_corpus_ungd2017\nUniversal Declaration of Human Rights in 464 languages | data_corpus_udhr\n\nLarger corpora are also available from online locations using `download()`:\n\nCorpus | Name\n--|--\n_Guardian_ newspaper articles in politics, economy, society and international sections from 2012 to 2016 | data_corpus_guardian\nTranscripts of speeches at Japan's Committee on Foreign Affairs and Defense of the lower house (Shugiin) from 1947 to 2017 | data_corpus_foreignaffairscommittee\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fquanteda%2Fquanteda.corpora","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fquanteda%2Fquanteda.corpora","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fquanteda%2Fquanteda.corpora/lists"}