{"id":13724570,"url":"https://github.com/lter/lterdatasampler","last_synced_at":"2025-07-06T15:05:37.516Z","repository":{"id":41052273,"uuid":"377568895","full_name":"lter/lterdatasampler","owner":"lter","description":"LTER data samples to teach environmental data science","archived":false,"fork":false,"pushed_at":"2023-10-03T18:53:21.000Z","size":47113,"stargazers_count":51,"open_issues_count":16,"forks_count":7,"subscribers_count":3,"default_branch":"main","last_synced_at":"2025-06-10T10:49:46.897Z","etag":null,"topics":["data-science","ecology","lter-science","r","r-package"],"latest_commit_sha":null,"homepage":"https://lter.github.io/lterdatasampler/","language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/lter.png","metadata":{"files":{"readme":"README.Rmd","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2021-06-16T16:59:13.000Z","updated_at":"2025-04-07T19:50:31.000Z","dependencies_parsed_at":"2023-09-24T15:06:16.121Z","dependency_job_id":"6bdfcb83-bccf-4e70-bc2c-de6e6ef41618","html_url":"https://github.com/lter/lterdatasampler","commit_stats":{"total_commits":179,"total_committers":6,"mean_commits":"29.833333333333332","dds":0.3966480446927374,"last_synced_commit":"ff784768e090bbbb3e49690c8b679ec3d02b8e66"},"previous_names":[],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/lter/lterdatasampler","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lter%2Flterdatasampler","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lter%2Flterdatasampler/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lter%2Flterdatasampler/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lter%2Flterdatasampler/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/lter","download_url":"https://codeload.github.com/lter/lterdatasampler/tar.gz/refs/heads/main","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/lter%2Flterdatasampler/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":259981382,"owners_count":22941143,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["data-science","ecology","lter-science","r","r-package"],"created_at":"2024-08-03T01:01:59.589Z","updated_at":"2025-07-06T15:05:37.461Z","avatar_url":"https://github.com/lter.png","language":"R","funding_links":[],"categories":["Biosphere"],"sub_categories":["Conservation and Restoration"],"readme":"---\noutput: github_document\neditor_options: \n  markdown: \n    wrap: 72\n---\n\n```{r setup, include=FALSE}\nknitr::opts_chunk$set(\n  collapse = TRUE,\n  comment = \"#\u003e\",\n  fig.path = \"man/figures/README-\",\n  out.width = \"75%\",\n  warning = FALSE,\n  message = FALSE,\n  fig.retina = 2,\n  fig.align = 'center'\n)\n\nlibrary(gt)\nlibrary(tidyverse)\n```\n\n\u003c!-- badges: start --\u003e\n\n[![Package\nSite](https://github.com/lter/lterdatasampler/workflows/pkgdown/badge.svg)](https://github.com/lter/lterdatasampler/actions)\n[![R-CMD-check](https://github.com/lter/lterdatasampler/workflows/R-CMD-check/badge.svg)](https://github.com/lter/lterdatasampler/actions)\n[![CRAN status](https://www.r-pkg.org/badges/version/lterdatasampler)](https://cran.r-project.org/package=lterdatasampler)\n[![CRAN RStudio mirror downloads](http://cranlogs.r-pkg.org/badges/lterdatasampler)](http://www.r-pkg.org/pkg/lterdatasampler)\n\n\n\u003c!-- badges: end --\u003e\n\n# lterdatasampler \u003ca href='https://lter.github.io/lterdatasampler/'\u003e\u003cimg src=\"man/figures/logo.png\" id=\"home_logo\" align=\"right\" height=\"180\"/\u003e\u003c/a\u003e\n\nThe mission of the [Long Term Ecological Research program (LTER)\nNetwork](https://lternet.edu/) is to “*provide the scientific community,\npolicy makers, and society with the knowledge and predictive\nunderstanding necessary to conserve, protect, and manage the nation’s\necosystems, their biodiversity, and the services they provide.*” A\nspecific goal of the LTER is [education and\ntraining](https://lternet.edu/education-and-training/) - “*to promote\ntraining, teaching, and learning about long-term ecological research and\nthe Earth’s ecosystems, and to educate a new generation of scientists.*”\n\nThe goal of this package is to provide a sampler to gather feedback from\nthe community of what will be a larger package containing 28 datasets -\none from each of the existing [US LTER\nsites](https://lternet.edu/site/). Those datasets are subsets of the\noriginal data and have been updated - sometimes substantially - from the\nraw data. They are aimed to be useful for teaching and training in\nenvironmental data science. **This content is thus not suitable for\nresearch and should only be used for teaching purposes**.\n\nWe encourage you to explore existing LTER [teaching and training\ninitiatives](https://lternet.edu/education-and-training/), and the\n**many** other available LTER datasets which can be accessed via the\n[Environmental Data\nInitiative](https://edirepository.org/). Please contact\ncited researchers directly to discuss using data for research purposes\nor in publication.\n\n## Installation\n\nYou can install the CRAN version of `lterdatasampler` with:\n\n``` r\ninstall.packages(\"lterdatasampler\")\n```\n\n\nYou can install the development version of `lterdatasampler` from\nGitHub with:\n\n``` r\n# install.packages(\"remotes\")\nremotes::install_github(\"lter/lterdatasampler\")\n```\n\n## The dataset samples\n\nDataset samples currently included in the package are summarized below;\nsee individual Articles for data and source details. Note: the three\nletter prefix for each dataset indicates the LTER site (see full list of\n[site abbreviations](https://lternet.edu/site/)).\n\n-   [`and_vertebrates`](https://lter.github.io/lterdatasampler/reference/and_vertebrates.html):\n    Records for aquatic vertebrates (cutthroat trout and salamanders) in\n    Mack Creek, Andrews Experimental Forest, Oregon (1987 - present)\n-   [`arc_weather`](https://lter.github.io/lterdatasampler/reference/arc_weather.html):\n    Daily meteorological (e.g. air temperature, precipitation) records\n    from Toolik Field Station, Alaska (1988 - present)\n-   [`hbr_maples`](https://lter.github.io/lterdatasampler/reference/hbr_maples.html):\n    Sugar maple seedlings at Hubbard Brook Experimental Forest (New\n    Hampshire) in calcium-treated and reference watersheds in August\n    2003 and June 2004\n-   [`knz_bison`](https://lter.github.io/lterdatasampler/reference/knz_bison.html):\n    Bison masses recorded for the herd at Konza Prairie Biological\n    Station LTER\n-   [`luq_streamchem`](https://lter.github.io/lterdatasampler/reference/luq_streamchem.html):\n    stream chemistry data for the Quebrada Sonadora (QS) location part of the Luqillo tropical\n    forest LTER site\n-   [`ntl_icecover`](https://lter.github.io/lterdatasampler/reference/ntl_icecover.html):\n    Ice freeze and thaw dates for Madison, Wisconsin Area lakes (1853 -\n    2019), North Temperate Lakes LTER\n-   [`ntl_airtemp`](https://lter.github.io/lterdatasampler/reference/ntl_airtemp.html):\n    Daily average air temperature data for Madison, Wisconsin (1869 -\n    2019), North Temperate Lakes LTER\n-   [`nwt_pikas`](https://lter.github.io/lterdatasampler/reference/nwt_pikas.html):\n    Pika observations for habitat and stress analysis at Niwot Ridge\n    LTER, Colorado\n-   [`pie_crab`](https://lter.github.io/lterdatasampler/reference/pie_crab.html):\n    Fiddler crab body size recorded summer 2016 in salt marshes from\n    Florida to Massachusetts including Plum Island Ecosystem LTER,\n    Virginia Coast LTER, and NOAA’s National Estuarine Research Reserve\n    System\n\n## Which data sample should I use?\n\nThese data samples are selected because they have features we feel are\ncommonly useful in introductory environmental data science and\nstatistics courses.\n\nIn the table below, we list some introductory methods / skills, then\nshare which data samples in this package we think are well-suited to use\nwhen teaching or learning them! It is not comprehensive - there are\n*many* different analyses \u0026 skills that these data samples would\nfacilitate. Here we highlight a few that we think would be commonly\nuseful\n\n```{r, echo = FALSE}\n# Create the table contents\ntable_contents \u003c- tribble(\n  ~method, ~datasample, ~data_description, ~link,\n  \"Linear relationships\", \"`pie_crab`\", \"Model the relationship between fiddler crab size and latitude using `pie_crab` , while learning about Bergmann's Rule!\", \"https://lter.github.io/lterdatasampler/articles/pie_crab_vignette.html\",\n  \"Linear relationships\", \"`ntl_icecover`\", \"Investigate the relationship between winter temperatures and ice cover duration for Wisconsin lakes using `ntl_icecover`\", \"https://lter.github.io/lterdatasampler/articles/ntl_icecover_vignette.html\",\n  \"Linear relationships\", \"`hbr_maples`\", \"Explore seedling height-mass relationships for sugar maples using `hbr_maples`\", \"https://lter.github.io/lterdatasampler/articles/hbr_maples_vignette.html\",\n  \"Non-linear relationships\", \"`knz_bison`\", \"Model the relationship between bison age and mass for male and female bison using `knz_bison`, for example estimating parameters in the Gompertz model\", \"https://lter.github.io/lterdatasampler/articles/knz_bison_vignette.html\",\n  \"Non-linear relationships\", \"`and_vertebrates`\", \"Model the length-mass relationships for cutthroat trout and salamanders in Mack Creek, Oregon\", \"https://lter.github.io/lterdatasampler/articles/and_vertebrates_vignette.html\", \n  \"Time series analysis\", \"`arc_weather`\", \"Explore seasonality, wrangling dates, or practice forecasting using daily meteorological records from Toolik Station, Alaska\", \"https://lter.github.io/lterdatasampler/articles/arc_weather_vignette.html\", \n  \"Time series analysis\", \"`luq_streamchem`\", \"Investigate the impact of a hurricane on stream water chemistry\", \"https://lter.github.io/lterdatasampler/articles/luq_streamchem_vignette.html\",\n  \"Spatial data introduction\", \"`nwt_pikas`\", \"Introduce basics of spatial data (e.g. CRS, projections) and tools for working with spatial data by visualizing pika locations at Niwot Ridge in the Colorado Rockies\", \"https://lter.github.io/lterdatasampler/articles/nwt_pikas_vignette.html\", \n  \"Comparing groups\", \"`hbr_maples`\", \"Compare sugar maple seedling heights in previously calcium-treated versus untreated watersheds using `hbr_maples`, using the exercise as an opportunity to think about acid rain and soil acidification\", \"https://lter.github.io/lterdatasampler/articles/hbr_maples_vignette.html\", \n  \"Comparing groups\", \"`and_vertebrates`\", \"Explore differences in size and abundance of cutthroat trout and salamanders in old growth versus previously clear cut forest sections (2 groups) or in different conditions (\u003e 2 groups, e.g. pool, cascade, riffle) of Mack Creek, Oregon\", \"https://lter.github.io/lterdatasampler/articles/and_vertebrates_vignette.html\"\n) %\u003e% \n  mutate(full_link = sprintf('\u003ca href = \"%s\"\u003e%s\u003c/a\u003e', link, datasample),\n         full_link = map(full_link, gt::html))\n\n\ntable_contents %\u003e% \n  select(method, full_link, data_description) %\u003e% \n  gt(groupname_col = \"method\") %\u003e% \n  tab_header(\n    title = \"Recommended data samples for introducing selected topics\",\n  ) %\u003e% \n  cols_label(method = \"Topic\", full_link = \"Data sample\", data_description = \"For example you could:\") %\u003e%\n  tab_options(row_group.as_column = TRUE) %\u003e% \n  tab_style(\n    style = \"vertical-align:middle\",\n    locations = cells_row_groups()\n  ) \n  \n```\n\n## How to provide feedback\n\nThe best way to provide feedback on this package is to open an\n[issue](https://github.com/lter/lterdatasampler/issues) and assign the\n`feedback` label. Thank you!\n\n## Acknowledgements\n\nThank you to the amazing students who contributed to this project: *Sam\nGuo, Adhitya Logan, Lia Ran, Sophia Sternberg, Karen Zhao* as part of\ntheir [UCSB Data Science capstone\nproject](https://ucsb-ds-capstone-2021.github.io/projects/nceas/update3.html).\nThank you also go to their Course Advisor Prof. Sang-yun Oh.\n\nPeople / organizations who supported this project:\n\n-   LTER Network Office\n-   LTER Information Managers\n-   LTER Education Committee\n-   All the LTER Researchers and Site PIs\n-   Cyber-infrastructures:\n    [EDI](https://edirepository.org/) and\n    [DataONE](https://www.dataone.org/)\n\nWe gratefully acknowledge all authors and contributors of the\n[`roxygen2`](https://roxygen2.r-lib.org/),\n[`usethis`](https://usethis.r-lib.org/),\n[`pkgdown`](https://pkgdown.r-lib.org/),\n[`devtools`](https://devtools.r-lib.org/),\n[`tidyverse`](https://www.tidyverse.org/) and\n[`metajam`](https://github.com/NCEAS/metajam/) packages. This website\nrelies heavily on themes created by Dr. Desirée DeLeon and Dr. Alison\nHill.\n\n\u003chr\u003e\n\n\u003cimg src=\"man/figures/institutions_logo.png\" width=\"100%\" align=\"center\" style=\"display: block; margin: auto;\"/\u003e\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flter%2Flterdatasampler","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Flter%2Flterdatasampler","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flter%2Flterdatasampler/lists"}