{"id":24319389,"url":"https://github.com/btskinner/downloadipeds","last_synced_at":"2025-09-27T04:31:27.892Z","repository":{"id":84089671,"uuid":"76143190","full_name":"btskinner/downloadipeds","owner":"btskinner","description":"Script to batch download IPEDS survey files","archived":false,"fork":false,"pushed_at":"2024-12-20T22:09:40.000Z","size":23,"stargazers_count":15,"open_issues_count":1,"forks_count":5,"subscribers_count":4,"default_branch":"main","last_synced_at":"2024-12-20T23:20:13.206Z","etag":null,"topics":["batch-download","ipeds","r","sas","spss","stata"],"latest_commit_sha":null,"homepage":null,"language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/btskinner.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2016-12-10T23:20:49.000Z","updated_at":"2024-12-20T22:09:44.000Z","dependencies_parsed_at":"2024-12-20T23:20:11.754Z","dependency_job_id":"c6912f3f-906f-4435-930a-7a4f4fd6f90a","html_url":"https://github.com/btskinner/downloadipeds","commit_stats":null,"previous_names":[],"tags_count":8,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/btskinner%2Fdownloadipeds","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/btskinner%2Fdownloadipeds/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/btskinner%2Fdownloadipeds/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/btskinner%2Fdownloadipeds/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/btskinner","download_url":"https://codeload.github.com/btskinner/downloadipeds/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":234384318,"owners_count":18823656,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["batch-download","ipeds","r","sas","spss","stata"],"created_at":"2025-01-17T15:32:49.156Z","updated_at":"2025-09-27T04:31:22.605Z","avatar_url":"https://github.com/btskinner.png","language":"R","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Purpose\n\nUse this script to batch download [Integrated Postsecondary Education\nData System (IPEDS)](http://nces.ed.gov/ipeds/) files. The downloaded\nfiles are not unzipped or processed in any way. This script simply\nsaves you the trouble of having to point and click your way through\nthe data center.\n\n(You can also download database files\n[here](https://nces.ed.gov/ipeds/Section/accessdatabase/), but you\nneed MS Access to open them and they only go back to 2004-2005.)\n\nOnly those files listed in `ipeds_file_list.txt` will be\ndownloaded. The default behavior is to download each of the following\nfiles into their own subdirectories:\n \n1. Data file  \n2. Dictionary file\n\nYou can also choose to download other data versions and/or program files:  \n  \n1. Data file (STATA version)  \n2. STATA program file (default if you ask for STATA version data)  \n3. SPSS program file  \n4. SAS program file\n\n# IMPORTANT NOTE\n\nThe default behavior is to download **ALL OF IPEDS**. If you don't\nwant everything, modify `ipeds_file_list.txt` to include only those\nfiles that you want. Simply erase those you don't want, keeping one\nfile name per row or comment them out using a hash symbol (#). Since\nthe downloading script ignores lines starting with hashes, you can add\nnotes or better section headers to the file if you want.\n\nI try to keep `ipeds_file_list.txt` updated, but if I've missed a file\nor haven't updated in a while, just add the name of the file or files,\none to a line. If you notice missing files, feel free to submit a pull\nrequest with an updated version of `ipeds_file_list.txt`.\n\nYou also have the option of whether you wish to overwrite existing\nfiles.  If you do, change the `overwrite` option to `TRUE`. The\ndefault behavior is to only download files listed in\n`ipeds_file_list.txt` that have not already been downloaded. Keep in\nmind that NCES revises old files from time to time, so it may be best\nto overwrite old files unless you have a reason to prefer the old\nversion. \n\n# To Run\n\nFor the default options, run the script `downloadipeds.R` as is. It\nwill create subdirectories as needed to store data and program\nfiles. You may wish to place the folder in its own directory. Just\nmake sure that `ipeds_file_list.txt` is in the same directory. For\nexample, let's say you place both files in a directory called `ipeds`:\n\n```\n./ipeds\n|__ downloadipeds.R\n|__ ipeds_file_list.txt\n```\n\nIf you run it with the default options, two new subdirectories will be\ncreated, one for the `data` files and one for the `dictionary`\nfiles. Using the above example, your directory will look like this\nafter it is finished:\n\n```\n./ipeds\n|__ downloadipeds.R\n|__ ipeds_file_list.txt\n|__ /data\n|   |__ HD2015.zip\n|   |__ IC2015.zip\n|   |__ \u003c...\u003e\n|\n|__ /dictionary\n    |__ HD2015_Dict.zip\n    |__ IC2015_Dict.zip\n    |__ \u003c...\u003e\n```\n\nTo download other program scripts or Stata versions of the data,\nchange the following commands in `downloadipeds.R` from `FALSE` to\n`TRUE`:\n\n```\n## -----------------------------------------------------------------------------\n## CHOOSE WHAT YOU WANT (TRUE == Yes, FALSE == No)\n## -----------------------------------------------------------------------------\n\n## default\nprimary_data = TRUE\ndictionary = TRUE\n\n## STATA version\n## (NB: downloading Stata version of data will also get Stata program files)\nstata_data = FALSE\n\n## other program files\nprog_spss = FALSE\nprog_sas  = FALSE\n\n## overwrite already downloaded files\noverwrite = FALSE\n```\n\nTo change output directory, modify `out_dir` (default is the current directory):\n\n```\n## -----------------------------------------------------------------------------\n## CHOOSE OUTPUT DIRECTORY (DEFAULT == '.', which is current directory)\n## -----------------------------------------------------------------------------\n\nout_dir = '.'\n\n```\n\n# Data size\n\nAs of 25 September 2023, downloading all IPEDS files (setting all\noptions to `TRUE`) requires approximately 2.38 GB of disk\nspace. Granted, you probably don't need both regular and Stata\nversions of the data files (which are the bulk of the directory\nsize). Here are the approximate subdirectory file sizes if you\ndownload all data files from all years:\n\n|Subdirectory|Approximate Size|\n|:--|:-:|\n|`./data`|1.16 GB|\n|`./dictionary`|35.9 MB|\n|`./sas_prog`|4.7 MB|\n|`./spss_prog`|4.3 MB|\n|`./stata_data`|1.17 GB|\n|`./stata_prog`|5 MB|\n\n# Combine\n\nTo combine multiple IPEDS data files into a single dataset, you may\nfind the following script useful: [Combine\nIPEDS](https://gist.github.com/btskinner/f42c87507169d0ba773c)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbtskinner%2Fdownloadipeds","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fbtskinner%2Fdownloadipeds","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbtskinner%2Fdownloadipeds/lists"}