{"id":22928653,"url":"https://github.com/uva-bi-sdad/stem_edu","last_synced_at":"2025-07-18T03:33:30.573Z","repository":{"id":70054200,"uuid":"186703989","full_name":"uva-bi-sdad/stem_edu","owner":"uva-bi-sdad","description":null,"archived":false,"fork":false,"pushed_at":"2019-10-31T15:02:18.000Z","size":92309,"stargazers_count":4,"open_issues_count":0,"forks_count":0,"subscribers_count":5,"default_branch":"master","last_synced_at":"2024-12-27T19:07:34.203Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/uva-bi-sdad.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-05-14T21:36:30.000Z","updated_at":"2023-03-09T19:45:45.000Z","dependencies_parsed_at":null,"dependency_job_id":"08afd487-484c-4886-9852-1d2cfd8a8330","html_url":"https://github.com/uva-bi-sdad/stem_edu","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/uva-bi-sdad%2Fstem_edu","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/uva-bi-sdad%2Fstem_edu/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/uva-bi-sdad%2Fstem_edu/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/uva-bi-sdad%2Fstem_edu/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/uva-bi-sdad","download_url":"https://codeload.github.com/uva-bi-sdad/stem_edu/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":239500863,"owners_count":19649341,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-12-14T09:27:21.780Z","updated_at":"2025-02-18T15:49:43.493Z","avatar_url":"https://github.com/uva-bi-sdad.png","language":"R","readme":"# Stem Education\n\n# Datasets\n- coursera_comp_sci.csv \n  - produced from: `~/stem_edu/src/hsbb8/working/webscraping/coursera/cs_coursera_scraping.Rmd`\n  - stored in: `~/stem_edu/data/stem_edu/final/webscraping_data/`\n  - the final data set with all of the Coursera inforamtion from web scraping Coursera computer science courses\n  \n- coursera_keyword_counts.csv\n  - produced from: `~/stem_edu/src/hsbb8/working/keyword_search/coursera_keyword_search.R`\n  - stored in: `~/stem_edu/data/stem_edu/final/webscraping_data/`\n  - counts of the number of Coursera.com courses that contain certain keywords of jobs skills (keywords from: O*NET)\n\n- indeed_keyword_counts.csv\n  - produced from: `~/stem_edu/src/hsbb8/working/keyword_search/Indeed-JobAdsWebscrapping.ipynb`\n  - stored in: `~/stem_edu/data/stem_edu/final/webscraping_data/`\n  - counts of the number of Indeed.com job ads that contain certain keywords of job skills (keywords from: O*NET)\n  \n- keywords_list.csv\n  - from O*NET.com: technology skills listed for Software Developer, Applications\n  - stored in: `~/stem_edu/data/stem_edu/original/hsbb8/`\n  - list of keywords of skills that could be needed in junior software developer jobs\n  - used to calculate the number of job ads or courses that these skills appear in \n    - used in: `coursera_keyword_counts.csv and indeed_keyword_counts.csv`\n  \n- virginia_res.csv\n  - produced from: `~/stem_edu/src/chanida3/indeed_virginia_webscraping.R`\n  - stored in: `~/stem_edu/data/stem_edu/chanida3/virginia_res.csv`\n  - information scraped from dental hygienist resumes in Virginia from indeed.com\n  \n-IPEDS_2010_degrees.zip -- IPEDS_2015_degrees.zip\n    -./data/stem_edu/original/IPEDS_data_zipped\n    -IPEDS data on STEM award counts by institution, award type (Master's, associates, certification, etc), race/ethnicity,and gender\n    -each count of award type by race is a separate column\n    -The ipedsM (melt table) makes the count by race into an observation\n    -This data set should be re-downloaded from IPEDS data center\n        -Download ALL degrees awarded for race/ethnicity and gender from 2010-2015 and then subset in R\n        -I made the mistake of selecting variables to try to subset data from the IPEDS data center and may have made a mistake in PhD awards for 2010\n\n-meltTable.RData (ipedsM is the object name)\n    -./data/stem_edu/working/meltTable.RData\n    -this is the melted version that turns the counts by race and gender into observations\n    -Lines 187-193 in trend_IPEDS.R creates and saves this table\n    \n- ./data/stem_edu/original/IPUMS\n  - Attachment_C_STEM.csv\n      - List of SOC codes considered STEM occupations by BLS.\n  - ipums.csv\n      - 2015 ACS/Census data table downloaded from IPUMS *AFTER* running 'do' file on original download in STATA.\n      - ipums does not preserve the sample you select for more than a couple of weeks. This was selected using Maddie Arnsbarger's account.\n      - Must make personal account with IPUMS and select the variables/year/sources you want to use.\n  - ipums_data_dictionary.csv\n      - variable definitions and valid values for each variable in ipums.csv sample.\n      - original data dictionary is in the dspg 2017 nsf stem project folder\n  - origin\n      - these 3 csv files compose ipums_data_dictionary.csv\n      - source: `./src/maddieka/01.1-ipums_data_dict.R`\n      \n- ./data/stem_edu/original/NCES_ELS\n  - byf3_dictionary.csv\n      - csv version of Layout_BYF1SCH.txt\n      - variable definitions for the els_02_12_byf3stu_v1_0.rdata file.\n  - Codebook.txt\n      - Comprehensive data dictionary for all 4 tables downloaded from NCES.\n  - Layout_....txt\n      - Data dictionaries for each corresponding .rdata file.\n      - Includes variable definitions, valid values, and valid value definitions.\n  - els_02...._v1_0.rdata\n      - rdata files downloaded from NCES ELS website.\n      - stu = student, inst = institution, sch = school.\n\n# Figures/ Plots\n\n- coursera_final_fixed_3.png\n  - produced from: `~/stem_edu/src/hsbb8/working/keyword_search/Indeed-JobAdsWebscrapping.ipynb`\n  - stored in: `~/stem_edu/output/keyword_search_graphs/`\n  - bar plot of the percentages of courses that certain job skills appear in\n    - used for the final poster\n  \n- indeed_final_plot_fixed.png\n  - produced from: `~/stem_edu/src/hsbb8/working/keyword_search/Indeed-JobAdsWebscrapping.ipynb`\n  - stored in: `~/stem_edu/output/keyword_search_graphs/`\n  - bar plot of the percentages of job ads that certain job skills appear in\n    - used for the final poster\n    \n- ./output/sankey_diagrams/ipums_women_deglevel_3.selected.occupations.html\n  - description: sankey of ipums data (only women; degree level to stem categorization for occupation + 3 selected case study occupations.)\n  - source: `src/maddieka/01.2-sankey_ipums.R\n  \n- ./output/sankey_diagrams/ipums_women_degreelevel_high.super.stem.occs.html\n  - description: sankey of ipums data (only women; degree level to high/super stem occupation categories.)\n  - source: `src/maddieka/poster_plotsDSPG2017.R`\n\n- ./output/sankey_diagrams/nces_sankey.html\n  - description: sankey of nces data\n  - source: `src/maddieka/02.1-sankey_nces.R`\n\n- ./output/sankey_diagrams/ipums_women_degreelevel_all.stem.occs.html\n  - description: DSPG 2017 POSTER SANKEY -- sankey of ipums data \n        (only women; degree level to stem occupation categories (using Biania's stem score calculations))\n  - source: `src/maddieka/poster_plotsDSPG2017.R`\n\n- ./output/sankey_diagrams/ipums_women_degtype_only3selected.occs.html\n  - description: sankey of ipums data (only women; degree level to 3 selected case study occupations.)\n  - source: `src/maddieka/01.2-sankey_ipums.R`\n    \n-./output/mind_maps/\n  -Dental_Hygienist.pdf\n  -Junior_Software_Developer_Applications.pdf\n  -Environmental_Engineer.pdf\n    -Mind maps for the three occupations\n    -The information contained comes from O*NET profile for each occupation\n  \n-./output/ipeds_trend/\n  -AssociatesDegreeTrend.png\n  -BachelorsDegreeTrend.png\n  -MastersDegreeTrend.png\n  -PHDTrend.png\n  -TotalStemTrend.png\n    -These are trend plots of STEM degrees earned each year as a percentage of all STEM degrees earned in a 6 year period, by race/ethnicity\n    -Code to generate them is in ./src/benjs23/trend_IPEDS.R\n\n-./output/ipeds_trend/\n  -2010.png \n    -Shows all of the award types earned in 2010 by race/ethnicity\n    -Code to generate them is in ./src/benjs23/bianica_raceAwardCrosstab.R\n      -Code can produce plots for all years 2010-2015\n","funding_links":[],"categories":[],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fuva-bi-sdad%2Fstem_edu","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fuva-bi-sdad%2Fstem_edu","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fuva-bi-sdad%2Fstem_edu/lists"}