{"id":22879296,"url":"https://github.com/utdata/thcic-pudf","last_synced_at":"2025-03-31T14:33:40.230Z","repository":{"id":92281340,"uuid":"237714286","full_name":"utdata/thcic-pudf","owner":"utdata","description":"Analysis of the Texas Inpatient Public Use Data File","archived":false,"fork":false,"pushed_at":"2020-12-03T15:14:51.000Z","size":38989,"stargazers_count":1,"open_issues_count":1,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-02-06T20:44:24.866Z","etag":null,"topics":["r"],"latest_commit_sha":null,"homepage":"https://utdata.github.io/thcic-pudf/","language":"Stata","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/utdata.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2020-02-02T03:46:03.000Z","updated_at":"2020-12-03T15:54:07.000Z","dependencies_parsed_at":"2023-06-08T06:45:27.289Z","dependency_job_id":null,"html_url":"https://github.com/utdata/thcic-pudf","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/utdata%2Fthcic-pudf","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/utdata%2Fthcic-pudf/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/utdata%2Fthcic-pudf/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/utdata%2Fthcic-pudf/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/utdata","download_url":"https://codeload.github.com/utdata/thcic-pudf/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":246483554,"owners_count":20784912,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["r"],"created_at":"2024-12-13T16:38:45.533Z","updated_at":"2025-03-31T14:33:40.212Z","avatar_url":"https://github.com/utdata.png","language":"Stata","funding_links":[],"categories":[],"sub_categories":[],"readme":"---\noutput:\n  html_document:\n    df_print: paged\nknit: (function(inputFile, encoding) { rmarkdown::render(\n    inputFile,\n    encoding = encoding,\n    output_dir = \"docs\",\n    output_file='index.html'\n  ) })\n---\n\n# Birth outcomes for Texas\n\nThis is an analysis for a series of stories about birth outcomes. The data used is the [Texas Hospital Inpatient Discharge Public Use Data File](https://www.dshs.texas.gov/thcic/hospitals/Inpatientpudf.shtm), [1q2016 - 4q2019]. Texas Department of State Health Services, Center for Health Statistics, Austin, Texas.\n\nNo raw data is made public through this project. The data is typically sold by the Center for Health Statistics. Details at the link above. This analysis uses files for discharges from 2016 to 2019, though a typically very small number of discharges for the fourth quarter of 2019 may not be included because of allowed reporting delays.\n\nNo patients or doctors have been revealed in the data or analysis. Suppressions have been applied for smaller aggregations to protect privacy.\n\nThis analysis is by:\n\n[**CHRISTIAN MCDONALD**](https://journalism.utexas.edu/faculty/christian-mcdonald), Assistant Professor of Practice\\\nSchool of Journalism and Media, Moody College of Communication\\\nThe University of Texas at Austin\n\n## Stories\n\n- [Birth on demand:](https://www.expressnews.com/projects/2020/off-the-chart/cesarean-texas-mothers/) Whether a woman delivers by cesarean has less to do with her health than the hospital she goes to. Case in point: Doctors Hospital of Laredo, where rates of surgical intervention during childbirth are way above the norm. Experts say something isn’t right.\n- [More than ‘a little snip’:](https://www.expressnews.com/projects/2020/off-the-chart/episiotomy-texas-mothers/) Two Texas hospitals performed episiotomies at rates four to six times the recommended level last year. But women giving birth should know: You have the final say.\n- [Database:](https://www.expressnews.com/projects/2020/off-the-chart/cesarean-episiotomy-texas-hospitals-database/) C-section and episiotomy rates in Texas hospitals.\n\n## Methodology\n\n- [The methodology](https://utdata.github.io/thcic-pudf/methodology.html) of the measures and calculations used in the analysis.\n\n## Notebooks\n\n### Primary Cesareans study\n\n- [02-analysis-pcsec](https://utdata.github.io/thcic-pudf/02-analysis-pcsec.html) is an analysis of Primary Cesarean rates vs all uncomplicated births.\n\n### Vaginal Births after Cesarean study\n\n- [02-analysis-vbac](https://utdata.github.io/thcic-pudf/02-analysis-vbac.html) is an analysis of Vaginal births after Ceseareans by hospital.\n\n### Episiotomy study\n\n- [02-analysis-episiotomy](https://utdata.github.io/thcic-pudf/02-analysis-episiotomy.html) is an analysis of episiotomy procedures.\n\n### Deliveries study\n\n- [02-analysis-deliveries](https://utdata.github.io/thcic-pudf/02-analysis-deliveries.html) looks at overall delivery rates at hospitals, including simple rates for Cesarean and episiotomies.\n\n### Summaries\n\n- [03-analysis-summaries](https://utdata.github.io/thcic-pudf/03-analysis-summaries.html) gives statewide rates and by-hospital averages by year for our key measures.\n\n\n## Data processing notebooks\n\n- [00-process-lists](https://utdata.github.io/thcic-pudf/00-process-lists.html) defines various lists of codes and columns used within other notebooks.\n- [0101-process-ahrq-del-loop](https://utdata.github.io/thcic-pudf/0101-process-ahrq-del-loop.html) is the method used to find all deliveries from the raw THCIC files.\n- [0101-process-ahrq-del-test](https://utdata.github.io/thcic-pudf/0101-process-ahrq-del-test.html) shows the processing logic in more detail using test data.\n- [0102-process-ahrq-del-compare](https://utdata.github.io/thcic-pudf/0102-process-ahrq-del-compare.html) compares the results of the loop script to the test script when both are processed using test data.\n- [0101-process-lf-epi-loop](https://utdata.github.io/thcic-pudf/0101-process-lf-epi-loop.html) is the method used to capture vaginal births using Leapfrog's methods.\n- [0101-process-lf-epi-test](https://utdata.github.io/thcic-pudf/0101-process-lf-epi-test.html) shows the processing logic in more detail using test data.\n- [0103-process-ahrq-providers](https://utdata.github.io/thcic-pudf/0103-process-ahrq-providers.html) creates a list of the most recent names for hospitals in the AHRQ data to assist analysis. Can't be run until the deliveries data is generated.\n- [0104-process-lf-providers](https://utdata.github.io/thcic-pudf/0104-process-lf-providers.html) creates a list of the most recent names for hospitals in the Leapfrog to assist analysis. Can't be run until the the AHRQ providers are processed.\n\n## Exports\n\nA series of aggregated data that may prove useful. In the `exports/` folder:\n\n- [Interactive Data Matrix](https://utdata.github.io/thcic-pudf/interactive.html) maps specific files/data to the mockup.\n\n### From 02-analysis-pcsec\n\n- ahrq_pcsec_rate_county_yr: (Primary cesarean rates for uncomplicated deliveries by patient county.) Grouped by PAT_COUNTY, COUNTY. Aggregations are NPCSEC_CNT (Count not primary Cesarean), PCSEC_CNT (Count primary Cesarean), TOTAL, PCRATE (Primary Cesarean rate).\n\n### From 02-analysis-episiotomy\n\n- lf_epi_rate_county_yr: (Episiotomy rates for uncomplicated deliveries by patient county.) Grouped by YR, PAT_COUNTY, COUNTY. Aggregations are EPIF (Count Epi False), EPIT (Count Epi True), TOTAL (Count of all vaginal deliveries), EPIRATE.\n\n### From 02-analysis-deliveries\n\n- blurbs_2019.json: Data: Hospital, birth per doc rate (weekly), Medicaid percentage, Maternal level.\n\n### From 03-analysis-summaries\n\n- table_2019.csv: Data: Hospital, location, episiotomy and primary cesarean rates for 2019.\n- chart_data.json: Data: Year, ID, Hospital, episiotomy and primary cesarean rates. There is a v2 version as well.\n- chart_data_print.csv: Same as `chart_data` but as csv for print.\n- summary_data.csv: Summary data for the major categories.\n\n### From 0102-process-providers\n\n- providers_full: Names and addresses of all the hospitals in the analysis.\n- providers_current: The most recent list of facilites, per the Center for Health Statistics.\n\n## Notes\n\nThere was a data correction sent out 2020-08-14 to replace some fields for some records in the `PUDF_base1_3Q2019_tab.txt` data file. The replacement file was not applied because we are not using the fields noted, `TOTAL_CHARGES` and `TOTAL_CHARGES_ACCOMM`.\n\n## Music to get you through it\n\nA good portion of this project was made possible through [Iron Maiden Radio](https://www.pandora.com/station/play/847050113183194133) on Pandora.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Futdata%2Fthcic-pudf","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Futdata%2Fthcic-pudf","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Futdata%2Fthcic-pudf/lists"}