{"id":16274078,"url":"https://github.com/trinker/read_docx","last_synced_at":"2025-10-08T08:57:23.007Z","repository":{"id":23368562,"uuid":"26729810","full_name":"trinker/read_docx","owner":"trinker","description":null,"archived":false,"fork":false,"pushed_at":"2014-11-16T22:19:42.000Z","size":116,"stargazers_count":5,"open_issues_count":0,"forks_count":1,"subscribers_count":2,"default_branch":"master","last_synced_at":"2025-04-04T08:39:08.985Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/trinker.png","metadata":{"files":{"readme":"README.Rmd","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2014-11-16T22:08:59.000Z","updated_at":"2018-05-18T18:08:34.000Z","dependencies_parsed_at":"2022-08-05T22:16:14.993Z","dependency_job_id":null,"html_url":"https://github.com/trinker/read_docx","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/trinker/read_docx","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/trinker%2Fread_docx","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/trinker%2Fread_docx/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/trinker%2Fread_docx/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/trinker%2Fread_docx/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/trinker","download_url":"https://codeload.github.com/trinker/read_docx/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/trinker%2Fread_docx/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":278916432,"owners_count":26068090,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-08T02:00:06.501Z","response_time":56,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-10T18:26:59.807Z","updated_at":"2025-10-08T08:57:22.992Z","avatar_url":"https://github.com/trinker.png","language":"R","funding_links":[],"categories":[],"sub_categories":[],"readme":"A function developed by [Bryan Goodrich](https://www.linkedin.com/in/bryangoodrich) for reading .docx files into R:\n\n```{r, echo=FALSE}\nlibrary(knitr)\n## knitr::knit2html(\"README.Rmd\", \"README.md\")\n```\n\n\n**The Code**\n\n```{r}\nread_docx \u003c- function (file, skip = 0) {\n    tmp \u003c- tempfile()\n    if (!dir.create(tmp)) stop(\"Temporary directory could not be established.\")\n    unzip(file, exdir = tmp)\n    xmlfile \u003c- file.path(tmp, \"word\", \"document.xml\")\n    doc \u003c- XML::xmlTreeParse(xmlfile, useInternalNodes = TRUE)\n    unlink(tmp, recursive = TRUE)\n    nodeSet \u003c- XML::getNodeSet(doc, \"//w:p\")\n    pvalues \u003c- sapply(nodeSet, XML::xmlValue)\n    pvalues \u003c- pvalues[pvalues != \"\"]\n    if (skip \u003e 0) pvalues \u003c- pvalues[-seq(skip)]\n    pvalues\n}\n```\n\n**In Action...**\n\n```{r, message=FALSE}\nlibrary(qdapRegex); library(qdap)\ninput \u003c- rm_non_ascii(read_docx(\"LRA2014AdvocacyProposal.docx\"))\nrm_citation(unbag(input), extract=TRUE)\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftrinker%2Fread_docx","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Ftrinker%2Fread_docx","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Ftrinker%2Fread_docx/lists"}