{"id":14066946,"url":"https://github.com/mjfii/R-NameParser-Lib","last_synced_at":"2025-07-30T00:31:47.203Z","repository":{"id":90851979,"uuid":"79484320","full_name":"mjfii/R-NameParser-Lib","owner":"mjfii","description":"An R library allowing parsing of surname, first name, and gender based on US census data. ","archived":false,"fork":false,"pushed_at":"2024-08-12T14:36:27.000Z","size":1971,"stargazers_count":6,"open_issues_count":2,"forks_count":0,"subscribers_count":2,"default_branch":"master","last_synced_at":"2024-08-12T16:56:37.819Z","etag":null,"topics":["algorithm","census-data","determination","gender","library","parse","r"],"latest_commit_sha":null,"homepage":"","language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"agpl-3.0","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/mjfii.png","metadata":{"files":{"readme":"readme.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":"CITATION.cff","codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-01-19T18:54:59.000Z","updated_at":"2024-08-12T14:36:31.000Z","dependencies_parsed_at":null,"dependency_job_id":"22502b01-74ad-48de-9b07-71823cb6980e","html_url":"https://github.com/mjfii/R-NameParser-Lib","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mjfii%2FR-NameParser-Lib","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mjfii%2FR-NameParser-Lib/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mjfii%2FR-NameParser-Lib/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/mjfii%2FR-NameParser-Lib/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/mjfii","download_url":"https://codeload.github.com/mjfii/R-NameParser-Lib/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":228065137,"owners_count":17863945,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["algorithm","census-data","determination","gender","library","parse","r"],"created_at":"2024-08-13T07:05:20.765Z","updated_at":"2024-12-04T07:30:28.393Z","avatar_url":"https://github.com/mjfii.png","language":"R","readme":"  \n# R Name Parser  \n  \nThis R package, `name.parser`, uses U.S. Census data to parse full names of individuals by identifying surnames, stripping salutations and suffixes, and processing common naming conventions.  Additionally, the names are evaluated for gender and the confidence level of that determination.\n\n## Motivation\n\nIt is a common occurrence to receive data with respect to individuals names in a single string, or single attribute.  Stripping this name into a 'first' name, 'middle' name, 'last' name, etc, is essential for comparison and other analytic endeavors.  Since, a simple algorithm, i.e. left most 'word' is the 'first' name, does not always work, even within the same data set, this algorithm was built to 'pull' apart a persons name into a 'best guess' set of strings.  Additionally, non-alpha characters, duplicate spacing, control characters, etc, are required to be removed in the processing of the string.\n\n## Prerequisites\n\nThe two required packages, `data.table` and `parallel` - both of which are installing when this library is loaded.  The census data utilizes the `data.table` library for look-ups and aggregation, while the `parallel` library is used when multiple names are required to process.\n\n## Installation\n\nUsing the `devtools` function, install with the below:\n\n```r\ninstall_github('mjfii/Name-Parser')\nlibrary('name.parser')\n```\n\n## Examples\n\nTo parse a name:\n\n```r\n# returns a single pipe (`|`) delimted string, e.g. \"salutation|first|middle|last|suffix|gender|confidence\".\nx \u003c- 'livingston III,  Mr. MICHAEL JOHN9'\nparse.name(x)\n# or, for multiple names in a `data.table` with similar attributes\nparse.names(x)\n```\n\nTo 'prepare' a name:\n```r\nx \u003c- 'livingston III,  Mr. MICHAEL JOHN9'\nprep.name(x)  \n```\n\nTo get the census data:\n```r\nx \u003c- 'livingston III,  Mr. MICHAEL JOHN9'\nx \u003c- prep.name(x)\nx \u003c- strsplit(x,' ')[[1]]\nget.census.data(x)\n```\n\nTo determine surname (last name) ordinal:\n```r\nx \u003c- 'livingston III,  Mr. MICHAEL JOHN9'\nx \u003c- prep.name(x)\nx \u003c- strsplit(x,' ')[[1]]\ncd \u003c- get.census.data(x)\nprint(x)\ndetermine.surname(cd)\n```\n\nTo determine gender:\n```r\nx \u003c- 'livingston III,  Mr. MICHAEL JOHN9'\nx \u003c- prep.name(x)\nx \u003c- strsplit(x,' ')[[1]]\ncd \u003c- get.census.data(x)\ndetermine.gender(cd)\n```\n\n## Contributors\n\nMichael Flanigan  \n email: [mick.flanigan@gmail.com](mick.flanigan@gmail.com)  \n twitter: [@mjfii](https://twitter.com/mjfii)  \n\n# Versioning\n\n0.0.0.9000 - Initial deployment (2017-02-10)\n","funding_links":[],"categories":["R"],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmjfii%2FR-NameParser-Lib","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fmjfii%2FR-NameParser-Lib","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fmjfii%2FR-NameParser-Lib/lists"}