{"id":16881549,"url":"https://github.com/benbjohnson/matlock","last_synced_at":"2025-07-31T10:10:16.346Z","repository":{"id":6225730,"uuid":"7457206","full_name":"benbjohnson/matlock","owner":"benbjohnson","description":"Simple name extraction utility.","archived":false,"fork":false,"pushed_at":"2015-06-03T14:16:04.000Z","size":768,"stargazers_count":2,"open_issues_count":0,"forks_count":4,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-07-22T02:04:14.208Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Ruby","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/benbjohnson.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2013-01-05T16:15:10.000Z","updated_at":"2021-04-21T15:16:39.000Z","dependencies_parsed_at":"2022-08-20T22:31:08.929Z","dependency_job_id":null,"html_url":"https://github.com/benbjohnson/matlock","commit_stats":null,"previous_names":[],"tags_count":3,"template":false,"template_full_name":null,"purl":"pkg:github/benbjohnson/matlock","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/benbjohnson%2Fmatlock","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/benbjohnson%2Fmatlock/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/benbjohnson%2Fmatlock/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/benbjohnson%2Fmatlock/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/benbjohnson","download_url":"https://codeload.github.com/benbjohnson/matlock/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/benbjohnson%2Fmatlock/sbom","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":267455521,"owners_count":24089995,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-07-28T02:00:09.689Z","response_time":68,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-13T16:03:28.599Z","updated_at":"2025-07-31T10:10:16.285Z","avatar_url":"https://github.com/benbjohnson.png","language":"Ruby","funding_links":[],"categories":[],"sub_categories":[],"readme":"Matlock\n=======\n\n## Overview\n\nMatlock is a simple name extraction utility.\nIt's useful for pulling out all the full names from a string.\nFor example, pass in the HTML for a web page and it will return an array of full names mentioned.\n\nIt's premise is pretty simple and it is geared toward names in the United States.\nMatlock extracts all bigrams (first name, surname) from the string where either the first name or surname is in the US Census list of common names.\nThe US Census data has been cleaned so that common words and US cities and states have been removed.\nYou can see the full list of name data in the `lib/matlock/data` directory.\n\n\n## Getting Started\n\nTo use Matlock, simply install the gem:\n\n```bash\n$ gem install matlock\n```\n\nThen in your script simply require the gem and extract names from a string:\n\n```ruby\nrequire \"matlock\"\nrequire 'open-uri'\n\nhtml = open(\"http://cnn.com\").read\n\nmatlock = Matlock.new()\nnames = matlock.extract_names(html)\n\n#=\u003e [\"Lance Armstrong\", \"Anderson Cooper\", ...]\n```\n\nMatlock will get some false positives and some true negatives.\nIt's not meant to be an \"end all, be all\" name extraction tool.\nIt's mainly meant to be quick and dirty but that's effective for many use cases.","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbenbjohnson%2Fmatlock","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fbenbjohnson%2Fmatlock","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fbenbjohnson%2Fmatlock/lists"}