{"id":13411499,"url":"https://github.com/jamesturk/jellyfish","last_synced_at":"2025-05-14T22:05:46.996Z","repository":{"id":971225,"uuid":"766701","full_name":"jamesturk/jellyfish","owner":"jamesturk","description":"🪼 a python library for doing approximate and phonetic matching of strings.","archived":false,"fork":false,"pushed_at":"2025-04-07T04:20:42.000Z","size":3674,"stargazers_count":2130,"open_issues_count":6,"forks_count":159,"subscribers_count":41,"default_branch":"main","last_synced_at":"2025-05-07T21:57:58.384Z","etag":null,"topics":["fuzzy-search","hacktoberfest","hamming","jaro-winkler","levenshtein","metaphone","python","soundex"],"latest_commit_sha":null,"homepage":"https://jamesturk.github.io/jellyfish/","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/jamesturk.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":".github/FUNDING.yml","license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":"CITATION.cff","codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null},"funding":{"github":"jamesturk","patreon":null,"open_collective":null,"ko_fi":null,"tidelift":null,"community_bridge":null,"liberapay":null,"issuehunt":null,"otechie":null,"custom":null}},"created_at":"2010-07-09T20:41:11.000Z","updated_at":"2025-05-04T10:26:17.000Z","dependencies_parsed_at":"2023-02-18T09:00:54.810Z","dependency_job_id":"e4b33478-30ec-4f68-a9eb-3a0c54e472f6","html_url":"https://github.com/jamesturk/jellyfish","commit_stats":{"total_commits":446,"total_committers":33,"mean_commits":"13.515151515151516","dds":0.5381165919282511,"last_synced_commit":"2ebc2c647eafce3036d12814b0247cdd55e0be4d"},"previous_names":[],"tags_count":55,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jamesturk%2Fjellyfish","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jamesturk%2Fjellyfish/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jamesturk%2Fjellyfish/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jamesturk%2Fjellyfish/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/jamesturk","download_url":"https://codeload.github.com/jamesturk/jellyfish/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":254235687,"owners_count":22036962,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["fuzzy-search","hacktoberfest","hamming","jaro-winkler","levenshtein","metaphone","python","soundex"],"created_at":"2024-07-30T20:01:14.093Z","updated_at":"2025-05-14T22:05:46.952Z","avatar_url":"https://github.com/jamesturk.png","language":"Jupyter Notebook","funding_links":["https://github.com/sponsors/jamesturk"],"categories":["Jupyter Notebook","Data Processing","Python","文本数据和NLP","Feature Extraction","Open-Source Software"],"sub_categories":["Data Similarity","General-Purpose Machine Learning","Text/NLP","String Comparison"],"readme":"# Overview\n\n**jellyfish** is a library for approximate \u0026 phonetic matching of strings.\n\nSource: [https://github.com/jamesturk/jellyfish](https://github.com/jamesturk/jellyfish)\n\nDocumentation: [https://jamesturk.github.io/jellyfish/](https://jamesturk.github.io/jellyfish/)\n\nIssues: [https://github.com/jamesturk/jellyfish/issues](https://github.com/jamesturk/jellyfish/issues)\n\n[![PyPI badge](https://badge.fury.io/py/jellyfish.svg)](https://badge.fury.io/py/jellyfish)\n[![Test badge](https://github.com/jamesturk/jellyfish/workflows/Python%20package/badge.svg)](https://github.com/jamesturk/jellyfish/actions?query=workflow%3A%22Python+package)\n[![Coveralls](https://coveralls.io/repos/jamesturk/jellyfish/badge.png?branch=master)](https://coveralls.io/r/jamesturk/jellyfish)\n![Test Rust](https://github.com/jamesturk/rust-jellyfish/workflows/Test%20Rust/badge.svg)\n\n## Included Algorithms\n\nString comparison:\n\n* Levenshtein Distance\n* Damerau-Levenshtein Distance\n* Jaccard Index\n* Jaro Distance\n* Jaro-Winkler Distance\n* Match Rating Approach Comparison\n* Hamming Distance\n\nPhonetic encoding:\n\n* American Soundex\n* Metaphone\n* NYSIIS (New York State Identification and Intelligence System)\n* Match Rating Codex\n\n## Example Usage\n\n``` python\n\u003e\u003e\u003e import jellyfish\n\u003e\u003e\u003e jellyfish.levenshtein_distance('jellyfish', 'smellyfish')\n2\n\u003e\u003e\u003e jellyfish.jaro_similarity('jellyfish', 'smellyfish')\n0.89629629629629637\n\u003e\u003e\u003e jellyfish.damerau_levenshtein_distance('jellyfish', 'jellyfihs')\n1\n\n\u003e\u003e\u003e jellyfish.metaphone('Jellyfish')\n'JLFX'\n\u003e\u003e\u003e jellyfish.soundex('Jellyfish')\n'J412'\n\u003e\u003e\u003e jellyfish.nysiis('Jellyfish')\n'JALYF'\n\u003e\u003e\u003e jellyfish.match_rating_codex('Jellyfish')\n'JLLFSH'\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjamesturk%2Fjellyfish","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjamesturk%2Fjellyfish","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjamesturk%2Fjellyfish/lists"}