{"id":20170787,"url":"https://github.com/pear/text_languagedetect","last_synced_at":"2025-05-07T15:08:09.366Z","repository":{"id":2996043,"uuid":"4012791","full_name":"pear/Text_LanguageDetect","owner":"pear","description":"PHP library to identify human languages from text samples.","archived":false,"fork":false,"pushed_at":"2023-02-27T20:55:38.000Z","size":315,"stargazers_count":50,"open_issues_count":1,"forks_count":18,"subscribers_count":17,"default_branch":"master","last_synced_at":"2025-05-07T15:08:07.164Z","etag":null,"topics":["detect-language","languages","php"],"latest_commit_sha":null,"homepage":"http://pear.php.net/package/Text_LanguageDetect","language":"PHP","has_issues":false,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/pear.png","metadata":{"files":{"readme":"README.rst","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null}},"created_at":"2012-04-13T05:41:24.000Z","updated_at":"2025-04-29T01:23:03.000Z","dependencies_parsed_at":"2023-07-06T11:47:08.863Z","dependency_job_id":null,"html_url":"https://github.com/pear/Text_LanguageDetect","commit_stats":{"total_commits":106,"total_committers":6,"mean_commits":"17.666666666666668","dds":0.5471698113207547,"last_synced_commit":"655b269959782416426cf0aa7bd47e19fed63210"},"previous_names":[],"tags_count":11,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pear%2FText_LanguageDetect","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pear%2FText_LanguageDetect/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pear%2FText_LanguageDetect/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pear%2FText_LanguageDetect/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/pear","download_url":"https://codeload.github.com/pear/Text_LanguageDetect/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":252902614,"owners_count":21822261,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["detect-language","languages","php"],"created_at":"2024-11-14T01:21:05.206Z","updated_at":"2025-05-07T15:08:09.342Z","avatar_url":"https://github.com/pear.png","language":"PHP","funding_links":[],"categories":[],"sub_categories":[],"readme":"*******************\nText_LanguageDetect\n*******************\nPHP library to identify human languages from text samples.\nReturns confidence scores for each.\n\n\nInstallation\n============\n\nPEAR\n----\n::\n\n    $ pear install Text_LanguageDetect\n\nComposer\n--------\n::\n\n    $ composer require pear/text_languagedetect\n\n\nUsage\n=====\nAlso see the examples in the ``docs/`` directory and\nthe `official documentation`__.\n\n__ http://pear.php.net/package/Text_LanguageDetect/docs\n\nLanguage detection\n------------------\nSimple language detection::\n\n    \u003c?php\n    require_once 'Text/LanguageDetect.php';\n\n    $text = 'Was wäre, wenn ich Ihnen das jetzt sagen würde?';\n\n    $ld = new Text_LanguageDetect();\n    $language = $ld-\u003edetectSimple($text);\n\n    echo $language;\n    //output: german\n\nShow the three most probable languages with their confidence score::\n\n    \u003c?php\n    require_once 'Text/LanguageDetect.php';\n\n    $text = 'Was wäre, wenn ich Ihnen das jetzt sagen würde?';\n\n    $ld = new Text_LanguageDetect();\n    //3 most probable languages\n    $results = $ld-\u003edetect($text, 3);\n\n    foreach ($results as $language =\u003e $confidence) {\n        echo $language . ': ' . number_format($confidence, 2) . \"\\n\";\n    }\n\n    //output:\n    //german: 0.35\n    //dutch: 0.25\n    //swedish: 0.20\n    ?\u003e\n\n\nLanguage code\n-------------\nInstead of returning the full language name, ISO 639-2 two and three\nletter codes can be returned::\n\n    \u003c?php\n    require_once 'Text/LanguageDetect.php';\n    $ld = new Text_LanguageDetect();\n\n    //will output the ISO 639-1 two-letter language code\n    // \"de\"\n    $ld-\u003esetNameMode(2);\n    echo $ld-\u003edetectSimple('Das ist ein kleiner Text') . \"\\n\";\n\n    //will output the ISO 639-2 three-letter language code\n    // \"deu\"\n    $ld-\u003esetNameMode(3);\n    echo $ld-\u003edetectSimple('Das ist ein kleiner Text') . \"\\n\";\n    ?\u003e\n\n\nSupported languages\n===================\n- albanian\n- arabic\n- azeri\n- bengali\n- bulgarian\n- cebuano\n- croatian\n- czech\n- danish\n- dutch\n- english\n- estonian\n- farsi\n- finnish\n- french\n- german\n- hausa\n- hawaiian\n- hindi\n- hungarian\n- icelandic\n- indonesian\n- italian\n- kazakh\n- kyrgyz\n- latin\n- latvian\n- lithuanian\n- macedonian\n- mongolian\n- nepali\n- norwegian\n- pashto\n- pidgin\n- polish\n- portuguese\n- romanian\n- russian\n- serbian\n- slovak\n- slovene\n- somali\n- spanish\n- swahili\n- swedish\n- tagalog\n- turkish\n- ukrainian\n- urdu\n- uzbek\n- vietnamese\n- welsh\n\n\nLinks\n=====\nHomepage\n  http://pear.php.net/package/Text_LanguageDetect\nBug tracker\n  http://pear.php.net/bugs/search.php?cmd=display\u0026package_name[]=Text_LanguageDetect\nDocumentation\n  http://pear.php.net/package/Text_LanguageDetect/docs\nUnit test status\n  https://travis-ci.org/pear/Text_LanguageDetect\n\n  .. image:: https://travis-ci.org/pear/Text_LanguageDetect.svg?branch=master\n     :target: https://travis-ci.org/pear/Text_LanguageDetect\n\n\nNotes\n=====\nWhere are the data from?\n\n I don't recall where I got the original data set.\n It's just the frequencies of 3-letter combinations in each supported language.\n It could be generated from a few random wikipedia pages from each language.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpear%2Ftext_languagedetect","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fpear%2Ftext_languagedetect","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpear%2Ftext_languagedetect/lists"}