{"id":21064141,"url":"https://github.com/somenath203/language-identifier-using-tensorflow","last_synced_at":"2026-04-28T16:32:54.145Z","repository":{"id":230644535,"uuid":"779715619","full_name":"somenath203/Language-Identifier-using-Tensorflow","owner":"somenath203","description":"Click below to checkout the website","archived":false,"fork":false,"pushed_at":"2024-05-13T12:48:35.000Z","size":297,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"main","last_synced_at":"2025-01-20T20:50:11.149Z","etag":null,"topics":["gradio","gru","huggingface","keras","language-classification","lstm","rnn","sequential","tensorflow","textcnn"],"latest_commit_sha":null,"homepage":"https://som11-language-predictor.hf.space","language":"Jupyter Notebook","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/somenath203.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2024-03-30T15:22:26.000Z","updated_at":"2024-07-16T07:08:58.000Z","dependencies_parsed_at":"2024-05-13T13:58:36.859Z","dependency_job_id":"90459cd1-673a-478a-8227-df139fc72af1","html_url":"https://github.com/somenath203/Language-Identifier-using-Tensorflow","commit_stats":null,"previous_names":["somenath203/language-identifier-using-tensorflow"],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/somenath203%2FLanguage-Identifier-using-Tensorflow","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/somenath203%2FLanguage-Identifier-using-Tensorflow/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/somenath203%2FLanguage-Identifier-using-Tensorflow/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/somenath203%2FLanguage-Identifier-using-Tensorflow/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/somenath203","download_url":"https://codeload.github.com/somenath203/Language-Identifier-using-Tensorflow/tar.gz/refs/heads/main","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":243506938,"owners_count":20301779,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["gradio","gru","huggingface","keras","language-classification","lstm","rnn","sequential","tensorflow","textcnn"],"created_at":"2024-11-19T17:48:26.074Z","updated_at":"2025-12-30T16:34:31.017Z","avatar_url":"https://github.com/somenath203.png","language":"Jupyter Notebook","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Language Identifier\n\n## Introduction\nThis is a deep learning project created with the help of Tensorflow that predicts the language of a given text snippet. Currently, this language prediction model supports a total of 22 languages as of now, which include: Arabic, Chinese, Dutch, English, Estonian, French, Hindi, Indonesian, Japanese, Korean, Latin, Persian, Portuguese, Pashto, Romanian, Russian, Spanish, Swedish, Tamil, Thai, Turkish, and Urdu.\n\n## Dataset used in this project\n\nThe dataset used in this project is taken from kaggle: https://www.kaggle.com/datasets/zarajamshaid/language-identification-datasst\n\n## Models used in this project\n\n1) Vanilla Sequential model\n2) TextCNN model\n3) Bidirectional SimpleRNN model\n4) Bidirectional LSTM model\n5) Bidirectional GRU model\n6) Ensemble Learning(Bidirectional LSTM + Bidirectional GRU) model\n\n**Out of the all the above models, textCNN proved to be the most effective one with a training accuracy of around 78.99% and testing accuracy of around 73.65%**\n\n## About the web application of the deep learning model\n\nThe deep learning model of this project is connected with an application created with Gradio for real time prediction and it is deployed on HuggingFace Spaces.\n\n## Links\n\nLive Preview: https://som11-language-predictor.hf.space/\n\n## Warning\nWhile the model of this project can classify languages correctly, but in some cases, the model may misclassify languages, therefore, it is strongly advised not to rely solely on the output of this model.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsomenath203%2Flanguage-identifier-using-tensorflow","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsomenath203%2Flanguage-identifier-using-tensorflow","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsomenath203%2Flanguage-identifier-using-tensorflow/lists"}