{"id":13779221,"url":"https://github.com/pharo-ai/Polyglot","last_synced_at":"2025-05-11T12:32:56.590Z","repository":{"id":45398668,"uuid":"189589232","full_name":"pharo-ai/Polyglot","owner":"pharo-ai","description":"A library for Natural Language Processing","archived":true,"fork":false,"pushed_at":"2023-09-25T12:47:04.000Z","size":83,"stargazers_count":7,"open_issues_count":8,"forks_count":3,"subscribers_count":6,"default_branch":"master","last_synced_at":"2024-05-18T21:52:26.916Z","etag":null,"topics":["natural-language-processing","nlp","pharo"],"latest_commit_sha":null,"homepage":"","language":"Smalltalk","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/pharo-ai.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-05-31T12:20:14.000Z","updated_at":"2024-07-05T18:25:14.892Z","dependencies_parsed_at":"2024-07-05T18:40:33.095Z","dependency_job_id":null,"html_url":"https://github.com/pharo-ai/Polyglot","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pharo-ai%2FPolyglot","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pharo-ai%2FPolyglot/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pharo-ai%2FPolyglot/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/pharo-ai%2FPolyglot/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/pharo-ai","download_url":"https://codeload.github.com/pharo-ai/Polyglot/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":253567358,"owners_count":21928818,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["natural-language-processing","nlp","pharo"],"created_at":"2024-08-03T18:01:02.745Z","updated_at":"2025-05-11T12:32:56.260Z","avatar_url":"https://github.com/pharo-ai.png","language":"Smalltalk","funding_links":[],"categories":["Natural Language Processing"],"sub_categories":[],"readme":"# Polyglot\n[![Build Status](https://travis-ci.org/PolyMathOrg/Polyglot.svg?branch=master)](https://travis-ci.org/PolyMathOrg/Polyglot)\n[![Build status](https://ci.appveyor.com/api/projects/status/nk84odcludj242lw?svg=true)](https://ci.appveyor.com/project/nikhilpinnaparaju/Polyglot)\n[![Coverage Status](https://coveralls.io/repos/github/PolyMathOrg/Polyglot/badge.svg?branch=master)](https://coveralls.io/github/PolyMathOrg/Polyglot?branch=master)\n[![License](https://img.shields.io/badge/license-MIT-blue.svg)](https://raw.githubusercontent.com/PolyMathOrg/Polyglot/master/LICENSE)\n[![Pharo version](https://img.shields.io/badge/Pharo-6.1-%23aac9ff.svg)](https://pharo.org/download)\n[![Pharo version](https://img.shields.io/badge/Pharo-7.0-%23aac9ff.svg)](https://pharo.org/download)\n[![Pharo version](https://img.shields.io/badge/Pharo-8.0-%23aac9ff.svg)](https://pharo.org/download)\n\nThis repository is marked as a public archive and it will be deleted in the future. It contains duplicated code that it can be found in the other pharo-ai repositories. We encourage you to look into the other NLP repositories that we have inside pharo-ai.\n\nA library for Natural Language Processing implemented in Pharo. To get more information, check out the [Polyglot Booklet](https://github.com/SquareBracketAssociates/Booklet-Polyglot).\n\n## Installation\nTo install Polyglot, go to the Playground (`Ctrl+OW`) in your fresh Pharo image and execute the following Metacello script (select it and press Do-it button or `Ctrl+D`):\n\n```smalltalk\nMetacello new\n  baseline: 'Polyglot';\n  repository: 'github://PolyMathOrg/Polyglot/src';\n  load.\n```\n\n## List of Supported Features\n\n- Tokenization\n- N-grams\n- Term Frequency-Inverse Document Frequency Scoring\n- N-Gram Language Modelling\n- Stemming\n- Part of Speech Tagging\n- Named Entity Recognizer\n- Dependency Parser\n- Modified Atlas Bridge\n- Common Vector Metrics\n\n## Google Summer of Code 2019 Report\n### Author: Nikhil Pinnaparaju\n\nOrganisation: [Pharo](https://pharo.org/)\n\nProject: [Polyglot](https://github.com/PolyMathOrg/Polyglot)\n\nMentors: Oleksandr Zaitsev, Alexandre Bergel\n\n\nA library for Natural Language Processing implemented in Pharo. \n\n## Features Implemented\n- Tokenization\n- N-grams\n- Term Frequency-Inverse Document Frequency Scoring\n- N-Gram Language Modelling\n- Stemming\n- Part of Speech Tagging\n- Named Entity Recognizer\n- Dependency Parser\n- Modified Atlas Bridge\n- Common Vector Metrics\n\n\n## Code Contribution\n- [Commits - Polyglot](https://github.com/PolyMathOrg/Polyglot/commits/master?author=nikhilpinnaparaju).\n\n- [Pull Requests - Polyglot](https://github.com/PolyMathOrg/Polyglot/pulls?utf8=%E2%9C%93\u0026q=is%3Apr+author%3Anikhilpinnaparaju+).\n\n- [Issues Raised - PolyMath](https://github.com/PolyMathOrg/PolyMath/issues?utf8=%E2%9C%93\u0026q=is%3Aissue+author%3Anikhilpinnaparaju+).\n\n- [Pull Requests - PolyMath](https://github.com/PolyMathOrg/PolyMath/pulls?utf8=%E2%9C%93\u0026q=is%3Apr+author%3Anikhilpinnaparaju+).\n\n## Documentation\n\n### Blog Posts\n\n- [Representing Documents as Vectors and Visualizing them Using Polyglot in Pharo](https://medium.com/@nikhilpinnaparaju/representing-documents-as-vectors-and-visualizing-them-using-polyglot-in-pharo-73887e8bb418)\n- [Stemming in Polyglot](https://medium.com/@nikhilpinnaparaju/stemming-in-polyglot-2672a349e15)\n- [Working with the Atlas Pharo-Python Bridge](https://medium.com/@nikhilpinnaparaju/working-with-the-atlas-pharo-python-bridge-1ad6ba356f7)\n- [Polyglot for Large Corpora](https://medium.com/@nikhilpinnaparaju/polyglot-for-large-corpora-71267c525876)\n- [Introducing Polyglot](https://link.medium.com/XrrMmBsfPX)\n- [Tokenization  — GSoC with Pharo Consortium](https://link.medium.com/YlaK5QtfPX)\n- [Community Bonding Period — GSoC with Pharo Consortium](https://link.medium.com/WyjlwqwfPX)\n- [Architecture Design For an NLP Library](https://link.medium.com/Az8fikxfPX)\n- [PCA in Pharo using PolyMath, DataFrame and Roassal](https://link.medium.com/qcRTM0yfPX)\n- [My Journey Into Google Summer of Code — 2019](https://link.medium.com/PZ6Zd4zfPX)\n\n### Booklets\n\n- [The Polyglot Booklet](https://github.com/SquareBracketAssociates/Booklet-Polyglot)\n- [Documentation for Polyglot](https://github.com/nikhilpinnaparaju/Polyglot-Documentation)\n\n## Project Demonstration/Presentation\n- [Polyglot ESUG Presention V1.0](https://drive.google.com/file/d/18J2bgdrJ6Dhbaxg1-N3hC8-ZD7y0P9yp/view?usp=sharing)\n- [Polyglot ESUG Presention V2.0](https://drive.google.com/file/d/1PgNS1xpwos1txeclhbzFfVlbrbs54pGs/view?usp=sharing)\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpharo-ai%2FPolyglot","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fpharo-ai%2FPolyglot","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fpharo-ai%2FPolyglot/lists"}