{"id":13395052,"url":"https://github.com/jekyll/classifier-reborn","last_synced_at":"2025-05-16T14:07:44.913Z","repository":{"id":19569191,"uuid":"22818441","full_name":"jekyll/classifier-reborn","owner":"jekyll","description":"A general classifier module to allow Bayesian and other types of classifications. A fork of cardmagic/classifier.","archived":false,"fork":false,"pushed_at":"2024-05-27T18:32:55.000Z","size":701,"stargazers_count":557,"open_issues_count":28,"forks_count":111,"subscribers_count":19,"default_branch":"master","last_synced_at":"2025-05-06T23:03:20.487Z","etag":null,"topics":["bayesian-classifier","ruby","rubyml"],"latest_commit_sha":null,"homepage":"https://jekyll.github.io/classifier-reborn/","language":"Ruby","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"lgpl-2.1","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/jekyll.png","metadata":{"files":{"readme":"README.markdown","changelog":"History.markdown","contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2014-08-10T20:54:45.000Z","updated_at":"2025-04-14T17:47:28.000Z","dependencies_parsed_at":"2024-06-12T06:43:40.014Z","dependency_job_id":"81043a5b-b4dc-4859-9d61-1333bb67215d","html_url":"https://github.com/jekyll/classifier-reborn","commit_stats":{"total_commits":326,"total_committers":47,"mean_commits":"6.9361702127659575","dds":0.665644171779141,"last_synced_commit":"d8872fe18ff80fa6f6a4b66384a3a9a58090a493"},"previous_names":[],"tags_count":10,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jekyll%2Fclassifier-reborn","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jekyll%2Fclassifier-reborn/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jekyll%2Fclassifier-reborn/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/jekyll%2Fclassifier-reborn/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/jekyll","download_url":"https://codeload.github.com/jekyll/classifier-reborn/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":254544146,"owners_count":22088807,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["bayesian-classifier","ruby","rubyml"],"created_at":"2024-07-30T17:01:40.492Z","updated_at":"2025-05-16T14:07:42.981Z","avatar_url":"https://github.com/jekyll.png","language":"Ruby","readme":"# Classifier Reborn\n\n[![Gem Version](https://badge.fury.io/rb/classifier-reborn.svg)](https://rubygems.org/gems/classifier-reborn)\n[![Build Status](https://img.shields.io/travis/jekyll/classifier-reborn/master.svg)](https://travis-ci.org/jekyll/classifier-reborn)\n---\n\n## [Read the Docs](https://jekyll.github.io/classifier-reborn/)\n\n## Getting Started\n\nClassifier Reborn is a general classifier module to allow Bayesian and other types of classifications.\nIt is a fork of [cardmagic/classifier](https://github.com/cardmagic/classifier) under more active development.\nCurrently, it has [Bayesian Classifier](https://en.wikipedia.org/wiki/Naive_Bayes_classifier) and [Latent Semantic Indexer (LSI)](https://en.wikipedia.org/wiki/Latent_semantic_analysis) implemented.\n\nHere is a quick illustration of the Bayesian classifier.\n\n```bash\n$ gem install classifier-reborn\n$ irb\nirb(main):001:0\u003e require 'classifier-reborn'\nirb(main):002:0\u003e classifier = ClassifierReborn::Bayes.new 'Ham', 'Spam'\nirb(main):003:0\u003e classifier.train \"Ham\", \"Sunday is a holiday. Say no to work on Sunday!\"\nirb(main):004:0\u003e classifier.train \"Spam\", \"You are the lucky winner! Claim your holiday prize.\"\nirb(main):005:0\u003e classifier.classify \"What's the plan for Sunday?\"\n#=\u003e \"Ham\"\n```\n\nNow, let's build an LSI, classify some text, and find a cluster of related documents.\n\n```bash\nirb(main):006:0\u003e lsi = ClassifierReborn::LSI.new\nirb(main):007:0\u003e lsi.add_item \"This text deals with dogs. Dogs.\", :dog\nirb(main):008:0\u003e lsi.add_item \"This text involves dogs too. Dogs!\", :dog\nirb(main):009:0\u003e lsi.add_item \"This text revolves around cats. Cats.\", :cat\nirb(main):010:0\u003e lsi.add_item \"This text also involves cats. Cats!\", :cat\nirb(main):011:0\u003e lsi.add_item \"This text involves birds. Birds.\", :bird\nirb(main):012:0\u003e lsi.classify \"This text is about dogs!\"\n#=\u003e :dog\nirb(main):013:0\u003e lsi.find_related(\"This text is around cats!\", 2)\n#=\u003e [\"This text revolves around cats. Cats.\", \"This text also involves cats. Cats!\"]\n```\n\nThere is much more that can be done using Bayes and LSI beyond these quick examples.\nFor more information read the following documentation topics.\n\n* [Installation and Dependencies](https://jekyll.github.io/classifier-reborn/)\n* [Bayesian Classifier](https://jekyll.github.io/classifier-reborn/bayes)\n* [Latent Semantic Indexer (LSI)](https://jekyll.github.io/classifier-reborn/lsi)\n* [Classifier Validation](https://jekyll.github.io/classifier-reborn/validation)\n* [Development and Contributions](https://jekyll.github.io/classifier-reborn/development) (*Optional Docker instructions included*)\n\n### Notes on JRuby support\n\n```ruby\ngem 'classifier-reborn-jruby', platforms: :java\n```\n\nWhile experimental, this gem should work on JRuby without any kind of additional changes. Unfortunately, you will **not** be able to use C bindings to GNU/GSL or similar performance-enhancing native code. Additionally, we do not use `fast_stemmer`, but rather [an implementation](https://tartarus.org/martin/PorterStemmer/java.txt) of the [Porter Stemming](https://tartarus.org/martin/PorterStemmer/) algorithm. Stemming will differ between MRI and JRuby, however you may choose to [disable stemming](https://tartarus.org/martin/PorterStemmer/) and do your own manual preprocessing (or use some other [popular Java library](https://opennlp.apache.org/)). \n\nIf you encounter a problem, please submit your issue with `[JRuby]` in the title.\n\n## Code of Conduct\n\nIn order to have a more open and welcoming community, `Classifier Reborn` adheres to the `Jekyll`\n[code of conduct](https://github.com/jekyll/jekyll/blob/master/CODE_OF_CONDUCT.markdown) adapted from the `Ruby on Rails` code of conduct.\n\nPlease adhere to this code of conduct in any interactions you have in the `Classifier` community.\nIf you encounter someone violating these terms, please let [Chase Gilliam](https://github.com/Ch4s3) know and we will address it as soon as possible.\n\n## Authors and Contributors\n\n* [Lucas Carlson](mailto:lucas@rufy.com)\n* [David Fayram II](mailto:dfayram@gmail.com)\n* [Cameron McBride](mailto:cameron.mcbride@gmail.com)\n* [Ivan Acosta-Rubio](mailto:ivan@softwarecriollo.com)\n* [Parker Moore](mailto:email@byparker.com)\n* [Chase Gilliam](mailto:chase.gilliam@gmail.com)\n* and [many more](https://github.com/jekyll/classifier-reborn/graphs/contributors)...\n\nThe Classifier Reborn library is released under the terms of the [GNU LGPL-2.1](https://github.com/jekyll/classifier-reborn/blob/master/LICENSE).\n","funding_links":[],"categories":["Scientific","Ruby","Machine Learning Libraries"],"sub_categories":["Frameworks","Text-to-Speech-to-Text"],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjekyll%2Fclassifier-reborn","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fjekyll%2Fclassifier-reborn","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fjekyll%2Fclassifier-reborn/lists"}