{"id":32955186,"url":"https://github.com/aurelian/ruby-stemmer","last_synced_at":"2025-11-12T22:00:56.880Z","repository":{"id":444520,"uuid":"66944","full_name":"aurelian/ruby-stemmer","owner":"aurelian","description":"Expose libstemmer_c to Ruby","archived":true,"fork":false,"pushed_at":"2022-05-12T10:57:26.000Z","size":191,"stargazers_count":250,"open_issues_count":7,"forks_count":22,"subscribers_count":5,"default_branch":"master","last_synced_at":"2025-11-08T07:11:55.873Z","etag":null,"topics":["c","ruby","ruby-extension","rubynpl","stemmer"],"latest_commit_sha":null,"homepage":"http://locknet.ro/archive/2009-10-29-ann-ruby-stemmer.html","language":"C","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/aurelian.png","metadata":{"files":{"readme":"README.rdoc","changelog":null,"contributing":null,"funding":null,"license":"MIT-LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2008-10-23T20:02:59.000Z","updated_at":"2025-05-04T15:16:40.000Z","dependencies_parsed_at":"2022-07-04T16:31:20.177Z","dependency_job_id":null,"html_url":"https://github.com/aurelian/ruby-stemmer","commit_stats":null,"previous_names":[],"tags_count":9,"template":false,"template_full_name":null,"purl":"pkg:github/aurelian/ruby-stemmer","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aurelian%2Fruby-stemmer","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aurelian%2Fruby-stemmer/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aurelian%2Fruby-stemmer/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aurelian%2Fruby-stemmer/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/aurelian","download_url":"https://codeload.github.com/aurelian/ruby-stemmer/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/aurelian%2Fruby-stemmer/sbom","scorecard":{"id":216518,"data":{"date":"2025-08-11","repo":{"name":"github.com/aurelian/ruby-stemmer","commit":"05192b09d6652fbae5e8dc1ba92762c46444e971"},"scorecard":{"version":"v5.2.1-40-gf6ed084d","commit":"f6ed084d17c9236477efd66e5b258b9d4cc7b389"},"score":3.2,"checks":[{"name":"Packaging","score":-1,"reason":"packaging workflow not detected","details":["Warn: no GitHub/GitLab publishing workflow detected."],"documentation":{"short":"Determines if the project is published as a package that others can easily download, install, easily update, and uninstall.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#packaging"}},{"name":"Code-Review","score":1,"reason":"Found 3/17 approved changesets -- score normalized to 1","details":null,"documentation":{"short":"Determines if the project requires human code review before pull requests (aka merge requests) are merged.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#code-review"}},{"name":"Maintained","score":0,"reason":"project is archived","details":["Warn: Repository is archived."],"documentation":{"short":"Determines if the project is \"actively maintained\".","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#maintained"}},{"name":"Token-Permissions","score":-1,"reason":"No tokens found","details":null,"documentation":{"short":"Determines if the project's workflows follow the principle of least privilege.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#token-permissions"}},{"name":"Dangerous-Workflow","score":-1,"reason":"no workflows found","details":null,"documentation":{"short":"Determines if the project's GitHub Action workflows avoid dangerous patterns.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#dangerous-workflow"}},{"name":"Binary-Artifacts","score":10,"reason":"no binaries found in the repo","details":null,"documentation":{"short":"Determines if the project has generated executable (binary) artifacts in the source repository.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#binary-artifacts"}},{"name":"Pinned-Dependencies","score":-1,"reason":"no dependencies found","details":null,"documentation":{"short":"Determines if the project has declared and pinned the dependencies of its build process.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#pinned-dependencies"}},{"name":"CII-Best-Practices","score":0,"reason":"no effort to earn an OpenSSF best practices badge detected","details":null,"documentation":{"short":"Determines if the project has an OpenSSF (formerly CII) Best Practices Badge.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#cii-best-practices"}},{"name":"Security-Policy","score":0,"reason":"security policy file not detected","details":["Warn: no security policy file detected","Warn: no security file to analyze","Warn: no security file to analyze","Warn: no security file to analyze"],"documentation":{"short":"Determines if the project has published a security policy.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#security-policy"}},{"name":"Fuzzing","score":0,"reason":"project is not fuzzed","details":["Warn: no fuzzer integrations found"],"documentation":{"short":"Determines if the project uses fuzzing.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#fuzzing"}},{"name":"Vulnerabilities","score":10,"reason":"0 existing vulnerabilities detected","details":null,"documentation":{"short":"Determines if the project has open, known unfixed vulnerabilities.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#vulnerabilities"}},{"name":"License","score":10,"reason":"license file detected","details":["Info: project has a license file: MIT-LICENSE:0","Info: FSF or OSI recognized license: MIT License: MIT-LICENSE:0"],"documentation":{"short":"Determines if the project has defined a license.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#license"}},{"name":"Signed-Releases","score":-1,"reason":"no releases found","details":null,"documentation":{"short":"Determines if the project cryptographically signs release artifacts.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#signed-releases"}},{"name":"Branch-Protection","score":0,"reason":"branch protection not enabled on development/release branches","details":["Warn: branch protection not enabled for branch 'master'"],"documentation":{"short":"Determines if the default and release branches are protected with GitHub's branch protection settings.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#branch-protection"}},{"name":"SAST","score":0,"reason":"SAST tool is not run on all commits -- score normalized to 0","details":["Warn: 0 commits out of 21 are checked with a SAST tool"],"documentation":{"short":"Determines if the project uses static code analysis.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#sast"}}]},"last_synced_at":"2025-08-17T01:44:48.521Z","repository_id":444520,"created_at":"2025-08-17T01:44:48.521Z","updated_at":"2025-08-17T01:44:48.521Z"},"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":283961754,"owners_count":26923662,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-11-11T02:00:06.610Z","response_time":65,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["c","ruby","ruby-extension","rubynpl","stemmer"],"created_at":"2025-11-12T22:00:41.102Z","updated_at":"2025-11-12T22:00:56.872Z","avatar_url":"https://github.com/aurelian.png","language":"C","readme":"= Notice @aurelian May 2022\n\n👋 This project started in 2008 mostly as a mean for me to learn how to build C extensions to ruby, exposing a library at that time I needed to use in a real life project.\nIt's 2022 and many things changed since. Most important is my lack of time to keep up with recent libstemmer_c versions and releasing builds compatible with various versions of Windows.\n\nWith this in mind, it is fair to archive this project.\n\n= Ruby Stemmer\n\nRuby-Stemmer exposes SnowBall API to Ruby.\n\n{Travis CI Status}[https://api.travis-ci.org/aurelian/ruby-stemmer.png]\n\nThis package includes libstemmer_c library released under BSD licence and available for free {here}[https://snowballstem.org/download.html].\n\nSupport for latin language is also included and it has been generated with the snowball compiler using {schinke contribution}[https://snowballstem.org/otherapps/schinke/].\n\nFor more details about libstemmer_c please visit the {SnowBall website}[https://snowballstem.org/].\n\n== Usage\n\n  require 'rubygems'\n  require 'lingua/stemmer'\n\n  stemmer= Lingua::Stemmer.new(:language =\u003e \"ro\")\n  stemmer.stem(\"netăgăduit\") #=\u003e netăgădu\n\n=== Alternative\n\n  require 'rubygems'\n  require 'lingua/stemmer'\n\n  Lingua.stemmer( %w(incontestabil neîndoielnic), :language =\u003e \"ro\" ) #=\u003e [\"incontest\", \"neîndoieln\"]\n  Lingua.stemmer(\"installation\") #=\u003e \"instal\"\n  Lingua.stemmer(\"installation\", :language =\u003e \"fr\", :encoding =\u003e \"ISO_8859_1\") do | word |\n    puts \"~\u003e #{word}\" #=\u003e \"instal\"\n  end # =\u003e #\u003cLingua::Stemmer:0x102501e48\u003e\n\n=== Gemfile\n\n  gem 'ruby-stemmer', '\u003e=2.0.0', :require =\u003e 'lingua/stemmer'\n\n=== More details\n\n* Complete API in {RDoc format}[http://rdoc.info/github/aurelian/ruby-stemmer/master/frames]\n* More usage on the {test file}[https://github.com/aurelian/ruby-stemmer/blob/master/test/lingua/test_stemmer.rb]\n\n== Install\n\n gem install ruby-stemmer\n\n==== Windows\n\nThere's also a Windows (Fat bin)\n\n gem install ruby-stemmer --platform=x86-mingw32\n\nAs far as I know the above should work with {rubyinstaller}[http://rubyinstaller.org/]. If it fails, you could try with:\n\n gem install ruby-stemmer --platform=x86-mswin32\n\n{It's known}[https://cl.ly/BX9o] to work under Windows XP.\n\n=== Development version\n\n  $ git clone git://github.com/aurelian/ruby-stemmer.git\n  $ cd ruby-stemmer\n  $ rake -T #\u003c== see what we've got\n  $ rake compile #\u003c== builds the extension do'h\n  $ rake test\n\n==== Cross Compiling\n\nInstall {rake-compiler-dock}[https://github.com/rake-compiler/rake-compiler-dock] and follow the setup.\n\nThen, inside the docker image:\n\n  $ AR=i686-w64-mingw32-ar CC=i686-w64-mingw32-gcc LD=i686-w64-mingw32-ld rake cross native gem\n\nOr, build the lib first then compile:\n\n  $ cd libstemmer_c\n  $ AR=i686-w64-mingw33-ar CC=i686-w64-mingw32-gcc LD=i686-w64-mingw32-ld make\n  $ cd ../\n  $ rake cross native gem\n\n== NOT A BUG\n\nThe stemming process is an algorithm to allow one to find the stem of an word (not the root of it).\nFor further reference on stem vs. root, please check wikipedia articles on the topic:\n\n* https://en.wikipedia.org/wiki/Word_stem\n* https://en.wikipedia.org/wiki/Root_(linguistics)\n\n== TODO\n\n* {Open issues}[https://github.com/aurelian/ruby-stemmer/issues]\n\n== Note on Patches/Pull Requests\n \n* Fork the project from {github}[https://github.com/aurelian/ruby-stemmer]\n* Make your feature addition or {bug fix}[https://github.com/aurelian/ruby-stemmer/issues]\n* Add tests for it. This is important so I don't break it in a\n  future version unintentionally.\n* Commit, do not mess with rakefile, version, or history.\n\n  if you want to have your own version, that is fine but\n  bump version in a commit by itself I can ignore when I pull\n* Send me a pull request. Bonus points for topic branches.\n\n== Alternative Stemmers for Ruby\n\n* {stemmer4r}[https://rubygems.org/gems/stemmer4r] (ext)\n* {fast-stemmer}[https://rubygems.org/gems/fast-stemmer] (ext)\n* {uea-stemmer}[https://rubygems.org/gems/uea-stemmer] (ext)\n* {stemmer}[https://rubygems.org/gems/stemmer] (pure ruby)\n* add yours\n\n== Copyright\n\nCopyright (c) 2008-2020 {Aurelian Oancea}[http://locknet.ro]. See MIT-LICENSE for details.\n\n== Contributors\n\n* {Aurelian Oancea}[https://github.com/aurelian]\n* {Yury Korolev}[https://github.com/yury] - various bug fixes\n* {Aaron Patterson}[https://github.com/tenderlove] - rake compiler (windows support), code cleanup\n* {Damián Silvani}[https://github.com/munshkr] - Ruby 1.9 encoding\n\n# encoding: utf-8\n","funding_links":[],"categories":["Data Processing and ETL","Ruby","[](https://github.com/josephmisiti/awesome-machine-learning/blob/master/README.md#ruby)Ruby","NLP Pipeline Subtasks"],"sub_categories":["General-Purpose Machine Learning","Lexical Processing"],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Faurelian%2Fruby-stemmer","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Faurelian%2Fruby-stemmer","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Faurelian%2Fruby-stemmer/lists"}