{"id":17166697,"url":"https://github.com/willf/segment_ruby","last_synced_at":"2025-07-26T15:32:10.731Z","repository":{"id":56894715,"uuid":"70846912","full_name":"willf/segment_ruby","owner":"willf","description":"SegmentRuby is a module for segmenting (English) text based on various language models.","archived":false,"fork":false,"pushed_at":"2016-11-02T23:55:19.000Z","size":11086,"stargazers_count":2,"open_issues_count":0,"forks_count":1,"subscribers_count":2,"default_branch":"master","last_synced_at":"2024-12-01T13:45:29.833Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"","language":"Ruby","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/willf.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2016-10-13T20:47:45.000Z","updated_at":"2021-08-17T19:16:17.000Z","dependencies_parsed_at":"2022-08-21T01:20:33.818Z","dependency_job_id":null,"html_url":"https://github.com/willf/segment_ruby","commit_stats":null,"previous_names":[],"tags_count":4,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/willf%2Fsegment_ruby","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/willf%2Fsegment_ruby/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/willf%2Fsegment_ruby/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/willf%2Fsegment_ruby/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/willf","download_url":"https://codeload.github.com/willf/segment_ruby/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":227691977,"owners_count":17805174,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-10-14T23:06:26.179Z","updated_at":"2024-12-02T07:41:09.782Z","avatar_url":"https://github.com/willf.png","language":"Ruby","funding_links":[],"categories":[],"sub_categories":[],"readme":"# SegmentRuby\n\nSegmentRuby is a module for segmenting (English)\ntext based on various language models.\n\n## Installation\n\nAdd this line to your application's Gemfile:\n\n```ruby\ngem 'segment_ruby'\n```\n\nAnd then execute:\n\n    $ bundle\n\nOr install it yourself as:\n\n    $ gem install segment_ruby\n\n## Usage\n\n```\nrequire 'segment_ruby'\nt = SegmentRuby::Analyzer.new(:twitter)\nt.segment(\"theboywholived\")\n=\u003e [\"the\", \"boy\", \"who\", \"lived\"]\n```\nModels include:\n\n- `:norvig`: based on Google web data\n- `:google_books`: based on Google books data\n- `:anchor`: based on Web anchor text\n- `:twitter`: based on Twitter data\n- `:small`: smaller version of the Google books data\n- `:us_names`: US names, based on SSI data\n\nThe default model is `small`. Use it if is seems to work for you.\n\n## Development\n\nAfter checking out the repo, run `bin/setup` to install dependencies. Then, run `rake spec` to run the tests. You can also run `bin/console` for an interactive prompt that will allow you to experiment.\n\nTo install this gem onto your local machine, run `bundle exec rake install`. To release a new version, update the version number in `version.rb`, and then run `bundle exec rake release`, which will create a git tag for the version, push git commits and tags, and push the `.gem` file to [rubygems.org](https://rubygems.org).\n\n## Contributing\n\nBug reports and pull requests are welcome on GitHub at https://github.com/willf/segment_ruby.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fwillf%2Fsegment_ruby","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fwillf%2Fsegment_ruby","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fwillf%2Fsegment_ruby/lists"}