{"id":13609099,"url":"https://github.com/snowballstem/snowball","last_synced_at":"2025-10-20T12:58:35.892Z","repository":{"id":7082708,"uuid":"8372403","full_name":"snowballstem/snowball","owner":"snowballstem","description":"Snowball compiler and stemming algorithms","archived":false,"fork":false,"pushed_at":"2025-10-19T09:47:15.000Z","size":2545,"stargazers_count":812,"open_issues_count":19,"forks_count":189,"subscribers_count":37,"default_branch":"master","last_synced_at":"2025-10-20T12:58:28.113Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":"https://snowballstem.org/","language":"C","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"bsd-3-clause","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/snowballstem.png","metadata":{"files":{"readme":"README.rst","changelog":"NEWS","contributing":"CONTRIBUTING.rst","funding":null,"license":"COPYING","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":"AUTHORS","dei":null,"publiccode":null,"codemeta":null,"zenodo":null,"notice":null,"maintainers":null,"copyright":null,"agents":null,"dco":null,"cla":null}},"created_at":"2013-02-23T07:17:42.000Z","updated_at":"2025-10-19T19:40:40.000Z","dependencies_parsed_at":"2024-01-15T15:08:52.025Z","dependency_job_id":"dfa08cdc-a5e1-4bd2-afcd-bab12a3d6174","html_url":"https://github.com/snowballstem/snowball","commit_stats":{"total_commits":1044,"total_committers":36,"mean_commits":29.0,"dds":0.3132183908045977,"last_synced_commit":"a749dfa9fec90b9081b0902eadd4e22bd2b4b8a7"},"previous_names":[],"tags_count":5,"template":false,"template_full_name":null,"purl":"pkg:github/snowballstem/snowball","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/snowballstem%2Fsnowball","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/snowballstem%2Fsnowball/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/snowballstem%2Fsnowball/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/snowballstem%2Fsnowball/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/snowballstem","download_url":"https://codeload.github.com/snowballstem/snowball/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/snowballstem%2Fsnowball/sbom","scorecard":{"id":834867,"data":{"date":"2025-08-11","repo":{"name":"github.com/snowballstem/snowball","commit":"1cbef80e6b120e3332a53c788c1ac1a579270cb0"},"scorecard":{"version":"v5.2.1-40-gf6ed084d","commit":"f6ed084d17c9236477efd66e5b258b9d4cc7b389"},"score":4.4,"checks":[{"name":"Dangerous-Workflow","score":10,"reason":"no dangerous workflow patterns detected","details":null,"documentation":{"short":"Determines if the project's GitHub Action workflows avoid dangerous patterns.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#dangerous-workflow"}},{"name":"Packaging","score":-1,"reason":"packaging workflow not detected","details":["Warn: no GitHub/GitLab publishing workflow detected."],"documentation":{"short":"Determines if the project is published as a package that others can easily download, install, easily update, and uninstall.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#packaging"}},{"name":"Code-Review","score":0,"reason":"Found 1/30 approved changesets -- score normalized to 0","details":null,"documentation":{"short":"Determines if the project requires human code review before pull requests (aka merge requests) are merged.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#code-review"}},{"name":"Maintained","score":10,"reason":"30 commit(s) and 11 issue activity found in the last 90 days -- score normalized to 10","details":null,"documentation":{"short":"Determines if the project is \"actively maintained\".","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#maintained"}},{"name":"Binary-Artifacts","score":10,"reason":"no binaries found in the repo","details":null,"documentation":{"short":"Determines if the project has generated executable (binary) artifacts in the source repository.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#binary-artifacts"}},{"name":"Token-Permissions","score":0,"reason":"detected GitHub workflow tokens with excessive permissions","details":["Warn: no topLevel permission defined: .github/workflows/ci.yml:1","Info: no jobLevel write permissions found"],"documentation":{"short":"Determines if the project's workflows follow the principle of least privilege.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#token-permissions"}},{"name":"CII-Best-Practices","score":0,"reason":"no effort to earn an OpenSSF best practices badge detected","details":null,"documentation":{"short":"Determines if the project has an OpenSSF (formerly CII) Best Practices Badge.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#cii-best-practices"}},{"name":"Pinned-Dependencies","score":0,"reason":"dependency not pinned by hash detected -- score normalized to 0","details":["Warn: GitHub-owned GitHubAction not pinned by hash: .github/workflows/ci.yml:113: update your workflow using https://app.stepsecurity.io/secureworkflow/snowballstem/snowball/ci.yml/master?enable=pin","Warn: third-party GitHubAction not pinned by hash: .github/workflows/ci.yml:160: update your workflow using https://app.stepsecurity.io/secureworkflow/snowballstem/snowball/ci.yml/master?enable=pin","Warn: third-party GitHubAction not pinned by hash: .github/workflows/ci.yml:169: update your workflow using https://app.stepsecurity.io/secureworkflow/snowballstem/snowball/ci.yml/master?enable=pin","Warn: third-party GitHubAction not pinned by hash: .github/workflows/ci.yml:175: update your workflow using https://app.stepsecurity.io/secureworkflow/snowballstem/snowball/ci.yml/master?enable=pin","Warn: GitHub-owned GitHubAction not pinned by hash: .github/workflows/ci.yml:194: update your workflow using https://app.stepsecurity.io/secureworkflow/snowballstem/snowball/ci.yml/master?enable=pin","Warn: pipCommand not pinned by hash: .github/workflows/ci.yml:184","Info:   0 out of   2 GitHub-owned GitHubAction dependencies pinned","Info:   0 out of   3 third-party GitHubAction dependencies pinned","Info:   0 out of   1 pipCommand dependencies pinned"],"documentation":{"short":"Determines if the project has declared and pinned the dependencies of its build process.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#pinned-dependencies"}},{"name":"Vulnerabilities","score":10,"reason":"0 existing vulnerabilities detected","details":null,"documentation":{"short":"Determines if the project has open, known unfixed vulnerabilities.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#vulnerabilities"}},{"name":"Security-Policy","score":0,"reason":"security policy file not detected","details":["Warn: no security policy file detected","Warn: no security file to analyze","Warn: no security file to analyze","Warn: no security file to analyze"],"documentation":{"short":"Determines if the project has published a security policy.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#security-policy"}},{"name":"License","score":10,"reason":"license file detected","details":["Info: project has a license file: COPYING:0","Info: FSF or OSI recognized license: BSD 3-Clause \"New\" or \"Revised\" License: COPYING:0"],"documentation":{"short":"Determines if the project has defined a license.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#license"}},{"name":"Fuzzing","score":0,"reason":"project is not fuzzed","details":["Warn: no fuzzer integrations found"],"documentation":{"short":"Determines if the project uses fuzzing.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#fuzzing"}},{"name":"Signed-Releases","score":-1,"reason":"no releases found","details":null,"documentation":{"short":"Determines if the project cryptographically signs release artifacts.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#signed-releases"}},{"name":"Branch-Protection","score":0,"reason":"branch protection not enabled on development/release branches","details":["Warn: branch protection not enabled for branch 'master'"],"documentation":{"short":"Determines if the default and release branches are protected with GitHub's branch protection settings.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#branch-protection"}},{"name":"SAST","score":0,"reason":"SAST tool is not run on all commits -- score normalized to 0","details":["Warn: 0 commits out of 1 are checked with a SAST tool"],"documentation":{"short":"Determines if the project uses static code analysis.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#sast"}}]},"last_synced_at":"2025-08-23T18:43:52.436Z","repository_id":7082708,"created_at":"2025-08-23T18:43:52.436Z","updated_at":"2025-08-23T18:43:52.436Z"},"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":280094904,"owners_count":26271003,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-20T02:00:06.978Z","response_time":62,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-08-01T19:01:32.432Z","updated_at":"2025-10-20T12:58:35.887Z","avatar_url":"https://github.com/snowballstem.png","language":"C","funding_links":[],"categories":["C","文本数据和NLP","Feature Extraction","🔧 Utilities \u0026 Miscellaneous"],"sub_categories":["Text/NLP"],"readme":"Snowball is a small string processing language for creating stemming algorithms\nfor use in Information Retrieval, plus a collection of stemming algorithms\nimplemented using it.\n\nSnowball was originally designed and built by Martin Porter.  Martin retired\nfrom development in 2014 and Snowball is now maintained as a community project.\nMartin originally chose the name Snowball as a tribute to SNOBOL, the excellent\nstring handling language from the 1960s.  It now also serves as a metaphor for\nhow the project grows by gathering contributions over time.\n\nThe Snowball compiler translates a Snowball program into source code in another\nlanguage - currently Ada, ISO C, C#, Dart, Go, Java, Javascript, Object Pascal,\nPHP, Python and Rust are supported.\n\nThis repository contains the source code for the snowball compiler and the\nstemming algorithms.  The snowball compiler is written in ISO C - you'll need\na C compiler which support C99 to build it (but the C code it generates should\nwork with any ISO C compiler).\n\nSee https://snowballstem.org/ for more information about Snowball.\n\nWhat is Stemming?\n=================\n\nStemming maps different forms of the same word to a common \"stem\" - for\nexample, the English stemmer maps *connection*, *connections*, *connective*,\n*connected*, and *connecting* to *connect*.  So a search for *connected*\nwould also find documents which only have the other forms.\n\nThis stem form is often a word itself, but this is not always the case as this\nis not a requirement for text search systems, which are the intended field of\nuse.  We also aim to conflate words with the same meaning, rather than all\nwords with a common linguistic root (so *awe* and *awful* don't have the same\nstem), and over-stemming is more problematic than under-stemming so we tend not\nto stem in cases that are hard to resolve.  If you want to always reduce words\nto a root form and/or get a root form which is itself a word then Snowball's\nstemming algorithms likely aren't the right answer.\n\nBuilding Snowball\n=================\n\nGNU make is required to build Snowball.\n\nThe build system is currently structured as two separate stages for many of the\ntarget languages.\n\nThe first stage builds the Snowball compiler and runs it to create target\nlanguage code (and it can also run tests on each stemmer).  The expectation is\nthat you then create a \"distribution\" tarballs of this code with ``make dist``\n(or to create one for a specific target language, e.g.  ``make\ndist_libstemmer_c`` for C).  These tarballs are created in the ``dist/``\nsubdirectory.\n\nTo actually build the libstemmer library you then unpack and build the\ndistribution tarball, e.g. for C::\n\n    tar xf dist/libstemmer_c-3.0.0.tar.gz\n    cd libstemmer_c-3.0.0\n    make\n\nCross-compiling\n---------------\n\nIf cross-compiling starting from the git repo, the Snowball compiler needs to\nbe built with a native compiler then libstemmer with the cross-compiler.  For\nexample::\n\n    make CC=cc dist_libstemmer_c\n    tar xf dist/libstemmer_c-3.0.0.tar.gz\n    cd libstemmer_c-3.0.0\n    make CC=riscv64-unknown-linux-gnu-gcc\n\nIf you are cross-compiling to or from Microsoft Windows, you'll need to also\nwork around an assumption in libstemmer's ``Makefile`` which sets ``EXEEXT``\nbased on the OS you are building on::\n\n    ifeq ($(OS),Windows_NT)\n    EXEEXT=.exe\n    endif\n\nFor example, if cross-compiling from Linux to Microsoft Windows, use something\nlike this for the libstemmer build::\n\n    make CC=x86_64-w64-mingw32-gcc EXEEXT=.exe\n\nWhen going the other way, you'll need to use ``EXEEXT=``.\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsnowballstem%2Fsnowball","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsnowballstem%2Fsnowball","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsnowballstem%2Fsnowball/lists"}