{"id":27881782,"url":"https://github.com/src-d/vecino","last_synced_at":"2026-03-09T07:31:49.126Z","repository":{"id":62587171,"uuid":"94335054","full_name":"src-d/vecino","owner":"src-d","description":"Vecino is a command line application to discover Git repositories which are similar to the one that the user provides.","archived":false,"fork":false,"pushed_at":"2019-08-20T09:52:52.000Z","size":53,"stargazers_count":49,"open_issues_count":2,"forks_count":13,"subscribers_count":10,"default_branch":"master","last_synced_at":"2025-11-28T12:12:29.005Z","etag":null,"topics":["babelfish","git"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"other","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/src-d.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":"CONTRIBUTING.md","funding":null,"license":"LICENSE.md","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2017-06-14T13:39:03.000Z","updated_at":"2025-02-03T21:50:55.000Z","dependencies_parsed_at":"2022-11-03T22:41:44.993Z","dependency_job_id":null,"html_url":"https://github.com/src-d/vecino","commit_stats":null,"previous_names":[],"tags_count":7,"template":false,"template_full_name":null,"purl":"pkg:github/src-d/vecino","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/src-d%2Fvecino","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/src-d%2Fvecino/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/src-d%2Fvecino/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/src-d%2Fvecino/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/src-d","download_url":"https://codeload.github.com/src-d/vecino/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/src-d%2Fvecino/sbom","scorecard":{"id":843557,"data":{"date":"2025-08-11","repo":{"name":"github.com/src-d/vecino","commit":"80bde1f2213790061ff76238877c7f9934b0dc01"},"scorecard":{"version":"v5.2.1-40-gf6ed084d","commit":"f6ed084d17c9236477efd66e5b258b9d4cc7b389"},"score":3.2,"checks":[{"name":"Code-Review","score":4,"reason":"Found 7/17 approved changesets -- score normalized to 4","details":null,"documentation":{"short":"Determines if the project requires human code review before pull requests (aka merge requests) are merged.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#code-review"}},{"name":"Packaging","score":-1,"reason":"packaging workflow not detected","details":["Warn: no GitHub/GitLab publishing workflow detected."],"documentation":{"short":"Determines if the project is published as a package that others can easily download, install, easily update, and uninstall.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#packaging"}},{"name":"Maintained","score":0,"reason":"0 commit(s) and 0 issue activity found in the last 90 days -- score normalized to 0","details":null,"documentation":{"short":"Determines if the project is \"actively maintained\".","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#maintained"}},{"name":"Dangerous-Workflow","score":-1,"reason":"no workflows found","details":null,"documentation":{"short":"Determines if the project's GitHub Action workflows avoid dangerous patterns.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#dangerous-workflow"}},{"name":"Token-Permissions","score":-1,"reason":"No tokens found","details":null,"documentation":{"short":"Determines if the project's workflows follow the principle of least privilege.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#token-permissions"}},{"name":"Binary-Artifacts","score":10,"reason":"no binaries found in the repo","details":null,"documentation":{"short":"Determines if the project has generated executable (binary) artifacts in the source repository.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#binary-artifacts"}},{"name":"CII-Best-Practices","score":0,"reason":"no effort to earn an OpenSSF best practices badge detected","details":null,"documentation":{"short":"Determines if the project has an OpenSSF (formerly CII) Best Practices Badge.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#cii-best-practices"}},{"name":"Security-Policy","score":0,"reason":"security policy file not detected","details":["Warn: no security policy file detected","Warn: no security file to analyze","Warn: no security file to analyze","Warn: no security file to analyze"],"documentation":{"short":"Determines if the project has published a security policy.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#security-policy"}},{"name":"Vulnerabilities","score":10,"reason":"0 existing vulnerabilities detected","details":null,"documentation":{"short":"Determines if the project has open, known unfixed vulnerabilities.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#vulnerabilities"}},{"name":"Fuzzing","score":0,"reason":"project is not fuzzed","details":["Warn: no fuzzer integrations found"],"documentation":{"short":"Determines if the project uses fuzzing.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#fuzzing"}},{"name":"License","score":9,"reason":"license file detected","details":["Info: project has a license file: LICENSE.md:0","Warn: project license file does not contain an FSF or OSI license."],"documentation":{"short":"Determines if the project has defined a license.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#license"}},{"name":"Signed-Releases","score":-1,"reason":"no releases found","details":null,"documentation":{"short":"Determines if the project cryptographically signs release artifacts.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#signed-releases"}},{"name":"Pinned-Dependencies","score":0,"reason":"dependency not pinned by hash detected -- score normalized to 0","details":["Warn: containerImage not pinned by hash: Dockerfile:1: pin your Docker image by updating ubuntu:18.04 to ubuntu:18.04@sha256:152dc042452c496007f07ca9127571cb9c29697f42acbfad72324b2bb2e43c98","Warn: containerImage not pinned by hash: reference/Dockerfile:1: pin your Docker image by updating ubuntu:16.04 to ubuntu:16.04@sha256:1f1a2d56de1d604801a9671f301190704c25d604a416f59e03c04f5c6ffee0d6","Warn: downloadThenRun not pinned by hash: Dockerfile:5-13","Warn: pipCommand not pinned by hash: Dockerfile:5-13","Warn: pipCommand not pinned by hash: Dockerfile:16","Warn: downloadThenRun not pinned by hash: reference/Dockerfile:5-18","Warn: pipCommand not pinned by hash: reference/Dockerfile:5-18","Info:   0 out of   2 containerImage dependencies pinned","Info:   0 out of   2 downloadThenRun dependencies pinned","Info:   0 out of   3 pipCommand dependencies pinned"],"documentation":{"short":"Determines if the project has declared and pinned the dependencies of its build process.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#pinned-dependencies"}},{"name":"Branch-Protection","score":0,"reason":"branch protection not enabled on development/release branches","details":["Warn: branch protection not enabled for branch 'master'"],"documentation":{"short":"Determines if the default and release branches are protected with GitHub's branch protection settings.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#branch-protection"}},{"name":"SAST","score":0,"reason":"SAST tool is not run on all commits -- score normalized to 0","details":["Warn: 0 commits out of 22 are checked with a SAST tool"],"documentation":{"short":"Determines if the project uses static code analysis.","url":"https://github.com/ossf/scorecard/blob/f6ed084d17c9236477efd66e5b258b9d4cc7b389/docs/checks.md#sast"}}]},"last_synced_at":"2025-08-23T20:58:23.378Z","repository_id":62587171,"created_at":"2025-08-23T20:58:23.379Z","updated_at":"2025-08-23T20:58:23.379Z"},"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":30287425,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-03-09T02:57:19.223Z","status":"ssl_error","status_checked_at":"2026-03-09T02:56:26.373Z","response_time":61,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["babelfish","git"],"created_at":"2025-05-05T05:05:08.631Z","updated_at":"2026-03-09T07:31:49.101Z","avatar_url":"https://github.com/src-d.png","language":"Python","readme":"# Vecino [![Build Status](https://travis-ci.org/src-d/vecino.svg)](https://travis-ci.org/src-d/vecino) [![codecov](https://codecov.io/github/src-d/vecino/coverage.svg?branch=master)](https://codecov.io/gh/src-d/vecino) [![PyPI](https://img.shields.io/pypi/v/vecino.svg)](https://pypi.python.org/pypi/vecino)\n\nVecino is a command line application to discover Git repositories which are similar\nto the one that the user provides.\n\n```\n$ vecino https://github.com/apache/spark\n...\n                                    apache/spark\t4.07\n                                   amplab/graphx\t5.80\n                               EclairJS/eclairjs\t5.84\n                       EclairJS/eclairjs-nashorn\t5.87\n                                 cloudera/impyla\t6.01\n                           databricks/spark-perf\t6.26\n                                forward3d/rbhive\t6.29\n                                     apache/hive\t6.29\n                              ondra-m/ruby-spark\t6.31\n                        SnappyDataInc/snappydata\t6.31\n```\n\nFinding related open source software can be hard. Sometimes using a search engine is not enough.\nOne of the reliable ways to determine projects which seem to be close to yours is to look into\nthe source code and let it judge. Vecino defines similarity through matching or synonymical\nsource code identifiers.\n\nVecino uses id2vec, source{d}'s source code identifer embeddings and much of\n[ast2vec](https://github.com/src-d/ast2vec) engine. Parsing is performed with [Babelfish](http://doc.bblf.sh).\nThe suggested repositories are taken from the loaded NBOW model - the only currently available now\nis from October 2016.\n\n### Please note\n\nThe currently available public models were converted and are outdated and not fully compatible with\nthe preprocessing in ast2vec. Thus the results can be imprecise. The original results can be reproduced in\nthe [reference notebook](reference/nearest_repos.ipynb).\n\nBesides, since Babelfish supports only Python and Java at the moment, it is impossible to query\nrepositories written in other languages.\n\n### Installation\n\n```\npip3 install vecino\n```\n\nAs in the rest of ML projects at source{d}, only Python3 is supported and Python2 will never be.\n\n### Usage\n\nCommand line:\n\n```\n$ vecino apache/spark\n```\n\nPython API:\n\n```python\nimport vecino\n\nengine = vecino.SimilarRepositories()\nprint(engine.query(\"https://github.com/apache/spark\"))\n```\n\n### Docker image\n\n```\ndocker build -t srcd/vecino .\ndocker run -d --privileged -p 9432:9432 --name bblfshd bblfsh/bblfshd\ndocker exec -it bblfshd bblfshctl driver install --all\ndocker run -it --rm srcd/vecino https://github.com/apache/spark\n```\n\nIn order to cache the downloaded models:\n\n```\ndocker run -it --rm -v /path/to/cache/on/host:/root srcd/vecino https://github.com/apache/spark\n```\n\n### Contributions\n\n...are welcome! See [CONTRIBUTING](CONTRIBUTING.md) and [code of conduct](CODE_OF_CONDUCT.md).\n\n### License\n\n[Apache 2.0](LICENSE.md)\n","funding_links":[],"categories":["Software"],"sub_categories":[],"project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsrc-d%2Fvecino","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fsrc-d%2Fvecino","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fsrc-d%2Fvecino/lists"}