{"id":16905589,"url":"https://github.com/vibhoothi/vsm","last_synced_at":"2025-03-20T16:21:34.416Z","repository":{"id":120169901,"uuid":"184474223","full_name":"vibhoothi/vsm","owner":"vibhoothi","description":"Vector Space Model Calculation using NLTK ","archived":false,"fork":false,"pushed_at":"2019-05-02T09:46:18.000Z","size":28,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-01-25T15:13:03.080Z","etag":null,"topics":["information-retrieval","vector-space-model"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/vibhoothi.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2019-05-01T20:02:52.000Z","updated_at":"2019-10-23T10:00:53.000Z","dependencies_parsed_at":null,"dependency_job_id":"98de1e13-5335-4c7a-a60a-775b9ca04a3c","html_url":"https://github.com/vibhoothi/vsm","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vibhoothi%2Fvsm","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vibhoothi%2Fvsm/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vibhoothi%2Fvsm/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/vibhoothi%2Fvsm/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/vibhoothi","download_url":"https://codeload.github.com/vibhoothi/vsm/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":244647093,"owners_count":20487010,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["information-retrieval","vector-space-model"],"created_at":"2024-10-13T18:38:59.019Z","updated_at":"2025-03-20T16:21:34.393Z","avatar_url":"https://github.com/vibhoothi.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# Vector Space Model\n\n## What are different types of models in Information Retrival \nThere are two types of models\n* Boolean Retrival \n* Vector Space Model\n\n## Disadvantages of Boolean Retrival Model \n  * Similarity function is boolean\n    * Exact-match only, no partial matches\n    * Retrieved documents not ranked\n  * All terms are equally important\n    * Boolean operator usage has much more influence than a critical word\n  * Query language is expressive but complicated\n\n## What is Vector Space Model \n* In Vector Space Model both Documents and queries are vectors each w(i,j) is a weight for term j in document i\n* \"bag-of-words representation\"\n* Similarity of a document vector to a query vector = cosine of the angle between them\n* Cosine is a normalized dot product\n* Documents ranked by decreasing cosine value\n* Formula is ![](https://i.imgur.com/wdlDQQd.png)\n * sim(d,q) = 1 when d = q\n * sim(d,q) = 0 when d and q share no terms\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fvibhoothi%2Fvsm","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fvibhoothi%2Fvsm","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fvibhoothi%2Fvsm/lists"}