https://github.com/IndexStorm/git-rec-back
Backend code for GitHub Recommendation Extension
https://github.com/IndexStorm/git-rec-back
ai faiss flask machine-learning sbert
Last synced: 10 months ago
JSON representation
Backend code for GitHub Recommendation Extension
- Host: GitHub
- URL: https://github.com/IndexStorm/git-rec-back
- Owner: IndexStorm
- License: mit
- Created: 2022-08-08T12:01:33.000Z (almost 4 years ago)
- Default Branch: public
- Last Pushed: 2022-08-11T07:52:20.000Z (almost 4 years ago)
- Last Synced: 2024-11-22T22:33:18.233Z (over 1 year ago)
- Topics: ai, faiss, flask, machine-learning, sbert
- Language: Python
- Homepage: https://indexstorm.com/
- Size: 9.77 KB
- Stars: 28
- Watchers: 1
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.md
Awesome Lists containing this project
README
# GitHub Recommendations Backend

This is an official repository for the backend part of GitHub Recommender extension
Link to extension: Chrome and Firefox
Repository with extension code
## Motivation
We find it hard to explore the world of GitHub repositories and are afraid of missing out some useful repos. That is why we decided to build this extension to help people to discover new horizons of hidden GitHub gems.
Once you've loaded the GitHub repo page, the extension adds a list of similar repositories based on their titles and descriptions. We do not collect any data and open source this product to everyone.
## Explanation
Using [SBERT](https://github.com/UKPLab/sentence-transformers) embeddings and scrapped repo's descriptions we have constructed vector embeddings for over 100,000 repositories. Once the client sends the description of the current repo, we construct a vector from it and then search for the nearest embeddings in the vector space. You can read more about [vector cosine similarity](https://www.pinecone.io/learn/vector-embeddings/). Constructed embeddings and scrapped repo's descriptions can be shared upon request.
## Tech/frameworks used
The backend was built on **Python 3.9** using:
- [Flask](https://github.com/pallets/flask) for server
- [SBERT](https://github.com/UKPLab/sentence-transformers) for building repo's embeddings
- [FAISS](https://github.com/facebookresearch/faiss) for fast cosine similarity search
Made by indexStorm
Follow on Twitter @index_storm
Authors: