Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/ansonyuu/matchmaking
Embedding space of names clustered based on their interests using the sentence-transformers all-MiniLM-L6-v2 model
https://github.com/ansonyuu/matchmaking
Last synced: 13 days ago
JSON representation
Embedding space of names clustered based on their interests using the sentence-transformers all-MiniLM-L6-v2 model
- Host: GitHub
- URL: https://github.com/ansonyuu/matchmaking
- Owner: ansonyuu
- Created: 2023-05-24T23:46:47.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2024-11-01T16:03:08.000Z (19 days ago)
- Last Synced: 2024-11-01T17:17:52.723Z (19 days ago)
- Language: Jupyter Notebook
- Homepage:
- Size: 480 KB
- Stars: 121
- Watchers: 1
- Forks: 9
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# Embedding Matchmaking
_"Words can't describe how unique your interests are... but coordinates can" - Sean Ashley, circa 2023_
A flattened embedding space of names clustered based on their interests using the sentence-transformers all-MiniLM-L6-v2 model. Created for the UW Startups S23 Kickoff event with guidance from [Jacky Zhao](https://jzhao.xyz/) and [Sean Ashley](https://www.linkedin.com/in/sean-ashley). [Simha Kalimipalli](https://github.com/Simha-Kalimipalli) later aded interactivity!
![Sample output of script](https://github.com/ansonyuu/matchmaking/blob/main/sample.png?raw=true)
## Instructions for use
1. Collect or format your data in the following format
| Name | What are your interests? (or varying permutations of this question) |
| ----- | ------------------------------------------------------------------- |
| Alice | I love being the universal placeholder for every CS joke ever |
| Bob | I too love being the universal placeholder for every CS joke |2. Clone the repository
3. Install all required packages using pip or conda:- `umap-learn`
- `scikit-learn`
- `scipy`
- `sentence-transformers`
- `matplotlib`
- `pyvis`
- `pandas`
- `numpy`
- `seaborn`
- `branca`4. Replace `attendees.csv` in `visualizer.ipynb` with the path to your downloaded data
5. Run all cells
6. Bask in the glory of having an awesome new poster
7. Make two (!) cool interactive visualizations