Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/chanind/reddit-words
What have Spacy's sense2vec 2019 word vectors learned from Reddit?
https://github.com/chanind/reddit-words
sense2vec spacy spacy-nlp word2vec
Last synced: about 2 months ago
JSON representation
What have Spacy's sense2vec 2019 word vectors learned from Reddit?
- Host: GitHub
- URL: https://github.com/chanind/reddit-words
- Owner: chanind
- License: mit
- Created: 2021-05-25T19:23:18.000Z (over 3 years ago)
- Default Branch: main
- Last Pushed: 2022-11-20T18:40:16.000Z (about 2 years ago)
- Last Synced: 2024-10-16T11:18:37.832Z (3 months ago)
- Topics: sense2vec, spacy, spacy-nlp, word2vec
- Language: TypeScript
- Homepage: https://chanind.github.io/reddit-words/
- Size: 1.82 MB
- Stars: 3
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# reddit-words
What have Spacy's sense2vec 2019 word vectors learned from Reddit?
Live demo: [chanind.github.io/reddit-words](https://chanind.github.io/reddit-words)
## How does this work?
This project attempts to examine what the Spacy sense2vec Reddit word vectors have learned about words. This works by taking 2 words that differ in a single understandable dimension, like "east" vs "west", and subtracts them to get a direction vector. Then, this looks at the word vectors that have both the largest and smallest scores when projected along that direction.
## Project structure
This project contains 2 parts, a server written in Python which does the projection and ranking calculation, and a React web frontend which provides a UI to explore and navigate the data. The server is in the `server` directory, and the website is in the `website` directory.
## Contributing
If you have an idea to improve this project, pull requests are always welcome!
## Work with me!
I'm interested in doing a PhD in NLP in 2022, and would love to work with
researchers doing NLP work in the meantime. Please reach out at
[email protected] if you have a
project I can be a part of!