https://github.com/jaanli/food2vec
:hamburger:
https://github.com/jaanli/food2vec
embeddings food machine-learning
Last synced: about 1 year ago
JSON representation
:hamburger:
- Host: GitHub
- URL: https://github.com/jaanli/food2vec
- Owner: jaanli
- License: mit
- Created: 2017-01-17T19:28:52.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2022-06-10T10:22:40.000Z (about 4 years ago)
- Last Synced: 2025-04-09T22:19:10.813Z (about 1 year ago)
- Topics: embeddings, food, machine-learning
- Language: Jupyter Notebook
- Homepage:
- Size: 83 MB
- Stars: 225
- Watchers: 9
- Forks: 48
- Open Issues: 2
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# food2vec
Food vectors. Live demo at [https://altosaar.github.io/food2vec/](https://altosaar.github.io/food2vec/), blog post with more information and plots here: https://jaan.io/food2vec-augmented-cooking-machine-intelligence/
## Usage
Train a model on the recipes dataset, replicate the results from the [blog post](https://jaan.io/food2vec-augmented-cooking-machine-intelligence/):
```
conda env create -f environment.yml
conda activate food2vec
git clone git@github.com:altosaar/food2vec.git
echo "[submodule \"src/sentence_word2vec\"]
path = src/sentence_word2vec
url = https://github.com/altosaar/sentence_word2vec.git
git submodule update --init
cd food2vec/src
./run_fasttext.sh
```
## Visualization & embedding exploration tools
```
# run t-sne and make the plots for the ingredient embeddings
jupyter notebook ./src/plot_ingredients_recipes.ipynb
```
## Embedding plot.ly plots to host them yourself
https://gist.github.com/altosaar/67d8456ad28acd1abb497f1950d8de8a
## Contributing
Pull requests and all feedback welcome! Please file an issue if you run into problems replicating the results.
## Ideas on next steps
* get more data
* convert jupyter notebook for plotting into one python script
* write scripts to figure out the right vocabulary
* fit a better model (e.g. multi-class regression in pytorch) -- if you manage to get better results than the live demo at https://altosaar.github.io/food2vec/ just submit a pull request with the new `assets/data/wordVecs.js` and I'll happily update it :)
* compare the above model embeddings to the current embeddings
* make the UI of the website more user-friendly and mobile-friendly
## Acknowledgments
Thanks to Anthony for open-sourcing a [javascript embedding browser](https://github.com/turbomaze/word2vecjson) -- the one here is heavily based on it.