https://github.com/gofynd/mildnet

Visual Similarity research at Fynd. Contains code to reproduce 2 of our research papers.
https://github.com/gofynd/mildnet

ai artificial-intelligence computer-vision deep-learning ecommerce feature-extraction image-retrieval keras machine-learning machinelearning ml recommendation-system recommender-system tensorflow visual-recommendations visual-search

Last synced: 19 days ago
JSON representation

Visual Similarity research at Fynd. Contains code to reproduce 2 of our research papers.

Host: GitHub
URL: https://github.com/gofynd/mildnet
Owner: gofynd
License: apache-2.0
Created: 2019-02-28T07:02:11.000Z (about 6 years ago)
Default Branch: master
Last Pushed: 2023-03-24T22:36:06.000Z (about 2 years ago)
Last Synced: 2025-03-31T10:01:32.850Z (about 2 months ago)
Topics: ai, artificial-intelligence, computer-vision, deep-learning, ecommerce, feature-extraction, image-retrieval, keras, machine-learning, machinelearning, ml, recommendation-system, recommender-system, tensorflow, visual-recommendations, visual-search
Language: Jupyter Notebook
Homepage: https://arxiv.org/abs/1903.00905
Size: 2.61 MB
Stars: 83
Watchers: 8
Forks: 33
Open Issues: 11
Metadata Files:
- Readme: README.md
- Changelog: HISTORY.md
- Contributing: CONTRIBUTING.md
- License: LICENSE.txt

Awesome Lists containing this project

README

[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/retrieving-similar-e-commerce-images-using/image-retrieval-street2shop-topwear)](https://paperswithcode.com/sota/image-retrieval-street2shop-topwear?p=retrieving-similar-e-commerce-images-using) [![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/mildnet-a-lightweight-single-scaled-deep/image-retrieval-street2shop-topwear)](https://paperswithcode.com/sota/image-retrieval-street2shop-topwear?p=mildnet-a-lightweight-single-scaled-deep)

# MILDNet

This repo cantains the training code used during Visual Similarity research at [Fynd](https://www.fynd.com/). One can easily reproduce our state of the art models [MILDNet](https://arxiv.org/abs/1903.00905) and [Ranknet](https://arxiv.org/abs/1901.03546) and 2 other research works from the past. 25 configs are present which constitutes configurations of most critical experiments by us.

For more details, refer to the **[Colab Notebook](https://colab.research.google.com/github/gofynd/mildnet/blob/master/MILDNet_on_Colab.ipynb) (execute training on Free GPUs or TPUs provided by Google in just 2 clicks)** or head to our research papers on Arxiv:
- [MILDNet: A Lightweight Single Scaled Deep Ranking Architecture](https://arxiv.org/abs/1903.00905)
- [Retrieving Similar E-Commerce Images Using Deep Learning](https://arxiv.org/abs/1901.03546)

We have also open-sourced 8 of our top experiment results with weights [here](https://console.cloud.google.com/storage/browser/fynd-open-source/research/MILDNet/). To analyze and compare all the results (**training**) head to [this Colab notebook](https://colab.research.google.com/drive/1u7mZCU6AYXATWVu-sc_xpzE4ZRtVoSt9). To get an idea on using any of our open-sourced models models to find n similar items (**inferencing**) from your entire dataset head to [this Colab notebook](https://colab.research.google.com/drive/1j1pTFA2vNizQJeWPYgwoQYvXUMtdJ2-n).

## Introduction
Visual Recommendation is a crucial feature for any ecommerce platform. It gives the platform power of instantly suggesting similar looking products to what a user is browsing, thus capturing his/her immediate intent which could result in higher customer engagment (CTR) and hence the conversion.

The task of identifying similar products is not trivial as the details concerned here (pattern, structure etc.) are complexely grained in the product image pixels and these product comes in various variety even within the same class. CNNs have showed great understanding and results in this task.

The base of such a system is a CNN extracting key features from product images and returning a vector respresenting those features. When these embeddings for all the products are mapped on an n-dimensional space, it places similar products closer to non-similar items. The nearest neighbours are then the top most visual similar items. Below diagram gives a brief overview:

![](https://storage.googleapis.com/ml_shared_bucket/MILDNet/doc_imgs/VS_Basic_Inference_Flow.jpg)

## Repo Overview

- [execute.py](execute.py): Execute this to run training locally or on Google Cloud ML Engine.
- [MILDNet_on_Colab.ipynb](https://colab.research.google.com/github/gofynd/mildnet/blob/master/MILDNet_on_Colab.ipynb): Google Colaboratory notebook describes the task and contains training, exploration and inference code.
- [requirements-local-cpu.txt](requirements-local-cpu.txt)/[requirements-local-gpu.txt](requirements-local-gpu.txt): Requirement files **only** need to execute when running locally.
- settings.cfg: Global configs to setup:
-- MILDNET_JOB_DIR (**mandatory**): Requires directory path to store training outputs. Either pass path of local directory or Google cloud storage (gs://.....)
-- MILDNET_REGION (**optional**): Only needed when running on ML Engine (e.g. us-east1)
-- MILDNET_DATA_PATH (**mandatory**): Path where training data is stored. Change only when using custom data.
-- HYPERDASH_KEY: Hyperdash is a nice tool to log system out or to track training metrics. One can easily monitor all the jobs running using their Android app or webpage.
- [job_configs](job_configs): Contains 25 configs defines the basic job configs like the training model architecture, loss function, optimizer, number of epoch, learning rate etc.
- [trainer](trainer): Contains all script needed for training.
- [training_configs](training_configs): Training related environment config, only needed when running on ML Engine. Defines the cluster type, gpu type etc.

## Job Configs:
We carried out various experiments to study the performace of 4 research works (including ours). 8 of those variants can be readily tested here by this notebook:

- Multiscale-Alexnet: Multiscale model with base convnet as Alexnet and 2 shallow networks. We couldn’t find a good implementation of Alexnet on Tensorflow, so we used Theano to train this network.
- Visnet: Visnet Multiscale model with base convnet as VGG16 and 2 shallow networks. Without LRN2D layer from Caffe.
- Visnet-LRN2D: Visnet Multiscale model with base convnet as VGG16 and 2 shallow networks. Contains LRN2D layer from Caffe.
- RankNet: Multiscale model with base convnet as VGG19 and 2 shallow networks. Hinge Loss is used here.
- MILDNet: Single VGG16 architecture with 4 skip connections
- MILDNet-Contrastive: Single VGG16 architecture with 4 skip connections, uses contrastive loss.
- MILDNet-512-No-Dropout: MILDNet: Single VGG16 architecture with 4 skip connections. Dropouts are not used after feature concatenation.
- MILDNet-MobileNet: MILDNet: Single MobileNet architecture with 4 skip connections.

Based on this experiments, below is the list of all the configs available to try out:

- Default Models:
- alexnet.cnf
- ranknet.cnf
- vanila_vgg16.cnf
- visnet.cnf
- mildnet.cnf
- visnet-lrn2d.cnf
- Mildnet Ablation Study
- mildnet_skip_3.cnf
- mildnet_skip_2.cnf
- mildnet_skip_4.cnf
- mildnet_skip_1.cnf
- Mildnet Low Features
- mildnet_512_512.cnf
- mildnet_1024_512.cnf
- mildnet_512_no_dropout.cnf
- Mildnet Other Losses
- mildnet_hinge_new.cnf
- mildnet_angular_2.cnf
- mildnet_contrastive.cnf
- mildnet_lossless.cnf
- mildnet_angular_1.cnf
- Mildnet Other Variants
- mildnet_without_skip_big.cnf
- mildnet_vgg19.cnf
- mildnet_vgg16_big.cnf
- mildnet_without_skip.cnf
- mildnet_mobilenet.cnf
- mildnet_all_trainable.cnf
- mildnet_cropped.cnf

**Note that mildnet_contrastive.cnf and the Default Models configs are the models compared in the research paper.**

## Training

- [execute.py](execute.py): Single point entry for running training job on **local** or **Google Cloud ML Engine**. Asks user whether to run the training locally or on ML Engine. The user then need to select a config from a list of configs. Finally, the script executes [gcloud.local.run.keras.sh](gcloud.local.run.keras.sh) if user selects to run locally or [gcloud.remote.run.keras.sh](gcloud.remote.run.keras.sh) if user selects to run on Google Cloud ML Engine. **Make sure to setup [settings.cfg](settings.cfg) if running on ML Engine.**

- [MILDNet_on_Colab.ipynb](https://colab.research.google.com/github/gofynd/mildnet/blob/master/MILDNet_on_Colab.ipynb): Google Colaboratory notebook, gives a brief introduction of the task. Also one can execute training in just 2 clicks:
-- 1. Open notebook on google colab.
-- 2. From menu select Runtime -> Run all

### Installation (only when running locally)

Make sure to have gsutil installed and the user to be logged in:

curl https://sdk.cloud.google.com | bash

exec -l $SHELL

gcloud init

Install requirements:
- Running training on CPU:
```pip install -r requirements-local-cpu.txt```
- Running training on GPU:
```pip install -r requirements-local-gpu.txt```

Set below configs in settings.cfg:
- MILDNET_JOB_DIR=gs://....
- MILDNET_REGION=us-east1
- MILDNET_DATA_PATH=gs://fynd-open-source/research/MILDNet/
- HYPERDASH_KEY=your_hyperdash_key

## Run Training on Custom Job Config

- Config for the job to be trained need to be added in job_config folder

- Run ```python execute.py```

## View Logs from ML Engine

- Stream logs on terminal using:

gcloud ml-engine jobs stream-logs {{job_name}}

- Check tensorboard of ongoing training using:

tensorboard --logdir=gs://fynd-open-source/research/MILDNet/top_jobs/{{job_name}} --port=8080

- Hyperdash: Either use [Hyperdash Website](https://hyperdash.io/dashboard/models) or [Android App](https://play.google.com/store/apps/details?id=com.hyperdash)/[iOS App](https://itunes.apple.com/us/app/hyperdash-machine-learning-monitoring/id1257582233) to monitor logs.

## Contributing

Please read [CONTRIBUTING.md](CONTRIBUTING.md) and [CONDUCT.md](CONDUCT.md) for details on our code of conduct, and the process for submitting pull requests to us.

## Versioning

Please see [HISTORY.md](HISTORY.md). For the versions available, see the [tags on this repository](https://github.com/gofynd/mildnet/tags).

## Authors

* **Anirudha Vishvakarma** - *Initial work* ([[email protected]]([email protected]))

## License

This project is licensed under the Apache License - see the [LICENSE.txt](LICENSE.txt) file for details

## Acknowledgments

* All aspiring research works cited in our research paper
* [Google Colaboratory](https://colab.research.google.com/) to provide free GPUs/TPUs, helped us lot with experimentations and reporting.
* [Annoy](https://github.com/spotify/annoy): Easy to use and fast Approximate Nearest Neighbour library.
* [convnets-keras](https://github.com/heuritech/convnets-keras): Github repo contains Alexnet implementation on Keras, helped us to evaluate Alexnet base models as presented in [this](https://arxiv.org/abs/1404.4661) research work.
* [image-similarity-deep-ranking](https://github.com/akarshzingade/image-similarity-deep-ranking): Github repo helped us to use triplet data in keras.
* [Hyperdash](https://hyperdash.io/): Free monitoring tool for ML tasks.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/gofynd/mildnet

Awesome Lists containing this project

README