An open API service indexing awesome lists of open source software.

https://github.com/krshrimali/modanet-minorproject


https://github.com/krshrimali/modanet-minorproject

Last synced: 3 months ago
JSON representation

Awesome Lists containing this project

README

          

# ModaNet-MinorProject-

**Donwloading Data** (and reading)

1. `wget https://github.com/kyamagu/paperdoll/raw/master/data/chictopia/chictopia.sql.gz`

Once done, do this:

* gunzip -c chictopia.sql.gz `gunzip -c chictopia.sql.gz | sqlite3 chictopia.sqlite3`

(this will take a LOT of time, keep it running until the program terminates. **When it terminates, take a screenshot of the terminal window** (to ensure there was no error))

Update: (Feb 13 2019) **Step-1 Done!**

Next stop - **Download LMDB data (~40 GB).**

2. * wget https://s3-ap-northeast-1.amazonaws.com/kyamagu-public/chictopia2/photos.lmdb.tar
* `wget http://vision.is.tohoku.ac.jp/chictopia2/photos.lmdb.tar`

Update: (Feb 14 2019) **Step-2 Done!**

MD5SUM is: `a404789687fc3906d7d843942c802f53.`

3. Then extract the LMDB content:

`tar xf photos.lmdb.tar`

> Metadata are stored in chictopia.sql.gz, and images are stored in separate LMDB as encoded binaries.

# NOTES:
1. # Handling .mdb Files
https://stackoverflow.com/questions/3620539/how-to-deal-with-mdb-access-files-with-python