Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/neural-nuts/cam2caption
[DEPRECATED] An Android application which converts camera feed to captions in real time
https://github.com/neural-nuts/cam2caption
android-application convolutional-neural-networks image-captioning recurrent-neural-networks tensorflow-android tensorflow-models
Last synced: about 2 months ago
JSON representation
[DEPRECATED] An Android application which converts camera feed to captions in real time
- Host: GitHub
- URL: https://github.com/neural-nuts/cam2caption
- Owner: neural-nuts
- License: bsd-3-clause
- Archived: true
- Created: 2017-10-27T10:58:38.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2018-02-02T14:33:08.000Z (almost 7 years ago)
- Last Synced: 2024-09-26T21:20:24.804Z (about 2 months ago)
- Topics: android-application, convolutional-neural-networks, image-captioning, recurrent-neural-networks, tensorflow-android, tensorflow-models
- Language: Java
- Homepage:
- Size: 30.4 MB
- Stars: 53
- Watchers: 7
- Forks: 18
- Open Issues: 5
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# Cam2Caption
An Android application which converts camera feed to natural language captions in real time.
The app uses our customized pre-trained model generated through [image-caption-generator](https://github.com/neural-nuts/image-caption-generator).
Using this model the app takes **1-2 second(s)** to caption a live camera frame on Huawei Honor 6x.The trained model to run this app can be obtained [here](https://drive.google.com/open?id=0ByhzM2YklhADNmk4cEN2MTA5U0E).
## Software Pre-Requisites
1. Android-Sdk for > Kitkat
2. Android-Studio
3. Tensorflow Java Library
- Already provided build #44 in this repository. Latest nightly builds can be obtained frome [here](https://ci.tensorflow.org/view/Nightly/job/nightly-android/)
- **Warning**: Did not test this app with builds other that #44## Data Pre-Requisites
1. Trained model from [image-caption-generator](https://github.com/neural-nuts/image-caption-generator)
2. Word IDs to Word map pickle from [image-caption-generator](https://github.com/neural-nuts/image-caption-generator) currently provided in `Application/src/main/assets`## Instructions
To build this app for your android phone-
1. Clone this repository
2. Download the trained model from [here](https://drive.google.com/open?id=0ByhzM2YklhADNmk4cEN2MTA5U0E).
3. Add the downloaded pre-trained model to `Application/src/main/assets` folder in the repository.
4. Open the repository in Android Studio
5. Build the app on your device using Android Studio## Working
The app is just a prototype, which uses our optimized and skimmed-down model from [image-caption-generator](https://github.com/neural-nuts/image-caption-generator), we also use a faster encoder CNN- Google's Inception v4.and finally use an end-to-end pre-trained model as balackbox in this app for quickly generating captions in real time.Note: Due to lack of computation power our model is not very well trained.
## Preview
Here is a quick preview of the app which was made by pointing the device camera towards a slideshow running on a screen and some real-life scenes.
#TO-DO: Create a real preview by testing the app on streets.## Notes
1. To create a tensorflow android app from scratch please follow this brilliant [tutorial](https://omid.al/posts/2017-02-20-Tutorial-Build-Your-First-Tensorflow-Android-App.html) by Omid Alemi.
2. Currently the app is tested for Huawei Honor 6x only.## Citation
If you use our model or code in your research, please cite the paper:
```
@article{Mathur2017,
title={Camera2Caption: A Real-time Image Caption Generator},
author={Pranay Mathur and Aman Gill and Aayush Yadav and Anurag Mishra and Nand Kumar Bansode},
journal={IEEE Conference Publication},
year={2017}
}
```