Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/SCLinDennis/Weakly-Supervised-Dense-Video-Captioning
This repo is not completed
https://github.com/SCLinDennis/Weakly-Supervised-Dense-Video-Captioning
Last synced: about 1 month ago
JSON representation
This repo is not completed
- Host: GitHub
- URL: https://github.com/SCLinDennis/Weakly-Supervised-Dense-Video-Captioning
- Owner: SCLinDennis
- Created: 2018-08-30T11:40:23.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2018-11-03T07:07:59.000Z (about 6 years ago)
- Last Synced: 2024-08-02T20:44:31.259Z (5 months ago)
- Language: Python
- Size: 29.3 KB
- Stars: 7
- Watchers: 4
- Forks: 4
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- Awesome-Video-Captioning - [tf-code
README
# Weakly-Supervised-Dense-Video-Captioning
This repo try to implement ***[Weakly Supervised Dense Video Captioning](https://arxiv.org/abs/1704.01502)*** in tensorflow but not complete yet.## Requirement
- Python 3
- Keras 2.2
- Tensorflow 1.8## Usage
- Run ```lexical_Res.py``` for training FCN with MIML loss while saving weights with the lowest loss.
- Run ```region_selection.py``` to generate most informative and coherrence region sequence.
- Run ```TRY3/model_seq2seq.py``` to train language model.
- While using ```TRY3/s2vt_predict_v2.py``` to inference the model.## Guide
1. ```extract_frames.py```: Uniform sampling 30 frames for each video.
2. ```load_data.py```: Create label vector and word dictionary.
3. ```Res_video_bag.py```: Lexical FCN(Resnet50) with a frame as an instance.
4. ```lexical_Res.py```: Lexical FCN(Resnet50) with a region as an instance.
5. ```region_selection.py```: Region sequence generator, which cound form one region sequence now.
6. dic/: Where to put ix2word, word2ix, word_counts.
7. frames/: Where to put frames extracted by ```extract_frames.py```.
8. MSRVTT/: Where to put training/testing labels and region sequences generated by ```region_selection.py```.
9. videos/: Where to put the [***MSR-VTT***](http://ms-multimedia-challenge.com/dataset) videos.
10. Weight_Resnet50/: Where to put weight save from ```lexical_Res.py```.
11. Weight_Resnet50_vasbag/: Where to put weight save from ```Res_video_bag.py```
12. TRY3/```s2vt_train.py```: Language model using S2VT.(train)
13. TRY3/```s2vt.py```: S2VT model graph.
14. TRY3/```s2vt_inference.py```: Language model using S2VT.(inference)## Reference
- [Weakly Supervised Dense Video Captioning](https://arxiv.org/abs/1704.01502)
- [s2vt](https://github.com/thtang/ADLxMLDS2017)## Contact
Shih-Chen Lin ([email protected])Any discussions and suggestions are welcome!