Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/agermanidis/thingscoop
Search and filter videos based on objects that appear in them using convolutional neural networks
https://github.com/agermanidis/thingscoop
Last synced: 3 months ago
JSON representation
Search and filter videos based on objects that appear in them using convolutional neural networks
- Host: GitHub
- URL: https://github.com/agermanidis/thingscoop
- Owner: agermanidis
- License: mit
- Created: 2015-07-28T23:48:15.000Z (over 9 years ago)
- Default Branch: master
- Last Pushed: 2016-05-11T16:26:37.000Z (over 8 years ago)
- Last Synced: 2024-10-08T16:18:48.326Z (3 months ago)
- Language: Python
- Homepage:
- Size: 5.55 MB
- Stars: 358
- Watchers: 21
- Forks: 61
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-tensorflow - GoogleNet Convolutional Neural Network Groups Movie Scenes By Setting - Search, filter, and describe videos based on objects, places, and other things that appear in them (Models/Projects)
- Awesome-TensorFlow-Chinese - GoogleNet Convolutional Neural Network Groups Movie Scenes By Setting - Search, filter, and describe videos based on objects, places, and other things that appear in them (模型项目 / 微信群)
- awesome-tensorflow - GoogleNet Convolutional Neural Network Groups Movie Scenes By Setting - Search, filter, and describe videos based on objects, places, and other things that appear in them (Models/Projects)
- fucking-awesome-tensorflow - GoogleNet Convolutional Neural Network Groups Movie Scenes By Setting - Search, filter, and describe videos based on objects, places, and other things that appear in them (Models/Projects)
README
## Thingscoop: Utility for searching and filtering videos based on their content
### Description
Thingscoop is a command-line utility for analyzing videos semantically - that means searching, filtering, and describing videos based on objects, places, and other things that appear in them.
When you first run thingscoop on a video file, it uses a [convolutional neural network](https://en.wikipedia.org/wiki/Convolutional_neural_network) to create an "index" of what's contained in the every second of the input by repeatedly performing image classification on a frame-by-frame basis. Once an index for a video file has been created, you can search (i.e. get the start and end times of the regions in the video matching the query) and filter (i.e. create a [supercut](https://en.wikipedia.org/wiki/Supercut) of the matching regions) the input using arbitrary queries. Thingscoop uses a very basic query language that lets you to compose queries that test for the presence or absence of labels with the logical operators `!` (not), `||` (or) and `&&` (and). For example, to search a video the presence of the sky *and* the absence of the ocean: `thingscoop search 'sky && !ocean' `.
Right now two models are supported by thingscoop: `vgg_imagenet` uses the architecture described in ["Very Deep Convolutional Networks for Large-Scale Image Recognition"](http://arxiv.org/abs/1409.1556) to recognize objects from the [ImageNet](http://www.image-net.org/) database, and `googlenet_places` uses the architecture described in ["Going Deeper with Convolutions"](http://arxiv.org/abs/1409.4842) to recognize settings and places from the [MIT Places](http://places.csail.mit.edu/) database. You can specify which model you'd like to use by running `thingscoop models use `, where `` is either `vgg_imagenet` or `googlenet_places`. More models will be added soon.
Thingscoop is based on [Caffe](http://caffe.berkeleyvision.org/), an open-source deep learning framework.
### Installation
1. Install ffmpeg, imagemagick, and ghostscript: `brew install ffmpeg imagemagick ghostscript` (Mac OS X) or `apt-get install ffmpeg imagemagick ghostscript` (Ubuntu).
1. Follow the installation instructions on the [Caffe Installation page](http://caffe.berkeleyvision.org/installation.html).
2. Make sure you build the Python bindings by running `make pycaffe` (on Caffe's directory).
3. Set the environment variable CAFFE_ROOT to point to Caffe's directory: `export CAFFE_ROOT=[Caffe's directory]`.
4. Install thingscoop: `easy_install thingscoop` or `pip install thingscoop`.### Usage
#### `thingscoop search `
Print the start and end times (in seconds) of the regions in `` that match ``. Creates an index for `` using the current model if it does not exist.
Example output:
```
$ thingscoop search violin waking_life.mp4
/Users/anastasis/Downloads/waking_life.mp4 148.000000 162.000000
/Users/anastasis/Downloads/waking_life.mp4 176.000000 179.000000
/Users/anastasis/Downloads/waking_life.mp4 180.000000 186.000000
/Users/anastasis/Downloads/waking_life.mp4 189.000000 190.000000
/Users/anastasis/Downloads/waking_life.mp4 192.000000 200.000000
/Users/anastasis/Downloads/waking_life.mp4 211.000000 212.000000
/Users/anastasis/Downloads/waking_life.mp4 222.000000 223.000000
/Users/anastasis/Downloads/waking_life.mp4 235.000000 243.000000
/Users/anastasis/Downloads/waking_life.mp4 247.000000 249.000000
/Users/anastasis/Downloads/waking_life.mp4 251.000000 253.000000
/Users/anastasis/Downloads/waking_life.mp4 254.000000 258.000000
```####`thingscoop filter `
Generate a video compilation of the regions in the `` that match ``. Creates index for `` using the current model if it does not exist.
Example output:
#### `thingscoop sort `
Create a compilation video showing examples for every label recognized in the video (in alphabetic order). Creates an index for `` using the current model if it does not exist.
Example output:
#### `thingscoop describe `
Print every label that appears in `` along with the number of times it appears. Creates an index for `` using the current model if it does not exist.
#### `thingscoop preview `
Create a window that plays the input video `` while also displaying the labels the model recognizes on every frame.
```
$ thingscoop describe koyaanisqatsi.mp4 -m googlenet_places
sky 405
skyscraper 363
canyon 141
office_building 130
highway 78
lighthouse 66
hospital 64
desert 59
shower 49
volcano 45
underwater 44
airport_terminal 43
fountain 39
runway 36
assembly_line 35
aquarium 34
fire_escape 34
music_studio 32
bar 28
amusement_park 28
stage 26
wheat_field 25
butchers_shop 25
engine_room 24
slum 20
butte 20
igloo 20
...etc
```#### `thingscoop index `
Create an index for `` using the current model if it does not exist.
#### `thingscoop models list`
List all models currently available in Thingscoop.
```
$ thingscoop models list
googlenet_imagenet Model described in the paper "Going Deeper with Convolutions" trained on the ImageNet database
googlenet_places Model described in the paper "Going Deeper with Convolutions" trained on the MIT Places database
vgg_imagenet 16-layer model described in the paper "Return of the Devil in the Details: Delving Deep into Convolutional Nets" trained on the ImageNet database
```#### `thingscoop models info `
Print more detailed information about ``.
```
$ thingscoop models info googlenet_places
Name: googlenet_places
Description: Model described in the paper "Going Deeper with Convolutions" trained on the MIT Places database
Dataset: MIT Places
```#### `thingscoop models freeze`
List all models that have already been downloaded.
```
$ thingscoop models freeze
googlenet_places
vgg_imagenet
```#### `thingscoop models current`
Print the model that is currently in use.
```
$ thingscoop models current
googlenet_places
```#### `thingscoop models use `
Set the current model to ``. Downloads that model locally if it hasn't been downloaded already.
#### `thingscoop models download `
Download the model `` locally.
#### `thingscoop models remove `
Remove the model `` locally.
#### `thingscoop models clear`
Remove all models stored locally.
#### `thingscoop labels list`
Print all the labels used by the current model.
```
$ thingscoop labels list
abacus
abaya
abstraction
academic gown
accessory
accordion
acorn
acorn squash
acoustic guitar
act
actinic radiation
action
activity
adhesive bandage
adjudicator
administrative district
admiral
adornment
adventurer
advocate
...
```#### `thingscoop labels search `
Print all the labels supported by the current model that match the regular expression ``.
```
$ thingscoop labels search instrument$
beating-reed instrument
bowed stringed instrument
double-reed instrument
free-reed instrument
instrument
keyboard instrument
measuring instrument
medical instrument
musical instrument
navigational instrument
negotiable instrument
optical instrument
percussion instrument
scientific instrument
stringed instrument
surveying instrument
wind instrument
...```
### Full usage options
```
thingscoop - Command-line utility for searching and filtering videos based on their contentUsage:
thingscoop filter ... [-o ] [-m ] [-s ] [-c ] [--recreate-index] [--gpu-mode] [--open]
thingscoop search ... [-o ] [-m ] [-s ] [-c ] [--recreate-index] [--gpu-mode]
thingscoop describe [-n ] [-m ] [--recreate-index] [--gpu-mode] [-c ]
thingscoop index [-m ] [-s ] [-c ] [-r ] [--recreate-index] [--gpu-mode]
thingscoop sort [-m ] [--gpu-mode] [--min-confidence ] [--max-section-length ] [-i ] [--open]
thingscoop preview [-m ] [--gpu-mode] [--min-confidence ]
thingscoop labels list [-m ]
thingscoop labels search [-m ]
thingscoop models list
thingscoop models info
thingscoop models freeze
thingscoop models current
thingscoop models use
thingscoop models download
thingscoop models remove
thingscoop models clearOptions:
--version Show version.
-h --help Show this screen.
-o --output Output file for supercut
-s --sample-rate How many frames to classify per second (default = 1)
-c --min-confidence Minimum prediction confidence required to consider a label (default depends on model)
-m --model Model to use (use 'thingscoop models list' to see all available models)
-n --number-of-words Number of words to describe the video with (default = 5)
-t --max-section-length Max number of seconds to show examples of a label in the sorted video (default = 5)
-r --min-occurrences Minimum number of occurrences of a label in video required for it to be shown in the sorted video (default = 2)
-i --ignore-labels Labels to ignore when creating the sorted video video
--title Title to show at the beginning of the video (sort mode only)
--gpu-mode Enable GPU mode
--recreate-index Recreate object index for file if it already exists
--open Open filtered video after creating it (OS X only)
```### CHANGELOG
#### 0.2 (8/16/2015)
* Added `sort` option for creating a video compilation of all labels appearing in a video
* Now using JSON for the index files#### 0.1 (8/5/2015)
* Conception
### License
MIT