https://github.com/davidstutz/hierarchical-graph-based-video-segmentation

Implementation of the hierarchical graph-based video segmentation algorithm proposed by Grundmann et al. [1] based on the image segmentation algorithm by Felzenswalb and Huttenlocher [2].
https://github.com/davidstutz/hierarchical-graph-based-video-segmentation

computer-vision opencv video-segmentation

Last synced: 8 months ago
JSON representation

Implementation of the hierarchical graph-based video segmentation algorithm proposed by Grundmann et al. [1] based on the image segmentation algorithm by Felzenswalb and Huttenlocher [2].

Host: GitHub
URL: https://github.com/davidstutz/hierarchical-graph-based-video-segmentation
Owner: davidstutz
Created: 2015-06-11T17:00:34.000Z (over 10 years ago)
Default Branch: master
Last Pushed: 2018-11-28T13:56:51.000Z (about 7 years ago)
Last Synced: 2025-04-02T21:51:20.727Z (9 months ago)
Topics: computer-vision, opencv, video-segmentation
Language: C++
Size: 54.7 MB
Stars: 24
Watchers: 3
Forks: 14
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

# Hierarchical Graph-Based Video Segmentation

[![Build Status](https://travis-ci.org/davidstutz/hierarchical-graph-based-video-segmentation.svg?branch=master)](https://travis-ci.org/davidstutz/hierarchical-graph-based-video-segmentation)

This is an implementation of the hierarchical graph-based video segmentation algorithm proposed by Grundmann et al. [1] based on the graph-based image segmentation algorithm by Felzenswalb and Huttenlocher [2].

[1] M. Grundmann, V. Kwatra, M. Han, and I. A. Essa.
Efficient hierarchical graph-based video segmentation.
In Conference on Computer Vision and Pattern Recognition, pages 2141–2148,
San Francisco, 2010, June 2010.
[2] P. F. Felzenszwalb and D. P. Huttenlocher.
Efficient graph-based image segmentation.
International Journal of Computer Vision, 59(2):167–181, September 2004.

Further, evaluation metrics based on the Precision-Recall Framework for videos [3,4], Undersegmentation Error [4,5] and Achievable Segmentation Accuracy [3] are provided.

[3] C. Xu and J. J. Corso.
Evaluation of super-voxel methods for early video processing.
In Computer Vision and Pattern Recognition, Conference on,
pages 1202–1209, Providence, RI, June 2012.
[4] F. Galasso, N. S. Nagaraja, T. J. Cardenas, T. Brox and B.Schiele.
A Unified Video Segmentation Benchmark: Annotation, Metrics and Analysis.
In International Conference on Computer Vision,
pages 3527-3534, Sydney, Australia, December 2013.
[5] P. Neubert and P. Protzel.
Superpixel benchmark and comparison.
In Forum Bildverarbeitung, Regensburg, Germany, November 2012.

The algorithm is based on optical flow, therefore a command line tool to pre-compute optical flow using OpenCV is provided. Further, the `alley_1` sequence form the [Sintel dataset]() [6] is provided. Pre-computed flow (in the correct format, see `alley_1_flow` and `optical_flow_cli`) as well as ground truth is provided (see `alley_1_gt`).

[6] D. J. Butler, J. Wulff, G. B. Stanley and M. J. Black.
A naturalistic open source movie for optical flow evaluation.
In European Conference on Computer Vision, pages 611--625, October 2012.

![Example: original sequence, initial oversegmentation and a selected hierarchy level.](results/alley_1_full_spaced_690.png?raw=true "Example: original sequence, initial oversegmentation and a selected hierarchy level.")

## Compile

The implementation is based on [OpenCV](http://opencv.org/), see [here](http://docs.opencv.org/doc/tutorials/introduction/linux_install/linux_install.html#linux-installation) for installation instructions, [CMake](http://www.cmake.org/) and [Boost](http://www.boost.org/). The implementation has been tested on Ubuntu 14.04 and 14.10:

$ sudo apt-get install build-essential cmake libboost-all-dev
$ git clone https://github.com/davidstutz/hierarchical-graph-based-video-segmentation.git
$ cd hierarchical-graph-based-video-segmentation
$ mkdir build
$ cd build
$ cmake ..
$ make
Scanning dependencies of target segment
[ 16%] Building CXX object lib_segment/CMakeFiles/segment.dir/graph_segmentation.cpp.o
[ 33%] Building CXX object lib_segment/CMakeFiles/segment.dir/evaluation.cpp.o
[ 50%] Building CXX object lib_segment/CMakeFiles/segment.dir/io.cpp.o
[ 66%] Building CXX object lib_segment/CMakeFiles/segment.dir/io_util.cpp.o
Linking CXX static library libsegment.a
[ 66%] Built target segment
Scanning dependencies of target segment_cli
[ 83%] Building CXX object segment_cli/CMakeFiles/segment_cli.dir/main.cpp.o
Linking CXX executable segment_cli
[ 83%] Built target segment_cli
Scanning dependencies of target optical_flow_cli
[100%] Building CXX object optical_flow_cli/CMakeFiles/optical_flow_cli.dir/main.cpp.o
Linking CXX executable optical_flow_cli
[100%] Built target optical_flow_cli
# Run video segmentation on the alley_1 sequence and save visualization to build/output_vis:
$ ./segment_cli/segment_cli ../alley_1 ../alley_1_flow/ --vis-dir output_vis --input-gt ../alley_1_gt/
----- Level 0
Built graph (1.43415).
Oversegmented graph (3.04033).
Enforced minimum region size (2.0374).
3D Boundary Recall: 0.990899
3D Undersegmentation Error: 0.0906389
3D Achievable Segmentation Accuracy: 0.954518
# ...

The two command line tools provide the following options:

$ ./segment_cli/segment_cli --help
Allowed options:
--help produce help message
--input-video arg directory of input video (provided as
sequence of individual images)
--input-flow arg input flow directory (text files in
cv::Storage format, see io.h)
--input-gt arg input ground truth; if given, some metrics
will be computed and displayed
--length arg (=10) length of video to oversegment (may be lower
than the actual sequence length)
--flow-weight arg (=0.200000003) weight on flow angle for edge weight
computation
--threshold arg (=0.0199999996) threshold (is multiplied by 1.3 for each
additional hierarchy level)
--hierarchies arg (=40) number of hierarchies
--vis-dir arg visualization directory (default will not
visualize the result!)
--output-dir arg (=output) output directory

# Compute OpenCV optical flow and save in the correct format.
$ ./optical_flow_cli/optical_flow_cli --help
Allowed options:
--help produce help message
--input-dir arg input directory
--pyramid-scale arg (=0.5) pyramid scale
--pyramid-levels arg (=1) pyramid levels
--window-size arg (=7) window size
--num-iterations arg (=10) number of iterations at each pyramid
level
--polynomial-neighborhood-size arg (=5)
size of pixel neighborhood to find
polynomial expansion
--polynomial-sigma arg (=1.10000002) standard deviation of Gaussian used to
smooth derivatives
--gaussian-filter use a Gaussian filter isntead of a box
filter
--output-dir arg (=output) output directory

## Usage

An usage example can be found in `segment_cli/main.cpp`. Example:

// The video is expected to be provided as sequence of images, see alley_1 as example.
// The same holds for the optical flow, which is provided using text files formatted
// according to cv::FileStorage, see io.h.
boost::filesystem::path in_dir("path/to/video");
boost::filesystem::path flow_dir("path/to/flow");

// Length of the seuqnce to read (may be smaller than the actual length).
int length = 10;
Video video = IO::readVideo(in_dir, length);
FlowVideo flowVideo = IO::readFlowVideo(flow_dir, length);

assert(video.getFrameNumber() > 0);
assert(video.getFrameNumber() == flowVideo.getFrameNumber());

// Parameters:
int M = 300; // Minimum segment size, enforced after segmentation.
int L = 40; // Number of hierarchy levels.
float c = 0.02; // Threshold used for the initial oversegmentation.
float beta = 0.25; // Importance of flow information for edge weight computation.
float alpha = 1 - beta; // ... importance of color.

GraphSegmentationMagic* magic = new GraphSegmentationMagicThreshold(c);
GraphSegmentationDistance* distance = new GraphSegmentationEuclideanRGBFlowAngle(alpha, beta);

GraphSegmentation segmenter(distance, magic);

// Setup the video graph.
segmenter.buildGraph(video, flowVideo);
segmenter.buildEdges();

// Oversegment the video.
segmenter.oversegmentGraph();

// Enforce minimum segment size.
segmenter.enforceMinimumSegmentSize(M);

// Get the corresponding segmentation video (that is containing the labels.
// The video contains length three channel images where the labels are encoded
// as 24 bit numbers, see io_util.h
SegmentationVideo sv_video = segmenter.deriveLabels();
IO::writeSegmentationVideo(out_dir / boost::filesystem::path("0"), sv_video);

// Visualize by randonly coloring segments.
IO::writeColoredSegmentationVideo(vis_dir / boost::filesystem::path("0"), sv_video);

// Each new hierarchy level, the threshold is raised by the factor 1.3.
GraphSegmentationHierarchyMagic* hmagic = new GraphSegmentationHierarchyMagicThreshold(c, 1.3);
GraphSegmentationHierarchyDistance* hdistance =
new GraphSegmentationHierarchyRGBChiSquareFlowAngle(alpha, beta);

segmenter.setHierarchyMagic(hmagic);
segmenter.setHierarchyDistance(hdistance);

for (int l = 0; l < L; l++) {
// Build the region graph.
segmenter.buildRegionGraph();

// Segment the region graph.
segmenter.addHierarchyLevel();

// Enforce minimum segment size.
segmenter.enforceMinimumSegmentSize(l/2 * M);

SegmentationVideo sv_video = segmenter.deriveLabels();
IO::writeSegmentationVideo(out_dir / boost::filesystem::path(std::to_string(l + 1)),
sv_video);

IO::writeColoredSegmentationVideo(vis_dir / boost::filesystem::path(std::to_string(l + 1)),
sv_video);
}

Further documentation can be found in the corresponding header files: `graph_segmentation.h`, `evaluation.h`, `io.h`.

## License

**Please read carefully the following terms and conditions and any accompanying documentation before you download and/or use this software and associated documentation files (the "Software").**

The authors hereby grant you a non-exclusive, non-transferable, free of charge right to copy, modify, merge, publish, distribute, and sublicense the Software for the sole purpose of performing non-commercial scientific research, non-commercial education, or non-commercial artistic projects.

Any other use, in particular any use for commercial purposes, is prohibited. This includes, without limitation, incorporation in a commercial product, use in a commercial service, or production of other artefacts for commercial purposes.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

You understand and agree that the authors are under no obligation to provide either maintenance services, update services, notices of latent defects, or corrections of defects with regard to the Software. The authors nevertheless reserve the right to update, modify, or discontinue the Software at any time.

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software. You agree to refer to this repository in documents and papers that report on research using the Software.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/davidstutz/hierarchical-graph-based-video-segmentation

Awesome Lists containing this project

README