https://github.com/bbc/rd-apmm-python-lib-mediagrains

A python library for handling grain-based media
https://github.com/bbc/rd-apmm-python-lib-mediagrains
rd-cloudfit rd-project rd-section-apmm rd-stability-green
Last synced: about 2 months ago
JSON representation
A python library for handling grain-based media
Host: GitHub
URL: https://github.com/bbc/rd-apmm-python-lib-mediagrains
Owner: bbc
License: apache-2.0
Created: 2018-02-02T13:56:08.000Z (over 7 years ago)
Default Branch: main
Last Pushed: 2025-02-26T09:15:07.000Z (3 months ago)
Last Synced: 2025-03-22T06:51:15.848Z (2 months ago)
Topics: rd-cloudfit, rd-project, rd-section-apmm, rd-stability-green
Language: Python
Size: 1.51 MB
Stars: 7
Watchers: 13
Forks: 0
Open Issues: 1
Metadata Files:
- Readme: README.md
- License: COPYING
- Codeowners: .github/CODEOWNERS
Awesome Lists containing this project

README

        # mediagrains

A python library for handling grain-based media in a python-native

style.

## Introduction

Provides constructor functions for various types of grains and classes

that nicely wrap those grains, as well as a full serialisation and

deserialisation library for Grain Sequence Format (GSF) format, a lightweight

binary wrapper format for various media types including uncompressed video.

Please read the pydoc documentation for more details. The GSF format is

documented in [gsf_docs/gsf.md](gsf_docs/gsf.md).

Some useful tools for handling the Grain Sequence Format (GSF) file format

are also included - see [Tools](#tools).

## Installation

### Requirements

* A working Python 3.10+ installation

* The tool [Docker](https://docs.docker.com/engine/install/) is needed to run the tests, but not required to use the library.

### Steps

```bash

# Install from pip

$ pip install mediagrains

# Install directly from source repo

$ git clone [email protected]:bbc/rd-apmm-python-lib-mediagrains.git

$ cd rd-apmm-python-lib-mediagrains

$ make install

```

## Usage

As an example of using this in your own code the following is a simple

program to load the contents of a GSF file and print the timestamp of

each grain.

```Python console

>>> from mediagrains.gsf import load

>>> f = open('examples/video.gsf', "rb")

>>> (head, segments) = load(f)

>>> print('\n'.join(str(grain.origin_timestamp) for grain in segments[1]))

1420102800:0

1420102800:20000000

1420102800:40000000

1420102800:60000000

1420102800:80000000

1420102800:100000000

1420102800:120000000

1420102800:140000000

1420102800:160000000

1420102800:180000000

```

Alternatively to create a new video grain in 10-bit planar YUV 4:2:0 and fill

it with colour-bars:

```Python console

>>> from mediagrains import VideoGrain

>>> from uuid import uuid1

>>> from mediagrains.cogenum import CogFrameFormat, CogFrameLayout

>>> src_id = uuid1()

>>> flow_id = uuid1()

>>> grain = VideoGrain(src_id=src_id, flow_id=flow_id, cog_frame_format=CogFrameFormat.S16_422_10BIT, width=1920, height=1080)

>>> colours = [

...      (0x3FF, 0x000, 0x3FF),

...      (0x3FF, 0x3FF, 0x000),

...      (0x3FF, 0x000, 0x000),

...      (0x3FF, 0x3FF, 0x3FF),

...      (0x3FF, 0x200, 0x3FF),

...      (0x3FF, 0x3FF, 0x200) ]

>>> x_offset = [0, 0, 0]

>>> for colour in colours:

...     i = 0

...     for c in grain.components:

...             for x in range(0,c.width//len(colours)):

...                     for y in range(0,c.height):

...                             grain.data[c.offset + y*c.stride + (x_offset[i] + x)*2 + 0] = colour[i] & 0xFF

...                             grain.data[c.offset + y*c.stride + (x_offset[i] + x)*2 + 1] = colour[i] >> 8

...             x_offset[i] += c.width//len(colours)

...             i += 1

```

(a more natural interface for accessing data exists in the form of numpy arrays. See later.)

The object grain can then be freely used for whatever video processing

is desired, or it can be serialised into a GSF file as follows:

```Python console

>>> from mediagrains.gsf import dump

>>> f = open('dummyfile.gsf', 'wb')

>>> dump(f, [grain])

>>> f.close()

```

The encoding module also supports a "progressive" mode where an

encoder object is created and a dump started, then grains can be added

and will be written to the output file as they are added.

```Python console

>>> from uuid import uuid1

>>> from mediagrains import Grain

>>> from mediagrains.gsf import GSFEncoder

>>> src_id = uuid1()

>>> flow_id = uuid1()

>>> f = open('dummyfile.gsf', 'wb')

>>> enc = GSFEncoder(f)

>>> seg = enc.add_segment()  # This must be done before the call to start_dump

>>> enc.start_dump()  # This writes the file header and starts the export

>>> seg.add_grain(Grain(src_id=src_id, flow_id=flow_id))  # Adds a grain and writes it to the file

>>> seg.add_grain(Grain(src_id=src_id, flow_id=flow_id))  # Adds a grain and writes it to the file

>>> seg.add_grain(Grain(src_id=src_id, flow_id=flow_id))  # Adds a grain and writes it to the file

>>> enc.end_dump()  # This ends the export and finishes off the file

>>> f.close()

```

If the underlying file is seekable then the end_dump call will upade all segment

metadata to list the correct grain count, otherwise the counts will be

left at -1.

### Comparing Grains

In addition the library contains a relatively rich grain comparison

mechanism in the submodule `comparison`. This submodule exposes methods

for comparing two grains, and comparing the outputs of two grain iterators.

Information on advanced comparison features such as how to include attributes,

exclude attributes, expect differences on attributes and compare PSNR values

can be found in the [comparison submodules documentation](mediagrains/comparison/README.md).

An example of usage is as follows:

```python

>>> from mediagrains.comparison import compare_grain

>>> print(compare_grain(a, b))

❌   Grains do not match

  ✅   .grain_type == 'video'

  ✅   .source_id == UUID('9d0a2518-8f39-11ec-bcdd-737806a40a30')

  ✅   .flow_id == UUID('a1269208-8f39-11ec-bcdd-737806a40a30')

  ✅   .rate == Fraction(25, 1)

  ✅   .duration == Fraction(1, 25)

  ✅   .length == 6220800

  ❌   a.origin_timestamp - b.origin_timestamp == mediatimestamp.immutable.TimeOffset.from_sec_nsec('-0:160000000'), not the expected mediatimestamp.immutable.TimeOffset.from_sec_nsec('0:0')

  ❌   a.sync_timestamp - b.sync_timestamp == mediatimestamp.immutable.TimeOffset.from_sec_nsec('-0:160000000'), not the expected mediatimestamp.immutable.TimeOffset.from_sec_nsec('0:0')

  ◯   a.creation_timestamp - b.creation_timestamp == mediatimestamp.immutable.TimeOffset.from_sec_nsec('0:0') as expected

  ✅   Lists match

    ✅   len(.timelabels) == 0

  ✅   .cog_frame_format == CogFrameFormat.U8_444

  ✅   .width == 1920

  ✅   .height == 1080

  ✅   .cog_frame_layout == CogFrameLayout.FULL_FRAME

  ✅   Binary data .data are equal

```


This output gives a relatively detailed breakdown of the differences

between two grains, both as a printed string (as seen above) and also

in a data-centric fashion as a tree structure which can be

interrogated in code.

### Numpy arrays

An additional feature is provided in the form of numpy array access to the data in a grain. As such the above example of creating colourbars can be done more easily:

```Python console

>>> from mediagrains.numpy import VideoGrain

>>> from uuid import uuid1

>>> from mediagrains.cogenums import CogFrameFormat, CogFrameLayout

>>> src_id = uuid1()

>>> flow_id = uuid1()

>>> grain = VideoGrain(src_id=src_id, flow_id=flow_id, cog_frame_format=CogFrameFormat.S16_422_10BIT, width=1920, height=1080)

>>> colours = [

...      (0x3FF, 0x000, 0x3FF),

...      (0x3FF, 0x3FF, 0x000),

...      (0x3FF, 0x000, 0x000),

...      (0x3FF, 0x3FF, 0x3FF),

...      (0x3FF, 0x200, 0x3FF),

...      (0x3FF, 0x3FF, 0x200) ]

>>> for c in range(0, 3):

...     for x in range(0, grain.components[c].width):

...         for y in range(0, grain.components[c].height):

...             grain.component_data[c][x, y] = colours[x*len(colours)//grain.components[c].width][c]

```

## Documentation

The API is well documented in the docstrings of the module mediagrains. A rendered version of this documentation is available [here](https://bbc.github.io/rd-apmm-python-lib-mediagrains/mediagrains/mediagrains.html).

Instructions for using the 'new style' grains can be find in [new_style_grains.md](new_style_grains.md)

## Tools

Some tools are installed with the library to make working with the Grain Sequence Format (GSF) file format easier.

* `wrap_video_in_gsf` - Provides a means to read raw video essence and generate a GSF file.

* `wrap_audio_in_gsf` - As above, but for audio.

* `extract_from_gsf` - Read a GSF file and dump out the raw essence within.

* `gsf_probe` - Read metadata about the segments in a GSF file.

For example, to generate a GSF file containing a test pattern from `ffmpeg`, dump the metadata and then play it out

again:

```bash

ffmpeg -f lavfi -i testsrc=duration=20:size=1920x1080:rate=25 -pix_fmt yuv422p10le -c:v rawvideo -f rawvideo - | \

wrap_video_in_gsf - output.gsf --size 1920x1080 --format S16_422_10BIT --rate 25

gsf_probe output.gsf

extract_gsf_essence output.gsf - | ffplay -f rawvideo -pixel_format yuv422p10 -video_size 1920x1080 -framerate 25 pipe:0

```

To do the same with a sine wave:

```bash

ffmpeg -f lavfi -i "sine=frequency=1000:duration=5" -f s16le -ac 2 - | wrap_audio_in_gsf - output_audio.gsf --sample-rate 44100

gsf_probe output_audio.gsf

extract_gsf_essence output_audio.gsf - | ffplay -f s16le -ac 2 -ar 44100 pipe:0

```

The tools have been packaged into the `mediagrains` docker image for convenience. The `mediagrains` image containing all the tools can be built using:

```bash

make tools

```

You can make use of the tools image with the following command:

```bash

$(make -s run-cmd)  

```

Running the command without `` or `` will list the available tools. Running without `` will provide help.

## Development

### Commontooling

This repository uses a library of makefiles, templates, and other tools for development tooling and CI workflows. To discover operations that may be run against this repo, run the following in the top level of the repo:

```bash

$ make

```

### Testing

To run the unittests for this package in a docker container follow these steps:

```bash

$ git clone [email protected]:bbc/rd-apmm-python-lib-mediagrains.git

$ cd rd-apmm-python-lib-mediagrains

$ make test

```

### Continuous Integration

This repository includes [GitHub Actions workflows](./.github/workflows/) for CI. The shared workflows are centrally managed and should not be modified.

## Versioning

We use [Semantic Versioning](https://semver.org/) for this repository

## Contributing

All contributions are welcome, before submitting you must read and sign a copy of the [Individual Contributor License Agreement](ICLA.md)

Please ensure you have run the test suite before submitting a Pull Request, and include a version bump in line with our [Versioning](#versioning) policy.

## Authors

* James Weaver

* Philip deNier

* Sam Mesterton-Gibbons

* Alex Rawcliffe

* James Sandford

For further information, contact 

## License

See [LICENSE.md](LICENSE.md)
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/bbc/rd-apmm-python-lib-mediagrains

Awesome Lists containing this project

README