https://github.com/sun-lab-nbb/ataraxis-video-system
Interfaces with a wide range of cameras to flexibly record visual stream data as images or videos.
https://github.com/sun-lab-nbb/ataraxis-video-system
camera-interface ffmpeg gentl harvesters opencv video-recording
Last synced: 6 months ago
JSON representation
Interfaces with a wide range of cameras to flexibly record visual stream data as images or videos.
- Host: GitHub
- URL: https://github.com/sun-lab-nbb/ataraxis-video-system
- Owner: Sun-Lab-NBB
- License: gpl-3.0
- Created: 2024-05-28T17:28:30.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-03-04T19:40:32.000Z (7 months ago)
- Last Synced: 2025-03-26T06:43:57.406Z (7 months ago)
- Topics: camera-interface, ffmpeg, gentl, harvesters, opencv, video-recording
- Language: Python
- Homepage:
- Size: 2.25 MB
- Stars: 0
- Watchers: 0
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# ataraxis-video-system
A Python library that interfaces with a wide range of cameras to flexibly record visual stream data as images or videos.


[](https://github.com/astral-sh/uv)
[](https://github.com/astral-sh/ruff)




___## Detailed Description
This library provides an interface for efficiently acquiring and saving visual data from cameras in real time. To
achieve this, the library internally binds OpenCV and GeniCam backends to grab frames from a wide range of consumer,
industrial, and scientific cameras using USB and Gigabit interfaces. To save the acquired frames, the library uses
FFMPEG to recruit CPUs or GPUs and supports H264 and H265 codecs. The library abstracts setup, acquisition, and cleanup
procedures via a simple API exposed by the VideoSystem interface class, while allowing for extensive configuration of
all managed elements. To optimize runtime efficiency, the library uses multithreading and multiprocessing, where
appropriate.
___## Features
- Supports Windows, Linux, and macOS.
- Uses OpenCV or GeniCam (Harvesters) to interface with a wide range of consumer, industrial and scientific cameras.
- Uses FFMPEG to efficiently encode acquired data as videos or images in real time.
- Highly customizable and can be extensively fine-tuned for quality or throughput.
- Pure-python API.
- GPL 3 License.
___## Table of Contents
- [Dependencies](#dependencies)
- [Installation](#installation)
- [Usage](#usage)
- [API Documentation](#api-documentation)
- [Developers](#developers)
- [Authors](#authors)
- [License](#license)
- [Acknowledgements](#Acknowledgments)
___## Dependencies
- [FFMPEG](https://www.ffmpeg.org/download.html). Make sure that the installed FFMPEG is available in your system’s
path, and Python has permissions to call FFMPEG. We recommend using the latest stable release of FFMPEG, although the
minimal requirement is support for H254 and H265 codecs.
- [Harvesters Dependencies](https://github.com/genicam/harvesters/blob/master/docs/INSTALL.rst) if you intend to use
Harvesters camera backend. Primarily, this only includes the ***GenTL Producer CTI***, as all other dependencies are
installed automatically. If your camera vendor supplies a .cti file for that camera, use that file. Otherwise, we
recommend using the [MvImpactAcquire](https://assets-2.balluff.com/mvIMPACT_Acquire/). This producer requires
licensing for versions at or above **3.0.0**, but lower versions seem to function well with all tested hardware. This
library has been tested using version **2.9.2**.For users, all other library dependencies are installed automatically by all supported installation methods
(see [Installation](#installation) section).For developers, see the [Developers](#developers) section for information on installing additional development
dependencies.
___## Installation
### Source
Note, installation from source is ***highly discouraged*** for everyone who is not an active project developer.
Developers should see the [Developers](#Developers) section for more details on installing from source. The instructions
below assume you are ***not*** a developer.1. Download this repository to your local machine using your preferred method, such as Git-cloning. Use one
of the stable releases from [GitHub](https://github.com/Sun-Lab-NBB/ataraxis-video-system/releases).
2. Unpack the downloaded zip and note the path to the binary wheel (`.whl`) file contained in the archive.
3. Run ```python -m pip install WHEEL_PATH```, replacing 'WHEEL_PATH' with the path to the wheel file, to install the
wheel into the active python environment.### pip
Use the following command to install the library using pip: ```pip install ataraxis-video-system```.
___## Usage
### OS support status
This library was primarily written on and for Linux systems. It is extensively tested on Linux and performs well under
all test conditions. It is very likely that Linux users will not experience any issues outside the usual possibility of
encountering bugs missed during initial testing.Windows bindings also work as expected, but require additional setup. Make sure FFMPEG is updated to the latest version,
as we found issues using FFMPEG built in 2023, where it drastically slowed the speed of video encoding to the point
where VideoSystem could not operate for prolonged periods of time. We also had to disable the OpenCV MSMF HW
transformations, as MSMF took a very long time to establish camera connection with this flag enabled. This latter issue
is well known to the community.macOS mostly works as expected except for live frame displaying, which does not work. This is a known
issue with the OpenCV backend used to render the frame data. Similar issue is observed for other backends, such as
Qt5 and Tkinter that were tested as a potential replacement. Displaying the frames on macOS is disabled until we find
a working backend.Overall, the library in its current state should be stable. Most of the outstanding issues should be resolved in the
future with updates to OpenCV and other dependencies.### Quickstart
This is a minimal example of how to use this library. It is also available as a [script](examples/quickstart.py).
This example is intentionally kept minimal, consult
[API documentation](https://ataraxis-video-system-api-docs.netlify.app/) for more details.
```
from pathlib import Pathimport numpy as np
from ataraxis_time import PrecisionTimer
from ataraxis_data_structures import DataLoggerfrom ataraxis_video_system import VideoSystem, InputPixelFormats
# Since most classes used in this example use multiprocessing, they have to run inside the __main__ guard
if __name__ == "__main__":
# The directory where to output the recorded frames and the acquisition timestamps
output_directory = Path("/Users/InfamousOne/Desktop/vid_test")# The DataLogger is used to save frame acquisition timestamps to disk. During runtime, it logs frame timestamps as
# uncompressed NumPy arrays (.npy), and after runtime it can compress log entries into one .npz archive.
logger = DataLogger(output_directory=output_directory, instance_name="webcam", exist_ok=True)# DataLogger uses a parallel process to write log entries to disk. It has to be started before it can save any log
# entries.
logger.start()# The VideoSystem minimally requires an ID and a DataLogger instance. The ID is critical, as it is used to identify
# the log entries generated by the VideoSystem. For VideoSystems that will be saving frames, output_directory is
# also required
vs = VideoSystem(system_id=np.uint8(101), data_logger=logger, output_directory=output_directory)# By default, all added cameras are interfaced with using OpenCV backend. Check the API documentation to learn about
# all available camera configuration options and supported backends. This camera is configured to save frames and to
# display the live video feed to the user at 25 fps. The display framerate can be the same or lower as the
# acquisition framerate.
vs.add_camera(
save_frames=True,
acquisition_frame_rate=30,
display_frames=True,
display_frame_rate=15,
color=False, # Acquires images in MONOCHROME mode
)# To save the frames acquired by the system, we need to add a saver. Here, we demonstrate adding a video saver, but
# you can also use the add_image_saver() method to output frames as images. The default video saver uses CPU and
# H265 codec to encode frames as an MP4 video.
vs.add_video_saver(input_pixel_format=InputPixelFormats.MONOCHROME)# Calling this method arms the video system and starts frame acquisition. However, the frames are not initially
# saved to disk.
vs.start()timer = PrecisionTimer("s")
timer.delay_noblock(delay=2) # During this delay, camera frames are displayed to the user but are not saved# Begins saving frames to disk as an MP4 video file
vs.start_frame_saving()
timer.delay_noblock(delay=5) # Records frames for 5 seconds, generating ~150 frames
vs.stop_frame_saving()# Frame acquisition can be started and stopped as needed, although all frames will be written to the same output
# video file. If you intend to cycle frame acquisition, it may be better to use an image saver backend.# Stops the VideoSystem runtime and releases all resources
vs.stop()# Stops the DataLogger and compresses acquired logs into a single .npz archive. This step is required for being
# able to parse the data with the VideoSystem API
logger.stop()
logger.compress_logs(remove_sources=True)# Extracts the list of frame timestamps from the compressed log generated above. The extraction function
# automatically uses VideoSystem ID, DataLogger name, and the output_directory to resolve the archive path.
timestamps = vs.extract_logged_data() # Returns a list of timestamps, each is given in microseconds since epoch
# onset# Computes and prints the actual framerate of the camera based on saved frames.
timestamp_array = np.array(timestamps, dtype=np.uint64)
time_diffs = np.diff(timestamp_array)
fps = 1 / (np.mean(time_diffs) / 1e6)
print(fps)# You can also check the output directory for the created video.
```### Image to Video Conversion
Some use cases may require capturing frames with an ImageSaver class, rather than the default VideoSaver. This approach
often achieves the highest acquisition rate and data quality as it allows capturing frames with no processing overhead.
However, this format is extremely space-inefficient, so it may be desirable to encode captured frames as a video
post-runtime, when runtime speed is no longer a critical factor. The VideoSystem supports such conversions via the
encode_video_from_images() method:
```
from ataraxis_data_structures import DataLogger
from ataraxis_video_system import VideoSystem
import numpy as np
from tempfile import mkdtemp
from pathlib import Path# Initializes the logger, which is not used for anything other than initializing the VideoSystem
output_directory = Path("/home/cybermouse/Desktop/vid_test")
logger = DataLogger(output_directory)# When the VideoSystem is used to encode images as video, it only needs to be equipped with the VideoSaver.
vs = VideoSystem(system_id=np.uint(1), data_logger=logger)
vs.add_video_saver()# The path should point to a folder with png, tiff or jpg files named in a way that allows deducing the sequence they
# were acquired in (e.g: 0001.png). The VideoSystem will then encode the frames as video and output the generated file
# in the output_directory
image_directory = Path("/home/cybermouse/Desktop/vid_test/images")
vs.encode_video_from_images(directory=image_directory, target_fps=15, video_name='test_video')```
### Data Logging
Like some other Ataraxis libraries, this library relies on the
[DataLogger](https://github.com/Sun-Lab-NBB/ataraxis-data-structures#datalogger) class to save frame acquisition
timestamps to disk during runtime. The timestamps are serialized and saved as `.npy` files. It is **highly** advised
to study the documentation for the class before using this library, especially if you want to parse the logged data
manually instead of using the method exposed by the VideoSystem class.The DataLogger may be shared by multiple Ataraxis classes that generate log entries, such as
[MicroControllerInterface](https://github.com/Sun-Lab-NBB/ataraxis-communication-interface) classes. To support using
the same logger class for multiple sources, each source (class) active at the same time has to use a unique byte-ID
(system id). These id-codes are used to identify the source class in log files and during further processing.#### Log entries format
Each timestamp is logged as a one-dimensional numpy uint8 array (.npy file). Inside the array, the data is organized in
the following order:
1. The uint8 id of the data source. For this library, the source ID is the ID code of the VideoSystem that submits the
data to be logged. The ID occupies the first byte of each logged array.
2. The uint64 timestamp that specifies the number of microseconds relative to the **onset** timestamp (see below). The
timestamp occupies **8** bytes following the ID byte. This is the frame acquisition timestamp.**Note:** Timestamps are generated at frame acquisition but are only submitted to the logger when the corresponding
frame is saved to disk. Therefore, the timestamps always match the order the saved frames appear in the video file or
are saved to disk as image files.#### Onset timestamp:
Each VideoSystem that logs its data generates an `onset` timestamp as part of its `start()` method runtime.
This log entry uses a modified data order and stores the current UTC time, accurate to microseconds. All further log
entries for the same source use the timestamp section of their payloads to communicate the number of microseconds
elapsed since the onset timestamp acquisition. The onset log entries follow the following order:
1. The uint8 id of the data source.
2. The uint64 value **0** that occupies 8 bytes following the source id. This is the only time when the timestamp value
of a log entry can be set to 0.
3. The uint64 value that stores the number of microseconds elapsed since the UTC epoch. This value specifies the
current time when the onset timestamp was generated.#### Starting and stopping logging
Until the DataLogger is started through its `start()` method, the log entries will be buffered in the multiprocessing
queue, which uses the host-computer’s RAM. To avoid running out of buffer space, **make sure** the DataLogger's
`start()` method is called before calling the `start()` method of any VideoSystem class. Once ***all*** sources
using the same DataLogger have finished their runtime, call the `stop()` method to end log saving and then call the
`compress_logs()` method to compress all individual `.npy` entries into an `.npz` archive. Compressing the logs is
required to later parse the frame acquisition timestamps for further analysis (see [quickstart](#quickstart)).#### Reading timestamps from logs
The VideoSystem class exposes the `extract_logged_data()` method that allows parsing the timestamps for all frames
saved by the class during runtime.***Note:*** to parse logged data, the VideoSystem has to be initialized and provided with an initialized DataLogger
class. Overall, it is advised to parse logged data immediately after finishing the acquisition runtime, as the class
would be configured correctly for the parsing to work as intended.***Attention!*** Since version 1.1.0 the library exposes a global, multiprocessing-safe, and instance-independent
function `extract_logged_video_system_data()`. This function behaves exactly like the instance-bound log extraction
method does, but can be used to parse logged data without the need to have an initialized VideoSystem instance. You can
use the `log_path` property of an initialized VideoSystem instance to get the path to the .npz archive that stores
logged data after compression, which is a required argument for the instance-independent log extraction function.
___## API Documentation
See the [API documentation](https://ataraxis-video-system-api-docs.netlify.app/) for the
detailed description of the methods and classes exposed by components of this library.
___## Developers
This section provides installation, dependency, and build-system instructions for the developers that want to
modify the source code of this library.### Installing the library
The easiest way to ensure you have the most recent development dependencies and library source files is to install the
python environment for your OS (see below). All environments used during development are exported as .yml files and as
spec.txt files to the [envs](envs) folder. The environment snapshots were taken on each of the three explicitly
supported OS families: Windows 11, OSx Darwin, and GNU Linux.**Note!** Since the OSx environment was built for the Darwin platform (Apple Silicon), it may not work on Intel-based
Apple devices.1. If you do not already have it installed, install [tox](https://tox.wiki/en/latest/user_guide.html) into the active
python environment. The rest of this installation guide relies on the interaction of local tox installation with the
configuration files included in with this library.
2. Download this repository to your local machine using your preferred method, such as git-cloning. If necessary, unpack
and move the project directory to the appropriate location on your system.
3. ```cd``` to the root directory of the project using your command line interface of choice. Make sure it contains
the `tox.ini` and `pyproject.toml` files.
4. Run ```tox -e import``` to automatically import the os-specific development environment included with the source
distribution. Alternatively, you can use ```tox -e create``` to create the environment from scratch and automatically
install the necessary dependencies using pyproject.toml file.
5. If either step 4 command fails, use ```tox -e provision``` to fix a partially installed environment.**Hint:** while only the platforms mentioned above were explicitly evaluated, this project will likely work on any
common OS but may require additional configuration steps.### Additional Dependencies
In addition to installing the required python packages, separately install the following dependencies:
1. [Python](https://www.python.org/downloads/) distributions, one for each version that you intend to support. These
versions will be installed in-addition to the main Python version installed in the development environment.
The easiest way to get tox to work as intended is to have separate python distributions, but using
[pyenv](https://github.com/pyenv/pyenv) is a good alternative. This is needed for the 'test' task to work as
intended.### Development Automation
This project comes with a fully configured set of automation pipelines implemented using
[tox](https://tox.wiki/en/latest/user_guide.html). Check [tox.ini file](tox.ini) for details about
available pipelines and their implementation. Alternatively, call ```tox list``` from the root directory of the project
to see the list of available tasks.**Note!** All commits to this project have to successfully complete the ```tox``` task before being pushed to GitHub.
To minimize the runtime for this task, use ```tox --parallel```.For more information, you can also see the 'Usage' section of the
[ataraxis-automation project](https://github.com/Sun-Lab-NBB/ataraxis-automation#Usage) documentation.### Automation Troubleshooting
Many packages used in 'tox' automation pipelines (uv, mypy, ruff) and 'tox' itself are prone to various failures. In
most cases, this is related to their caching behavior. Despite a considerable effort to disable caching behavior known
to be problematic, in some cases it cannot or should not be eliminated. If you run into an unintelligible error with
any of the automation components, deleting the corresponding .cache (.tox, .ruff_cache, .mypy_cache, etc.) manually
or via a cli command is very likely to fix the issue.___
## Versioning
We use [semantic versioning](https://semver.org/) for this project. For the versions available, see the
[tags on this repository](https://github.com/Sun-Lab-NBB/ataraxis-video-system/tags).---
## Authors
- Ivan Kondratyev ([Inkaros](https://github.com/Inkaros))
- Jacob Groner ([Jgroner11](https://github.com/Jgroner11))
- Natalie Yeung___
## License
This project is licensed under the GPL3 License: see the [LICENSE](LICENSE) file for details.
___## Acknowledgments
- All Sun lab [members](https://neuroai.github.io/sunlab/people) for providing the inspiration and comments during the
development of this library.
- The creators of all other projects used in our development automation pipelines [see pyproject.toml](pyproject.toml).---