https://github.com/wizenink/bring

High-performance, lock-free SPSC ring buffer
https://github.com/wizenink/bring

concurrent cpp lock-free ring-buffer spsc

Last synced: 3 months ago
JSON representation

High-performance, lock-free SPSC ring buffer

Host: GitHub
URL: https://github.com/wizenink/bring
Owner: wizenink
License: mit
Created: 2026-01-02T09:06:44.000Z (5 months ago)
Default Branch: master
Last Pushed: 2026-01-05T10:18:01.000Z (5 months ago)
Last Synced: 2026-01-08T08:26:02.474Z (5 months ago)
Topics: concurrent, cpp, lock-free, ring-buffer, spsc
Language: CMake
Homepage:
Size: 35.2 KB
Stars: 1
Watchers: 0
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          # Bring - Lock-Free SPSC Ring Buffer

[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)

[![C++23](https://img.shields.io/badge/C%2B%2B-23-blue.svg)](https://en.cppreference.com/w/cpp/23)

A high-performance, lock-free **Single Producer Single Consumer (SPSC)** ring buffer implementation in modern C++23. Designed for maximum throughput and minimal latency in concurrent scenarios.

## Features

- **Lock-Free**: Uses atomics with acquire-release semantics for thread safety

- **Cache-Optimized**: False sharing prevention with 64-byte alignment

- **Zero-Copy**: Move semantics and in-place construction support

- **Type-Safe**: C++23 concepts for compile-time validation

- **Header-Only**: Easy integration, just include and use

## Quick Start

### Requirements

- C++23 compatible compiler (GCC 12+, Clang 16+, MSVC 2022+)

- CMake 3.20+

### Installation

#### Header-Only

Simply copy `include/bring/ring_buffer.hpp` to your project:

```cpp

#include 

bring::RingBuffer buffer;

```

#### CMake Integration

```cmake

include(FetchContent)

FetchContent_Declare(

  bring

  GIT_REPOSITORY https://github.com/wizenink/bring.git

  GIT_TAG master

)

FetchContent_MakeAvailable(bring)

target_link_libraries(your_target PRIVATE bring::bring)

```

### Basic Usage

```cpp

#include 

#include 

// Create a ring buffer (capacity must be power of 2)

bring::RingBuffer buffer;

// Producer thread

std::thread producer([&buffer]() {

    for (int i = 0; i < 1000; ++i) {

        while (!buffer.try_push(i)) {

            std::this_thread::yield(); // Buffer full, retry

        }

    }

});

// Consumer thread

std::thread consumer([&buffer]() {

    for (int i = 0; i < 1000; ++i) {

        auto value = buffer.try_pop();

        while (!value.has_value()) {

            std::this_thread::yield(); // Buffer empty, retry

            value = buffer.try_pop();

        }

        // Process value.value()

    }

});

producer.join();

consumer.join();

```

## API Reference

### Construction

```cpp

bring::RingBuffer buffer;

```

- `T`: Element type (must be move-constructible and destructible)

- `Capacity`: Buffer size (must be power of 2, > 1)

### Core Operations

#### `try_push(item) -> bool`

Try to push an item into the buffer. Returns `true` on success, `false` if buffer is full.

```cpp

if (buffer.try_push(42)) {

    // Success

}

```

#### `try_pop() -> std::optional`

Try to pop an item from the buffer. Returns `std::optional` with value on success, `std::nullopt` if empty.

```cpp

auto result = buffer.try_pop();

if (result.has_value()) {

    int value = result.value();

}

```

#### `try_pop_ip(T& out) -> bool`

In-place pop for better performance (avoids `std::optional` overhead).

```cpp

int value;

if (buffer.try_pop_ip(value)) {

    // value contains popped element

}

```

#### `try_consume(Func&& processor) -> bool`

Process element with callback without extracting it.

```cpp

buffer.try_consume([](int&& value) {

    // Process value

});

```

#### `emplace(Args&&... args) -> bool`

Construct element in-place. Returns `true` on success.

```cpp

buffer.emplace(arg1, arg2, arg3);

```

### Query Operations

#### `is_empty() -> bool`

Check if buffer is empty.

#### `is_full() -> bool`

Check if buffer is full.

#### `get_state() -> BufferState`

Atomically check both empty and full state from a consistent snapshot.

```cpp

auto state = buffer.get_state();

if (state.empty) {

    // Buffer is empty

}

if (state.full) {

    // Buffer is full

}

```

## Performance

Benchmarks show exceptional performance for SPSC scenarios:

- **Single-threaded**: ~1-2 ns per push/pop operation

- **Multi-threaded (SPSC)**: ~3-10 ns per operation depending on buffer size

- **Cache performance**: >99.999% L1 data cache hit rate (validated with cachegrind)

- **Throughput**: Up to 300M+ operations/second on modern hardware

Run `cmake --build build --target run_benchmarks` to see performance on your system.

## Design Decisions

### Why Power-of-2 Capacity?

Enables fast modulo operations using bitwise AND: `(index + 1) & (Capacity - 1)`

### Memory Ordering

- **Acquire-Release**: Ensures visibility of data between producer/consumer

- **Relaxed**: Used for same-thread operations where ordering not critical

### Cache Line Alignment

Head and tail pointers are aligned to 64-byte boundaries to prevent false sharing between producer and consumer threads.

### One-Slot Reservation

The buffer reserves one slot to distinguish between full and empty states (when `head == tail`).

## Building & Testing

### Running Tests

```bash

# Configure with testing enabled

cmake -B build -DBUILD_TESTING=ON

# Build

cmake --build build

# Run all tests using CMake target

cmake --build build --target run_tests

# Or run tests individually

./build/unit_tests        # Single-threaded tests

./build/mt_tests          # Multi-threaded stress tests

# Or use CTest

cd build && ctest --output-on-failure

```

### Running Benchmarks

```bash

# Configure with benchmarks enabled

cmake -B build -DBUILD_BENCHMARKS=ON -DCMAKE_BUILD_TYPE=Release

# Build and run benchmarks using CMake target

cmake --build build --target run_benchmarks

# Or run directly

./build/benchmarks

# Run with specific parameters

./build/benchmarks --benchmark_filter=BM_SPSC --benchmark_min_time=1.0s

```

### Cache Performance Analysis

```bash

# Build the cachegrind benchmark

cmake -B build -DBUILD_TESTING=ON

# Run with valgrind cachegrind

valgrind --tool=cachegrind --cachegrind-out-file=cachegrind.out ./build/cachegrind_bench

# View results

cg_annotate cachegrind.out | head -50

```

## Thread Safety

**SPSC Only**: This ring buffer is designed for exactly one producer thread and one consumer thread. Using it with multiple producers or consumers will result in race conditions.

For MPSC/MPMC scenarios, use a different data structure or add external synchronization.

## License

This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/wizenink/bring

Awesome Lists containing this project

README