Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/penberg/awesome-low-latency

Patterns and resources of low latency programming.
https://github.com/penberg/awesome-low-latency

List: awesome-low-latency

awesome low-latency

Last synced: about 1 month ago
JSON representation

Patterns and resources of low latency programming.

Host: GitHub
URL: https://github.com/penberg/awesome-low-latency
Owner: penberg
Created: 2022-05-31T07:13:58.000Z (about 2 years ago)
Default Branch: main
Last Pushed: 2023-01-04T09:49:09.000Z (over 1 year ago)
Last Synced: 2024-05-15T01:11:13.798Z (about 1 month ago)
Topics: awesome, low-latency
Homepage:
Size: 28.3 KB
Stars: 374
Watchers: 20
Forks: 10
Open Issues: 0
Metadata Files:
- Readme: README.md

Lists

lists - awesome-low-latency
awesome-stars - penberg/awesome-low-latency - Patterns and resources of low latency programming. (Others)
fucking-lists - awesome-low-latency

README

# Awesome Low Latency

Low latency programming is increasingly important across a variety of use cases. Still, many of the tips and tricks of low latency are only part of developer folklore.
This document attempts to codify that knowledge for people to (re)discover the art of low-latency programming.

## Patterns

### How to Measure Latency Correctly

* Latency is a distribution
* Avoid coordinated omission

### Avoid Data Movement

* Co-locate compute and data e.g. Processing-In-Memory or Processing-Near-Memory
* Replicate data for faster access
* Maximize cache hit rate
* Control memory access patterns

### Avoid Work

* Avoid dynamic memory management
* Avoid demand paging to prevent memory thrashing e.g. by using larger memory pages (hugepages on Linux, superpages on FreeBSD, ...)
* Avoid as much work as possible (for example, avoid function call overhead by using inlining)
* Avoid CPU intensive computation.

### Avoid Waiting

* Partition data to avoid sharing (and, therefore, synchronization)
* Make shared data structures read-only (when possible)
* Reduce head-of-line blocking
* Avoid context switching
* Use wait-free data synchronization
* Use busy-polling instead of wakeups
* Disable Nagle's algorithm
* Use non-blocking I/O

### Hide Latency

* Parallelize requests to different services
* Request hedging (send redundant requests to multiple replicase, use response from fastest one)
* Use optimized SIMD instructions for suitable problems
* Multiprocessing and multithreading

### Tune for Low Latency

* Use preemptible kernel
* Interrupt and process affinity
* Watch out for bad device drivers

### Advanced Topics

* Use kernel-bypass networking such as DPDK or XDP
* Use hardware offload with accelerators and FPGA

## Blogs

* [11 Best Practices for Low Latency Systems](https://bdarfler.medium.com/11-best-practices-for-low-latency-systems-a00fc6e0dfda) by Ben Darfler (2014).
* [Optimizing web servers for high throughput and low latency](https://dropbox.tech/infrastructure/optimizing-web-servers-for-high-throughput-and-low-latency) by Alexey Ivanov (2017).

## Publications

* [The Tail at Scale](https://cacm.acm.org/magazines/2013/2/160173-the-tail-at-scale/fulltext) by Jeffrey Dean and Luiz André Barroso (2013)
* [Tales of the Tail: Hardware, OS, and Application-level Sources of Tail Latency](https://drkp.net/papers/latency-socc14.pdf) by Jialin Li et al (2014)
* [Amdahl’s Law for Tail Latency](https://www.csl.cornell.edu/~delimitrou/papers/2018.cacm.amdahlsTail.pdf) by Christina Delimitrou and Christos Kozyrakis (2018)

## Conferences

* [P99 CONF](https://www.p99conf.io)