An open API service indexing awesome lists of open source software.

https://github.com/vsadov/nonblocking

Implementation of a lock-free dictionary on .Net.
https://github.com/vsadov/nonblocking

dictionary fast high-performance lock-free wait-free

Last synced: about 1 year ago
JSON representation

Implementation of a lock-free dictionary on .Net.

Awesome Lists containing this project

README

          

# NonBlocking
Implementation of a non-blocking dictionary.
https://www.nuget.org/packages/NonBlocking

## Overview

NonBlocking dictionary:

- NonBlocking dictionary has the same API as ConcurrentDictionary.
- No locks are taken during any operation including Get, Add, Remove, internal resizes etc...
- While multiple threads accessing NonBlocking dictionary will help each other in operations such as table resizing, there is no dependency on such behavior. If any thread get unscheduled or delayed for whatever reason, other threads will be able to make progress independently.

## Implementation notes
Core algorithms are based on NonBlockingHashMap, written and released to the public domain by Dr. Cliff Click.
A good overview is here https://www.youtube.com/watch?v=HJ-719EGIts

Most differences of this implementation are motivated by the differences in public APIs on .Net platform:
- .Net has structs and reified generics.
- platform APIs such as Interlocked.CompareExchange have differences.
- ConcurrentDictionary API differs from a typical java dictionary.

## Tested on platforms:
* x86
* x64
* arm
* arm64

## Performance

On most operations NonBlocking dictionary is faster than ConcurrentDictionary. It is particularly faster in write-heavy scenarios.
Read/Write oprations scale linearly with the number of active threads if hardware/OS permits.

## Benchmarks

The test machine is
AMD EPYC 7V12 64-Core Processor

Base speed: 2.44 GHz
Sockets: 2
Cores: 128
Logical processors: 256

Configured as: 4 NUMA nodes

The following sample benchmarks perform various operations on {int --> string} dictionaries and run as 64bit process on net6.0 runtime.

![Get](/../pictures//Graphs/Read.png?raw=true "Random Get")
The machine is configured as 4-node NUMA and Windows scheduler would use cores from one node, then their HT siblings, and only then use another node. Thus we see "steps".

![Write](/../pictures//Graphs/Write.png?raw=true "Random Write")
Not taking locks makes writes cheaper and scale better as number of threads increases.