Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/sintef/hybridmap
Rust hybrid map using smallvec and the std hashmap
https://github.com/sintef/hybridmap
crate hashmap map rust smallvec
Last synced: about 2 months ago
JSON representation
Rust hybrid map using smallvec and the std hashmap
- Host: GitHub
- URL: https://github.com/sintef/hybridmap
- Owner: SINTEF
- License: apache-2.0
- Created: 2024-01-29T21:34:55.000Z (11 months ago)
- Default Branch: main
- Last Pushed: 2024-08-01T14:11:42.000Z (5 months ago)
- Last Synced: 2024-08-08T21:10:48.056Z (5 months ago)
- Topics: crate, hashmap, map, rust, smallvec
- Language: Rust
- Homepage: https://crates.io/crates/hybridmap
- Size: 22.5 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# HybridMap
HybridMap is a Rust™ hybrid map implementation that uses a vector on the memory stack for small maps and a hash map overwise.
As with most hybrid technologies, including two components instead of one is one too many. However, the hybrid solution can provide some value for specific use cases.
HybridMap can be slightly faster for tiny maps, especially short-lived ones living on the memory stack, usually up to 16 entries and without too many lookups.
## Example
HybridMap can be used like most other maps.
```rust
use hybridmap::HybridMap;let mut map = HybridMap::::new();
map.insert(1, "one");
map.insert(2, "two");assert_eq!(map.get(&1), Some(&"one"));
assert_eq!(map.len(), 2);
```## Benchmarks
The benchmark is unlikely to be representative of your use cases. You might see some of the gains shown below if you create many short-lived small maps. You may also get worse performances than a standard hash map.
You could adapt the benchmarks to your use cases. If you don't know whether you should use this hybrid map or a hashmap, you should go with a hashmap. As the numbers show, the performance gain is not that great.
*Results on a Macbook Pro M1:*
| Type | Map | Size | Median Time (ns) | Performance Gain |
|--------|----------------|------|------------------|------------------|
| i64 | HashMap | 1 | 248 | |
| i64 | **HybridMap** | 1 | 194 | x1.28 |
| i64 | HashMap | 4 | 1 117 | |
| i64 | **HybridMap** | 4 | 822 | x1.36 |
| i64 | HashMap | 16 | 4 581 | |
| i64 | **HybridMap** | 16 | 3 241 | x1.41 |
| i64 | HashMap | 128 | 36 593 | |
| i64 | **HybridMap** | 128 | 36 629 | x1.0 |
| uuid | HashMap | 1 | 335 | |
| uuid | **HybridMap** | 1 | 235 | x1.43 |
| uuid | HashMap | 4 | 1 610 | |
| uuid | **HybridMap** | 4 | 941 | x1.71 |
| uuid | HashMap | 16 | 6 346 | |
| uuid | **HybridMap** | 16 | 6 424 | x0.99 |
| uuid | HashMap | 128 | 49 799 | |
| uuid | **HybridMap** | 128 | 49 841 | x1.0 |
| string | HashMap | 1 | 1 176 | |
| string | **HybridMap** | 1 | 1 113 | x1.06 |
| string | HashMap | 4 | 5 313 | |
| string | **HybridMap** | 4 | 4 695 | x1.13 |
| string | HashMap | 16 | 21 626 | |
| string | **HybridMap** | 16 | 21 009 | x1.03 |
| string | HashMap | 128 | 156 010 | |
| string | **HybridMap** | 128 | 156 880 | x0.99 |In this benchmark, the HybridMap switches to a HashMap internally once it has more than `16` entries. This benchmark is not a very robust benchmark. Benchmarking HybridMap correctly is hard and requires more effort than implementing the crate. As the license says, use at your own risk.
*However for tiny maps, that are short-lived, the performance gain could be more interesting:*
| Type | Len | Median Time (ns) | Performance Gain |
|-----------------------|---------|------------------|------------------|
| HashMap | 1 | 130 | |
| HashMap | 2 | 173 | |
| HybridMap | 1 | 50 | x2.61 |
| HybridMap | 2 | 174 | x0.99 |
| HybridMap | 1 | 53 | x2.45 |
| HybridMap | 2 | 80 | x2.17 |```bash
# Run the benchmarks
cargo bench --bench=hybridmap_bench -- --quick --quiet# Run this command instead if you have more patience
cargo bench --bench=hybridmap_bench# Open the results in a browser
open target/criterion/report/index.html
# or
xdg-open target/criterion/report/index.html
```## Memory Usage
HybridMap has a small memory overhead, the enum variant between the vector and the hashmap and a vector pre-allocated on the stack.
The default vector size on the stack is `8` entries. You may save a tiny bit of memory by adapting the vector size to the number of entries you expect to store in the maps. But a large vector will very quickly be a waste of resources. Consider staying below `20`.
For maps containing very few entries, one or two, memory usage can be one order of magnitude smaller than a hashmap. Otherwise, the memory usage is similar to a normal hashmap.
You can adapt the `benches/hybridmap_memory.rs` file to your use case.
```bash
# Run the memory benchmark
# You will probably have to run it many times without things in the background
# to get a coherent result.
cargo bench --bench=hybridmap_memory
```## Why ?
I started benchmarking tiny maps to check whether I should switch from HashMap to BTreeMap for my use case. I also had a naive Vec implementation that was faster despite for small maps. Thus, I made this crate for fun.
The energy savings this crate may bring probably do not compensate for the energy I used to boil water for my tea while implementing this crate. But it was fun.
## License
This project is licensed under the Apache License, Version 2.0 - see the [LICENSE](LICENSE) file for details.
## Acknowledgements
* Inspired by [robjtede/tinymap/](https://github.com/robjtede/tinymap/).
* Use [smallvec](https://github.com/servo/rust-smallvec).