Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/Srekel/zig-sparse-set
Sparse sets for zig, supporting both SOA and AOS style
https://github.com/Srekel/zig-sparse-set
public-domain sparse-sets zig
Last synced: 2 months ago
JSON representation
Sparse sets for zig, supporting both SOA and AOS style
- Host: GitHub
- URL: https://github.com/Srekel/zig-sparse-set
- Owner: Srekel
- License: other
- Created: 2019-11-15T21:18:21.000Z (about 5 years ago)
- Default Branch: master
- Last Pushed: 2024-06-15T16:45:56.000Z (7 months ago)
- Last Synced: 2024-08-03T23:16:52.259Z (6 months ago)
- Topics: public-domain, sparse-sets, zig
- Language: Zig
- Size: 758 KB
- Stars: 62
- Watchers: 5
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
- awesome-zig - zig-sparse-set๐๏ธSparse sets for zig, supporting both SOA and AOS style
- awesome-zig - Srekel/zig-sparse-set - sparse-set ๐ก. Sparse sets for zig, supporting both SOA and AOS style. (Data Structure and Algorithm / Linker)
README
# :ferris_wheel: zig-sparse-set :ferris_wheel:
An implementation of Sparse Sets for Zig.
## :confused: What is a Sparse Set? :confused:
A Sparse Set - well, this implementation technique specifically at least - is a fairly simple data structure with some properties that make it especially useful for some aspects in game development, but you know... it's probably interesting for other areas too.
Here's a good introduction: https://research.swtch.com/sparse
Basically, sparse sets solve this problem: You have a bunch of ***sparse*** handles, but you want to loop over the values they represent linearly over memory.
## :point_down: Example :point_down:
Maybe your game has a few hundred **entities** with a certain **component** (specific piece of game data) at any given time. An entity is a 16 bit ID (handle) and the entities containing this component eventually gets spread out randomly over 0..65535 during runtime.
:anguished: In your frame update function, it would be nice to **not** do this... :anguished:
```zig
for (active_entities) |entity| {
var some_big_component = &self.big_components[entity];
some_big_component.x += 3;
}
```... because it would waste memory. Or, in the case that the component isn't huge, but it's expensive to instantiate (i.e. you can't just zero-init it, for example), you need to do that 65k times at startup. Additionally, you'll be skipping over swaths of memory, so every access will likely be a cache miss.
:worried: Similarly, you might want to avoid this... :worried:
```zig
for (self.components) |some_component| {
if (some_component.enabled) {
some_component.x += 3;
}
}
```...because you'll be doing a lot of looping over data that isn't of interest. Also, you need to store a flag for every component. So potentially cache misses for the `+=` operation here too.
(Note that the flag could be implemented as a bitset lookup, for example, which would probably be better, but still have the same underlying problem).
:heart_eyes: With a sparse set, you can always simply loop over the data linearly: :heart_eyes:
```zig
for (self.component_set.toValueSlice()) |*some_component| {
some_component.x += 3;
}
```## :sunglasses: But wait, there's more! :sunglasses:
1) **O(1)** Lookup from sparse to dense, and vice versa.
2) **O(1)** **Has,** **Add**, and **Remove**.
3) **O(1)** **Clear** (remove all elements).
4) **O(d)** iteration (dense list).
5) Elements of sparse and dense lists do not need to be (and are not) initialized upon startup - they are undefined.
6) Supports SOA-style component layout. (See **References** below if you're unfamiliar with what that is)
7) Supports AOS-style too by optionally storing and managing a value array internally.
8) Can be inspected "easily" in a debugger.
9) Optional error-handling.
10) Even if you don't need to loop over the values, a sparse set is a potential alternative to a hash map.
11) Optionally growable.:star: [1] This is nice and important because you can then do:
```zig
for (self.component_set.toValueSlice()) |*some_component, dense_index| {
some_component.x += 3;
var entity = self.component_set.getByDense(dense_index);
self.some_other_system.doStuffWithEntity(entity, some_component.x, 1234567);
}
```:star: [2] The O(1) remove is important. It is solved by swapping in the last element into the removed spot. So there's two things to consider there:
1) Is it cheap to copy the data? Like, if your component is large or needs some kind of allocation logic on copy.
2) Is it OK that the list of components are **unsorted**? If not, sparse sets are not a good fit.:star: [5] Special care has been taken (depending on a couple of coming Zig changes) to ensure that neither Valgrind nor Zig will complain about possibly accessing uninitialized memory.
:star: [6] With the standard SparseSet implementation, it doesn't actually store any data - you have to do that manually. If you want to, you can store it in an SOA - "Structure of Arrays" manner, like so:
(**Note:** For both the following SOA example and the AOS example below, you can look at src/test.zig for the whole example.)
```zig
const Entity = u32;
const Vec3 = struct {
x: f32 = 0,
y: f32 = 0,
z: f32 = 0,
};
const MyPositionSystemSOA = struct {
component_set: sparse_set.SparseSet(Entity, u8) = undefined,
xs: [256]f32 = [_]f32{0} ** 256,
ys: [256]f32 = [_]f32{0} ** 256,
zs: [256]f32 = [_]f32{0} ** 256,
```The trick then is to **make sure** you handle the dense indices:
```zig
const MyPositionSystem = struct {
// ...pub fn addComp(self: *Self, ent: Entity, pos: Vec3) void {
var dense = self.component_set.add(ent);
self.xs[dense] = pos.x;
self.ys[dense] = pos.y;
self.zs[dense] = pos.z;
}pub fn removeComp(self: *Self, ent: Entity) void {
var dense_old: u8 = undefined;
var dense_new: u8 = undefined;
self.component_set.removeWithInfo(ent, &dense_old, &dense_new);
self.xs[dense_new] = self.xs[dense_old];
self.ys[dense_new] = self.ys[dense_old];
self.zs[dense_new] = self.zs[dense_old];
}pub fn updateComps(self: *Self) void {
for (self.component_set.toSparseSlice()) |ent, dense| {
self.xs[dense] += 3;
}
}
};
```:star: [7] With SparseSetAOS, things are simplified for you, and this will probably be the most common use case. It has the same API but has a few additional functions, and also stores an internal list of all the data.
```zig
const MyPositionSystemAOS = struct {
component_set: sparse_set_aos.SparseSetAOS(Entity, u8, Vec3) = undefined,// ...
pub fn addComp(self: *Self, ent: Entity, pos: Vec3) void {
_ = self.component_set.add(ent, pos);
}pub fn removeComp(self: *Self, ent: Entity) void {
self.component_set.remove(ent);
}pub fn getComp(self: *Self, ent: Entity) Vec3 {
return self.component_set.getValueBySparse(ent).*;
}pub fn updateComps(self: Self) void {
for (self.component_set.toValueSlice()) |*value, dense| {
value.x += 3;
}
}
```:star: [8] Compared to a sparse-handle-as-index lookup, you don't have a long list of "duds" between each valid element. And compared to a hash map, it should be more straightforward to introspect the sparse set's linear list.
:star: [9] All functions that can assert from bad usage (e.g. adding more handles than the capacity, or indexing out of bounds) also has a corresponding "OrError" function that does Zig-style error handling.
## :smiling_imp: Hold up... What's the catch? :smiling_imp:
1) Well, there's the unsorted thing.
2) If you remove things frequently, and/or your objects are expensive to copy, it may outweigh the benefit of having the data continuous in memory.
3) The lookup requires `@sizeOf(DenseT) * MaxSparseValue` bytes. So for the example above, if you know that you will never have more than 256 of a specific component, then you can store the dense index as a `u8`. This would result in you needing `65536 * 1 byte = 64 kilobytes`. If you need more than 256 components, and say only 16k entities, you'd need 32 kilobytes.
* **Note:** The dense -> sparse lookup is likely significantly smaller: `@sizeof(SparseT) * MaxDenseValue`, so for example `256 * 2 bytes = 512 bytes`.
4) If you don't need to loop over the elements, and are starved for memory, a hash map might be a better option.
5) Compared to looking the value up directly using the sparse handle as an array index, there's an extra indirection.
6) Using uninitialized memory may cause some validators to complain. As mentioned above, Valgrind and Zig should be fine.So, all in all, there are of course benefits and drawbacks to sparse sets. You'll have to consider this on a case-by-case basis.
## :page_with_curl: License :page_with_curl:
Pick your license: Public Domain (Unlicense) or MIT.
## :statue_of_liberty: Examples :statue_of_liberty:
See `src/test.zig`. Especially the MyPositionSystem ones.
Here is the "unit test" that is used for generating documentation, it uses all of the functionality:
```zig
test "docs" {
const Entity = u32;
const DenseT = u8;
const DocValueT = i32;
const DocsSparseSet = SparseSet(.{
.SparseT = Entity,
.DenseT = DenseT,
.ValueT = DocValueT,
.allow_resize = .NoResize,
.value_layout = .InternalArrayOfStructs,
});var ss = DocsSparseSet.init(std.debug.global_allocator, 128, 8) catch unreachable;
defer ss.deinit();var ent1: Entity = 1;
var ent2: Entity = 2;
_ = try ss.addOrError(ent1);
_ = try ss.addValueOrError(ent2, 2);
std.testing.expectEqual(@as(DenseT, 2), ss.len());
try ss.removeOrError(ent1);
var old: DenseT = undefined;
var new: DenseT = undefined;
try ss.removeWithInfoOrError(ent2, &old, &new);
_ = ss.toSparseSlice();
_ = ss.toValueSlice();
std.testing.expectEqual(@as(DenseT, 0), ss.len());
ss.clear();
std.testing.expectEqual(@as(DenseT, 8), ss.remainingCapacity());_ = try ss.addValueOrError(ent1, 10);
std.testing.expectEqual(@as(DenseT, 0), try ss.getBySparseOrError(ent1));
std.testing.expectEqual(@as(DocValueT, 10), (try ss.getValueBySparseOrError(ent1)).*);
std.testing.expectEqual(@as(Entity, ent1), try ss.getByDenseOrError(0));
std.testing.expectEqual(@as(DocValueT, 10), (try ss.getValueByDenseOrError(0)).*);
}
```## :paperclip: References :paperclip:
* [incrediblejr's](https://gist.github.com/incrediblejr) C implementation [ijss](https://gist.github.com/incrediblejr/931efb7587e1ab328fa65ecc94d1009f).
* The above [article](https://research.swtch.com/sparse) describing this technique.
* The [Zig](https://ziglang.org/) language.
* Jonathan Blow SOA/AOS [video](https://youtu.be/YGTZr6bmNmk) overview.
* My [twitter](https://twitter.com/Srekel).