Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/zetzit/zz
πΊπ ZetZ a zymbolic verifier and tranzpiler to bare metal C
https://github.com/zetzit/zz
Last synced: about 1 month ago
JSON representation
πΊπ ZetZ a zymbolic verifier and tranzpiler to bare metal C
- Host: GitHub
- URL: https://github.com/zetzit/zz
- Owner: zetzit
- License: mit
- Archived: true
- Created: 2019-06-20T19:03:13.000Z (over 5 years ago)
- Default Branch: master
- Last Pushed: 2022-06-17T01:37:37.000Z (over 2 years ago)
- Last Synced: 2024-05-23T05:32:12.068Z (6 months ago)
- Language: Rust
- Homepage:
- Size: 2.03 MB
- Stars: 1,602
- Watchers: 40
- Forks: 52
- Open Issues: 32
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-rust-formalized-reasoning - zz - zymbolic verifier and tranzpiler to bare metal C. (Programming Language / Libraries)
README
![logo](logo2.png?raw=true)
ZetZ is for systems without dynamic memory, where C is and will remain the defacto standard system interface.
Target bare metal MCUs, embedded linux, WASM, and embed it in other languages.You can also use it to build cross platform libraries, with a clean portable C-standard api.
Zetz plays nice with others and does not require rewriting everything in zetz to be useful in large projects.A major innovative feature is that all code is formally verified by symbolic execution in a virtual machine,
at compile time.### quick quick start
1. Install https://github.com/Z3Prover/z3 usually through a distro package
2. Get the latest binary from http://bin.zetz.it## Discord Community
https://discord.gg/EsMxjWtcf5
## editor support
- Emacs: [zetz-mode](https://github.com/damon-kwok/zetz-mode)
- Vim: [zz.vim](https://github.com/zetzit/vim)### how it looks
ZZ has some go and rust aesthetics, but remains a C dialect at the core.
```C++
using ::{printf}struct Random {
u32 num;
}fn rng(Random *self) u32 {
return self->num;
}export fn main() int {
let r = Random{
num: 42,
};
printf("your lucky number: %u\n", r.rng());
return 0;
}```
### the basic ideas
#### plain C ABI
ZZ emits plain C and it will always do that. It is one of the main reasons it exists.
It will always emit C into a C compiler which will then emit the binary.most modern languages have their own ABI, deviating from C, so you have to use glue tools to play nice with others.
with ZZ being emitted as C, all you do is include the header.There is no stack unwinding (C++, rust), and no coroutines (go), so all code emits to plain ansi C
with no requirements towards compiler features.ZZ works nicely with vendor provided closed source compiler for obscure systems.
Like arduino, esp32, propriatary firmware compilers, and integrates nicely into existing industry standard microkernels like zephyr, freertos, etc.#### safety and correctness with symbolic execution
Unlike other modern languages, ZZ has raw unchecked pointers as default. Nothing prevents you from doing crazy things,
as long as you have mathematical proof that what you're doing is defined behaviour according to the C standard.Checking is done by executing your code in SSA form at compile time within a virtual machine in an SMT prover.
None of the checks are emitted into runtime code.The standard library is fully stack based and heap allocation is strongly discouraged.
ZZ has convenience tools to deal with the lack of flexibility that comes with that, such as checked tail pointers.#### namespaces, autogenerated headers and declaration ordering
No modern language has headers or semantically relevant declaration order and neither does ZZ.
but since it interacts with raw C nicely, it also emits headers, and orders declarations so a c compiler likes them.ZZ puts module namespaces into the C symbol using underscores instead of mangling.
so my::lib::hello becomes my_lib_hello, which is C convention.### language reference
#### mutability: const, mut
by default, everything is const. this is the opposite of C. the mut keyword is used to make a global variable, or function argument mutable.
#### polymorphism
ZZ follows the C model of polymorphism: any struct can be cast to the same type as its first member.
In ZZ the cast is implicit because it is always safe.```C++
struct Vehicle {
int wheels;
}struct Car {
Vehicle base;
}fn allowed_entry(Vehicle *self) bool {
return self->wheels <= 2;
}fn main() {
Car c{
base: Vehicle {
wheels: 4,
}
};
assert(!c.allowed_entry());
}```
#### where and model
ZZ requires that all memory access is mathematically proven to be defined.
Defined as in the opposite of "undefined behaviour" in the C specification.
In other words, undefined behaviour is not allowed in ZZ.You will quite often be told that by the compiler that something is not provable,
like indexing into an array.this is not ok:
```C
fn bla(int * a) {
a[2];
}
```you must tell the compiler that accessing the array at position 2 is defined. quick fix for this one:
```C
fn bla(int * a)
where len(a) == 3
{
a[2];
}
```this will compile. its not a very useful function tho, because trying to use it in any context where the array is not len 3 will not be allowed.
here's a better example:```C
fn bla(int * a, int l)
where len(a) >= l
{
if l >= 3 {
a[2];
}
}
```thanks to the underlying SMT solver, the ZZ symbolic executor will know that `a[2]` is only executed in the case where `len(a) >= l >= 3`, so it is defined.
The where keyword requires behaviour in the callsite, and the model keyword declares how the function itself will behave.
```C
fn bla(int a) int
model return == 2 * a
{
return a * a;
}
```In this simple example, we can declare that a function returns 2 times its input.
But it actually does not, so this won't compile.### theory
we can use annotations to define states for types, which neatly lets you define which calls are legal on which
type at a given time in the program without ANY runtime code.```C++
theory is_open(int*) bool;
fn open(int mut* a)
model is_open(a)
{
static_attest(is_open(a));
*a = 1;
}fn read(int require mut* a)
where is_open(a)
model is_open(a)
{
return *a;
}fn close(int mut* a)
{
*a = 0;
}
```the above example defines a type state transition that is legal: open -> read -> close
any other combination will lead to a compile error, such as read before open.#### storage: const, static, atomic and thread_local
const and static work exactly like in rust, but with C syntax.
```C
export const uint32_t foo = 3;
static mutable float blarg = 2.0/0.3;
thread_local mutable bool bob = true;
atomic mutable int marvin = 0;
```const is inlined in each module and therefore points to different memory in each module.
static has a global storage location, but is private to the current module.in effect, there is no way to declare a shared global writable variable.
ZZ has no borrowchecker, and the restriction has nothing to do with preventing multithread races.
Instead the declarations are selected so that the resulting exported binary interface can be mapped to any other language.if you need to export a global writeable memory location (which is still a bad idea, because threads),
you can define a function that returns a pointer to the local static.thread_local and atomic are mapped directly to the C11 keywords.
ZZ can use nicer keywords because there are no user defined names at the top level.#### visibility: pub, export
by default all declarations are private to a module
"export" can be used to make sure the declaration ends in the final result. that is in the binary and the export header.
"pub" marks a declaration as local to the project. it is usable in other zz modules, but not exported into the resulting binary
#### struct initializationTo prepare for type elision, all expressions have to have a known type.
```C
struct A {
int a;
int b;
}fn main() {
A a = A{
a : 2,
};
}
```#### conditional compilation / preprocessor
Like in rust, the prepro is not a string processor, but rather executed on the AST **after** parsing.
This makes it behave very different than C, even if the syntax is the same as C.The right hand side of #if is evaluated immediately and can only access preprocessor scope.
```C
struct A {
int a;
#if def("TEST")
uint proc;
#elif def("MAYBE")
int proc;
#else
void* proc;
#endif
}
```Every branch of an #if / #else must contain a completed statement,
and can only appear where a statement would be valid,
so this is not possible:```C
pub fn foo(
#if os("unix")
)
#endif
```note that even code that is disabled by conditions must still be valid syntax. It can however not be type checked,
#### a note on west-const vs east-const
ZZ enforces east-const. C is not a formally correct language, so in order to make ZZ formally correct, we have to make some syntax illegal.
In this case we sacrifice west-const, which is incosistent and difficult to comprehend anyway.west-const with left aligned star reads as if the pointer is part of the type, and mutability is a property of the pointer (which it is).
```C++
int mut* foo;
foo = 0; // compile error
*foo = 0 // valid
```unless you want to apply mutability to the local storage named foo
```C++
void * mut foo;
foo = 0; // valid
*foo = 0 // compile error
```Coincidentally this is roughly equivalent to Rust, so Rust devs should feel right at home.
Even if not, you will quickly learn how pointer tags works by following the compiler errors.#### closures
function pointers are difficult to do nicely but also make them emit to all languages well, so they don't really exist in ZZ.
instead you declare closures, which are automatically casted from and to functions```C++
closure rand_t() int;fn secure_random() int {
return 42;
}fn main() {
rand_t rand = secure_random;
}```
#### metaprogramming or templates: tail variants
technically zz does not have metaprogramming. template functions blow up code size and are difficult to export as plain C types.
instead zz makes code reusable by allowing allocations of structs to be larger than their member sizes.We call this the "tail". And it can be used to make functions on fixed size arrays reusable for other sizes.
here's the String type:
```C++
export struct String+ {
usize len;
char mem[];
}
```A + sign behind the name indicates this type has a tail.
The tail here is mem, which is specified as array with no size.a length function could be implemented with this signature:
```C++
fn len(String+t mut * self) {
return t;
}
```again, the + indicates a tail. in this case, the tail size is bound to a local variable named t,
which can be used in the function body.when allocating a new stack variable of type String, you also allocate a tail on the same stack
```C++
String+100 s = {0};
string::append(&s, "hello");
string::append(&s, "world");
printf("%.*s", s.len, s.mem);
```again, + means tail, but here we specify an integer value of exact numbers of char we would like to add.
the tail is measured in number of elements of whatever is the last unsized element in the struct, not in bytes.String can dynamically expand within the tail memory. in this case, we append some stuff to the string, without ever allocating any heap.
simply returning from the current function will clear up any memory used, without the need for destructor ordering or signal safety.#### symbols
Symbols are a big global enum that lets you create unique values from anywhere in your code.
```C++
using symbols;symbol Car;
symbol Bike;fn drive_this(usize sym)
where symbol(sym)
{
if sym == Car {
printf("bzzzz\n");
} else {
printf("what do i do with a %s?\n", symbols::nameof(sym));
}
}
```note that you cannot make assumptions about the integer value of a symbol,
as it depends on compilation and loading order#### new constructors
ZZ autogenerates bindings to more languages than C, and some languages are not fully compatible with C abi.
Specifically they don't allow returning structs from functions, a standard way to create "constructor" functions in C.intstead you should be using a function that takes a mut pointer as its first argument, and call it on a zero initialized stack variable.
zz has syntactic sugar for this with the 'new' keyword.```C++
struct A {
int a;
int b;
}fn empty(A mut new * self, int a)
{
self->a = a;
}fn main() {
new a = empty(3);
assert(a.a == 3)
assert(a.b == 0)
}
```new creates a new local variable with the correct size and passes it as self argument to the constructor
to create a local with a tail, use new like this:
```C++
new+100 foo = string::empty();
```#### procedural macros
macros in zz are fully compiled and executed at compile time for each call.
this allows constructing arbitrary complex macros using regular zz code.unlike C prepro macros, macros must emit complete expressions.
for example you cannot emit an open brace without a closing brace.a macro is compiled to a standalone executable, automatically including all dependencies.
the call arguments and derive context is passed as json to stdin,
and the macro is expected to print zz code to stdout.```C++
/! creates literal string with arg0 repeated arg1 times
export macro repeat() {new+1000 a = ast::from_macro();
err::assert2(a.args[0].t == ast::Expression::LiteralString, "expected arg0: string");
err::assert2(a.args[1].t == ast::Expression::Literal, "expected arg1: number");
let num = (int)atoi(a.args[1].v.string);printf("\"");
for int mut i = 0; i < num; i++ {
printf("%s", a.args[0].v.string);
}
printf("\"");
}export fn main() int {
printf("hello %s\n", repeat("world ", 32));
return 0;
}
```#### inline included C source
ZZ supports importing C source with the `using` keyword. Imported C
source can be inlined at the call site by using the `inline` keyword. In certain
cases the imported C source may depend on symbols defined in a ZZ
module. ZZ symbols can be given to an imported C file with the `needs`
keyword.In the example below the `native.h` header file depends on the
`example_Container` type to be defined. The `lib.zz` file imports the
header file in line and exposes the `Container` type to the header file
by specifying it with the `needs` keyword. The `Container` type is made
available as `example_Container` because the project name is
`example` and the type is `Container`.```c
// native.h
#include
void print(example_Container *self) {
printf("%s\n", (char *) self->value);
}void init(example_Container *self, void const *value) {
self->value = value;
}
``````c++
// lib.zz
inline using (needs Container) "native.h" as nativeexport struct Container {
void *value;
}export fn create_container(Container new mut *self, void *value) {
native::init(self, value);
}pub fn print(Container *self) {
native::print(self);
}
``````C++
using examplefn main() int {
new container = example::create_container("hello");
container.print();
return 0;
}
```#### packed structs and unions
ZZ supports packed structs and unions with the `packed` modifier. This
modifier, when used with `struct` or `union`, omits alignment padding.
This is typically used for architecture independant serialization at the
cost of causing alingment faultsBelow is an example of a packed and unpacked struct and their static
sizes printed to stdout.```C++
using ::{ printf }struct Packed packed {
u8 a;
u8 b;
int b;
}struct Unpacked {
u8 a;
u8 b;
int b;
}fn main() int {
printf("sizeof(Packed) == lu\n", sizeof(Packed)); // 6
printf("sizeof(Unpacked) == lu\n", sizeof(Unpacked)); // 8
return 0;
}
```#### environment variables
##### `ZZ_MODULE_PATHS`
When ZZ imports other ZZ modules it will look in a projects `modules/`
directory by default. The search path can be extended by defining the
`ZZ_MODULE_PATHS` environment variable much like
[`PATH`](https://en.wikipedia.org/wiki/PATH_(variable)) environment
variable where multiple paths can be defined separated by a colon (`:`)
on POSIX systems and a semi-coloon (`;`) on Windows.```sh
ZZ_MODULE_PATHS="$PWD/path/to/modules:/usr/share/zz/modules" zz build
```[gcc-attributes]: https://gcc.gnu.org/onlinedocs/gcc-4.0.2/gcc/Type-Attributes.html