An open API service indexing awesome lists of open source software.

https://github.com/xiaodaigh/parquet-format-technical-notes


https://github.com/xiaodaigh/parquet-format-technical-notes

Last synced: 2 months ago
JSON representation

Awesome Lists containing this project

README

        

# parquet-format-technical-notes

## Script

The first and last 4 bytes of a parquet file are "PAR1".

The 8th-5th last bytes are the metadata size = `meta_len`.

You can read the metadata by seeking to

```
filesize - 8 - metadatalen
```

where

```julia
sz = filesize(io)
data_size = (sz - meta_len - 2length("PAR1")-length(size of meta : Int32)) = sz - meta_len - 12
```