An open API service indexing awesome lists of open source software.

https://github.com/gregorybchris/tinytable

Dataset compression tool
https://github.com/gregorybchris/tinytable

Last synced: 5 months ago
JSON representation

Dataset compression tool

Awesome Lists containing this project

README

          

# TinyTable

Empirical tool to find the best file format for caching a particular dataset.

## Installation

Install the [current PyPI release](https://pypi.org/project/tinytable/):

```bash
pip install tinytable
```

Or install from source:

```bash
pip install git+https://github.com/gregorybchris/tinytable
```

## Basic Usage

```bash
# Run the tinytable command to get file size statistics
tinytable --experiment run --dataset --trials 3

# Generate charts to visualize the best file format
tinytable --experiment plot

# Delete experiment data
tinytable --experiment clean
```

## Advanced Usage

```bash
# Subsample the dataset before serializing
tinytable run -d --sample .5

# Don't write the results to a file, just print them
tinytable -e exp1 run -d data/data.csv --no_write

# Specify the directory to which experiments should be saved
tinytable -e exp1 --experiments_dir run -d data/data.csv
```