https://github.com/gregorybchris/tinytable
Dataset compression tool
https://github.com/gregorybchris/tinytable
Last synced: 5 months ago
JSON representation
Dataset compression tool
- Host: GitHub
- URL: https://github.com/gregorybchris/tinytable
- Owner: gregorybchris
- License: apache-2.0
- Created: 2020-01-28T08:01:56.000Z (almost 6 years ago)
- Default Branch: master
- Last Pushed: 2020-02-22T23:41:30.000Z (almost 6 years ago)
- Last Synced: 2025-03-02T02:33:23.886Z (11 months ago)
- Language: Python
- Size: 17.6 KB
- Stars: 0
- Watchers: 2
- Forks: 0
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
- License: LICENSE.txt
Awesome Lists containing this project
README
# TinyTable
Empirical tool to find the best file format for caching a particular dataset.
## Installation
Install the [current PyPI release](https://pypi.org/project/tinytable/):
```bash
pip install tinytable
```
Or install from source:
```bash
pip install git+https://github.com/gregorybchris/tinytable
```
## Basic Usage
```bash
# Run the tinytable command to get file size statistics
tinytable --experiment run --dataset --trials 3
# Generate charts to visualize the best file format
tinytable --experiment plot
# Delete experiment data
tinytable --experiment clean
```
## Advanced Usage
```bash
# Subsample the dataset before serializing
tinytable run -d --sample .5
# Don't write the results to a file, just print them
tinytable -e exp1 run -d data/data.csv --no_write
# Specify the directory to which experiments should be saved
tinytable -e exp1 --experiments_dir run -d data/data.csv
```