https://github.com/makcymal/arrow-view
CLI preview of Apache Arrow files
https://github.com/makcymal/arrow-view
apache-arrow cli
Last synced: 11 months ago
JSON representation
CLI preview of Apache Arrow files
- Host: GitHub
- URL: https://github.com/makcymal/arrow-view
- Owner: makcymal
- Created: 2024-12-19T16:33:12.000Z (about 1 year ago)
- Default Branch: master
- Last Pushed: 2025-01-30T04:59:54.000Z (about 1 year ago)
- Last Synced: 2025-02-07T05:31:10.960Z (about 1 year ago)
- Topics: apache-arrow, cli
- Language: C++
- Homepage:
- Size: 214 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# CLI preview of Apache Arrow files
This provides a way of previewing data in CLI similar to `head()`, `describe()`, `info()` of `pandas.DataFrame`. The following assumes that file contains only one table.
The output is formatted as table with [single-header `tabulate` lib](https://github.com/p-ranav/tabulate). Place it in `src/lib`.
## Example
Suppose you have a dataset on aircraft prices in Apache Arrow `aircrafts.arrow`:
#### See the first 5 rows:
```
arrow-head aircrafts.arrow
```
#### See the descriptive statistics on numerical fields:
```
arrow-desc aircrafts.arrow
```
#### See the list of columns, dtypes and null counts:
```
arrow-info aircrafts.arrow
```
## arrow-head
Write first rows of table into console.
The columns width is equal among data columns and is adjusted automatically
to fit screen entirely. However, in case of tables with a lot of columns,
this may result in very narrow columns and bad readability.
Usage:
```
arrow-head dataset.arrow
```
## arrow-desc
Compute descriptive statistics on numerical fields.
The columns width is equal among data columns and is adjusted automatically
to fit screen entirely. However, in case of tables with a lot of columns,
this may result in very narrow columns and bad readability.
Usage:
```
arrow-desc data.arrow
```
## arrow-info
List columns, dtypes and null counts
Usage:
```
arrow-info dataset.arrow
```