https://github.com/apahl/ds_tools
Misc helpers for datasciencing, written in Codon.
https://github.com/apahl/ds_tools
codon codon-lang
Last synced: 7 months ago
JSON representation
Misc helpers for datasciencing, written in Codon.
- Host: GitHub
- URL: https://github.com/apahl/ds_tools
- Owner: apahl
- License: mit
- Created: 2025-02-19T08:31:01.000Z (over 1 year ago)
- Default Branch: main
- Last Pushed: 2025-02-19T19:37:41.000Z (over 1 year ago)
- Last Synced: 2025-03-29T08:07:56.471Z (about 1 year ago)
- Topics: codon, codon-lang
- Language: Shell
- Homepage:
- Size: 5.86 KB
- Stars: 0
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# DS Tools
A collection of small helper binaries for doing data science,
mostly for pre- and post-processing of datasets.
Just out of fun, the tools are written in the experimental language [codon](https://github.com/exaloop/codon/).
Codon is a compiled language with a syntax that is very close to Python's.
Unfortunately, Codon is currently only available on Linux / Mac.
To compile the tools in this repository, you must have the `codon` compiler in your path,
then run `./build.sh`. The resulting binaries are then in the `bin` folder.
The tools are written mainly for my personal use.
Please feel free to clone and use, but at this time I am not really looking for contributions.
## `split_csv`
```
Split a large CSV / TSV file into smaller chunks.
USAGE:
csv_split
in_file: large input file to split (Required)
num_files: number of files to split into (Required)
```