https://github.com/bigmlcom/gen-data
A tiny project for generating synthetic data
https://github.com/bigmlcom/gen-data
Last synced: 9 months ago
JSON representation
A tiny project for generating synthetic data
- Host: GitHub
- URL: https://github.com/bigmlcom/gen-data
- Owner: bigmlcom
- License: other
- Created: 2016-01-29T19:44:13.000Z (almost 10 years ago)
- Default Branch: master
- Last Pushed: 2016-02-05T01:23:46.000Z (almost 10 years ago)
- Last Synced: 2024-04-16T10:58:59.997Z (over 1 year ago)
- Language: Clojure
- Size: 9.77 KB
- Stars: 1
- Watchers: 11
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# gen-data
A tiny project to generate synthetic data. Each class will be
representd by a randomly projected and translated gaussian. Example
usage:
```console
lein run --output foo.csv --classes 100 --fields 20 --rows 10000 --seed foo
```
# Parameters
The possible parameters for a `lein run` task are:
- *--output* : The output file.
- *--classes* : The number of classes (with associated clusters) in the data.
- *--fields* : The number of numeric fields the data.
- *--hidden* : The number of hidden fields. The hidden fields are
part of the cluster projections, but not visible in
the final data.
- *--rows* : The number of rows.
- *--seed* : A seed for random number generator.