https://github.com/clojure-finance/clojask-examples
Examples of Clojask usage
https://github.com/clojure-finance/clojask-examples
big-data clojure dataframe parallel-computing
Last synced: 10 months ago
JSON representation
Examples of Clojask usage
- Host: GitHub
- URL: https://github.com/clojure-finance/clojask-examples
- Owner: clojure-finance
- License: mit
- Created: 2021-11-20T13:29:44.000Z (about 4 years ago)
- Default Branch: main
- Last Pushed: 2022-12-10T17:50:03.000Z (about 3 years ago)
- Last Synced: 2023-04-20T22:21:46.466Z (over 2 years ago)
- Topics: big-data, clojure, dataframe, parallel-computing
- Language: Clojure
- Homepage: https://clojure-finance.github.io/clojask-website
- Size: 68.4 KB
- Stars: 2
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
## Clojask Examples
Some commonly used examples for Clojure parallel data processing framework [Clojask](https://github.com/clojure-finance/clojask).
### Usage
The example codes are stored in `src/clojask_examples`.
Change the value of `:main` in `project.clj` to the corresponding namespace of the example you want to run.
Run the example using
```bash
lein run
```
### List of Examples
- [Basic tutorial](src/clojask_examples/basic_tutorial.clj)
Cover the basic APIs in the Clojask library. Also cover how to read and write to different file formats.
- [Groupby Aggregate](src/clojask_examples/groupby_aggregate.clj)
Group-by, then aggregate v.s. direct aggregate.
- [Join](src/clojask_examples/ordinary_join.clj)
Natural inner join, left join and right join.
- [Store Results in memory](src/clojask_examples/in_memory.clj)
For datasets that are smaller than memory, you can store the result in memory and reuse it faster. This function is also necessary to read and write excel files.
- [Connection with `tech.ml.dataset`](src/clojask_examples/techmldataset.clj)
Convert from and to the popular Clojure DataFrame library `tech.ml.dataset`.
- [Rolling Join](src/clojask_examples/rolling_join.clj)
Forward and backward rolling join with thresholds. See the definition [here](https://www.r-bloggers.com/2016/06/understanding-data-table-rolling-joins/).
- [Enhanced Reshape](src/clojask_examples/enhanced_reshape.clj)
Cbind, rbind, melt and dcast. See the [definition](https://cran.r-project.org/web/packages/data.table/vignettes/datatable-reshape.html#enhanced-new-functionality) of them in R.
- [Timezone](https://github.com/clojure-finance/clojask-examples/blob/main/src/clojask_examples/timezone.clj)
How to define parsers and formatters for fields of zoned datetime.
- [Outer Join](https://github.com/clojure-finance/clojask-examples/blob/main/src/clojask_examples/outer_join.clj)
How to do Outer Join / Cartesian Product.