Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/aws/random-cut-forest-by-aws
An implementation of the Random Cut Forest data structure for sketching streaming data, with support for anomaly detection, density estimation, imputation, and more.
https://github.com/aws/random-cut-forest-by-aws
algorithms anomalydetection streaming
Last synced: about 1 month ago
JSON representation
An implementation of the Random Cut Forest data structure for sketching streaming data, with support for anomaly detection, density estimation, imputation, and more.
- Host: GitHub
- URL: https://github.com/aws/random-cut-forest-by-aws
- Owner: aws
- License: apache-2.0
- Created: 2019-11-15T19:48:13.000Z (almost 5 years ago)
- Default Branch: main
- Last Pushed: 2024-08-06T06:51:02.000Z (3 months ago)
- Last Synced: 2024-10-04T21:06:38.000Z (about 1 month ago)
- Topics: algorithms, anomalydetection, streaming
- Language: Java
- Homepage: https://github.com/aws/random-cut-forest-by-aws
- Size: 3.14 MB
- Stars: 211
- Watchers: 12
- Forks: 34
- Open Issues: 11
-
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE
- Code of conduct: CODE_OF_CONDUCT.md
Awesome Lists containing this project
README
# Random Cut Forest by AWS
This repository contains implementations of the Random Cut Forest (RCF) probabilistic data structure.
RCFs were originally developed at Amazon to use in a nonparametric anomaly detection algorithm for
streaming data. Later new algorithms based on RCFs were developed for density estimation, imputation,
and forecasting.The different directories correspond to equivalent implementations in different languages, and bindings to
to those base implementations, using language specific features for greater flexibility of use.RandomCutForest in the randomcutforest-core package provides an estimation (say anomaly score, or extrapolation over a forecast horizon)
and using that raw estimation can be challenging. The randomcutforest-parkservices package provides
several capabilities (ThresholdedRandomCutForest, RCFCaster, respectively) for distilling the scores to a determination of
a potential anomaly or calibrated forecast respectively.
The package randomcutforest-examples showcases several example scenarios for using the repository.
They also provide examples for some of the parameter settings. Many of these examples are built in tests.## Documentation
* Guha, S., Mishra, N., Roy, G., & Schrijvers, O. (2016, June). Robust random cut forest based anomaly detection on streams. In *International conference on machine learning* (pp. 2712-2721).
## Code of Conduct
This project has adopted an [Open Source Code of Conduct](https://aws.github.io/code-of-conduct).
## Security issue notifications
If you discover a potential security issue in this project we ask that you notify AWS/Amazon Security via our [vulnerability reporting page](http://aws.amazon.com/security/vulnerability-reporting/). Please do **not** create a public GitHub issue.
## Licensing
See the [LICENSE](./LICENSE) file for our project's licensing. We will ask you to confirm the licensing of your contribution.
## Copyright
Copyright 2019-2020 Amazon.com, Inc. or its affiliates. All Rights Reserved.