An open API service indexing awesome lists of open source software.

https://github.com/mamba413/cdcsis

Conditional Distance Correlation based Statistical Method
https://github.com/mamba413/cdcsis

conditional-dependence feature-selection variable-selection

Last synced: 3 months ago
JSON representation

Conditional Distance Correlation based Statistical Method

Awesome Lists containing this project

README

        

CDC Statistics
===========
[![Travis build status](https://travis-ci.org/Mamba413/cdcsis.svg?branch=master)](https://travis-ci.org/Mamba413/cdcsis)
[![CRAN Status Badge](http://www.r-pkg.org/badges/version/cdcsis)](https://CRAN.R-project.org/package=cdcsis)
![CRAN Downloads overall](https://cranlogs.r-pkg.org/badges/grand-total/cdcsis?color=brightgreen)
[![Codacy Badge](https://api.codacy.com/project/badge/Grade/645c7a47d9484abc80098d4426dd6a64)](https://app.codacy.com/app/Mamba413/cdcsis?utm_source=github.com&utm_medium=referral&utm_content=Mamba413/cdcsis&utm_campaign=Badge_Grade_Dashboard)

Introdution
----------
The fundamental problems for data mining and statistical/machine learning are:

- how to select the important features for ultra high dimensional dataset?
- whether a statistical/machine learning model is sufficient (i.e. does not need to include additional variables)?

CDC Statistics based statistical method provides solutions for these issues.

License
----------
GPL (>= 2)

Reference
----------
- Xueqin Wang, Wenliang Pan, Wenhao Hu, Yuan Tian & Heping Zhang (2015) Conditional Distance Correlation, Journal of the American Statistical Association, 110:512, 1726-1734, DOI: 10.1080/01621459.2014.993081
- Canhong Wen, Wenliang Pan, Mian Huang and Xueqin Wang (2018) Sure independence screening adjusted for confounding covariates with ultrahigh dimensional data, Statistica Sinica, 28 (2018), no. 1, 293--318, DOI:10.5705/ss.202014.0117