{"id":13666037,"url":"https://github.com/zmjones/edarf","last_synced_at":"2025-10-26T08:09:25.307Z","repository":{"id":20393593,"uuid":"23669422","full_name":"zmjones/edarf","owner":"zmjones","description":"exploratory data analysis using random forests","archived":false,"fork":false,"pushed_at":"2018-02-06T20:54:56.000Z","size":7256,"stargazers_count":68,"open_issues_count":2,"forks_count":11,"subscribers_count":11,"default_branch":"master","last_synced_at":"2025-10-24T02:18:00.749Z","etag":null,"topics":["exploratory-data-analysis","machine-learning","r","random-forest","rstats"],"latest_commit_sha":null,"homepage":null,"language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/zmjones.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null,"zenodo":null}},"created_at":"2014-09-04T16:03:18.000Z","updated_at":"2025-06-17T20:28:53.000Z","dependencies_parsed_at":"2022-07-31T18:38:07.946Z","dependency_job_id":null,"html_url":"https://github.com/zmjones/edarf","commit_stats":null,"previous_names":[],"tags_count":1,"template":false,"template_full_name":null,"purl":"pkg:github/zmjones/edarf","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zmjones%2Fedarf","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zmjones%2Fedarf/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zmjones%2Fedarf/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zmjones%2Fedarf/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/zmjones","download_url":"https://codeload.github.com/zmjones/edarf/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/zmjones%2Fedarf/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":281064210,"owners_count":26437785,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","status":"online","status_checked_at":"2025-10-26T02:00:06.575Z","response_time":61,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["exploratory-data-analysis","machine-learning","r","random-forest","rstats"],"created_at":"2024-08-02T06:00:56.130Z","updated_at":"2025-10-26T08:09:25.277Z","avatar_url":"https://github.com/zmjones.png","language":"R","funding_links":[],"categories":["R"],"sub_categories":[],"readme":"![](https://travis-ci.org/zmjones/edarf.svg) ![](http://www.r-pkg.org/badges/version/edarf)\n![](http://cranlogs.r-pkg.org/badges/edarf)\n[![DOI](https://zenodo.org/badge/23669422.svg)](https://zenodo.org/badge/latestdoi/23669422)\n[![status](http://joss.theoj.org/papers/d29df349c8450ef958c0fde5bf164371/status.svg)](http://joss.theoj.org/papers/d29df349c8450ef958c0fde5bf164371)\n\nFunctions useful for exploratory data analysis using random forests.\n\nThis package extends the functionality of random forests fit by [party](https://CRAN.R-project.org/package=party) (multivariate, regression, and classification), [randomForestSRC](https://CRAN.R-project.org/package=randomForestSRC) (regression and classification,), [randomForest](https://CRAN.R-project.org/package=randomForest) (regression and classification), and [ranger](https://cran.r-project.org/package=ranger) (classification and regression).\n\nThe subdirectory `pkg` contains the actual package. The package can be installed with [devtools](https://cran.r-project.org/package=devtools).\n\n```{r}\ndevtools::install_github(\"zmjones/edarf\", subdir = \"pkg\")\n```\n\nFunctionality includes:\n\n - `partial_dependence` which computes the expected prediction made by the random forest if it were marginalized to only depend on a subset of the features. `plot_pd` plots the results.\n - `variable_importance` which computes feature importance for arbitrary loss functions, aggregated across the training data or for individual observations. This may also be used for subsets of the feature space in order to detect interactions.\n - `extract_proximity` and `plot_prox` which computes or extracts proximity matrices and plots them using a biplot given a matrix of principal components of said matrix.\n \nIf you use the package for research, please cite it.\n\n\t@article{jones2016,\n\t  doi = {10.21105/joss.00092},\n\t  url = {http://dx.doi.org/10.21105/joss.00092},\n\t  year  = {2016},\n\t  month = {oct},\n\t  publisher = {The Open Journal},\n\t  volume = {1},\n\t  number = {6},\n\t  author = {Zachary M. Jones and Fridolin J. Linder},\n\t  title = {edarf: Exploratory Data Analysis using Random Forests},\n\t  journal = {The Journal of Open Source Software}\n\t}\n\nPull requests, bug reports, feature requests, etc. are welcome!\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzmjones%2Fedarf","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fzmjones%2Fedarf","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fzmjones%2Fedarf/lists"}