{"id":19434436,"url":"https://github.com/likelet/cfdnamodelingcode","last_synced_at":"2026-06-13T03:31:31.646Z","repository":{"id":86650579,"uuid":"88004355","full_name":"likelet/cfDNAmodelingCode","owner":"likelet","description":"Rscripts for analysis cfDNA methylation level in multi cancers","archived":false,"fork":false,"pushed_at":"2018-04-04T13:37:12.000Z","size":22,"stargazers_count":6,"open_issues_count":0,"forks_count":0,"subscribers_count":1,"default_branch":"master","last_synced_at":"2025-11-19T03:22:16.361Z","etag":null,"topics":[],"latest_commit_sha":null,"homepage":null,"language":"R","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/likelet.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-04-12T03:11:55.000Z","updated_at":"2025-03-27T02:07:25.000Z","dependencies_parsed_at":null,"dependency_job_id":"230d44f4-2d9a-45e0-925b-81d6f4af4627","html_url":"https://github.com/likelet/cfDNAmodelingCode","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/likelet/cfDNAmodelingCode","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/likelet%2FcfDNAmodelingCode","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/likelet%2FcfDNAmodelingCode/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/likelet%2FcfDNAmodelingCode/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/likelet%2FcfDNAmodelingCode/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/likelet","download_url":"https://codeload.github.com/likelet/cfDNAmodelingCode/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/likelet%2FcfDNAmodelingCode/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":34271500,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-05-26T15:22:16.424Z","status":"online","status_checked_at":"2026-06-13T02:00:06.617Z","response_time":62,"last_error":null,"robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":true,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":[],"created_at":"2024-11-10T14:46:25.085Z","updated_at":"2026-06-13T03:31:31.045Z","avatar_url":"https://github.com/likelet.png","language":"R","funding_links":[],"categories":[],"sub_categories":[],"readme":"# cfDNAmodelingCode\n\nRscripts for analysis cfDNA methylation level in multi cancers.\nCurrently, only code for HCC analysis was released. All analysis started with well-imputed methlyation datamatrix from sequencing platform.\n\n# To rerun analysis\n\nIf you are intreasted in perform the similar analysis on our data or your local dataset, plz git pull the repo directly. \nHowever, its worth noting that my code for screening methylation martker involved several random functions which may result in a slight different marker list when rerun the same analysis. The random procedure includes splitting sampleset into training and validation dataset, subsampling training data for LASSO screening and perform RandomForest variable selection. Therefore, for reproduce analysis result, it is a good option to store the workspaces as well as files and list generated in each seperated analysis to make result or variables reproductive. \nIf you are insist in getting exactly the analysis figures between two seprated run,  here is my advice that may help you:\n1. Avoiding splitting data set by random function. On contrary, you may try to divide your data set by the collection date, collection position or other information reasonblely. \n2. Increasing the times of resampling as many as possible if computational resources permited.\n3. avoiding the randomn function utilized in your analysis.\n    \n\n# SessionInfor in R env\n```r \nsessionInfo()\nR version 3.2.3 (2015-12-10)\nPlatform: x86_64-redhat-linux-gnu (64-bit)\nRunning under: CentOS Linux 7 (Core)\n\nlocale:\n [1] LC_CTYPE=zh_CN.UTF-8          LC_NUMERIC=C                  LC_TIME=zh_CN.UTF-8          \n [4] LC_COLLATE=zh_CN.UTF-8        LC_MONETARY=zh_CN.UTF-8       LC_MESSAGES=zh_CN.UTF-8      \n [7] LC_PAPER=zh_CN.UTF-8          LC_NAME=zh_CN.UTF-8           LC_ADDRESS=zh_CN.UTF-8       \n[10] LC_TELEPHONE=zh_CN.UTF-8      LC_MEASUREMENT=zh_CN.UTF-8    LC_IDENTIFICATION=zh_CN.UTF-8\n\nattached base packages:\n[1] grid      parallel  stats     graphics  grDevices utils     datasets  methods   base     \n\nother attached packages:\n [1] R2HTML_2.3.2           limma_3.26.9           risksetROC_1.0.4      \n [4] MASS_7.3-45            RColorBrewer_1.1-2     scales_0.4.1          \n [7] easyGgplot2_1.0.0.9000 devtools_1.12.0        ggthemes_3.3.0        \n[10] ggsci_2.0              pheatmap_1.0.8         varSelRF_0.7-5        \n[13] randomForest_4.6-12    papeR_1.0-1            xtable_1.8-2          \n[16] car_2.1-3              caret_6.0-73           glmnet_2.0-5          \n[19] foreach_1.4.3          pROC_1.8               mice_2.25             \n[22] Rcpp_0.12.8            rms_5.0-0              SparseM_1.74          \n[25] Hmisc_4.0-0            Formula_1.2-1          lattice_0.20-34       \n[28] survival_2.40-1        ggplot2_2.2.0.9000     ROCR_1.0-7            \n[31] gplots_3.0.1           Matrix_1.2-7.1        \n\nloaded via a namespace (and not attached):\n [1] splines_3.2.3       RWeka_0.4-29        gtools_3.5.0        assertthat_0.1     \n [5] stats4_3.2.3        latticeExtra_0.6-28 quantreg_5.29       chron_2.3-47       \n [9] digest_0.6.10       minqa_1.2.4         colorspace_1.3-1    sandwich_2.3-4     \n[13] htmltools_0.3.5     plyr_1.8.4          gmodels_2.16.2      mvtnorm_1.0-5      \n[17] gdata_2.17.0        lme4_1.1-12         MatrixModels_0.4-1  htmlTable_1.7      \n[21] tibble_1.2          mgcv_1.8-16         FSelector_0.21      TH.data_1.0-7      \n[25] withr_1.0.2         nnet_7.3-12         lazyeval_0.2.0      pbkrtest_0.4-6     \n[29] magrittr_1.5        memoise_1.0.0       polspline_1.1.12    nlme_3.1-128       \n[33] foreign_0.8-67      RWekajars_3.9.0-1   tools_3.2.3         data.table_1.9.6   \n[37] multcomp_1.4-6      stringr_1.1.0       munsell_0.4.3       cluster_2.0.5      \n[41] entropy_1.2.1       caTools_1.17.1      nloptr_1.0.4        iterators_1.0.8    \n[45] bitops_1.0-6        gtable_0.2.0        ModelMetrics_1.1.0  codetools_0.2-15   \n[49] reshape2_1.4.2      gridExtra_2.2.1     zoo_1.7-13          knitr_1.15.1       \n[53] KernSmooth_2.23-15  rJava_0.9-8         stringi_1.1.2       rpart_4.1-10       \n[57] acepack_1.4.1      `\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flikelet%2Fcfdnamodelingcode","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Flikelet%2Fcfdnamodelingcode","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Flikelet%2Fcfdnamodelingcode/lists"}