{"id":15646405,"url":"https://github.com/danielhanchen/sciblox","last_synced_at":"2026-03-08T00:31:18.967Z","repository":{"id":127491953,"uuid":"97677759","full_name":"danielhanchen/sciblox","owner":"danielhanchen","description":"sciblox - Easier Data Science and Machine Learning","archived":false,"fork":false,"pushed_at":"2017-07-28T13:14:08.000Z","size":1442,"stargazers_count":50,"open_issues_count":1,"forks_count":1,"subscribers_count":3,"default_branch":"master","last_synced_at":"2025-09-19T08:34:16.914Z","etag":null,"topics":["boosting","data-analysis","data-mining","data-preprocessing","data-science","data-visualization","imputation","machine-learning","python","sklearn"],"latest_commit_sha":null,"homepage":"https://danielhanchen.github.io/","language":"HTML","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/danielhanchen.png","metadata":{"files":{"readme":"README.md","changelog":"CHANGES.txt","contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":"CODE_OF_CONDUCT.md","threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2017-07-19T05:51:40.000Z","updated_at":"2025-02-01T18:07:43.000Z","dependencies_parsed_at":null,"dependency_job_id":"e0a9cb39-e40b-43b9-93d8-bd189656a4d0","html_url":"https://github.com/danielhanchen/sciblox","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/danielhanchen/sciblox","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/danielhanchen%2Fsciblox","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/danielhanchen%2Fsciblox/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/danielhanchen%2Fsciblox/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/danielhanchen%2Fsciblox/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/danielhanchen","download_url":"https://codeload.github.com/danielhanchen/sciblox/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/danielhanchen%2Fsciblox/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":30238843,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-03-08T00:30:53.000Z","status":"ssl_error","status_checked_at":"2026-03-08T00:30:44.061Z","response_time":53,"last_error":"SSL_read: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["boosting","data-analysis","data-mining","data-preprocessing","data-science","data-visualization","imputation","machine-learning","python","sklearn"],"created_at":"2024-10-03T12:12:45.193Z","updated_at":"2026-03-08T00:31:18.937Z","avatar_url":"https://github.com/danielhanchen.png","language":"HTML","funding_links":[],"categories":[],"sub_categories":[],"readme":"# sciblox\nAn all in one Python3 Data Science Package. Easy visualisation, data mining, data preparation and machine learning.\n\nPlease check the Jupyter Notebook for instructions on how to use it.\nYou can also check sciblox out on https://danielhanchen.github.io/\n\nhttps://pypi.python.org/pypi/sciblox\n\nInstall:\n```sh\n[sudo] pip install sciblox\n```\nNOTE: If you intend to use remove linearly dependent rows or KNN,SVD impute:\n```sh\n[sudo] pip install fancyimpute sympy theano\n```\nIf fancyimpute fails: Please install C++ or MingW compiler\n\n\nWHAT'S NEW?\n1. FASTER (x10) BPCA fill\n2. Better analyser\n3. NEW modules - Machine Learning\n\nSome features explained include:\n\n1. MICE, BPCA missing data imputation with Random Forests, XGBoost and Linear Regression support\n2. Automatic Data Plotting\n3. Word extraction and frequency plots\n4. Sequential text processing\n5. CARET like processes including ZeroVarCheck, FreqRatios etc.\n6. Discretization and Continuisation\n7. Easy data structure changes like Hcat, Vcat, reversing etc.\n8. Easy CARET like Machine Learning modules\n9. Automatic Best Graphs Plotting\n\nIN CONSTRUCTION:\n1. Advanced text extraction methods\n2. Automatic Machine Learning methods\n\nFor easier calling:\n```python\nfrom sciblox import *\n%matplotlib notebook\n```\nIf you are using other methods, just copy paste sciblox.py into whatever Python3 main directory.\nThen call it same as top.\n\nSome screenshots:\n\n![Analysing](/img/Analyse.jpg?raw=true \"Auto analysing and 3d plots\")\n\n![Preprocessing](/img/Preprocess.jpg?raw=true \"CARET like Preprocess\")\n\n![Analytics](/img/Analytics.jpg?raw=true \"CARET like checking\")\n\n![Plotting](/img/Plot.jpg?raw=true \"Cool easy plots\")\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdanielhanchen%2Fsciblox","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fdanielhanchen%2Fsciblox","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fdanielhanchen%2Fsciblox/lists"}