Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-11-18 00:06:52 UTC
- JSON Representation
https://github.com/hamelsmu/docker_tutorial
Code and helper scripts for article on Medium "How Docker Can Help You Become A More Effective Data Scientist"
data-science docker docker-tutorial medium medium-article
Last synced: 27 Oct 2024
https://github.com/younes-charfaoui/daily-coding-problem
Series of the problem 💯 and solution ✅ asked by Daily Coding problem👨🎓 website.
airbnb amazon apple cisco coding-interviews coding-problems coursera data-science dropbox facebook google ibm linkedin microsoft mozila netflix nvidia python twitter youtube
Last synced: 27 Oct 2024
https://github.com/vanderschaarlab/hyperimpute
A framework for prototyping and benchmarking imputation methods
data-science imputation imputation-algorithm machine-learning machine-learning-prerequisites preprocessing-data python scikit-learn
Last synced: 13 Oct 2024
https://github.com/phillipdupuis/dtale-desktop
Build a data visualization dashboard with simple snippets of python code
data-analysis data-science data-visualization fastapi pandas python react typescript visualization
Last synced: 17 Nov 2024
https://github.com/salvatorera/tutorial
Tutorials on machine learning, artificial intelligence, data science with math explanation and reusable code (in python and R)
artificial-intelligence bioinformatics biology computer-vision convolutional-neural-networks data-science deep-learning graph image machine-learning natural-language-processing nlp python r streamlit streamlit-webapp tutorial tutorials vision-transformer
Last synced: 09 Oct 2024
https://github.com/curiousily/Machine-Learning-from-Scratch
Succinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
artificial-intelligence book classification data-science machine-learning machine-learning-algorithms neural-networks notebook recommender-systems regression reinforcement-learning sentiment-analysis
Last synced: 08 Aug 2024
https://github.com/jgoerner/beyond-jupyter
🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
airflow apache apistar data-science docker docker-compose jupyter jupyter-notebook minio postgres superset
Last synced: 27 Oct 2024
https://github.com/risenw/datasist
A Python library for easy data analysis, visualization, exploration and modeling
data-analysis data-science data-visualization feature-engineering machine-learning python-3
Last synced: 14 Nov 2024
https://github.com/pyscaffold/pyscaffoldext-dsproject
💫 PyScaffold extension for data-science projects
data-science pyscaffold pyscaffold-extension python
Last synced: 15 Nov 2024
https://github.com/tvdboom/atom
Automated Tool for Optimized Modelling
automl dagshub data-exploration data-pipeline data-science interactive-visualizations machine-learning mlflow model-predictions modelling python scikit-learn shap visualization
Last synced: 29 Oct 2024
https://github.com/minerva-ml/open-solution-toxic-comments
Open solution to the Toxic Comment Classification Challenge
challenge competition data-science deep-learning ensemble-model kaggle kaggle-competition machine-learning neptune nlp pipeline prediction python python3
Last synced: 07 Aug 2024
https://github.com/heidelbergcement/hcrystalball
A library that unifies the API for most commonly used libraries and modeling techniques for time-series forecasting in the Python ecosystem.
cross-validation data-science fbprophet model-selection pmdarima sarimax sklearn sklearn-api sklearn-compatible sklearn-library sktime statsmodels tbats time-series time-series-forecasting transformer wrapper
Last synced: 10 Oct 2024
https://github.com/thebabylonai/babylog
A lightweight logger for machine learning teams to log images and predictions in production.
computer-vision cvops data-science logger logging-library machine-learning ml mlops python python3
Last synced: 14 Nov 2024
https://github.com/saezlab/decoupler-py
Python package to perform enrichment analysis from omics data.
bioinformatics data-science enrichment enrichment-analysis numba python single-cell spatial-transcriptomics transcriptomics
Last synced: 12 Nov 2024
https://github.com/oxinabox/datadeps.jl
reproducible data setup for reproducible science
data data-science open-science
Last synced: 18 Oct 2024
https://github.com/tariqdaouda/mariana
The Cutest Deep Learning Framework which is also a wonderful Declarative Language
artificial-intelligence artificial-neural-networks data-science deep-learning deep-learning-algorithms deep-learning-library deep-neural-networks deeplearning machine-learning machine-learning-algorithms machinelearning python theano
Last synced: 07 Nov 2024
https://github.com/googleapis/python-bigquery-dataframes
BigQuery DataFrames
bigquery data-science machine-learning python
Last synced: 12 Oct 2024
https://github.com/chanmenglin/pandasversusexcel
Python数据分析入门,数据分析师入门
charts data-analysis data-charts data-science data-science-learning data-view data-visualization excel histogram learn-pandas learn-python matplotlib pandas pandas-excel python
Last synced: 14 Nov 2024
https://github.com/tirthajyoti/Interactive_Machine_Learning
IPython widgets, interactive plots, interactive machine learning
analytics animation classification data-science interactive jupyter-notebook machine-learning python regression scikit-learn statistics supervised-learning
Last synced: 07 Aug 2024
https://github.com/safreita1/TIGER
Python toolbox to evaluate graph vulnerability and robustness (CIKM 2021)
adversarial-attacks attack cascading-failures data-mining data-science defense diffusion epidemics graph graph-attack graph-mining machine-learning netshield network-attack networks robustness simulation vulnerability
Last synced: 12 Nov 2024
https://github.com/tirthajyoti/interactive_machine_learning
IPython widgets, interactive plots, interactive machine learning
analytics animation classification data-science interactive jupyter-notebook machine-learning python regression scikit-learn statistics supervised-learning
Last synced: 01 Nov 2024
https://github.com/minusxai/minusx
MinusX is an AI Data Scientist for Analytics Apps you already use and love. Currently it supports Jupyter, Metabase, & Posthog.
artificial-intelligence data-analytics data-science jupyter metabase
Last synced: 11 Oct 2024
https://github.com/raptor-ml/raptor
Transform your pythonic research to an artifact that engineers can deploy easily.
ai-infra data-engineering data-science dataops feature-engineering feature-extraction feature-platform featurestore kubeflow kubernetes machine-learning ml mlops model-deployment production raptor raptor-ml reactive-ml
Last synced: 11 Oct 2024
https://github.com/whitews/flowkit
A Python toolkit for flow cytometry analysis supporting GatingML and FlowJo workspaces
cytometry data-science fcs fcs-files flow-cytometry flow-cytometry-analysis flowjo gatingml immunology python
Last synced: 11 Nov 2024
https://github.com/TheDatumOrg/TSB-UAD
An End-to-End Benchmark Suite for Univariate Time-Series Anomaly Detection
anomaly-detection anomaly-detection-algorithm benchmark data-mining data-science datasets python python3 time-series time-series-analysis
Last synced: 30 Oct 2024
https://github.com/emilhvitfeldt/r-text-data
List of textual data sources to be used for text mining in R
data-science nlp rstats text-analysis text-analytics-in-r text-mining tidytext
Last synced: 30 Oct 2024
https://github.com/jazzdotdev/jazz
The Scripting Engine that Combines Speed, Safety, and Simplicity
actix android chromeos crypto cryptography data-science database development-environment embeddable jazz jinja2 linux lua markdown rust scraping scripting web witness
Last synced: 05 Nov 2024
https://github.com/voila-dashboards/voici
Voici turns any Jupyter Notebook into a static web application
dashboards data-science emscripten jupyter jupyterlite voila-dashboard wasm
Last synced: 04 Sep 2024
https://github.com/EmilHvitfeldt/R-text-data
List of textual data sources to be used for text mining in R
data-science nlp rstats text-analysis text-analytics-in-r text-mining tidytext
Last synced: 05 Aug 2024
https://github.com/benmarwick/ctv-archaeology
CRAN Task View: Archaeological Science
archaeological-science archaeology cran-task data-science r task-view
Last synced: 16 Nov 2024
https://github.com/dongjunlee/quantified-self
Self-knowledge through numbers
chatbot data-science fitbit github kino machine-learning personal-assistant quantified-self rescuetime self-tracking slack-bot todoist toggl trello
Last synced: 08 Nov 2024
https://github.com/mybridge/learn-python
Python Top 45 Articles of 2017
algorithm data-science machine-learning python python3
Last synced: 07 Nov 2024
https://github.com/DongjunLee/quantified-self
Self-knowledge through numbers
chatbot data-science fitbit github kino machine-learning personal-assistant quantified-self rescuetime self-tracking slack-bot todoist toggl trello
Last synced: 06 Nov 2024
https://github.com/google/applied-machine-learning-intensive
Applied Machine Learning Intensive
data-science machine-learning python3 sklearn tensorflow tensorflow-examples tensorflow-tutorials
Last synced: 26 Sep 2024
https://rivasiker.github.io/ggHoriPlot/
A user-friendly, highly customizable R package for building horizon plots in ggplot2
data-science data-visualization ggplot2 horizon-plots r r-package
Last synced: 13 Nov 2024
https://github.com/rivasiker/ggHoriPlot
A user-friendly, highly customizable R package for building horizon plots in ggplot2
data-science data-visualization ggplot2 horizon-plots r r-package
Last synced: 12 Nov 2024
https://github.com/aws-samples/aws-ml-jp
SageMakerで機械学習モデルを構築、学習、デプロイする方法が学べるNotebookと教材集
aws data-science deep-learning jupyter-notebook machine-learning mlops sagemaker
Last synced: 08 Nov 2024
https://github.com/arabacibahadir/sup-res
A great companion for finding key support and resistance levels on financial charts, cryptocurrencies.
algotrade analysis binance binance-api bitcoin cryptocurrency data-science finance pandas pinescript python stock telegram telegram-bot tradingview
Last synced: 27 Oct 2024
https://github.com/apache/incubator-liminal
Apache Liminals goal is to operationalise the machine learning process, allowing data scientists to quickly transition from a successful experiment to an automated pipeline of model training, validation, deployment and inference in production. Liminal provides a Domain Specific Language to build ML workflows on top of Apache Airflow.
ai airflow big-data data-science machine-learning ml workflows
Last synced: 01 Oct 2024
https://github.com/ayush1997/YouTube-Like-predictor
YouTube Like Count Predictions using Machine Learning
data-analysis data-science machine-learning predictive-analysis random-forest visualization youtube-api
Last synced: 07 Aug 2024
https://github.com/egenn/rtemis
Advanced Machine Learning and Visualization
data-science data-visualization machine-learning machine-learning-library r rstats visualization
Last synced: 25 Oct 2024
https://github.com/kwmsmith/scipy-2017-cython-tutorial
Material for the SciPy 2017 Cython tutorial
c c-plus-plus cython data-science docker machine-learning notebook performance python
Last synced: 14 Oct 2024
https://github.com/dlab-berkeley/R-Fundamentals-Legacy
D-Lab's 12 hour introduction to R Fundamentals. Learn how to create variables and functions, manipulate data frames, make visualizations, use control flow structures, and more, using R in RStudio.
automation data-science data-visualization data-wrangling r
Last synced: 11 Nov 2024
https://github.com/ropensci/tarchetypes
Archetypes for targets and pipelines
data-science high-performance-computing peer-reviewed pipeline r r-package r-targetopia reproducibility rstats targets workflow
Last synced: 15 Nov 2024
https://github.com/durgeshsamariya/data-science-machine-learning-project-with-source-code
Data Science and Machine Learning projects with source code.
artificial-intelligence awesome awesome-list data-science data-science-projects machine-learning machine-learning-projects python
Last synced: 08 Nov 2024
https://github.com/jupyterhub/repo2docker-action
A GitHub action to build data science environment images with repo2docker and push them to registries.
actions binder data-science datascience docker jupyter jupyter-notebook repo2docker repo2docker-action
Last synced: 16 Nov 2024
https://github.com/h2oai/wave-apps
Sample AI Apps built with H2O Wave.
data-science h2oai hacktoberfest low-code machine-learning python3
Last synced: 06 Nov 2024
https://github.com/rk2900/drsa
Deep Recurrent Survival Analysis, an auto-regressive deep model for time-to-event data analysis with censorship handling. An implementation of our AAAI 2019 paper and a benchmark for several (Python) implemented survival analysis methods.
data-science deep-learning machine-learning survival-analysis
Last synced: 07 Nov 2024
https://github.com/hamelsmu/seq2seq_tutorial
Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
data-science deep-learning deeplearning keras keras-tutorials machine-learning medium-article nlp-machine-learning rnn-encoder-decoder seq2seq-tutorial sequence-to-sequence
Last synced: 27 Oct 2024
https://github.com/dogoncouch/logdissect
CLI utility and Python module for analyzing log files and other data.
cli command-line data-analysis data-science forensic-analysis forensics json library log-analysis log-parser module parser parsing parsing-library python-library python-module python-modules security syslog
Last synced: 28 Oct 2024
https://github.com/hamelsmu/Seq2Seq_Tutorial
Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
data-science deep-learning deeplearning keras keras-tutorials machine-learning medium-article nlp-machine-learning rnn-encoder-decoder seq2seq-tutorial sequence-to-sequence
Last synced: 29 Oct 2024
https://github.com/picnicml/doddle-model
:cake: doddle-model: machine learning in Scala.
breeze data-science doddle-model machine-learning scala
Last synced: 18 Nov 2024
https://github.com/mmbazel/springboard-datasciencetrack-student
Springboard Program: Data Science Career Track - NLP
capstone data-science data-wrangling datasciencedreamjob dsdj mikikobazeley nlp python springboard
Last synced: 18 Nov 2024
https://github.com/DataHaskell/dh-core
Functional data science
data-analysis data-mining data-science dataframes datahaskell datasets machine-learning numerical-methods
Last synced: 30 Oct 2024
https://github.com/rodrigo-arenas/portfolio
Personal website, Data scientist portfolio template
beginner-friendly create-react-app css3 data-science data-science-portfolio data-science-projects developer-portfolio good-first-issue javascript material-ui personal-website portfolio portfolio-template portfolio-website react react-portfolio reactjs up-for-grabs web-template
Last synced: 13 Nov 2024
https://github.com/neptune-ml/steppy
Lightweight, Python library for fast and reproducible experimentation :microscope:
data-science deep-learning image-processing machine-learning minimal-interface nlp open-source pipeline python python-library python3 reproducibility reproducible-research steppy steppy-library steppy-toolkit steps
Last synced: 28 Aug 2024
https://github.com/lynxkite/lynxkite
The complete graph data science platform
complex-networks data-science graph-algorithms graph-visualization hacktoberfest machine-learning
Last synced: 08 Nov 2024
https://github.com/minerva-ml/steppy
Lightweight, Python library for fast and reproducible experimentation :microscope:
data-science deep-learning image-processing machine-learning minimal-interface nlp open-source pipeline python python-library python3 reproducibility reproducible-research steppy steppy-library steppy-toolkit steps
Last synced: 30 Oct 2024
https://github.com/genular/pandora
PANDORA - Predictive Analytics aNd Data Oriented Research Applications :computer:
bioinformatics biomarkers clinical-data clustering data-integration data-mining data-science data-visualization drug-discovery genomic-data-analysis machine-learning microbiome pandora predictive-analytics systems-biology transcriptomics tsne umap unsupervised-machine-learning
Last synced: 04 Nov 2024
https://github.com/gzuidhof/zarr.js
Javascript implementation of Zarr
array data-science gehlenborglab javascript typescript zarr
Last synced: 13 Nov 2024
https://github.com/ray-project/xgboost_ray
Distributed XGBoost on Ray
dask data-science kaggle machine-learning modin xgboost
Last synced: 15 Nov 2024
https://github.com/yizhe-ang/k-means-explorable
An Explorable Explainer of K-Means Clustering
ai clustering data-science explorable explorable-explanations javascript machine-learning svelte
Last synced: 16 Nov 2024
https://github.com/ing-bank/probatus
Validation (like Recursive Feature Elimination for SHAP) of (multiclass) classifiers & regressors and data used to develop them.
binary-classifiers data-analysis data-science feature-elimination machine-learning multi-class-classification recursive-feature-elimination regressors shap statistics tree-model
Last synced: 15 Nov 2024
https://github.com/EnvironmentOntology/envo
A community-driven ontology for the representation of environments
data-management data-science earth-science ecoinformatics ecology environment esip obofoundry ontology planetary-science semantics sustainable-development-goals
Last synced: 04 Nov 2024
https://github.com/jacobgil/confidenceinterval
The long missing library for python confidence intervals
data-science machine-learning metrics statistics
Last synced: 12 Nov 2024
https://github.com/gtkcyber/griffon-vm
Griffon Data Science Virtual Machine
apache-drill apache-spark big-data data-science database elasticsearch hadoop jupyter-notebook mysql node-js python r ruby scala virtual-machine
Last synced: 12 Oct 2024
https://github.com/mlr-org/mlr3pipelines
Dataflow Programming for Machine Learning in R
bagging data-science dataflow-programming ensemble-learning machine-learning mlr3 pipelines preprocessing r r-package stacking
Last synced: 10 Oct 2024
https://github.com/doubleml/doubleml-for-r
DoubleML - Double Machine Learning in R
causal-inference data-science double-machine-learning econometrics machine-learning mlr3 r statistics
Last synced: 17 Nov 2024
https://github.com/morganjwilliams/pyrolite
A set of tools for getting the most from your geochemical data.
chemistry data-science geochemical-data geochemistry geoscience pyrolite ternary-diagrams
Last synced: 25 Oct 2024
https://github.com/kraina-ai/srai
Spatial Representations for Artificial Intelligence
artificial-intelligence data-science geo geospatial machine-learning python spatial spatial-analysis srai
Last synced: 17 Nov 2024
https://github.com/njtierney/rmd4sci
Rmarkdown for Scientists
book bookdown data-science r rmarkdown rstats science
Last synced: 27 Oct 2024
https://github.com/ahmedkhemiri95/PDFs-TextExtract
Multiple and Large PDF Documents Text Extraction.
data-science extract-text parser pdf pdf-document pdf-processing pdfminer pdfs pdfs-textextract pypdf2 python text-analytics
Last synced: 04 Nov 2024
https://github.com/ModelChimp/modelchimp
Experiment tracking for machine and deep learning projects
ai artificial-intelligence data-science deep-learning experiment machine-learning ml model-management platform tool
Last synced: 27 Oct 2024
https://github.com/machine-learning-apps/ml-template-azure
Template for getting started with automated ML Ops on Azure Machine Learning
aml azure azure-machine-learning data-science machine-learning machine-learning-lifecycle mlops
Last synced: 02 Nov 2024
https://github.com/safe-graph/UGFraud
An Unsupervised Graph-based Toolbox for Fraud Detection
anomaly-detection data-science fraud-detection fraud-prevention graph-algorithms machine-learning opensource outlier-detection security-tools spam-detection toolbox
Last synced: 16 Nov 2024
https://github.com/RamiKrispin/Introduction-to-Docker
(WIP) Getting started with Docker - An introduction to Docker with data science and engineering applications
data-engineering data-science docker dockerfile
Last synced: 25 Oct 2024
https://github.com/safe-graph/ugfraud
An Unsupervised Graph-based Toolbox for Fraud Detection
anomaly-detection data-science fraud-detection fraud-prevention graph-algorithms machine-learning opensource outlier-detection security-tools spam-detection toolbox
Last synced: 11 Nov 2024
https://github.com/suji04/normalizednerd
Codes for the videos of my YouTube channel
data-science machine-learning python tutorial youtube
Last synced: 17 Nov 2024
https://github.com/csinva/hierarchical-dnn-interpretations
Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)
acd ai artificial-intelligence convolutional-neural-networks data-science deep-learning deep-neural-networks explainability explainable-ai feature-importance iclr interpretability interpretation jupyter-notebook machine-learning ml neural-network python pytorch statistics
Last synced: 14 Nov 2024
https://github.com/laura-rieger/deep-explanation-penalization
Code for using CDEP from the paper "Interpretations are useful: penalizing explanations to align neural networks with prior knowledge" https://arxiv.org/abs/1909.13584
ai artificial-intelligence cdep convolutional-neural-network data-science deep-learning explainability explainable-ai fairness fairness-ml feature-importance interpretability interpretable-deep-learning jupyter-notebook machine-learning ml neural-network python pytorch recurrent-neural-network
Last synced: 15 Nov 2024
https://github.com/mszell/introdatasci
Course materials for: Introduction to Data Science and Programming
course-materials crash-course data-science network-analysis pandas-python programming programming-courses python teaching-materials
Last synced: 12 Nov 2024
https://github.com/bcg-x-official/artkit
Automated prompt-based testing and evaluation of Gen AI applications
asyncio data-science gen-ai genai python red-teaming test-automation
Last synced: 15 Nov 2024
https://github.com/storopoli/ciencia-de-dados
Disciplina de Ciências de Dados da UNINOVE
aprendizagem-de-maquina aprendizagem-profunda ciencia-de-dados data-science deeplearning machinelearning matplotlib pandas python pytorch scikit-learn tensorflow
Last synced: 17 Nov 2024
https://github.com/scitime/scitime
Training time estimation for scikit-learn algorithms
data-science machine-learning python scikit-learn timer
Last synced: 01 Nov 2024
https://github.com/tpoisot/scientificcomputingfortherestofus
Introduction to Scientific Computing 🦊
best-practices data-science educational-resources julia machine-learning reproducible-documents scientific-computing
Last synced: 13 Nov 2024
https://github.com/tpoisot/ScientificComputingForTheRestOfUs
Introduction to Scientific Computing 🦊
best-practices data-science educational-resources julia machine-learning reproducible-documents scientific-computing
Last synced: 13 Nov 2024
https://github.com/romanmichaelpaolucci/AI_Stock_Trading
Design pattern for critical stages in the development process of an AI Stock Trading Bot
artificial-intelligence data-science machine-learning neural-network python trading trading-algorithms trading-bot trading-strategies
Last synced: 07 Nov 2024
https://github.com/scrapinghub/python-simhash
An efficient simhash implementation for python
Last synced: 10 Nov 2024
https://github.com/aydinnyunus/wallet-tracker
Detect real scammers with Wallet-Tracker CLI from anywhere.
bitcoin blueteam btc cli dashboard data-science database docker docker-compose eth ethereum golang graph hacking neo4j neodash visualization websocket
Last synced: 11 Nov 2024
https://github.com/datacamp/viewflow
Viewflow is an Airflow-based framework that allows data scientists to create data models without writing Airflow code.
airflow apache-airflow data-engineering data-science packages python workflow
Last synced: 15 Nov 2024
https://github.com/adijo/data-science-prep
Problems from https://datascienceprep.com/
data-science data-science-interview datascience interview-prep machine-learning machine-learning-interview machinelearning probability statistics
Last synced: 08 Nov 2024
https://github.com/winvector/pyvtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.
data-science machine-learning pydata python
Last synced: 14 Nov 2024
https://github.com/autoviml/deep_autoviml
Build tensorflow keras model pipelines in a single line of code. Now with mlflow tracking. Created by Ram Seshadri. Collaborators welcome. Permission granted upon request.
autokeras automl data-science deep-learning gcp keras machine-learning mlflow mljar pycaret python tensorflow tensorflow2 tpot
Last synced: 10 Oct 2024
https://github.com/vkoul/Econ-Data-Science
Articles/ Journals and Videos related to Economics:chart_with_upwards_trend: and Data Science :bar_chart:
casual-inference data-science econometrics economics economist machine-learning social-sciences
Last synced: 13 Nov 2024
https://github.com/jadianes/spark-r-notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
big-data bigdata data-analysis data-science exploratory-data-analysis jupyter jupyter-notebook notebook r sparkr
Last synced: 09 Nov 2024
https://github.com/napjon/krisk
Statistical Interactive Visualization with pandas+Jupyter integration on top of Echarts.
dashboard data-science data-visualization echarts interactive-charts jupyter-notebook python
Last synced: 15 Nov 2024
https://github.com/CertifaiAI/classifai
:fire: One of the most comprehensive open-source data annotation platform.
annotation annotation-tool big-data computervision data-annotation data-collection data-science deep-learning labelling machine-learning
Last synced: 17 Nov 2024
https://github.com/jakekandell/nba-predict
Predicts Daily NBA Games Using a Logistic Regression Model
data-science logistic-regression model nba nba-analytics nba-prediction nba-stats pandas prediction predictive-modeling python python3 scikit-learn
Last synced: 07 Nov 2024
https://github.com/ome/ngff
Next-generation file format (NGFF) specifications for storing bioimaging data in the cloud.
bioimaging cloud data-science file-formats spec
Last synced: 16 Nov 2024
https://github.com/yandexdataschool/roc_comparison
The fast version of DeLong's method for computing the covariance of unadjusted AUC.
Last synced: 06 Nov 2024