Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-07-29 13:36:33 UTC
- JSON Representation
https://github.com/anthony-wang/BestPractices
Things that you should (and should not) do in your Materials Informatics research.
best-practices common-pitfalls data-science example-code interactive-notebooks jupyter jupyter-notebooks machine-learning materials-informatics materials-science neural-networks python
Last synced: 02 Aug 2024
https://github.com/davendw49/k2
Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024
ai4science data-science geoai geoscience kg large-language-models llm
Last synced: 01 Aug 2024
https://github.com/minerva-ml/open-solution-toxic-comments
Open solution to the Toxic Comment Classification Challenge
challenge competition data-science deep-learning ensemble-model kaggle kaggle-competition machine-learning neptune nlp pipeline prediction python python3
Last synced: 07 Aug 2024
https://github.com/tirthajyoti/Interactive_Machine_Learning
IPython widgets, interactive plots, interactive machine learning
analytics animation classification data-science interactive jupyter-notebook machine-learning python regression scikit-learn statistics supervised-learning
Last synced: 07 Aug 2024
https://github.com/paddymul/buckaroo
Buckaroo - the data wrangling assistant for pandas. Quickly explore dataframes, and run pandas commands via a GUI. Works inside the jupyter notebook.
buckaroo data-science jupyter paddy pandas
Last synced: 03 Aug 2024
https://github.com/safreita1/TIGER
Python toolbox to evaluate graph vulnerability and robustness (CIKM 2021)
adversarial-attacks attack cascading-failures data-mining data-science defense diffusion epidemics graph graph-attack graph-mining machine-learning netshield network-attack networks robustness simulation vulnerability
Last synced: 02 Aug 2024
https://github.com/saezlab/decoupler-py
Python package to perform enrichment analysis from omics data.
bioinformatics data-science enrichment enrichment-analysis numba python single-cell spatial-transcriptomics transcriptomics
Last synced: 02 Aug 2024
https://github.com/jazzdotdev/jazz
The Scripting Engine that Combines Speed, Safety, and Simplicity
actix android chromeos crypto cryptography data-science database development-environment embeddable jazz jinja2 linux lua markdown rust scraping scripting web witness
Last synced: 01 Aug 2024
https://github.com/raptor-ml/raptor
Transform your pythonic research to an artifact that engineers can deploy easily.
ai-infra data-engineering data-science dataops feature-engineering feature-extraction feature-platform featurestore kubeflow kubernetes machine-learning ml mlops model-deployment production raptor raptor-ml reactive-ml
Last synced: 01 Aug 2024
https://github.com/EmilHvitfeldt/R-text-data
List of textual data sources to be used for text mining in R
data-science nlp rstats text-analysis text-analytics-in-r text-mining tidytext
Last synced: 05 Aug 2024
https://github.com/vanderschaarlab/hyperimpute
A framework for prototyping and benchmarking imputation methods
data-science imputation imputation-algorithm machine-learning machine-learning-prerequisites preprocessing-data python scikit-learn
Last synced: 02 Aug 2024
https://github.com/voila-dashboards/voici
Voici turns any Jupyter Notebook into a static web application
dashboards data-science emscripten jupyter jupyterlite voila-dashboard wasm
Last synced: 04 Sep 2024
https://github.com/DongjunLee/quantified-self
Self-knowledge through numbers
chatbot data-science fitbit github kino machine-learning personal-assistant quantified-self rescuetime self-tracking slack-bot todoist toggl trello
Last synced: 01 Aug 2024
https://github.com/aws-samples/aws-ml-jp
SageMakerで機械学習モデルを構築、学習、デプロイする方法が学べるNotebookと教材集
aws data-science deep-learning jupyter-notebook machine-learning mlops sagemaker
Last synced: 01 Aug 2024
https://github.com/google/applied-machine-learning-intensive
Applied Machine Learning Intensive
data-science machine-learning python3 sklearn tensorflow tensorflow-examples tensorflow-tutorials
Last synced: 02 Aug 2024
https://github.com/ayush1997/YouTube-Like-predictor
YouTube Like Count Predictions using Machine Learning
data-analysis data-science machine-learning predictive-analysis random-forest visualization youtube-api
Last synced: 07 Aug 2024
https://github.com/dlab-berkeley/R-Fundamentals-Legacy
D-Lab's 12 hour introduction to R Fundamentals. Learn how to create variables and functions, manipulate data frames, make visualizations, use control flow structures, and more, using R in RStudio.
automation data-science data-visualization data-wrangling r
Last synced: 02 Aug 2024
https://github.com/jupyterhub/repo2docker-action
A GitHub action to build data science environment images with repo2docker and push them to registries.
actions binder data-science datascience docker jupyter jupyter-notebook repo2docker repo2docker-action
Last synced: 01 Aug 2024
https://github.com/picnicml/doddle-model
:cake: doddle-model: machine learning in Scala.
breeze data-science doddle-model machine-learning scala
Last synced: 04 Aug 2024
https://github.com/rivasiker/ggHoriPlot
A user-friendly, highly customizable R package for building horizon plots in ggplot2
data-science data-visualization ggplot2 horizon-plots r r-package
Last synced: 02 Aug 2024
https://github.com/hamelsmu/Seq2Seq_Tutorial
Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
data-science deep-learning deeplearning keras keras-tutorials machine-learning medium-article nlp-machine-learning rnn-encoder-decoder seq2seq-tutorial sequence-to-sequence
Last synced: 31 Jul 2024
https://github.com/dogoncouch/logdissect
CLI utility and Python module for analyzing log files and other data.
cli command-line data-analysis data-science forensic-analysis forensics json library log-analysis log-parser module parser parsing parsing-library python-library python-module python-modules security syslog
Last synced: 31 Jul 2024
https://rivasiker.github.io/ggHoriPlot/
A user-friendly, highly customizable R package for building horizon plots in ggplot2
data-science data-visualization ggplot2 horizon-plots r r-package
Last synced: 02 Aug 2024
https://github.com/DataHaskell/dh-core
Functional data science
data-analysis data-mining data-science dataframes datahaskell datasets machine-learning numerical-methods
Last synced: 31 Jul 2024
https://github.com/neptune-ml/steppy
Lightweight, Python library for fast and reproducible experimentation :microscope:
data-science deep-learning image-processing machine-learning minimal-interface nlp open-source pipeline python python-library python3 reproducibility reproducible-research steppy steppy-library steppy-toolkit steps
Last synced: 28 Aug 2024
https://github.com/lynxkite/lynxkite
The complete graph data science platform
complex-networks data-science graph-algorithms graph-visualization hacktoberfest machine-learning
Last synced: 01 Aug 2024
https://github.com/minerva-ml/steppy
Lightweight, Python library for fast and reproducible experimentation :microscope:
data-science deep-learning image-processing machine-learning minimal-interface nlp open-source pipeline python python-library python3 reproducibility reproducible-research steppy steppy-library steppy-toolkit steps
Last synced: 31 Jul 2024
https://github.com/genular/pandora
PANDORA - Predictive Analytics aNd Data Oriented Research Applications :computer:
bioinformatics biomarkers clinical-data clustering data-integration data-mining data-science data-visualization drug-discovery genomic-data-analysis machine-learning microbiome pandora predictive-analytics systems-biology transcriptomics tsne umap unsupervised-machine-learning
Last synced: 01 Aug 2024
https://github.com/ray-project/xgboost_ray
Distributed XGBoost on Ray
dask data-science kaggle machine-learning modin xgboost
Last synced: 03 Aug 2024
https://github.com/yizhe-ang/k-means-explorable
An Explorable Explainer of K-Means Clustering
ai clustering data-science explorable explorable-explanations javascript machine-learning svelte
Last synced: 03 Aug 2024
https://github.com/gtkcyber/griffon-vm
Griffon Data Science Virtual Machine
apache-drill apache-spark big-data data-science database elasticsearch hadoop jupyter-notebook mysql node-js python r ruby scala virtual-machine
Last synced: 30 Jul 2024
https://github.com/mlr-org/mlr3pipelines
Dataflow Programming for Machine Learning in R
bagging data-science dataflow-programming ensemble-learning machine-learning mlr3 pipelines preprocessing r r-package stacking
Last synced: 13 Aug 2024
https://github.com/morganjwilliams/pyrolite
A set of tools for getting the most from your geochemical data.
chemistry data-science geochemical-data geochemistry geoscience pyrolite ternary-diagrams
Last synced: 30 Jul 2024
https://github.com/ahmedkhemiri95/PDFs-TextExtract
Multiple and Large PDF Documents Text Extraction.
data-science extract-text parser pdf pdf-document pdf-processing pdfminer pdfs pdfs-textextract pypdf2 python text-analytics
Last synced: 01 Aug 2024
https://github.com/RamiKrispin/Introduction-to-Docker
(WIP) Getting started with Docker - An introduction to Docker with data science and engineering applications
data-engineering data-science docker dockerfile
Last synced: 30 Jul 2024
https://github.com/njtierney/rmd4sci
Rmarkdown for Scientists
book bookdown data-science r rmarkdown rstats science
Last synced: 02 Aug 2024
https://github.com/EnvironmentOntology/envo
A community-driven ontology for the representation of environments
data-management data-science earth-science ecoinformatics ecology environment esip obofoundry ontology planetary-science semantics sustainable-development-goals
Last synced: 01 Aug 2024
https://github.com/ModelChimp/modelchimp
Experiment tracking for machine and deep learning projects
ai artificial-intelligence data-science deep-learning experiment machine-learning ml model-management platform tool
Last synced: 31 Jul 2024
https://github.com/safe-graph/UGFraud
An Unsupervised Graph-based Toolbox for Fraud Detection
anomaly-detection data-science fraud-detection fraud-prevention graph-algorithms machine-learning opensource outlier-detection security-tools spam-detection toolbox
Last synced: 03 Aug 2024
https://github.com/machine-learning-apps/ml-template-azure
Template for getting started with automated ML Ops on Azure Machine Learning
aml azure azure-machine-learning data-science machine-learning machine-learning-lifecycle mlops
Last synced: 01 Aug 2024
https://github.com/csinva/hierarchical-dnn-interpretations
Using / reproducing ACD from the paper "Hierarchical interpretations for neural network predictions" 🧠 (ICLR 2019)
acd ai artificial-intelligence convolutional-neural-networks data-science deep-learning deep-neural-networks explainability explainable-ai feature-importance iclr interpretability interpretation jupyter-notebook machine-learning ml neural-network python pytorch statistics
Last synced: 03 Aug 2024
https://github.com/laura-rieger/deep-explanation-penalization
Code for using CDEP from the paper "Interpretations are useful: penalizing explanations to align neural networks with prior knowledge" https://arxiv.org/abs/1909.13584
ai artificial-intelligence cdep convolutional-neural-network data-science deep-learning explainability explainable-ai fairness fairness-ml feature-importance interpretability interpretable-deep-learning jupyter-notebook machine-learning ml neural-network python pytorch recurrent-neural-network
Last synced: 03 Aug 2024
https://github.com/tpoisot/ScientificComputingForTheRestOfUs
Introduction to Scientific Computing 🦊
best-practices data-science educational-resources julia machine-learning reproducible-documents scientific-computing
Last synced: 02 Aug 2024
https://github.com/jacobgil/confidenceinterval
The long missing library for python confidence intervals
data-science machine-learning metrics statistics
Last synced: 01 Aug 2024
https://github.com/ropensci/tarchetypes
Archetypes for targets and pipelines
data-science high-performance-computing peer-reviewed pipeline r r-package r-targetopia reproducibility rstats targets workflow
Last synced: 05 Aug 2024
https://github.com/vkoul/Econ-Data-Science
Articles/ Journals and Videos related to Economics:chart_with_upwards_trend: and Data Science :bar_chart:
casual-inference data-science econometrics economics economist machine-learning social-sciences
Last synced: 02 Aug 2024
https://github.com/romanmichaelpaolucci/AI_Stock_Trading
Design pattern for critical stages in the development process of an AI Stock Trading Bot
artificial-intelligence data-science machine-learning neural-network python trading trading-algorithms trading-bot trading-strategies
Last synced: 01 Aug 2024
https://github.com/mszell/introdatasci
Course materials for: Introduction to Data Science and Programming
course-materials crash-course data-science network-analysis pandas-python programming programming-courses python teaching-materials
Last synced: 01 Aug 2024
https://github.com/napjon/krisk
Statistical Interactive Visualization with pandas+Jupyter integration on top of Echarts.
dashboard data-science data-visualization echarts interactive-charts jupyter-notebook python
Last synced: 01 Aug 2024
https://github.com/CertifaiAI/classifai
:fire: One of the most comprehensive open-source data annotation platform.
annotation annotation-tool big-data computervision data-annotation data-collection data-science deep-learning labelling machine-learning
Last synced: 03 Aug 2024
https://github.com/WinVector/pyvtreat
vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distributed under a BSD-3-Clause license.
data-science machine-learning pydata python
Last synced: 05 Aug 2024
https://github.com/LankyCyril/pyvenn
Python module for plotting Venn diagrams of 2..6 sets
data-science matplotlib matplotlib-venn venn venn-diagram venndiagram visualization
Last synced: 03 Aug 2024
https://github.com/medtagger/MedTagger
A collaborative framework for annotating medical datasets using crowdsourcing.
crowdsourcing data-science data-validation deep-learning labeling medical-imaging
Last synced: 03 Aug 2024
https://github.com/ColtAllen/btyd
Buy Till You Die and Customer Lifetime Value statistical models in Python.
bayesian buy-til-you-die customer-lifetime-value data-science python
Last synced: 02 Aug 2024
https://github.com/alexandervnikitin/tsgm
Generation and evaluation of synthetic time series datasets (also, augmentations, visualizations, a collection of popular datasets)
augmentations data-augmentation data-science datasets deep-learning generative-model keras machine-learning python synthetic-data synthetic-time-series tensorflow2 time-series vae
Last synced: 01 Aug 2024
https://github.com/streamnative/pulsar-spark
Spark Connector to read and write with Pulsar
apache-pulsar apache-spark batch-processing data-processing data-science flink spark spark-sql stream-processing structured-streaming
Last synced: 01 Aug 2024
https://github.com/innat/ML-Resource
A concise resource repository for machine learning
data-analysis data-science deep-learning kaggle machine-learning python spark
Last synced: 02 Aug 2024
https://github.com/NicholasMamo/multiplex-plot
Multiplex: visualizations that tell stories—A Python library to create and annotate beautiful network graph visualizations, text visualizations and more.
data-science data-visualisation graph-visualization graphs information-retrieval matplotlib natural-language-processing network-visualization python text-mining text-visualisation text-visualization visualisation visualizations viz vizualisation
Last synced: 07 Aug 2024
https://github.com/lawmurray/Birch
A probabilistic programming language that combines automatic differentiation, automatic marginalization, and automatic conditioning within Monte Carlo methods.
autodiff bayesian bayesian-inference bayesian-methods bayesian-statistics data-science machine-learning machine-learning-algorithms machine-learning-projects monte-carlo-methods monte-carlo-sampling probabilistic-programming-languages statistics
Last synced: 31 Jul 2024
https://github.com/formlio/forml
ForML - A development framework and MLOps platform for the lifecycle management of data science projects
ai data-science machine-learning ml mlops portability python reproducibility
Last synced: 03 Aug 2024
https://github.com/benthecoder/ml-blogs-that-are-worth-reading
Blogs on Machine Learning and Deep learning
ai artificial-intelligence data-science deep-learning machine-learning ml
Last synced: 01 Aug 2024
https://github.com/nicohlr/ipychart
The power of Chart.js with Python
charting-library chartjs charts data data-analysis data-science data-visualization ipywidgets javascript-es6 jupyter jupyter-notebook notebook python
Last synced: 01 Aug 2024
https://github.com/ome/ngff
Next-generation file format (NGFF) specifications for storing bioimaging data in the cloud.
bioimaging cloud data-science file-formats spec
Last synced: 03 Aug 2024
https://github.com/senderle/topic-modeling-tool
A point-and-click tool for creating and analyzing topic models produced by MALLET.
data-science digital-humanities mallet text-analytics topic-modeling
Last synced: 02 Aug 2024
https://github.com/dssg/MLforPublicPolicy
Class resources for CAPP 30254 (Machine Learning for Public Policy)
data-science machine-learning public-policy
Last synced: 31 Jul 2024
https://github.com/pbiecek/breakDown
Model Agnostics breakDown plots
data-science iml interpretability machine-learning visual-explanations xai
Last synced: 02 Aug 2024
https://github.com/pink-gorilla/notebook
Web based Clojure notebook application/-library.
clojure clojurescript codemirror data-science gorilla-notebook gorilla-repl pink-gorilla re-frame reagent vega
Last synced: 03 Aug 2024
https://github.com/nischalshrestha/Unravel
A fluent code explorer for R. 🔍
data-science datawrangling dplyr r rstats shiny tidyr tidyverse
Last synced: 13 Aug 2024
https://github.com/mc2-project/secure-xgboost
Secure collaborative training and inference for XGBoost.
collaborative-learning data-science enclave machine-learning privacy security xgboost
Last synced: 02 Aug 2024
https://github.com/georgian-io/pyoats
Quick and Easy Time Series Outlier Detection
anomaly anomaly-detection data-science deep-learning machine-learning time-series timeseries
Last synced: 31 Jul 2024
https://github.com/materialsproject/matbench
Matbench: Benchmarks for materials science property prediction
benchmark chemistry condensed-matter data-science machine-learning machine-learning-algorithms materials-science physics
Last synced: 02 Aug 2024
https://github.com/oracle/macest
Model Agnostic Confidence Estimator (MACEST) - A Python library for calibrating Machine Learning models' confidence scores
confidence-estimation data-science machine-learning python
Last synced: 03 Aug 2024
https://github.com/lettier/lda-topic-modeling
A PureScript, browser-based implementation of LDA topic modeling.
bayesian bulma bulma-css clustering data-science functional-programming gibbs-sampling latent-dirichlet-allocation lda machine-learning machine-learning-algorithms natural-language-processing nlp nlp-machine-learning purescript reactive reactive-programming text-mining thermite topic-modeling
Last synced: 02 Aug 2024
https://github.com/wyattowalsh/data-science-notes
Open-source project hosted at https://makeuseofdata.com to crowdsource a robust collection of notes related to data science (math, visualization, modeling, etc)
calculus classification compilation crowdsourcing data-science first-timers first-timers-only jupyter-book linear-algebra machine-learning modeling probability regression simulation statistics up-for-grabs visualization
Last synced: 03 Aug 2024
https://github.com/AlexIoannides/pymc-example-project
Example PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.
bayesian-data-analysis bayesian-inference data-science machine-learning numpy pandas probabilistic-programming pymc3 python scikit-learn
Last synced: 07 Aug 2024
https://github.com/jay-johnson/sci-pype
A Machine Learning API with native redis caching and export + import using S3. Analyze entire datasets using an API for building, training, testing, analyzing, extracting, importing, and archiving. This repository can run from a docker container or from the repository.
data-science devops-for-data-science docker docker-compose ipython ipython-notebook jupyter jupyter-notebook jupyter-themes machine-learning machine-learning-api predictive python red10 redis s3 seaborn stock-price-prediction xgb xgboost
Last synced: 07 Aug 2024
https://github.com/sissa-data-science/DADApy
Distance-based Analysis of DAta-manifolds in python
data-analysis data-science density-based-clustering density-estimation intrinsic-dimension machine-learning manifolds python
Last synced: 02 Aug 2024
https://github.com/target/data-validator
A tool to validate data, built around Apache Spark.
data-science data-validation hacktoberfest
Last synced: 01 Aug 2024
https://github.com/tlverse/sl3
💪 🤔 Modern Super Learning with Machine Learning Pipelines
data-science ensemble-learning ensemble-model machine-learning model-selection r r-package regression stacking statistics
Last synced: 02 Aug 2024
https://github.com/PetoLau/TSrepr
TSrepr: R package for time series representations
data-analysis data-mining data-mining-algorithms data-science r r-package representation time-series time-series-analysis time-series-classification time-series-clustering time-series-data-mining time-series-representations
Last synced: 01 Aug 2024
https://github.com/nla-group/classix
Fast and explainable clustering in Python
algorithm classification clustering cython data-analysis data-mining data-science database dataset explainable-ml machine-learning python unsupervised-learning unsupervised-machine-learning visualization
Last synced: 03 Aug 2024
https://github.com/xiyanghu/OSDT
Optimal Sparse Decision Trees
accelerate acceleration-model algorithm algorithm-optimization data-mining data-science interpretable-ml machine-learning ml-system mlsys neurips python python3
Last synced: 31 Jul 2024
https://github.com/TexteaInc/funix
Building web apps without manually creating widgets
app-builder data-science frontend machine-learning
Last synced: 13 Aug 2024
https://github.com/wlandau/targets-tutorial
Short course on the targets R package
data-science make pipeline r r-package reproducibility reproducible-research rstats targets workflow
Last synced: 13 Aug 2024
https://github.com/IlyaGusev/tgcontest
Telegram Data Clustering contest solution by Mindful Squirrel
classification clustering cpp data-science document-similarity fasttext machine-learning nlp
Last synced: 01 Aug 2024
https://github.com/jkoutsikakis/pytorch-wrapper
Provides a systematic and extensible way to build, train, evaluate, and tune deep learning models using PyTorch.
data-science deep-learning machine-learning neural-network python pytorch pytorch-wrapper tensor
Last synced: 07 Aug 2024
https://github.com/mratsim/Arch-Data-Science
Archlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
archlinux cuda cudnn data-science deep-learning lightgbm machine-learning mkl mxnet natural-language-processing natural-language-understanding nervana opencv package pandas pytorch scikit-learn spacy tensorflow xgboost
Last synced: 07 Aug 2024
https://github.com/GlobalMaksimum/sadedegel
A General Purpose NLP library for Turkish
acikhack2 ai artificial-intelligence bert binder corpus data-science deep-learning embeddings heroku machine-learning natural-language-processing neural-network neural-networks news-summarizer nlp python
Last synced: 02 Aug 2024
https://github.com/giswqs/manjaro-linux
Shell scripts for setting up Manjaro Linux for Python programming and deep learning
data-science deep-learning gis kde manjaro manjaro-linux notebook-jupyter python r remote-sensing shell-scripts tensorflow
Last synced: 06 Aug 2024
https://github.com/lyltj2010/DataMining
数据挖掘开源书
data-science datamining deeplearning machine-learning
Last synced: 28 Aug 2024
https://github.com/scottshambaugh/monaco
Quantify uncertainty and sensitivities in your computer models with an industry-grade Monte Carlo library.
data-science monaco monte-carlo python scientific-computing sensitivity-analysis simulation statistics uncertainty-analysis uncertainty-quantification
Last synced: 01 Aug 2024
https://github.com/markvanderloo/simputation
Making imputation easy
data-science imputation officialstatistics r rstats
Last synced: 02 Aug 2024
https://github.com/slai-labs/get-beam
Run GPU inference and training jobs on serverless infrastructure that scales with you.
artificial-intelligence cloud-computing cost-optimization data-science deep-learning distributed-computing gpu-acceleration gpu-computing hpc llm-serving llm-training machine-learning ml-infrastructure mlops python serverless serverless-architectures
Last synced: 01 Aug 2024
https://github.com/asavinov/prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
business-intelligence data-preparation data-preprocessing data-processing data-science data-wrangling feature-engineering map-reduce olap pandas python spark workflow
Last synced: 01 Aug 2024
https://github.com/talegari/tidypandas
A grammar of data manipulation for pandas inspired by tidyverse
data-analysis data-science dataframe dataframe-library dplyr pandas python tidyverse
Last synced: 12 Aug 2024
https://github.com/tidypyverse/tidypandas
A grammar of data manipulation for pandas inspired by tidyverse
data-analysis data-science dataframe dataframe-library dplyr pandas python tidyverse
Last synced: 01 Aug 2024
https://github.com/firmai/business-analytics-and-mathematics-python-book
Advanced Business Analytics and Mathematics with Python (by @firmai)
analytics business data-analysis data-science mathematics python
Last synced: 04 Aug 2024
https://github.com/mkearney/tweetbotornot2
🔍🐦🤖 Detect Twitter Bots!
bot-detection bot-detector classification data-science machine-learning r r-package rstats rtweet twitter twitter-api twitter-bot-detection twitter-bots xgboost
Last synced: 05 Aug 2024
https://github.com/synthesized-io/fairlens
Identify bias and measure fairness of your data
bias data data-analysis data-science fairness pandas python statistics
Last synced: 03 Aug 2024