Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-11-18 00:06:52 UTC
- JSON Representation
https://github.com/ashishpatel26/Amazing-Feature-Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
data-analysis data-mining data-science data-scientists data-visualization deep-learning feature-engineering feature-extraction feature-scaling feature-selection features machine-learning scikit-learn
Last synced: 07 Nov 2024
https://github.com/SimonBlanke/Hyperactive
An optimization and data collection toolbox for convenient and fast prototyping of computationally expensive models.
automated-machine-learning bayesian-optimization data-science deep-learning feature-engineering hyperactive hyperparameter-optimization keras machine-learning model-selection neural-architecture-search optimization parallel-computing parameter-tuning python pytorch scikit-learn xgboost
Last synced: 05 Nov 2024
https://github.com/simonblanke/hyperactive
An optimization and data collection toolbox for convenient and fast prototyping of computationally expensive models.
automated-machine-learning bayesian-optimization data-science deep-learning feature-engineering hyperactive hyperparameter-optimization keras machine-learning model-selection neural-architecture-search optimization parallel-computing parameter-tuning python pytorch scikit-learn xgboost
Last synced: 13 Nov 2024
https://github.com/ericlagergren/decimal
A high-performance, arbitrary-precision, floating-point decimal library.
arbitrary-precision big-decimal data-science decimal dogs-of-instagram financial general-decimal-arithmetic money multi-precision
Last synced: 04 Aug 2024
https://github.com/microsoft/Reactors
🌱 Join a community of developers at Microsoft Reactor and connect with people, skills, and technology to build your career or personal learning. We offer free livestreams, on-demand content, and hybrid/in-person events daily around the world. Access our projects and code here.
ai azure cloud data data-science devops dotnet events iot live-streaming low-code meetup mixed-reality ml no-code nodejs personal-de python web
Last synced: 13 Nov 2024
https://github.com/openhackathons-org/gpubootcamp
This repository consists for gpu bootcamp material for HPC and AI
ai4hpc cuda data-science deep-learning deepstream gpu hpc machine-learning mpi openacc openmp rapidsai
Last synced: 30 Oct 2024
https://github.com/boxuancui/DataExplorer
Automate Data Exploration and Treatment
cran data-analysis data-exploration data-science eda r r-package rstats visualization
Last synced: 13 Aug 2024
https://github.com/scverse/anndata
Annotated data.
anndata bioinformatics data-science machine-learning scanpy scverse transcriptomics
Last synced: 06 Nov 2024
https://github.com/BCG-X-Official/facet
Human-explainable AI.
data-analytics data-science explainable-ai hyperparameter-tuning interpretability machine-learning model-selection python shap-vector-decomposition simulation statistics
Last synced: 15 Nov 2024
https://github.com/bcg-x-official/facet
Human-explainable AI.
data-analytics data-science explainable-ai hyperparameter-tuning interpretability machine-learning model-selection python shap-vector-decomposition simulation statistics
Last synced: 15 Nov 2024
https://github.com/vi3k6i5/GuidedLDA
semi supervised guided topic model with custom guidedLDA
data-science guided-topic-modeling guidedlda machine-learning seededlda topic-modeling
Last synced: 13 Nov 2024
https://github.com/jmschrei/apricot
apricot implements submodular optimization for the purpose of selecting subsets of massive data sets to train machine learning models quickly. See the documentation page: https://apricot-select.readthedocs.io/en/latest/index.html
data-science machine-learning python submodular-optimization submodularity
Last synced: 13 Nov 2024
https://github.com/business-science/modeltime
Modeltime unlocks time series forecast models and machine learning in one framework
arima data-science deep-learning ets forecasting machine-learning machine-learning-algorithms modeltime prophet r-package tbats tidymodeling tidymodels time time-series time-series-analysis timeseries timeseries-forecasting
Last synced: 15 Nov 2024
https://github.com/ing-bank/popmon
Monitor the stability of a Pandas or Spark dataframe ⚙︎
covariate-shift data-analysis data-distributions data-profiling data-science dataset-shifts drift-detection hacktoberfest ing-bank ipython jupyter mlops monitoring pandas population-monitoring python spark statistical-process-control statistical-tests statistics
Last synced: 12 Oct 2024
https://github.com/akanz1/klib
Easy to use Python library of customized functions for cleaning and analyzing data.
data-analysis data-cleaning data-preprocessing data-science data-visualization feature-selection klib python
Last synced: 15 Nov 2024
https://github.com/JuliaAcademy/DataScience
Data Science in Julia course for JuliaAcademy.com, taught by Huda Nassar
data-science julia juliaacademy learnjulia
Last synced: 27 Oct 2024
https://github.com/polyaxon/traceml
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
dask data-exploration data-profiling data-quality data-quality-checks data-science data-visualization dataframes dataops explainable-ai matplotlib mlops pandas pandas-summary plotly pytorch spark statistics tensorflow tracking
Last synced: 11 Oct 2024
https://github.com/juliaacademy/datascience
Data Science in Julia course for JuliaAcademy.com, taught by Huda Nassar
data-science julia juliaacademy learnjulia
Last synced: 12 Oct 2024
https://github.com/hamelsmu/code_search
Code For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deep Learning"
code-search data-science deep-learning fastai keras machine-learning machine-learning-on-source-code ml-on-code natural-language-processing nlp python pytorch search search-algorithm searching-algorithms semantic-search semantic-search-engine tensorflow tutorial
Last synced: 26 Oct 2024
https://github.com/plotly/dash.jl
Dash for Julia - A Julia interface to the Dash ecosystem for creating analytic web applications in Julia. No JavaScript required.
bioinformatics charting dash dashboard data-science data-visualization finance gui-framework julia modeling no-javascript no-vba plotly plotly-dash productivity react technical-computing web-app
Last synced: 12 Oct 2024
https://github.com/doubleml/doubleml-for-py
DoubleML - Double Machine Learning in Python
causal-inference data-science double-machine-learning econometrics machine-learning python scikit-learn statistics
Last synced: 14 Oct 2024
https://github.com/akfamily/aktools
AKTools is an elegant and simple HTTP API library for AKShare, built for AKSharers!
akshare asyncio data data-science fastapi openapi pydanti
Last synced: 11 Oct 2024
https://github.com/ottogroup/palladium
Framework for setting up predictive analytics services
data-science machine-learning scikit-learn
Last synced: 29 Oct 2024
https://github.com/InfuseAI/piperider
Code review for data in dbt
code-review continuous-integration data-exploration data-observability data-pipeline data-profiler data-profiling data-quality data-reliability data-science data-testing data-visualization dbt dbt-metrics eda exploratory-data-analysis pull-requests python reporting
Last synced: 09 Nov 2024
https://github.com/explosion/prodigy-recipes
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
active-learning annotation annotation-tool artificial-intelligence computer-vision data-annotation data-science labeling-tool machine-learning machine-teaching natural-language-processing nlp prodigy spacy
Last synced: 07 Oct 2024
https://github.com/juliadatascience/juliadatascience
Book on Julia for Data Science
book data data-manipulation data-science data-visualization julia julia-language
Last synced: 13 Nov 2024
https://github.com/h2oai/mli-resources
H2O.ai Machine Learning Interpretability Resources
accountability data-mining data-science explainable-ml fairness fatml h2o iml interpretability interpretable-ai interpretable-machine-learning interpretable-ml jupyter-notebooks machine-learning machine-learning-interpretability mli python transparency xai xgboost
Last synced: 06 Nov 2024
https://github.com/frictionlessdata/specs
Technical specifications and guidelines for implementing Frictionless Data.
csv data-science json metadata schema validation
Last synced: 06 Nov 2024
https://github.com/a16z/nft-analyst-starter-pack
analytics data-science ethereum nfts python
Last synced: 16 Nov 2024
https://github.com/DoubleML/doubleml-for-py
DoubleML - Double Machine Learning in Python
causal-inference data-science double-machine-learning econometrics machine-learning python scikit-learn statistics
Last synced: 24 Aug 2024
https://github.com/rjurney/agile_data_code_2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
agile-data agile-data-science airflow amazon-ec2 amazon-web-services analytics apache-kafka apache-spark data data-science data-syndrome kafka machine-learning machine-learning-algorithms predictive-analytics python python-3 python3 spark vagrant
Last synced: 12 Oct 2024
https://github.com/serengil/chefboost
A Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4.5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting, Random Forest and Adaboost w/categorical features support for Python
adaboost c45-trees cart categorical-features data-mining data-science decision-trees gbdt gbm gbrt gradient-boosting gradient-boosting-machine gradient-boosting-machines id3 kaggle machine-learning python random-forest regression-tree
Last synced: 12 Nov 2024
https://github.com/farukalamai/advanced-machine-learning-engineer-roadmap-2024
A Full Stack ML (Machine Learning) Roadmap involves learning the necessary skills and technologies to become proficient in all aspects of machine learning, including data collection and preprocessing, model development, deployment, and maintenance.
aws computer-vision data-analysis data-science data-visualization deep-learning git-github machine-learning machine-learning-roadmap mlops natural-language-processing neural-network nlp opencv pandas python pytorch statistics tensorflow yolo
Last synced: 14 Nov 2024
https://github.com/filippobovo/production-data-science
Production Data Science: a workflow for collaborative data science aimed at production
collaborative data-science production workflow
Last synced: 13 Nov 2024
https://github.com/rudeboybert/fivethirtyeight
R package of data and code behind the stories and interactives at FiveThirtyEight
cran data-science datajournalism fivethirtyeight r rpackage statistics
Last synced: 13 Nov 2024
https://github.com/FilippoBovo/production-data-science
Production Data Science: a workflow for collaborative data science aimed at production
collaborative data-science production workflow
Last synced: 12 Nov 2024
https://github.com/polyaxon/haupt
Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon
bokeh data-processing data-profiling data-science data-visualization deep-learning jupyter lineage machine-learning matplotlib mlops models plotly python pytorch serving tensorflow tracking ui visualization
Last synced: 10 Oct 2024
https://github.com/JuliaDataScience/JuliaDataScience
Book on Julia for Data Science
book data data-manipulation data-science data-visualization julia julia-language
Last synced: 08 Aug 2024
https://github.com/ploomber/sklearn-evaluation
Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.
data-science deep-learning jupyter-notebook machine-learning pytorch scikit-learn sklearn tensorflow
Last synced: 13 Oct 2024
https://github.com/hurshd0/must-read-papers-for-ml
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
convolutional-networks data-analysis data-science deep-learning exploratory-data-analysis generalized-additive-models machine-learning neural-networks papers recommender-system recurrent-neural-networks rnn-lstm
Last synced: 07 Nov 2024
https://github.com/pgalko/BambooAI
A lightweight library that leverages Language Models (LLMs) to enable natural language interactions, allowing you to source and converse with data.
ai ai-agents data-analysis data-science gemini groq llm mistral ollama openai-api pandas pinecone python vector-database
Last synced: 28 Oct 2024
https://github.com/pgalko/bambooai
A lightweight library that leverages Language Models (LLMs) to enable natural language interactions, allowing you to source and converse with data.
ai ai-agents data-analysis data-science gemini groq llm mistral ollama openai-api pandas pinecone python vector-database
Last synced: 10 Oct 2024
https://github.com/pykale/pykale
Knowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the 🔥PyTorch ecosystem. ⭐ Star to support our work!
computer-vision data-science deep-learning domain-adaptation graph-analysis knowledge-aware-learning machine-learning medical-image-analysis meta-learning multimodal multimodal-learning python pytorch transfer-learning
Last synced: 11 Oct 2024
https://github.com/tlkh/ai-lab
All-in-one AI container for rapid prototyping
cuda data-science deep-learning docker jupyter nvidia pytorch tensorflow
Last synced: 14 Nov 2024
https://github.com/bodywork-ml/bodywork-core
ML pipeline orchestration and model deployments on Kubernetes.
batch cicd continuous-deployment data-science devops framework kubernetes machine-learning mlops orchestration pipeline python serving
Last synced: 09 Nov 2024
https://github.com/kevinschaich/pyspark-cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
cheat cheatsheet cheatsheets data data-science docs documentation guide guides pyspark pyspark-tutorial quickstart reference references spark spark-sql
Last synced: 31 Oct 2024
https://github.com/girder/girder
A data management platform for the web, developed by Kitware
data-analytics data-management data-science javascript kitware python resonant
Last synced: 04 Nov 2024
https://github.com/BlackHC/toma
Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory
data-science gpu machine-learning python pytorch
Last synced: 15 Nov 2024
https://github.com/blackhc/toma
Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory
data-science gpu machine-learning python pytorch
Last synced: 16 Nov 2024
https://github.com/kevintpeng/Learn-Something-Every-Day
📝 A compilation of everything that I learn; Computer Science, Software Development, Engineering, Math, and Coding in General. Read the rendered results here ->
algorithm aws blog computer-science course-materials data-engineering data-science education educational engineering learning math mathematics research software-engineering university unix waterloo
Last synced: 28 Oct 2024
https://github.com/jobream/List-of-Learning-Resources
This collection provides a list of educational resources for Software Engineers. Feel free to add your favorite resources as well and help others in their journey of learning.
competitive-programming computer-science data-science resources software-engineering web-development
Last synced: 12 Nov 2024
https://github.com/moabukar/Everything-Tech
A collection of online resources to help you on your Tech journey.
ansible aws azure backend data-engineering data-science devops docker frontend gcp kubernetes machine-learning networking python serverless software-engineering tech terraform
Last synced: 06 Nov 2024
https://github.com/plotly/dash-table
OBSOLETE: now part of https://github.com/plotly/dash
dash data-science data-visualization plotly plotly-dash python react table
Last synced: 05 Nov 2024
https://github.com/firmai/pandasvault
Advanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).
data-science data-structures dataframe functions pandas python snippets table tips
Last synced: 15 Nov 2024
https://github.com/moabukar/everything-tech
A collection of online resources to help you on your Tech journey.
ansible aws azure backend data-engineering data-science devops docker frontend gcp kubernetes machine-learning networking python serverless software-engineering tech terraform
Last synced: 26 Sep 2024
https://github.com/lijqhs/deeplearning-notes
Notes for Deep Learning Specialization Courses led by Andrew Ng.
algorithms andrew-ng backpropagation bias-variance cnn coursera data-analysis data-science deep-learning deeplearning hyperparameter-optimization machine-learning neural-network notes overfitting sequence-models statistics summary tensorflow
Last synced: 07 Nov 2024
https://github.com/breck7/scroll
Scroll is a language for scientists of all ages. Scroll includes a command line app that builds static blogs, websites, CSVs, text files, and more.
blog cms csv data-science knowledge-base knowledge-graph markdown markup markup-language note-taking scroll static-site-generator tree-notation
Last synced: 08 Nov 2024
https://github.com/HHammond/PrettyPandas
A Pandas Styler class for making beautiful tables
data-analysis data-science pandas pandas-dataframe pandas-dataframes pandas-styler python reporting
Last synced: 10 Nov 2024
https://github.com/run-ai/genv
GPU environment and cluster management with LLM support
bash container-runtime containers data-science deep-learning docker gpu gpus jupyter-notebook jupyterlab-extension k8s kubernetes llm-inference llms nvidia-gpu ollama ray vscode vscode-extension zsh
Last synced: 10 Oct 2024
https://github.com/okfn-brasil/rosie
🤖 Python application responsible for Serenata de Amor's intelligence
artificial-intelligence data-science machine-learning
Last synced: 31 Oct 2024
https://github.com/ankonzoid/artificio
Deep Learning Computer Vision Algorithms for Real-World Use
ai applications artificial-intelligence auto-encoders computer-vision convolutional-neural-networks data-science deep-learning image-classification image-finder image-processing image-recognition image-retrieval machine-learning neural-networks object-recognition python recommender-system recommender-systems transfer-learning
Last synced: 15 Nov 2024
https://github.com/ClimbsRocks/machineJS
[UNMAINTAINED] Automated machine learning- just give it a data file! Check out the production-ready version of this project at ClimbsRocks/auto_ml
auto-ml automated-machine-learning automl data-science data-scientists javascript javascript-library kaggle machine-learning machine-learning-algorithms machine-learning-library ml numerai scikit-learn
Last synced: 07 Aug 2024
https://github.com/rebecca-vickery/data-science-learning-resources
A comprehensive list of free resources for learning data science
artificial-intelligence data data-science machine-learning python
Last synced: 11 Nov 2024
https://github.com/vmware/versatile-data-kit
One framework to develop, deploy and operate data workflows with Python and SQL.
analytics data data-engineer data-engineering data-engineering-pipeline data-lineage data-pipelines data-science data-structures data-warehouse database dataops elt etl pipeline python snowflake sql trino warehouse
Last synced: 13 Nov 2024
https://github.com/Niketkumardheeryan/ML-CaPsule
ML-capsule is a Project for beginners and experienced data science Enthusiasts who don't have a mentor or guidance and wish to learn Machine learning. Using our repo they can learn ML, DL, and many related technologies with different real-world projects and become Interview ready.
analytics data-analysis data-science data-visualization datascience deep-learning deep-neural-networks deployment flask heroku-deployment machine-learning python r statistics streamlit-webapp
Last synced: 13 Nov 2024
https://github.com/dcai-course/dcai-lab
Lab assignments for Introduction to Data-Centric AI, MIT IAP 2024 👩🏽💻
course data-centric-ai data-science deep-learning homework lab machine-learning
Last synced: 30 Oct 2024
https://github.com/climbsrocks/machinejs
[UNMAINTAINED] Automated machine learning- just give it a data file! Check out the production-ready version of this project at ClimbsRocks/auto_ml
auto-ml automated-machine-learning automl data-science data-scientists javascript javascript-library kaggle machine-learning machine-learning-algorithms machine-learning-library ml numerai scikit-learn
Last synced: 13 Nov 2024
https://github.com/DataScienceUB/introduction-datascience-python-book
Introduction to Data Science: A Python Approach to Concepts, Techniques and Applications
analytics data data-science datascience machine-learning python sentiment-analysis
Last synced: 07 Aug 2024
https://github.com/Chicago/food-inspections-evaluation
This repository contains the code to generate predictions of critical violations at food establishments in Chicago. It also contains the results of an evaluation of the effectiveness of those predictions.
cdph chicago data-science food-poisoning open-data open-science public-health
Last synced: 30 Oct 2024
https://github.com/jbn/zigzag
Python library for identifying the peaks and valleys of a time series.
data-science statistics technical-analysis
Last synced: 15 Nov 2024
https://github.com/sforaidl/genrl
A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL
algorithm-implementations benchmarking data-science deep-learning gym hacktoberfest machine-learning neural-network openai python pytorch reinforcement-learning reinforcement-learning-algorithms
Last synced: 30 Oct 2024
https://github.com/SforAiDl/genrl
A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL
algorithm-implementations benchmarking data-science deep-learning gym hacktoberfest machine-learning neural-network openai python pytorch reinforcement-learning reinforcement-learning-algorithms
Last synced: 12 Nov 2024
https://github.com/5agado/data-science-learning
Repository of code and resources related to different data science and machine learning topics. For learning, practice and teaching purposes.
data-science deep-learning jupyter-notebook learning-by-doing machine-learning statistics
Last synced: 08 Nov 2024
https://github.com/platonai/PulsarRPA
Automate webpages at scale, scrape web data completely and accurately with high performance, distributed RPA.
crawler data-mining data-science rpa scraper scraping web-automation web-crawler web-mining web-scraping web-sql
Last synced: 05 Nov 2024
https://github.com/ledell/user-machine-learning-tutorial
useR! 2016 Tutorial: Machine Learning Algorithmic Deep Dive http://user2016.org/tutorials/10.html
data-science deep-learning ensemble-learning gradient-boosting-machine machine-learning r random-forest tutorial
Last synced: 17 Nov 2024
https://github.com/mfarragher/obsidiantools
Obsidian tools - a Python package for analysing an Obsidian.md vault
data-science knowledge-management network-analysis note-taking obsidian-community obsidian-md python
Last synced: 14 Oct 2024
https://github.com/kunalj101/Data-Science-Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
computer-vision data data-analysis data-science data-visualization dataset hacks image-augmentation ipynb machine-learning nlp nlp-machine-learning numpy pandas pandas-dataframe pandas-python pandas-tutorial python python3 tips-and-tricks
Last synced: 13 Nov 2024
https://github.com/rio-labs/rio
WebApps in pure Python. No JavaScript, HTML and CSS needed
data-analysis data-science data-visualization deep-learning machine-learning python ui webapp
Last synced: 06 Nov 2024
https://github.com/ledell/useR-machine-learning-tutorial
useR! 2016 Tutorial: Machine Learning Algorithmic Deep Dive http://user2016.org/tutorials/10.html
data-science deep-learning ensemble-learning gradient-boosting-machine machine-learning r random-forest tutorial
Last synced: 07 Aug 2024
https://github.com/kunalj101/data-science-hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
computer-vision data data-analysis data-science data-visualization dataset hacks image-augmentation ipynb machine-learning nlp nlp-machine-learning numpy pandas pandas-dataframe pandas-python pandas-tutorial python python3 tips-and-tricks
Last synced: 11 Oct 2024
https://github.com/capitalone/datacompy
Pandas and Spark DataFrame comparison for humans and more!
compare dask data data-science dataframes fugue numpy pandas polars pyspark python spark
Last synced: 11 Oct 2024
https://github.com/jasmcaus/ai-math-roadmap
Your no-nonsense guide to the Math used in Artificial Intelligence
ai ai-roadmap artificial-intelligence caer data-science deep-learning machine-learning mathematics neural-network roadmap
Last synced: 11 Nov 2024
https://github.com/youssefhosni/awesome-data-science-resoruces
A curated list of data science educational resources for essential data science skills
computer-science data-science deep-learning machine-learning statistics
Last synced: 07 Nov 2024
https://github.com/tobgu/qframe
Immutable data frame for Go
data-frame data-science dataframe go golang immutable
Last synced: 13 Nov 2024
https://github.com/airalcorn2/Michael-s-Data-Science-Curriculum
This is the companion curriculum to my guide to becoming a data scientist.
curriculum data-science machine-learning statistics
Last synced: 05 Aug 2024
https://github.com/airbnb/artificial-adversary
🗣️ Tool to generate adversarial text examples and test machine learning models against them
adversarial-examples black-box-attacks black-box-benchmarking classification data-mining data-science machine-learning metrics python python2 python3 spam spam-classification spam-detection spam-filtering text text-analysis text-classification text-mining text-processing
Last synced: 13 Nov 2024
https://github.com/EpistasisLab/scikit-rebate
A scikit-learn-compatible Python implementation of ReBATE, a suite of Relief-based feature selection algorithms for Machine Learning.
data-science feature-selection python
Last synced: 30 Oct 2024
https://github.com/epistasislab/scikit-rebate
A scikit-learn-compatible Python implementation of ReBATE, a suite of Relief-based feature selection algorithms for Machine Learning.
data-science feature-selection python
Last synced: 16 Nov 2024
https://github.com/aiguofer/gspread-pandas
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
data data-analytics data-engineering data-science dataframes google google-sheets google-spreadsheets gspread pandas python sheets
Last synced: 13 Nov 2024
https://github.com/basedosdados/sdk
⚙️ Código de manutenção do datalake (metadados e pacotes de acesso) | 📖 Docs: https://basedosdados.github.io/mais/
bigquery dados-abertos data-science govtech hacktoberfest hacktoberfest2022 open-data python r sql transparencia
Last synced: 13 Nov 2024
https://github.com/rohan-paul/machinelearning-deeplearning-code-for-my-youtube-channel
The full collection of all codes for my Youtube Channel segregated as per topic.
computer-vision data-science data-science-portfolio datascience deep-learning deep-neural-networks machine-learning machine-learning-algorithms math neural-network python pytorch pytorch-implementation pytorch-tutorial statistics tensorflow tensorflow-examples tensorflow-tutorials tensorflow2 youtube
Last synced: 12 Nov 2024
https://github.com/basedosdados/mais
⚙️ Código de manutenção do datalake (metadados e pacotes de acesso) | 📖 Docs: https://basedosdados.github.io/mais/
bigquery dados-abertos data-science govtech hacktoberfest hacktoberfest2022 open-data python r sql transparencia
Last synced: 13 Oct 2024
https://github.com/terrytangyuan/distributed-ml-patterns
Distributed Machine Learning Patterns from Manning Publications by Yuan Tang https://bit.ly/2RKv8Zo
argo argo-workflows book cloud-computing cloud-native data-science devops distributed-machine-learning distributed-systems kubeflow kubernetes large-scale-machine-learning machine-learning machine-learning-pipelines manning-publications mlops python tensorflow
Last synced: 29 Oct 2024
https://github.com/plotly/dashR
Create data science and AI web apps in R
dash data-science data-visualization plotly plotly-dash python r react web-application
Last synced: 27 Oct 2024
https://github.com/yzkang/My-Data-Competition-Experience
本人多次机器学习与大数据竞赛Top5的经验总结,满满的干货,拿好不谢
automl catboost data-science deep-learning feature-engineering feature-selection gan hyperparameter-optimization kaggle-competition lightgbm machine-learning model-fusion model-selection python sql tianchi-competition xgboost
Last synced: 11 Nov 2024
https://github.com/DagsHub/fds
Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc
Last synced: 15 Nov 2024
https://github.com/finos/jupyterlab_templates
Support for jupyter notebook templates in jupyterlab
data-science dataviz jupyter jupyterlab jupyterlab-extension machine-learning notebook
Last synced: 07 Nov 2024
https://github.com/plotly/plotly_matlab
Plotly Graphing Library for MATLAB®
d3 d3js data-science data-visualization matlab plotly technical-computing webgl
Last synced: 14 Nov 2024
https://github.com/neptune-ai/open-solution-mapping-challenge
Open solution to the Mapping Challenge :earth_americas:
competition crowdai data-science data-science-learning deep-learning kaggle lightgbm machine-learning machine-learning-lab mapping-challenge neptune pipeline pipeline-framework python satellite-imagery unet unet-image-segmentation unet-pytorch
Last synced: 14 Nov 2024
https://github.com/wilsonrljr/sysidentpy
A Python Package For System Identification Using NARMAX Models
data-science dynamical-systems machine-learning narmax narx system-identification time-series
Last synced: 12 Nov 2024