Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-11-10 00:06:58 UTC
- JSON Representation
https://github.com/milaan9/93_python_data_analytics_projects
This repository contains all the data analytics projects that I've worked on in python.
breast-cancer-prediction cervical-cancer-prediction covid-19-prediction data-analytics-projects data-science english-french-tranlation ipython-notebook machine-learning machine-learning-projects poker-hand-predictor python4datascience python4everybody resume-selection stock-news-prediction tutor-milaan9
Last synced: 11 Oct 2024
https://github.com/ing-bank/popmon
Monitor the stability of a Pandas or Spark dataframe ⚙︎
covariate-shift data-analysis data-distributions data-profiling data-science dataset-shifts drift-detection hacktoberfest ing-bank ipython jupyter mlops monitoring pandas population-monitoring python spark statistical-process-control statistical-tests statistics
Last synced: 12 Oct 2024
https://github.com/polyaxon/traceml
Engine for ML/Data tracking, visualization, explainability, drift detection, and dashboards for Polyaxon.
dask data-exploration data-profiling data-quality data-quality-checks data-science data-visualization dataframes dataops explainable-ai matplotlib mlops pandas pandas-summary plotly pytorch spark statistics tensorflow tracking
Last synced: 11 Oct 2024
https://github.com/juliaacademy/datascience
Data Science in Julia course for JuliaAcademy.com, taught by Huda Nassar
data-science julia juliaacademy learnjulia
Last synced: 12 Oct 2024
https://github.com/JuliaAcademy/DataScience
Data Science in Julia course for JuliaAcademy.com, taught by Huda Nassar
data-science julia juliaacademy learnjulia
Last synced: 27 Oct 2024
https://github.com/hamelsmu/code_search
Code For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deep Learning"
code-search data-science deep-learning fastai keras machine-learning machine-learning-on-source-code ml-on-code natural-language-processing nlp python pytorch search search-algorithm searching-algorithms semantic-search semantic-search-engine tensorflow tutorial
Last synced: 26 Oct 2024
https://github.com/plotly/dash.jl
Dash for Julia - A Julia interface to the Dash ecosystem for creating analytic web applications in Julia. No JavaScript required.
bioinformatics charting dash dashboard data-science data-visualization finance gui-framework julia modeling no-javascript no-vba plotly plotly-dash productivity react technical-computing web-app
Last synced: 12 Oct 2024
https://github.com/akanz1/klib
Easy to use Python library of customized functions for cleaning and analyzing data.
data-analysis data-cleaning data-preprocessing data-science data-visualization feature-selection klib python
Last synced: 03 Aug 2024
https://github.com/doubleml/doubleml-for-py
DoubleML - Double Machine Learning in Python
causal-inference data-science double-machine-learning econometrics machine-learning python scikit-learn statistics
Last synced: 14 Oct 2024
https://github.com/akfamily/aktools
AKTools is an elegant and simple HTTP API library for AKShare, built for AKSharers!
akshare asyncio data data-science fastapi openapi pydanti
Last synced: 11 Oct 2024
https://github.com/ottogroup/palladium
Framework for setting up predictive analytics services
data-science machine-learning scikit-learn
Last synced: 29 Oct 2024
https://github.com/InfuseAI/piperider
Code review for data in dbt
code-review continuous-integration data-exploration data-observability data-pipeline data-profiler data-profiling data-quality data-reliability data-science data-testing data-visualization dbt dbt-metrics eda exploratory-data-analysis pull-requests python reporting
Last synced: 09 Nov 2024
https://github.com/h2oai/mli-resources
H2O.ai Machine Learning Interpretability Resources
accountability data-mining data-science explainable-ml fairness fatml h2o iml interpretability interpretable-ai interpretable-machine-learning interpretable-ml jupyter-notebooks machine-learning machine-learning-interpretability mli python transparency xai xgboost
Last synced: 06 Nov 2024
https://github.com/explosion/prodigy-recipes
🍳 Recipes for the Prodigy, our fully scriptable annotation tool
active-learning annotation annotation-tool artificial-intelligence computer-vision data-annotation data-science labeling-tool machine-learning machine-teaching natural-language-processing nlp prodigy spacy
Last synced: 07 Oct 2024
https://github.com/frictionlessdata/specs
Technical specifications and guidelines for implementing Frictionless Data.
csv data-science json metadata schema validation
Last synced: 06 Nov 2024
https://github.com/a16z/nft-analyst-starter-pack
analytics data-science ethereum nfts python
Last synced: 03 Aug 2024
https://github.com/DoubleML/doubleml-for-py
DoubleML - Double Machine Learning in Python
causal-inference data-science double-machine-learning econometrics machine-learning python scikit-learn statistics
Last synced: 24 Aug 2024
https://github.com/rjurney/agile_data_code_2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
agile-data agile-data-science airflow amazon-ec2 amazon-web-services analytics apache-kafka apache-spark data data-science data-syndrome kafka machine-learning machine-learning-algorithms predictive-analytics python python-3 python3 spark vagrant
Last synced: 12 Oct 2024
https://github.com/rudeboybert/fivethirtyeight
R package of data and code behind the stories and interactives at FiveThirtyEight
cran data-science datajournalism fivethirtyeight r rpackage statistics
Last synced: 30 Oct 2024
https://github.com/FilippoBovo/production-data-science
Production Data Science: a workflow for collaborative data science aimed at production
collaborative data-science production workflow
Last synced: 02 Aug 2024
https://github.com/polyaxon/haupt
Lineage metadata API, artifacts streams, sandbox, API, and spaces for Polyaxon
bokeh data-processing data-profiling data-science data-visualization deep-learning jupyter lineage machine-learning matplotlib mlops models plotly python pytorch serving tensorflow tracking ui visualization
Last synced: 10 Oct 2024
https://github.com/serengil/chefboost
A Lightweight Decision Tree Framework supporting regular algorithms: ID3, C4.5, CART, CHAID and Regression Trees; some advanced techniques: Gradient Boosting, Random Forest and Adaboost w/categorical features support for Python
adaboost c45-trees cart categorical-features data-mining data-science decision-trees gbdt gbm gbrt gradient-boosting gradient-boosting-machine gradient-boosting-machines id3 kaggle machine-learning python random-forest regression-tree
Last synced: 02 Aug 2024
https://github.com/juliadatascience/juliadatascience
Book on Julia for Data Science
book data data-manipulation data-science data-visualization julia julia-language
Last synced: 30 Oct 2024
https://github.com/JuliaDataScience/JuliaDataScience
Book on Julia for Data Science
book data data-manipulation data-science data-visualization julia julia-language
Last synced: 08 Aug 2024
https://github.com/ploomber/sklearn-evaluation
Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.
data-science deep-learning jupyter-notebook machine-learning pytorch scikit-learn sklearn tensorflow
Last synced: 13 Oct 2024
https://github.com/hurshd0/must-read-papers-for-ml
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
convolutional-networks data-analysis data-science deep-learning exploratory-data-analysis generalized-additive-models machine-learning neural-networks papers recommender-system recurrent-neural-networks rnn-lstm
Last synced: 07 Nov 2024
https://github.com/pgalko/BambooAI
A lightweight library that leverages Language Models (LLMs) to enable natural language interactions, allowing you to source and converse with data.
ai ai-agents data-analysis data-science gemini groq llm mistral ollama openai-api pandas pinecone python vector-database
Last synced: 28 Oct 2024
https://github.com/pgalko/bambooai
A lightweight library that leverages Language Models (LLMs) to enable natural language interactions, allowing you to source and converse with data.
ai ai-agents data-analysis data-science gemini groq llm mistral ollama openai-api pandas pinecone python vector-database
Last synced: 10 Oct 2024
https://github.com/pykale/pykale
Knowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the 🔥PyTorch ecosystem. ⭐ Star to support our work!
computer-vision data-science deep-learning domain-adaptation graph-analysis knowledge-aware-learning machine-learning medical-image-analysis meta-learning multimodal multimodal-learning python pytorch transfer-learning
Last synced: 11 Oct 2024
https://github.com/tlkh/ai-lab
All-in-one AI container for rapid prototyping
cuda data-science deep-learning docker jupyter nvidia pytorch tensorflow
Last synced: 07 Nov 2024
https://github.com/bodywork-ml/bodywork-core
ML pipeline orchestration and model deployments on Kubernetes.
batch cicd continuous-deployment data-science devops framework kubernetes machine-learning mlops orchestration pipeline python serving
Last synced: 09 Nov 2024
https://github.com/kevinschaich/pyspark-cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
cheat cheatsheet cheatsheets data data-science docs documentation guide guides pyspark pyspark-tutorial quickstart reference references spark spark-sql
Last synced: 31 Oct 2024
https://github.com/girder/girder
A data management platform for the web, developed by Kitware
data-analytics data-management data-science javascript kitware python resonant
Last synced: 04 Nov 2024
https://github.com/kevintpeng/Learn-Something-Every-Day
📝 A compilation of everything that I learn; Computer Science, Software Development, Engineering, Math, and Coding in General. Read the rendered results here ->
algorithm aws blog computer-science course-materials data-engineering data-science education educational engineering learning math mathematics research software-engineering university unix waterloo
Last synced: 28 Oct 2024
https://github.com/blackhc/toma
Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory
data-science gpu machine-learning python pytorch
Last synced: 09 Nov 2024
https://github.com/moabukar/Everything-Tech
A collection of online resources to help you on your Tech journey.
ansible aws azure backend data-engineering data-science devops docker frontend gcp kubernetes machine-learning networking python serverless software-engineering tech terraform
Last synced: 06 Nov 2024
https://github.com/plotly/dash-table
OBSOLETE: now part of https://github.com/plotly/dash
dash data-science data-visualization plotly plotly-dash python react table
Last synced: 05 Nov 2024
https://github.com/moabukar/everything-tech
A collection of online resources to help you on your Tech journey.
ansible aws azure backend data-engineering data-science devops docker frontend gcp kubernetes machine-learning networking python serverless software-engineering tech terraform
Last synced: 26 Sep 2024
https://github.com/lijqhs/deeplearning-notes
Notes for Deep Learning Specialization Courses led by Andrew Ng.
algorithms andrew-ng backpropagation bias-variance cnn coursera data-analysis data-science deep-learning deeplearning hyperparameter-optimization machine-learning neural-network notes overfitting sequence-models statistics summary tensorflow
Last synced: 07 Nov 2024
https://github.com/breck7/scroll
Scroll is a language for scientists of all ages. Scroll includes a command line app that builds static blogs, websites, CSVs, text files, and more.
blog cms csv data-science knowledge-base knowledge-graph markdown markup markup-language note-taking scroll static-site-generator tree-notation
Last synced: 08 Nov 2024
https://github.com/HHammond/PrettyPandas
A Pandas Styler class for making beautiful tables
data-analysis data-science pandas pandas-dataframe pandas-dataframes pandas-styler python reporting
Last synced: 10 Nov 2024
https://github.com/jobream/List-of-Learning-Resources
This collection provides a list of educational resources for Software Engineers. Feel free to add your favorite resources as well and help others in their journey of learning.
competitive-programming computer-science data-science resources software-engineering web-development
Last synced: 02 Aug 2024
https://github.com/run-ai/genv
GPU environment and cluster management with LLM support
bash container-runtime containers data-science deep-learning docker gpu gpus jupyter-notebook jupyterlab-extension k8s kubernetes llm-inference llms nvidia-gpu ollama ray vscode vscode-extension zsh
Last synced: 10 Oct 2024
https://github.com/okfn-brasil/rosie
🤖 Python application responsible for Serenata de Amor's intelligence
artificial-intelligence data-science machine-learning
Last synced: 31 Oct 2024
https://github.com/firmai/pandasvault
Advanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).
data-science data-structures dataframe functions pandas python snippets table tips
Last synced: 03 Aug 2024
https://github.com/ClimbsRocks/machineJS
[UNMAINTAINED] Automated machine learning- just give it a data file! Check out the production-ready version of this project at ClimbsRocks/auto_ml
auto-ml automated-machine-learning automl data-science data-scientists javascript javascript-library kaggle machine-learning machine-learning-algorithms machine-learning-library ml numerai scikit-learn
Last synced: 07 Aug 2024
https://github.com/rebecca-vickery/data-science-learning-resources
A comprehensive list of free resources for learning data science
artificial-intelligence data data-science machine-learning python
Last synced: 02 Aug 2024
https://github.com/dcai-course/dcai-lab
Lab assignments for Introduction to Data-Centric AI, MIT IAP 2024 👩🏽💻
course data-centric-ai data-science deep-learning homework lab machine-learning
Last synced: 30 Oct 2024
https://github.com/vmware/versatile-data-kit
One framework to develop, deploy and operate data workflows with Python and SQL.
analytics data data-engineer data-engineering data-engineering-pipeline data-lineage data-pipelines data-science data-structures data-warehouse database dataops elt etl pipeline python snowflake sql trino warehouse
Last synced: 06 Nov 2024
https://github.com/climbsrocks/machinejs
[UNMAINTAINED] Automated machine learning- just give it a data file! Check out the production-ready version of this project at ClimbsRocks/auto_ml
auto-ml automated-machine-learning automl data-science data-scientists javascript javascript-library kaggle machine-learning machine-learning-algorithms machine-learning-library ml numerai scikit-learn
Last synced: 30 Oct 2024
https://github.com/DataScienceUB/introduction-datascience-python-book
Introduction to Data Science: A Python Approach to Concepts, Techniques and Applications
analytics data data-science datascience machine-learning python sentiment-analysis
Last synced: 07 Aug 2024
https://github.com/ankonzoid/artificio
Deep Learning Computer Vision Algorithms for Real-World Use
ai applications artificial-intelligence auto-encoders computer-vision convolutional-neural-networks data-science deep-learning image-classification image-finder image-processing image-recognition image-retrieval machine-learning neural-networks object-recognition python recommender-system recommender-systems transfer-learning
Last synced: 03 Aug 2024
https://github.com/Chicago/food-inspections-evaluation
This repository contains the code to generate predictions of critical violations at food establishments in Chicago. It also contains the results of an evaluation of the effectiveness of those predictions.
cdph chicago data-science food-poisoning open-data open-science public-health
Last synced: 30 Oct 2024
https://github.com/SforAiDl/genrl
A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL
algorithm-implementations benchmarking data-science deep-learning gym hacktoberfest machine-learning neural-network openai python pytorch reinforcement-learning reinforcement-learning-algorithms
Last synced: 02 Aug 2024
https://github.com/sforaidl/genrl
A PyTorch reinforcement learning library for generalizable and reproducible algorithm implementations with an aim to improve accessibility in RL
algorithm-implementations benchmarking data-science deep-learning gym hacktoberfest machine-learning neural-network openai python pytorch reinforcement-learning reinforcement-learning-algorithms
Last synced: 30 Oct 2024
https://github.com/jbn/zigzag
Python library for identifying the peaks and valleys of a time series.
data-science statistics technical-analysis
Last synced: 01 Nov 2024
https://github.com/platonai/PulsarRPA
Automate webpages at scale, scrape web data completely and accurately with high performance, distributed RPA.
crawler data-mining data-science rpa scraper scraping web-automation web-crawler web-mining web-scraping web-sql
Last synced: 05 Nov 2024
https://github.com/5agado/data-science-learning
Repository of code and resources related to different data science and machine learning topics. For learning, practice and teaching purposes.
data-science deep-learning jupyter-notebook learning-by-doing machine-learning statistics
Last synced: 08 Nov 2024
https://github.com/ledell/user-machine-learning-tutorial
useR! 2016 Tutorial: Machine Learning Algorithmic Deep Dive http://user2016.org/tutorials/10.html
data-science deep-learning ensemble-learning gradient-boosting-machine machine-learning r random-forest tutorial
Last synced: 26 Oct 2024
https://github.com/mfarragher/obsidiantools
Obsidian tools - a Python package for analysing an Obsidian.md vault
data-science knowledge-management network-analysis note-taking obsidian-community obsidian-md python
Last synced: 14 Oct 2024
https://github.com/rio-labs/rio
WebApps in pure Python. No JavaScript, HTML and CSS needed
data-analysis data-science data-visualization deep-learning machine-learning python ui webapp
Last synced: 06 Nov 2024
https://github.com/ledell/useR-machine-learning-tutorial
useR! 2016 Tutorial: Machine Learning Algorithmic Deep Dive http://user2016.org/tutorials/10.html
data-science deep-learning ensemble-learning gradient-boosting-machine machine-learning r random-forest tutorial
Last synced: 07 Aug 2024
https://github.com/capitalone/datacompy
Pandas and Spark DataFrame comparison for humans and more!
compare dask data data-science dataframes fugue numpy pandas polars pyspark python spark
Last synced: 11 Oct 2024
https://github.com/kunalj101/data-science-hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
computer-vision data data-analysis data-science data-visualization dataset hacks image-augmentation ipynb machine-learning nlp nlp-machine-learning numpy pandas pandas-dataframe pandas-python pandas-tutorial python python3 tips-and-tricks
Last synced: 11 Oct 2024
https://github.com/youssefhosni/awesome-data-science-resoruces
A curated list of data science educational resources for essential data science skills
computer-science data-science deep-learning machine-learning statistics
Last synced: 07 Nov 2024
https://github.com/tobgu/qframe
Immutable data frame for Go
data-frame data-science dataframe go golang immutable
Last synced: 30 Oct 2024
https://github.com/airalcorn2/Michael-s-Data-Science-Curriculum
This is the companion curriculum to my guide to becoming a data scientist.
curriculum data-science machine-learning statistics
Last synced: 05 Aug 2024
https://github.com/airbnb/artificial-adversary
🗣️ Tool to generate adversarial text examples and test machine learning models against them
adversarial-examples black-box-attacks black-box-benchmarking classification data-mining data-science machine-learning metrics python python2 python3 spam spam-classification spam-detection spam-filtering text text-analysis text-classification text-mining text-processing
Last synced: 30 Oct 2024
https://github.com/EpistasisLab/scikit-rebate
A scikit-learn-compatible Python implementation of ReBATE, a suite of Relief-based feature selection algorithms for Machine Learning.
data-science feature-selection python
Last synced: 30 Oct 2024
https://github.com/adnanwahab/Simple-GPU
🦒 Functional WebGPU
bioinformatics data-science geospatial javascript regl robotics webgpu
Last synced: 03 Aug 2024
https://github.com/rohan-paul/machinelearning-deeplearning-code-for-my-youtube-channel
The full collection of all codes for my Youtube Channel segregated as per topic.
computer-vision data-science data-science-portfolio datascience deep-learning deep-neural-networks machine-learning machine-learning-algorithms math neural-network python pytorch pytorch-implementation pytorch-tutorial statistics tensorflow tensorflow-examples tensorflow-tutorials tensorflow2 youtube
Last synced: 10 Oct 2024
https://github.com/kunalj101/Data-Science-Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
computer-vision data data-analysis data-science data-visualization dataset hacks image-augmentation ipynb machine-learning nlp nlp-machine-learning numpy pandas pandas-dataframe pandas-python pandas-tutorial python python3 tips-and-tricks
Last synced: 02 Aug 2024
https://github.com/basedosdados/mais
⚙️ Código de manutenção do datalake (metadados e pacotes de acesso) | 📖 Docs: https://basedosdados.github.io/mais/
bigquery dados-abertos data-science govtech hacktoberfest hacktoberfest2022 open-data python r sql transparencia
Last synced: 13 Oct 2024
https://github.com/aiguofer/gspread-pandas
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
data data-analytics data-engineering data-science dataframes google google-sheets google-spreadsheets gspread pandas python sheets
Last synced: 30 Oct 2024
https://github.com/yzkang/My-Data-Competition-Experience
本人多次机器学习与大数据竞赛Top5的经验总结,满满的干货,拿好不谢
automl catboost data-science deep-learning feature-engineering feature-selection gan hyperparameter-optimization kaggle-competition lightgbm machine-learning model-fusion model-selection python sql tianchi-competition xgboost
Last synced: 02 Aug 2024
https://github.com/terrytangyuan/distributed-ml-patterns
Distributed Machine Learning Patterns from Manning Publications by Yuan Tang https://bit.ly/2RKv8Zo
argo argo-workflows book cloud-computing cloud-native data-science devops distributed-machine-learning distributed-systems kubeflow kubernetes large-scale-machine-learning machine-learning machine-learning-pipelines manning-publications mlops python tensorflow
Last synced: 29 Oct 2024
https://github.com/DagsHub/fds
Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc
Last synced: 03 Aug 2024
https://github.com/plotly/dashR
Create data science and AI web apps in R
dash data-science data-visualization plotly plotly-dash python r react web-application
Last synced: 27 Oct 2024
https://github.com/plotly/plotly_matlab
Plotly Graphing Library for MATLAB®
d3 d3js data-science data-visualization matlab plotly technical-computing webgl
Last synced: 31 Oct 2024
https://github.com/finos/jupyterlab_templates
Support for jupyter notebook templates in jupyterlab
data-science dataviz jupyter jupyterlab jupyterlab-extension machine-learning notebook
Last synced: 07 Nov 2024
https://github.com/InfuseAI/primehub
open-source MLOps platform
data-science distributed-systems docker jupyter jupyterhub keycloak kubernetes machine-learning primehub primehub-ce
Last synced: 09 Nov 2024
https://github.com/neptune-ai/open-solution-mapping-challenge
Open solution to the Mapping Challenge :earth_americas:
competition crowdai data-science data-science-learning deep-learning kaggle lightgbm machine-learning machine-learning-lab mapping-challenge neptune pipeline pipeline-framework python satellite-imagery unet unet-image-segmentation unet-pytorch
Last synced: 07 Nov 2024
https://github.com/liyangbit/PyDataLab
open source for wechat-official-account (ID: PyDataLab)
data-analysis data-mining data-science data-visualization machine-learning python wechat-official-account
Last synced: 10 Aug 2024
https://github.com/thoughtworks/mlops-platforms
Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...
azureml data-science databricks dataiku datarobot google-ai-platform h2oai iguazio knime kubeflow machine-learning mlflow mlops pachyderm sagemaker seldon
Last synced: 02 Aug 2024
https://github.com/operatorai/modelstore
🏬 modelstore is a Python library that allows you to version, export, and save a machine learning model to your filesystem or a cloud storage provider.
data-science keras machine-learning mlops modelstore python-library pytorch s3-storage scikit-learn tensorflow transformer
Last synced: 13 Oct 2024
https://github.com/aunum/goro
A High-level Machine Learning Library for Go
data-science go golang machine-learning machinelearning
Last synced: 28 Oct 2024
https://github.com/adicherlavenkatasai/ml-workspace
Machine Learning (Beginners Hub), information(courses, books, cheat sheets, live sessions) related to machine learning, data science and python is available
cheat-sheets convolutional-networks data-science deep-learning deep-neural-networks gans harvard-edx interview-questions machine-learning python
Last synced: 31 Oct 2024
https://github.com/solegalli/feature-engineering-for-machine-learning
Code repository for the online course Feature Engineering for Machine Learning
data-science feature-engineering feature-extraction machine-learning python
Last synced: 30 Oct 2024
https://github.com/aaronpenne/data_visualization
A collection of my data visualizations, mostly in Python.
data-science data-visualization python3 visualization
Last synced: 25 Oct 2024
https://github.com/xoolive/traffic
A toolbox for processing and analysing air traffic data
adsb air-traffic-data data-analytics data-science data-visualisation declarative-pipeline mode-s trajectory
Last synced: 30 Oct 2024
https://github.com/jkrumbiegel/chain.jl
A Julia package for piping a value through a series of transformation expressions using a more convenient syntax than Julia's native piping functionality.
data-analysis data-science julia julia-language julia-package macro pipeline
Last synced: 30 Oct 2024
https://github.com/maxhalford/xam
:dart: Personal data science and machine learning toolbox
data-science machine-learning preprocessing python stacking
Last synced: 31 Oct 2024
https://github.com/MaxHalford/xam
:dart: Personal data science and machine learning toolbox
data-science machine-learning preprocessing python stacking
Last synced: 03 Aug 2024
https://github.com/matrix-profile-foundation/matrixprofile
A Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
algorithms anomaly-detection clustering data-mining data-science hacktoberfest matrixprofile motif-discovery python python2 python3 segmentation time-series time-series-analysis
Last synced: 12 Oct 2024
https://github.com/predict-idlab/tsflex
Flexible time series feature extraction & processing
data-science feature-engineering feature-extraction multimodal multivariate pandas processing python time-series window-stride
Last synced: 02 Nov 2024
https://github.com/wilsonrljr/sysidentpy
A Python Package For System Identification Using NARMAX Models
data-science dynamical-systems machine-learning narmax narx system-identification time-series
Last synced: 02 Aug 2024
https://github.com/jkrumbiegel/Chain.jl
A Julia package for piping a value through a series of transformation expressions using a more convenient syntax than Julia's native piping functionality.
data-analysis data-science julia julia-language julia-package macro pipeline
Last synced: 04 Aug 2024
https://github.com/triestpa/cryptocurrency-analysis-python
Open-Source Tutorial For Analyzing and Visualizing Cryptocurrency Data
bitcoin cryptocurrency data-analysis data-science data-visualization ethereum jupyter-notebook plotly python tutorial
Last synced: 08 Nov 2024
https://github.com/aeturrell/skimpy
skimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
data-science eda exploratory-data-analysis pandas statistics summary-statistics
Last synced: 03 Aug 2024
https://github.com/BlackHC/toma
Helps you write algorithms in PyTorch that adapt to the available (CUDA) memory
data-science gpu machine-learning python pytorch
Last synced: 03 Aug 2024