Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-07-29 13:36:33 UTC
- JSON Representation
https://github.com/mlr-org/mlr
Machine Learning in R
classification clustering cran data-science feature-selection hyperparameters-optimization imbalance-correction learners machine-learning mlr multilabel-classification predictive-modeling r r-package regression stacking statistics survival-analysis tuning tutorial
Last synced: 02 Aug 2024
https://github.com/unslothai/hyperlearn
2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.
data-analysis data-science deep-learning econometrics gpu machine-learning neural-network optimization python pytorch regression-models research scikit-learn statistics statsmodels tensor
Last synced: 30 Jul 2024
https://github.com/tidyverse/tidyverse
Easily install and load packages from the tidyverse
Last synced: 31 Jul 2024
https://github.com/jadianes/spark-py-notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
big-data bigdata data-analysis data-science ipython ipython-notebook machine-learning mllib notebook pyspark python spark
Last synced: 07 Aug 2024
https://github.com/joaomilho/Enterprise
🦄 The Enterprise™ programming language
ajax artificial-intelligence cloud crypto data-science disruptive-technology docker enterprise enterprise-development enterprise-services enterprise-software growth jvm kubernetes language money progressive-web-app quantum redux
Last synced: 30 Jul 2024
https://github.com/keras-team/keras-contrib
Keras community contributions
data-science deep-learning keras machine-learning neural-networks tensorflow theano
Last synced: 31 Jul 2024
https://github.com/supabase/supabase-py
Python Client for Supabase. Query Postgres from Flask, Django, FastAPI. Python user authentication, security policies, edge functions, file storage, and realtime data streaming. Good first issue.
auth authentication authorization community data-science databases django fastapi flask good-first-issue machine-learning postgres postgresql python supabase
Last synced: 30 Jul 2024
https://github.com/kantord/just-dashboard
:bar_chart: :clipboard: Dashboards using YAML or JSON files
big-data business-intelligence chart csv d3 d3js dashboard data data-driven data-engineering data-science data-visualization gist github-gist json just-dashboard yaml
Last synced: 30 Jul 2024
https://github.com/alinebastos/dev-practice
Practice your skills with these ideas.
back-end backend challenge css css3 data-science development front-end front-end-development frontend frontend-practice frontend-skills game git hackathons hacktoberfest javascript practice vim
Last synced: 01 Aug 2024
https://github.com/nubank/fklearn
fklearn: Functional Machine Learning
data-analysis data-science machine-learning ml python
Last synced: 01 Aug 2024
https://github.com/enzoampil/fastquant
fastquant — Backtest and optimize your ML trading strategies with only 3 lines of code!
algotrading backtesting cryptocurrency data-science financial-data-science machine-learning quantitative-finance stocks trading-strategies
Last synced: 31 Jul 2024
https://github.com/AxeldeRomblay/MLBox
MLBox is a powerful Automated Machine Learning python library.
auto-ml automated-machine-learning automl classification data-science deep-learning distributed drift encoding kaggle keras lightgbm machine-learning optimization pipeline prediction preprocessing regression stacking xgboost
Last synced: 02 Aug 2024
https://github.com/h2oai/h2o-tutorials
Tutorials and training material for the H2O Machine Learning Platform
data-science deep-learning h2o machine-learning python r tutorial
Last synced: 30 Jul 2024
https://github.com/pretzelai/pretzelai
The modern replacement for Jupyter Notebooks
analytics artificial-intelligence business-intelligence businessintelligence dashboard data data-analysis data-analytics data-science data-visualization duckdb notebooks open-source prql reporting sql sql-editor sql-editor-online visualization wasm
Last synced: 31 Jul 2024
https://github.com/hi-primus/optimus
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
big-data-cleaning bigdata cudf dask dask-cudf data-analysis data-cleaner data-cleaning data-cleansing data-exploration data-extraction data-preparation data-profiling data-science data-transformation data-wrangling machine-learning pyspark spark
Last synced: 31 Jul 2024
https://github.com/ironmussa/Optimus
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
big-data-cleaning bigdata cudf dask dask-cudf data-analysis data-cleaner data-cleaning data-cleansing data-exploration data-extraction data-preparation data-profiling data-science data-transformation data-wrangling machine-learning pyspark spark
Last synced: 30 Jul 2024
https://github.com/CamDavidsonPilon/lifetimes
Lifetime value in Python
data-science python statistics
Last synced: 02 Aug 2024
https://github.com/sepandhaghighi/pycm
Multi-class confusion matrix library in Python
accuracy ai artificial-intelligence classification confusion-matrix data data-analysis data-mining data-science deep-learning deeplearning evaluation machine-learning mathematics matrix ml multiclass-classification neural-network statistical-analysis statistics
Last synced: 02 Aug 2024
https://github.com/MLReef/mlreef
The collaboration workspace for Machine Learning
artificial-intelligence data-science deep-learning deeplearning machine-learning machine-learning-algorithms mlops mlops-environment models mxnet pytorch reproducibility tensorflow
Last synced: 31 Jul 2024
https://github.com/DLTK/DLTK
Deep Learning Toolkit for Medical Image Analysis
cnn data-science deep-learning deep-neural-networks dltk dltk-model-zoo machine-learning medical medical-image-processing medical-imaging ml neural-network neural-networks neuroimaging python tensorflow
Last synced: 02 Aug 2024
https://github.com/demidovakatya/vvedenie-mashinnoe-obuchenie
:memo: Подборка ресурсов по машинному обучению
collections data-mining data-science deep-learning machine-learning mooc neural-networks nlp russian university
Last synced: 07 Aug 2024
https://github.com/eBay/tsv-utils
eBay's TSV Utilities: Command line tools for large, tabular data files. Filtering, statistics, sampling, joins and more.
cli command-line csv d data-mining data-science delimited-files dlang reservoir-sampling sampling shuffle statistics tabular-data tsv uniq
Last synced: 01 Aug 2024
https://github.com/capitalone/DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
avro csv data-analysis data-labels data-science dataprofiling dataset gdpr graph-data machine-learning network-data nlp npi pandas pii privacy python security sensitive-data tabular-data
Last synced: 01 Aug 2024
https://github.com/capitalone/dataprofiler
What's in your data? Extract schema, statistics and entities from datasets
avro csv data-analysis data-labels data-science dataprofiling dataset gdpr graph-data machine-learning network-data nlp npi pandas pii privacy python security sensitive-data tabular-data
Last synced: 29 Jul 2024
https://github.com/code-kern-ai/refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
active-learning annotations artificial-intelligence data-centric-ai data-labeling data-science deep-learning human-in-the-loop labeling labeling-tool machine-learning natural-language-processing neural-search nlp python spacy supervised-learning text-annotation text-classification transformers
Last synced: 31 Jul 2024
https://github.com/khuyentran1401/Efficient_Python_tricks_and_tools_for_data_scientists
Efficient Python Tricks and Tools for Data Scientists
Last synced: 02 Aug 2024
https://github.com/google/uncertainty-baselines
High-quality implementations of standard and SOTA methods on a variety of tasks.
bayesian-methods data-science deep-learning machine-learning neural-networks probabilistic-programming statistics tensorflow
Last synced: 01 Aug 2024
https://modeloriented.github.io/DALEX/
moDel Agnostic Language for Exploration and eXplanation
black-box dalex data-science explainable-ai explainable-artificial-intelligence explainable-ml explanations explanatory-model-analysis fairness iml interpretability interpretable-machine-learning machine-learning model-visualization predictive-modeling responsible-ai responsible-ml xai
Last synced: 04 Aug 2024
https://github.com/ModelOriented/DALEX
moDel Agnostic Language for Exploration and eXplanation
black-box dalex data-science explainable-ai explainable-artificial-intelligence explainable-ml explanations explanatory-model-analysis fairness iml interpretability interpretable-machine-learning machine-learning model-visualization predictive-modeling responsible-ai responsible-ml xai
Last synced: 30 Jul 2024
https://github.com/sfirke/janitor
simple tools for data cleaning in R
data-analysis data-cleaning data-science dirty-data excel pivot-tables r spss tabulations tidyverse
Last synced: 30 Jul 2024
https://github.com/ebhy/budgetml
Deploy a ML inference service on a budget in less than 10 lines of code.
api data-science deployment fastapi inference machine-learning mlops
Last synced: 01 Aug 2024
https://github.com/csinva/imodels
Interpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).
ai artificial-intelligence bayesian-rule-list data-science explainable-ai explainable-ml imodels interpretability machine-learning ml optimal-classification-tree python rule-learning rulefit rules scikit-learn statistics supervised-learning
Last synced: 31 Jul 2024
https://github.com/safe-graph/graph-fraud-detection-papers
A curated list of graph-based fraud, anomaly, and outlier detection papers & resources
academic-publications anomaly-detection awsome-list data-mining data-science dataset deep-learning fraud-detection graph-algorithms graph-convolutional-networks graph-neural-networks machine-learning outlier-detection papers security spam-detection survey
Last synced: 02 Aug 2024
https://github.com/modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
chinese data-analysis data-science data-visualization dataset gpt gpt-4 instruction-tuning large-language-models llama llava llm llms multi-modal nlp opendata pre-training pytorch sora streamlit
Last synced: 01 Aug 2024
https://github.com/mlrun/mlrun
MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications.
data-engineering data-science experiment-tracking kubernetes machine-learning mlops mlops-workflow model-serving python workflow
Last synced: 01 Aug 2024
https://github.com/ahmetozlu/tensorflow_object_counting_api
🚀 The TensorFlow Object Counting API is an open source framework built on top of TensorFlow and Keras that makes it easy to develop object counting systems!
computer-vision data-science deep-learning deep-neural-networks image-processing machine-learning object-counting object-counting-api object-detection object-detection-api object-detection-label object-detection-pipelines opencv pedestrian-counting shelf-management shelf-navigation tensorflow tensorflow-api tensorflow-object-detection-api vehicle-counting
Last synced: 01 Aug 2024
https://github.com/PatMartin/Dex
Dex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
d3 d3js data-analysis data-mining data-science data-visualization datavis datavisualization dataviz groovy java javafx visualization
Last synced: 02 Aug 2024
https://github.com/dagworks-inc/hamilton
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
dag data-analysis data-engineering data-science dataframe etl etl-framework etl-pipeline feature-engineering featurization hacktoberfest lineage llmops machine-learning mlops numpy orchestration pandas python software-engineering
Last synced: 01 Aug 2024
https://github.com/DAGWorks-Inc/hamilton
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
dag data-analysis data-engineering data-science dataframe etl etl-framework etl-pipeline feature-engineering featurization hacktoberfest lineage llmops machine-learning mlops numpy orchestration pandas python software-engineering
Last synced: 31 Jul 2024
https://github.com/GoogleCloudPlatform/data-science-on-gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
cloud-computing data-analysis data-engineering data-pipeline data-processing data-science data-visualization machine-learning
Last synced: 07 Aug 2024
https://github.com/nok/sklearn-porter
Transpile trained scikit-learn estimators to C, Java, JavaScript and others.
data-science machine-learning scikit-learn sklearn
Last synced: 31 Jul 2024
https://github.com/reiinakano/xcessiv
A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling in Python.
automated-machine-learning data-science ensemble-learning hyperparameter-optimization machine-learning scikit-learn stacked-ensembles
Last synced: 03 Aug 2024
https://github.com/kotartemiy/pygooglenews
If Google News had a Python library
data-science google news python rss
Last synced: 31 Jul 2024
https://github.com/gboeing/ppde642
USC urban data science course series with Python and Jupyter
cities city-government coding course course-materials data-science jupyter network-analysis python spatial-analysis statistics syllabus transport transportation urban-analytics urban-data-science urban-informatics urban-planning urbanism usc
Last synced: 31 Jul 2024
https://github.com/bytewax/bytewax
Python Stream Processing
data-engineering data-processing data-science dataflow machine-learning python rust stream-processing streaming-data
Last synced: 02 Aug 2024
https://github.com/MiteshPuthran/Speech-Emotion-Analyzer
The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)
audio-files data-science deep-learning deep-neural-networks emotion emotion-recognition keras natural-language-processing natural-language-understanding neural-network python3 speech speech-emotion-recognition speech-recognition voice
Last synced: 31 Jul 2024
https://github.com/microsoft/responsible-ai-toolbox
Responsible AI Toolbox is a suite of tools providing model and data exploration and assessment user interfaces and libraries that enable a better understanding of AI systems. These interfaces and libraries empower developers and stakeholders of AI systems to develop and monitor AI more responsibly, and take better data-driven actions.
data-analysis data-science data-visualization error-analysis explainability explainable-ai explainable-ml fairness fairness-ai fairness-ml interpretability jupyter machine-learning machinelearning ml responsible-ai ui visualization widget widgets
Last synced: 01 Aug 2024
https://github.com/alan-turing-institute/CleverCSV
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
csv csv-converter csv-export csv-files csv-format csv-import csv-parser csv-parsing csv-reader csv-reading data-analysis data-mining data-science datascience machine-learning python python-library python3
Last synced: 31 Jul 2024
https://github.com/mandiant/ThreatPursuit-VM
Threat Pursuit Virtual Machine (VM): A fully customizable, open-sourced Windows-based distribution focused on threat intelligence analysis and hunting designed for intel and malware analysts as well as threat hunters to get up and running quickly.
analytics cyber data-science fireeye intelligence intelligence-analysis malware mandiant threat threathunting threatintelligence virtual-machine
Last synced: 04 Aug 2024
https://github.com/jrfiedler/causal_inference_python_code
Python code for part 2 of the book Causal Inference: What If, by Miguel Hernán and James Robins
causal-inference causality data-science python
Last synced: 31 Jul 2024
https://github.com/business-science/free_r_tips
Free R-Tips is a FREE Newsletter provided by Business Science. It comes with bite-sized code tutorials every week.
data-science newsletter tips tips-and-tricks
Last synced: 31 Jul 2024
https://github.com/devAmoghS/Machine-Learning-with-Python
Small scale machine learning projects to understand the core concepts . Give a Star 🌟If it helps you. BONUS: Interview Bank coming up..!
beginner-friendly data-science deep-learning exercises machine-learning practice-project python python-3 scikit-learn
Last synced: 07 Aug 2024
https://github.com/scikit-learn-contrib/MAPIE
A scikit-learn-compatible module for estimating prediction intervals.
classification confidence-intervals conformal-prediction data-science python regression sklearn
Last synced: 02 Aug 2024
https://github.com/annoviko/pyclustering
pyclustering is a Python, C++ data mining library.
algorithms c-plus-plus clustering data-mining data-science machine-learning neural-networks oscillatory-networks python python3
Last synced: 30 Jul 2024
https://github.com/rocketlaunchr/dataframe-go
DataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
data-science dataframe dataframes go golang machine-learning pandas pandas-dataframe python statistics
Last synced: 30 Jul 2024
https://github.com/crazyhottommy/getting-started-with-genomics-tools-and-resources
Unix, R and python tools for genomics and data science
bioinformatics cancer-genomics data-science
Last synced: 02 Aug 2024
https://github.com/DeepWisdom/AutoDL
Automated Deep Learning without ANY human intervention. 1'st Solution for AutoDL challenge@NeurIPS.
ai artificial-intelligence autodl autodl-challenge automated-machine-learning automl big-data data-science deeplearning feature-engineering full-automl lightgbm machine-learning model-selection multi-label nas python pytorch resnet tensorflow
Last synced: 03 Aug 2024
https://github.com/man-group/ArcticDB
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
big-data data data-analysis data-science database dataframe pandas quantitative-analysis quantitative-finance quantitative-trading
Last synced: 30 Jul 2024
https://github.com/man-group/arcticdb
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
big-data data data-analysis data-science database dataframe pandas quantitative-analysis quantitative-finance quantitative-trading
Last synced: 31 Jul 2024
https://github.com/opendatadiscovery/odd-platform
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
alerting bigdata data-catalog data-discovery data-engineering data-exploration data-governance data-lineage data-observability data-pipelines data-platform data-profiling data-quality data-science datacatalog lineage metadata metadata-management observability oss
Last synced: 31 Jul 2024
https://github.com/kyleskom/NBA-Machine-Learning-Sports-Betting
NBA sports betting using machine learning
ai data-science deep-learning gambling keras machine-learning nba nba-analytics nba-prediction neural-network python sports sports-analytics sports-betting sports-data tensorflow
Last synced: 31 Jul 2024
https://github.com/qri-io/qri
you're invited to a data party!
data-science dataset golang hacktoberfest hacktoberfest2021 ipfs opendata p2p qri service trust web3
Last synced: 31 Jul 2024
https://github.com/deepfence/FlowMeter
⭐ ⭐ Use ML to classify flows and packets as benign or malicious. ⭐ ⭐
awesome data-science data-science-projects forensics-tools hacktoberfest infosectools machine-learning machine-learning-projects machinelearning machinelearningproject network-analysis network-security packet-analyser pcap security security-tools tcpdump-like
Last synced: 01 Aug 2024
https://github.com/Shujian2015/FreeML
A List of Data Science/Machine Learning Resources (Mostly Free)
data-science deep-learning machine-learning natural-language-processing
Last synced: 02 Aug 2024
https://github.com/sb-ai-lab/LightAutoML
Fast and customizable framework for automatic ML model creation (AutoML)
automated-machine-learning automatic-machine-learning automl automl-algorithms binary-classification data-science kaggle lama machine-learning multiclass-classification nlp python regression
Last synced: 03 Aug 2024
https://github.com/JuliaStats/Distributions.jl
A Julia package for probability distributions and associated functions.
data-science julia probability-distributions statistics
Last synced: 03 Aug 2024
https://github.com/moj-analytical-services/splink
Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends
data-matching data-science deduplicate-data deduplication duckdb em-algorithm entity-resolution fuzzy-matching record-linkage spark uk-gov-data-science
Last synced: 31 Jul 2024
https://github.com/novak-99/MLPP
A library created to revitalize C++ as a machine learning front end. Per aspera ad astra.
cpp data-science deep-learning machine-learning
Last synced: 31 Jul 2024
https://github.com/makcedward/nlp
:memo: This repository recorded my NLP journey.
ai data-science deep-learning machine-learning nlp
Last synced: 30 Jul 2024
https://github.com/logicalclocks/hopsworks
Hopsworks - Data-Intensive AI platform with a Feature Store
aws azure data-science feature-engineering feature-management feature-store gcp governance hopsworks kserve machine-learning ml mlops model-serving pyspark python serverless
Last synced: 31 Jul 2024
https://github.com/xorbitsai/xorbits
Scalable Python DS & ML, in an API compatible & lightning fast way.
data-science distributed-systems lightgbm machine-learning ml numpy pandas python scalable xgboost
Last synced: 31 Jul 2024
https://github.com/skrub-data/skrub
Prepping tables for machine learning
data data-analysis data-cleaning data-preparation data-preprocessing data-science data-wrangling dirty-data machine-learning
Last synced: 31 Jul 2024
https://github.com/ScottfreeLLC/AlphaPy
Python AutoML for Trading Systems and Sports Betting
backtesting classification cryptocurrency data-science deep-learning iex keras machine-learning pandas portfolio predictive-analytics python regression scikit-learn sports stocks time-series-analysis trading trading-platform trading-strategies
Last synced: 02 Aug 2024
https://github.com/areed1192/sigma_coding_youtube
This is a collection of all the code that can be found on my YouTube channel Sigma Coding.
data-science google-maps-api m-language mlanguage office-applications outlook-vba power-bi power-query powerpoint-vba python python-tutorials python-windows vba vba-excel win32 win32com word-vba yelp-fusion-api
Last synced: 02 Aug 2024
https://github.com/rhiever/datacleaner
A Python tool that automatically cleans data sets and readies them for analysis.
automation data-science machine-learning python
Last synced: 30 Jul 2024
https://github.com/okfn-brasil/querido-diario
📰 Diários oficiais brasileiros acessíveis a todos | 📰 Brazilian government gazettes, accessible to everyone.
artificial-intelligence civic-tech data-science governments-gazettes govtech hacktoberfest hacktoberfest2023 machine-learning open-data politics scraping spider
Last synced: 30 Jul 2024
https://github.com/nfstream/nfstream
NFStream: a Flexible Network Data Analysis Framework.
artificial-intelligence cybersecurity data-analysis data-mining data-science dataset-generation deep-packet-inspection machine-learning ndpi netflow network-analysis network-monitoring network-security packet-analyser packet-capture pcap python traffic-analysis traffic-classification
Last synced: 01 Aug 2024
https://github.com/mrkn/pycall.rb
Calling Python functions from the Ruby language
data-science pycall python ruby rubydatascience rubyml
Last synced: 31 Jul 2024
https://github.com/pixiedust/pixiedust
Python Helper library for Jupyter Notebooks
data-science jupyter-notebook pixiedust python python-notebook scala-notebooks spark visualization
Last synced: 31 Jul 2024
https://github.com/squaredtechnologies/vizly-notebook
AI-powered Jupyter Notebook — use local AI to generate and edit code cells, automatically fix errors, and chat with your data
ai analysis analytics data-science jupyter jupyter-notebook jupyter-notebooks jupyterhub jupyterlab ollama python react reactjs
Last synced: 12 Aug 2024
https://github.com/TeoMeWhy/teomerefs
Guia de referências técnicas para carreira em dados
data data-science machine-learning python
Last synced: 31 Jul 2024
https://github.com/squaredtechnologies/thread
AI-powered Jupyter Notebook — use local AI to generate and edit code cells, automatically fix errors, and chat with your data
ai analysis analytics data-science jupyter jupyter-notebook jupyter-notebooks jupyterhub jupyterlab ollama python react reactjs
Last synced: 01 Aug 2024
https://github.com/daochenzha/data-centric-AI
A curated, but incomplete, list of data-centric AI resources.
ai artificial-intelligence data-centric data-centric-ai data-centric-machine-learning data-curation data-engineering data-quality data-science machine-learning
Last synced: 31 Jul 2024
https://github.com/sintel-dev/Orion
A machine learning library for detecting anomalies in signals.
anomaly-detection benchmarking data-science deep-learning generative-adversarial-network machine-learning orion signals time-series unsupervised-learning
Last synced: 31 Jul 2024
https://github.com/predict-idlab/plotly-resampler
Visualize large time series data with plotly.py
data-analysis data-science data-visualization plotly plotly-dash python time-series visualization
Last synced: 01 Aug 2024
https://github.com/dssg/hitchhikers-guide
The Hitchhiker's Guide to Data Science for Social Good
data-science dssg machine-learning training tutorial-exercises
Last synced: 02 Aug 2024
https://github.com/elixir-explorer/explorer
Series (one-dimensional) and dataframes (two-dimensional) for fast and elegant data exploration in Elixir
data-science dataframes elixir rust
Last synced: 01 Aug 2024
https://github.com/hystax/optscale
FinOps and MLOps platform to run ML/AI and regular cloud workloads with optimal performance and cost.
aws azure cloud cloud-cost cloud-cost-intelligence cost-optimization data-science databricks devops experiment-tracking finops gcp kubernetes ml mlflow mlops paas-instrumentation paas-profiling s3-optimization
Last synced: 01 Aug 2024
https://github.com/Mybridge/machine-learning-open-source
Monthly Series - Machine Learning Top 10 Open Source Projects
ai algorithm artificial-intelligence data-science machine-learning neural-network
Last synced: 31 Jul 2024
https://github.com/towardsai/tutorials
AI-related tutorials. Access any of them for free → https://towardsai.net/editorial
collaborative-filtering data-science deep-learning google-colab linear-algebra machine-learning math mathematics monte-carlo-simulation neural-networks nlp programming python python-tutorial recommendation-system sentiment-analysis tutorial
Last synced: 02 Aug 2024
https://github.com/cleanlab/cleanvision
Automatically find issues in image datasets and practice data-centric computer vision.
computer-vision data-centric-ai data-exploration data-profiling data-quality data-science data-validation deep-learning exploratory-data-analysis image-analysis image-classification image-generation image-quality image-segmentation
Last synced: 30 Jul 2024
https://github.com/sematic-ai/sematic
An open-source ML pipeline development platform
ai data-science machine-learning ml ml-ops ml-pipeline ml-pipelines mlops pipeline python python3
Last synced: 03 Aug 2024
https://github.com/maxpumperla/deep_learning_and_the_game_of_go
Code and other material for the book "Deep Learning and the Game of Go"
alphago alphago-zero data-science deep-learning game-of-go games machine-learning neural-networks python
Last synced: 01 Aug 2024
https://github.com/grailbio/reflow
A language and runtime for distributed, incremental data processing in the cloud
analysis-pipeline aws bioinformatics-pipeline cloud-computing data-science golang language runtime scientific-computing
Last synced: 31 Jul 2024
https://github.com/caserec/Datasets-for-Recommender-Systems
This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
data-science database datasets public-data recommender-systems
Last synced: 08 Aug 2024
https://github.com/ipython-books/cookbook-2nd
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
computing data-analysis data-mining data-science data-visualization ipython jupyter jupyter-notebook machine-learning numerical-computation python visualization
Last synced: 03 Aug 2024
https://github.com/iamaziz/PyDataset
Instant access to many datasets in Python.
Last synced: 07 Aug 2024
https://github.com/davidadsp/generative_deep_learning_2nd_edition
The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.
chatgpt dalle2 data-science deep-learning diffusion-models generative-adversarial-network gpt-3 machine-learning python stable-diffusion tensorflow
Last synced: 02 Aug 2024
https://github.com/LongOnly/Quantitative-Notebooks
Educational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
algorithmic-trading algotrading asset-allocation asset-management asset-pricing data-analysis data-science financial-analysis jupyter machine-learning notebook pairs-trading python quantitative-finance quantitative-trading stock-trading trading-algorithms trading-strategies
Last synced: 01 Aug 2024