Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/shahinrostami/chord
Engaging visualisations, made easy.
data-science data-visualization plotting python visualization
Last synced: 11 Jul 2024
![](https://github.com/shahinrostami.png)
https://github.com/pachyderm/pachyderm
Data-Centric Pipelines and Data Versioning
analytics big-data containers data-analysis data-science distributed-systems docker go kubernetes pachyderm
Last synced: 11 Jul 2024
![](https://github.com/pachyderm.png)
https://github.com/nelson-lang/nelson
The Nelson Programming Language
cpp17 data-science data-structures interpreter mathematical-functions matlab matrix-functions nelson octave programming-language scientific-computing scilab
Last synced: 11 Jul 2024
![](https://github.com/nelson-lang.png)
https://github.com/ploomber/sklearn-evaluation
Machine learning model evaluation made easy: plots, tables, HTML reports, experiment tracking and Jupyter notebook analysis.
data-science deep-learning jupyter-notebook machine-learning pytorch scikit-learn sklearn tensorflow
Last synced: 11 Jul 2024
![](https://github.com/ploomber.png)
https://github.com/jamesqo/gun-violence-data
A comprehensive, accessible database that contains records of over 260k US gun violence incidents from January 2013 to March 2018.
data-science gun-violence-archive machine-learning statistics
Last synced: 11 Jul 2024
![](https://github.com/jamesqo.png)
https://github.com/Ibotta/sk-dist
Distributed scikit-learn meta-estimators in PySpark
data-science machine-learning ml scikit-learn spark
Last synced: 11 Jul 2024
![](https://github.com/Ibotta.png)
https://github.com/prathimacode-hub/Awesome_Python_Scripts
🚀 Curated collection of Awesome Python Scripts which will make you go wow. Dive into this world of 360+ scripts. Feel free to contribute. Show your support by ✨this repository.
algorithms algorithms-datastructures beginner-friendly contributions contributions-welcome data-science data-structures education hacktoberfest hacktoberfest2022 learn open-source practice project python python-script python-scripts python3 search
Last synced: 11 Jul 2024
![](https://github.com/prathimacode-hub.png)
https://github.com/nipy/nipype
Workflows and interfaces for neuroimaging packages
big-data brain-imaging brainweb data-science dataflow dataflow-programming neuroimaging python workflow-engine
Last synced: 11 Jul 2024
![](https://github.com/nipy.png)
https://github.com/deepgraph/deepgraph
Analyze Data with Pandas-based Networks. Documentation:
data-analysis data-mining data-science data-structures data-visualization graph-database graph-theory graphs graphviz interfacing iterative-methods multilayer-networks network network-analysis network-visualization networkx pandas parallel partitioning
Last synced: 10 Jul 2024
![](https://github.com/deepgraph.png)
https://github.com/jasmcaus/caer
High-performance Vision library in Python. Scale your research, not boilerplate.
ai artificial-intelligence augmentation caer computer-vision cuda data-science deep-learning gpu image-classification image-processing image-segmentation machine-learning neural-network opencv python segmentation type-checking video-processing vision
Last synced: 10 Jul 2024
![](https://github.com/jasmcaus.png)
https://github.com/krassowski/jupyter-helpers
A collection of helpers for Jupyter/IPython
data-science jupyter jupyter-lab jupyter-notebook jupyter-widget jupyterlab jupyterlab-extension
Last synced: 10 Jul 2024
![](https://github.com/krassowski.png)
https://github.com/mmkim1210/GeneticsMakie.jl
🧬High-performance genetics- and genomics-related data visualization using Makie.jl
bioinformatics cairomakie colocalization data-science fine-mapping genetics genomics gwas julia julia-language linkage locuszoom makie multi-ethnic multivariate openmendel phewas qtl v2f visualization
Last synced: 10 Jul 2024
![](https://github.com/mmkim1210.png)
https://github.com/plantinformatics/pretzel
Javascript full-stack framework for Big Data visualisation and analysis
big-data bioinformatics data-science data-visualization ember emberjs express expressjs javascript open-source
Last synced: 10 Jul 2024
![](https://github.com/plantinformatics.png)
https://github.com/nicolaskruchten/jupyter_pivottablejs
Drag’n’drop Pivot Tables and Charts for Jupyter/IPython Notebook, care of PivotTable.js
data-analysis data-science interactive jupyter-notebook pivot-chart pivot-tables
Last synced: 10 Jul 2024
![](https://github.com/nicolaskruchten.png)
https://github.com/ml-tooling/ml-hub
🧰 Multi-user development platform for machine learning teams. Simple to setup within minutes.
data-science docker jupyter jupyterhub machine-learning python
Last synced: 10 Jul 2024
![](https://github.com/ml-tooling.png)
https://github.com/Kotlin/kandy
Kotlin plotting library.
data-science graphics jupyter-notebooks kotlin plot
Last synced: 10 Jul 2024
![](https://github.com/Kotlin.png)
https://github.com/LearnDataSci/articles
A repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
data-analysis data-science data-visualization machine-learning machine-learning-algorithms machinelearning python
Last synced: 10 Jul 2024
![](https://github.com/LearnDataSci.png)
https://github.com/coalio/Assistant
A data science library providing flexible dataframes for Lua 5.1+
data-analysis data-science data-structures dataframe lua
Last synced: 10 Jul 2024
![](https://github.com/coalio.png)
https://github.com/Kotlin/dataframe
Structured data processing in Kotlin
data-analysis data-science dataframe kotlin
Last synced: 10 Jul 2024
![](https://github.com/Kotlin.png)
https://github.com/maxhumber/redframes
General Purpose Data Manipulation Library
Last synced: 10 Jul 2024
![](https://github.com/maxhumber.png)
https://github.com/tidypyverse/tidypandas
A grammar of data manipulation for pandas inspired by tidyverse
data-analysis data-science dataframe dataframe-library dplyr pandas python tidyverse
Last synced: 10 Jul 2024
![](https://github.com/tidypyverse.png)
https://github.com/HanXinzi-AI/awesome-python-machine-learning-resources
a collection of awesome machine learning and deep learning Python libraries&tools. 热门实用机器学习和深入学习Python库和工具的集合
auto-ml awesome awesome-list cv data-analysis data-mining data-science data-visualization deep-learning fintech machine-learning machine-learning-algorithms nlp pytorch recommender-system sklearn tensorflow text-mining time-series
Last synced: 10 Jul 2024
![](https://github.com/HanXinzi-AI.png)
https://github.com/asavinov/prosto
Prosto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
business-intelligence data-preparation data-preprocessing data-processing data-science data-wrangling feature-engineering map-reduce olap pandas python spark workflow
Last synced: 10 Jul 2024
![](https://github.com/asavinov.png)
https://github.com/metarank/metarank
A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagement. A friendly Learn-to-Rank engine
automl data-engineering data-science deep-learning feature-engineering feature-extraction kubernetes machine-learning neural-networks personalization ranking scala search
Last synced: 09 Jul 2024
![](https://github.com/metarank.png)
https://github.com/machine-learning-apps/Issue-Label-Bot
Code For The Issue Label Bot, an App that automatically labels issues using machine learning, available on the GitHub Marketplace. This is also code for the blog article: "How to automate tasks on GitHub with machine learning for fun and profit"
bigquery bootstrap data-science deep-learning end-to-end-application flask gcp-cloud gharchive github-api-v3 github-app keras kubernetes machine-learning machine-learning-tutorials nlp production-machine-learning tensorflow
Last synced: 09 Jul 2024
![](https://github.com/machine-learning-apps.png)
https://github.com/cjroth/chronist
Long-term analysis of emotion, age, and sentiment using Lifeslice and text records.
data data-analysis data-science data-visualization dataset dataviz emotion emotion-analytics es6 javascript matplotlib pandas photoanalysis python sentiment sentiment-analysis
Last synced: 09 Jul 2024
![](https://github.com/cjroth.png)
https://github.com/natnew/Awesome-Data-Science
Carefully curated list of awesome data science resources.
ai awesome awesome-list data data-science deep-learning explainable-ai interoperability large-scale-machine-learning machine-learning machine-learning-operations ml-operations responsible-ai
Last synced: 09 Jul 2024
![](https://github.com/natnew.png)
https://github.com/h2oai/nitro
Create apps 10x quicker, without Javascript/HTML/CSS.
app apps data-analysis data-science developer-tools devtools graphics h2o-nitro low-code python ui ui-components user-interface web-application webapp widget-library widgets
Last synced: 09 Jul 2024
![](https://github.com/h2oai.png)
https://github.com/jazzdotdev/jazz
The Scripting Engine that Combines Speed, Safety, and Simplicity
actix android chromeos crypto cryptography data-science database development-environment embeddable jazz jinja2 linux lua markdown rust scraping scripting web witness
Last synced: 09 Jul 2024
![](https://github.com/jazzdotdev.png)
https://github.com/nfstream/nfstream
NFStream: a Flexible Network Data Analysis Framework.
artificial-intelligence cybersecurity data-analysis data-mining data-science dataset-generation deep-packet-inspection machine-learning ndpi netflow network-analysis network-monitoring network-security packet-analyser packet-capture pcap python traffic-analysis traffic-classification
Last synced: 09 Jul 2024
![](https://github.com/nfstream.png)
https://github.com/pykale/pykale
Knowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the 🔥PyTorch ecosystem. ⭐ Star to support our work!
computer-vision data-science deep-learning domain-adaptation graph-analysis knowledge-aware-learning machine-learning medical-image-analysis meta-learning multimodal multimodal-learning python pytorch transfer-learning
Last synced: 09 Jul 2024
![](https://github.com/pykale.png)
https://github.com/scottshambaugh/monaco
Quantify uncertainty and sensitivities in your computer models with an industry-grade Monte Carlo library.
data-science monaco monte-carlo python scientific-computing sensitivity-analysis simulation statistics uncertainty-analysis uncertainty-quantification
Last synced: 09 Jul 2024
![](https://github.com/scottshambaugh.png)
https://github.com/kennethleungty/Failed-ML
Compilation of high-profile real-world examples of failed machine learning projects
ai artificial-intelligence classification computer-vision data-engineering data-quality data-science deep-learning failed-data-science failed-machine-learning failed-ml fml forecasting machine-learning ml natural-language-processing production recsys regression
Last synced: 09 Jul 2024
![](https://github.com/kennethleungty.png)
https://github.com/mszell/introdatasci
Course materials for: Introduction to Data Science and Programming
course-materials crash-course data-science network-analysis pandas-python programming programming-courses python teaching-materials
Last synced: 08 Jul 2024
![](https://github.com/mszell.png)
https://github.com/ZackAkil/friendlier-data-labelling
Code resources for generating a google form for labelling data.
data-science google google-apps-script google-forms google-sheets machine-learning
Last synced: 08 Jul 2024
![](https://github.com/ZackAkil.png)
https://github.com/plotly/dash-table
OBSOLETE: now part of https://github.com/plotly/dash
dash data-science data-visualization plotly plotly-dash python react table
Last synced: 08 Jul 2024
![](https://github.com/plotly.png)
https://github.com/blockchain-etl/awesome-bigquery-views
Useful SQL queries for Blockchain ETL datasets in BigQuery.
blockchain-analytics crypto cryptocurrency data-analytics data-engineering data-science gcp google-cloud google-cloud-platform on-chain-analysis web3
Last synced: 08 Jul 2024
![](https://github.com/blockchain-etl.png)
https://github.com/piquette/qtrn
A cli tool to streamline financial markets data analysis :wrench:
cli data data-science finance go golang options quotes scraper stock stock-analysis stock-market
Last synced: 07 Jul 2024
![](https://github.com/piquette.png)
https://aymara.github.io/lima/
The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.
ai cpp data-science deep-learning entity-extraction free-software information-extraction linux machine-learning multilingual named-entity-recognition natural-language-processing neural-network nlp nlp-library powerful python relation-extraction tokenization windows
Last synced: 07 Jul 2024
![](https://github.com/aymara.png)
https://github.com/voxel51/voxelgpt
AI assistant that can query visual datasets, search the FiftyOne docs, and answer general computer vision questions
artificial-intelligence chatgpt computer-vision data-science deep-learning fiftyone langchain llm machine-learning openai python
Last synced: 07 Jul 2024
![](https://github.com/voxel51.png)
https://matheusfacure.github.io/python-causality-handbook/
Causal Inference for the Brave and True. A light-hearted yet rigorous approach to learning about impact estimation and causality.
causal-inference causality data-science econometrics harmless-econometrics impact-estimation python
Last synced: 06 Jul 2024
![](https://github.com/matheusfacure.png)
https://github.com/matheusfacure/python-causality-handbook
Causal Inference for the Brave and True. A light-hearted yet rigorous approach to learning about impact estimation and causality.
causal-inference causality data-science econometrics harmless-econometrics impact-estimation python
Last synced: 06 Jul 2024
![](https://github.com/matheusfacure.png)
https://github.com/somdeep/Statball
Statball - Football soccer stats analyser from top 5 european leagues with data obtained by web scraping from Fbref and Statsbomb
csharp data-science data-scraping data-viz dotnet dotnet-core fbref football football-analytics football-data scouting-data scraping soccer soccer-analytics soccer-data statsbomb tableau visualizations
Last synced: 06 Jul 2024
![](https://github.com/somdeep.png)
https://github.com/magesh-technovator/awesome-ai-applications
A Comprehensive survey on business use cases of AI that help them thrive in the digital economy
ai ai-applications analytics artificial-intelligence bussiness-intelligence computer-vision data-science deep-learning machine-learning natural-language-processing startup
Last synced: 06 Jul 2024
![](https://github.com/magesh-technovator.png)
https://github.com/Azure/AzureDSVM
AzureDSVM is an R package that offers convenient harness of Azure DSVM, remote execution of scalable and elastic data science work, and monitoring of on-demand resource consumption.
azure data-science data-science-virtual-machine r
Last synced: 06 Jul 2024
![](https://github.com/Azure.png)
https://gdsl-ul.github.io/san/
Spatial Modelling for Data Scientists
book cross-validation data-science geographically-weighted-regression maps moran-i multilevel-models r r-spatial spatial-analysis spatial-econometrics
Last synced: 05 Jul 2024
![](https://github.com/GDSL-UL.png)
https://github.com/jmari/iPharo
Pharo Smaltalk kernel for Jupyter
data-science jupyter-notebook pharo pharo-smalltalk smalltalk
Last synced: 05 Jul 2024
![](https://github.com/jmari.png)
https://github.com/antonycourtney/tad
A desktop application for viewing and analyzing tabular data
csv data-analysis data-science database desktop-application duckdb parquet-viewer pivot-tables pivots tabular-data
Last synced: 05 Jul 2024
![](https://github.com/antonycourtney.png)
https://github.com/pydoit/doit
task management & automation tool
build-automation build-system build-tool data-pipeline data-science hacktoberfest python task-runner workflow workflow-automation workflow-management
Last synced: 05 Jul 2024
![](https://github.com/pydoit.png)
https://github.com/jgoerner/data-science-stack-cookiecutter
🐳📊🤓Cookiecutter template to launch an awesome dockerized Data Science toolstack (incl. Jupyster, Superset, Postgres, Minio, AirFlow & API Star)
airflow apistar cookiecutter data-science docker docker-image jupyter minio postgres python superset
Last synced: 05 Jul 2024
![](https://github.com/jgoerner.png)
https://github.com/olavolav/uniplot
Lightweight plotting to the terminal. 4x resolution via Unicode.
data-analysis data-science plot python
Last synced: 05 Jul 2024
![](https://github.com/olavolav.png)
https://github.com/TeoMeWhy/teomerefs
Guia de referências técnicas para carreira em dados
data data-science machine-learning python
Last synced: 05 Jul 2024
![](https://github.com/TeoMeWhy.png)
https://github.com/RamiKrispin/Introduction-to-Docker
(WIP) Getting started with Docker - An introduction to Docker with data science and engineering applications
data-engineering data-science docker dockerfile
Last synced: 04 Jul 2024
![](https://github.com/RamiKrispin.png)
https://github.com/tomasonjo/blogs
Jupyter notebooks that support my graph data science blog posts at https://bratanic-tomaz.medium.com/
data-science graph graph-algorithms neo4j
Last synced: 04 Jul 2024
![](https://github.com/tomasonjo.png)
https://github.com/okfn-brasil/rosie
🤖 Python application responsible for Serenata de Amor's intelligence
artificial-intelligence data-science machine-learning
Last synced: 04 Jul 2024
![](https://github.com/okfn-brasil.png)
https://github.com/okfn-brasil/whistleblower
🚨A Twitter bot for publicly reporting suspicions found by Rosie, Serenata de Amor's AI
data-science facebook-messenger-bot machine-learning twitter-bot
Last synced: 04 Jul 2024
![](https://github.com/okfn-brasil.png)
https://github.com/louisfb01/start-machine-learning
A complete guide to start and improve in machine learning (ML), artificial intelligence (AI) in 2024 without ANY background in the field and stay up-to-date with the latest news and state-of-the-art techniques!
artificial-intelligence cheat-sheets course coursera coursera-machine-learning data-science deep-learning learn-to-code learning learning-python linear-algebra machine-learning neural-networks practice probability-statistics read-articles tutorial tutorials youtube youtube-playlist
Last synced: 04 Jul 2024
![](https://github.com/louisfb01.png)
https://github.com/ErdemOzgen/Data-Engineering-Roadmap
Roadmap for Data Engineering
awesome awesome-list awesome-resources ci-cd cloud data-science database dataengineering datapipeline datapreprocessing datawarehouse deep-learning development devops guidelines interview machine-learning mlops roadmap
Last synced: 04 Jul 2024
![](https://github.com/ErdemOzgen.png)
https://github.com/akfamily/aktools
AKTools is an elegant and simple HTTP API library for AKShare, built for AKSharers!
akshare asyncio data data-science fastapi openapi pydanti
Last synced: 04 Jul 2024
![](https://github.com/akfamily.png)
https://github.com/h1st-ai/h1st
Power Tools for AI Engineers With Deadlines
automl autonomous-vehicles avionics cold-start collaboration cybersecurity data-science datascience-environment energy-optimization ensemble-machine-learning explainability hacktoberfest home-automation human-in-the-loop industrial-iot predictive-maintenance time-series trustworthy-datascience
Last synced: 04 Jul 2024
![](https://github.com/h1st-ai.png)
https://github.com/codelibs/docker-fione
Docker for Fione
ai automl data-science machine-learning
Last synced: 03 Jul 2024
![](https://github.com/codelibs.png)
https://github.com/biolab/orange3
🍊 :bar_chart: :bulb: Orange: Interactive data analysis
classification clustering data-mining data-science data-visualization decision-trees machine-learning numpy orange orange3 pandas plotting python random-forest regression scikit-learn scipy visual-programming visualization
Last synced: 03 Jul 2024
![](https://github.com/biolab.png)
https://github.com/launchflow/buildflow
BuildFlow, is an open source framework for building large scale systems using Python. All you need to do is describe where your input is coming from and where your output should be written, and BuildFlow handles the rest. No configuration outside of the code is required.
batch data-science pipeline python streaming
Last synced: 03 Jul 2024
![](https://github.com/launchflow.png)
https://github.com/Himscipy/bnn_hvd
Distributed Training of Bayesian Neural Networks at Scale
bayesian-networks computer-vision data-science distributed-computing horovod machine-learning mnist tensorflow tensorflow-probability uncertainty-quantification variational-inference
Last synced: 03 Jul 2024
![](https://github.com/Himscipy.png)
https://github.com/pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps
Last synced: 03 Jul 2024
![](https://github.com/pymupdf.png)
https://github.com/MLMI2-CSSI/foundry
Simplifying the discovery and usage of machine-learning ready datasets in materials science and chemistry
chemistry data-science datasets machine-learning materials-science
Last synced: 03 Jul 2024
![](https://github.com/MLMI2-CSSI.png)
https://github.com/Technion-Kishony-lab/quibbler
Your data - interactive!
data-analysis data-science data-visualization declarative graphics gui interactive jupyter matplotlib python widgets
Last synced: 03 Jul 2024
![](https://github.com/Technion-Kishony-lab.png)
https://github.com/run-house/runhouse
The fastest way to iterate and deploy AI workloads on your own infra. Unobtrusive, debuggable, PyTorch-like APIs.
api artificial-intelligence aws azure collaboration data-science deployment distributed fastapi gcp infrastructure machine-learning middleware observability python pytorch ray sagemaker serverless
Last synced: 03 Jul 2024
![](https://github.com/run-house.png)
https://github.com/Materials-Data-Science-and-Informatics/awesome-fair
A curated list of awesome stuff around the FAIR principles for (scientific) data, i.e that data is findable, accessable, interoperable and re-usable.
ai-ready awesome awesome-list data-provenance data-science digital-objects fair fair-digital-objects fair-principles fdo interoperability metadata metadata-information metadata-management metadata-standard provenance research-data research-data-management
Last synced: 03 Jul 2024
![](https://github.com/Materials-Data-Science-and-Informatics.png)
https://github.com/kdr-aus/ogma
Scripting language focused on processing tabular data.
data-science language rust scripting-language table-data
Last synced: 03 Jul 2024
![](https://github.com/kdr-aus.png)
https://github.com/hamelsmu/Seq2Seq_Tutorial
Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
data-science deep-learning deeplearning keras keras-tutorials machine-learning medium-article nlp-machine-learning rnn-encoder-decoder seq2seq-tutorial sequence-to-sequence
Last synced: 02 Jul 2024
![](https://github.com/hamelsmu.png)
https://github.com/mandiant/ThreatPursuit-VM
Threat Pursuit Virtual Machine (VM): A fully customizable, open-sourced Windows-based distribution focused on threat intelligence analysis and hunting designed for intel and malware analysts as well as threat hunters to get up and running quickly.
analytics cyber data-science fireeye intelligence intelligence-analysis malware mandiant threat threathunting threatintelligence virtual-machine
Last synced: 02 Jul 2024
![](https://github.com/mandiant.png)
https://github.com/neonwatty/machine_learning_refined
Notes, examples, and Python demos for the 2nd edition of the textbook "Machine Learning Refined" (published by Cambridge University Press).
artificial-intelligence autograd collab data-science deep-learning jax jupyter-notebook lecture-notes machine-learning machine-learning-algorithms mathematical-optimization neural-network numpy python slides
Last synced: 02 Jul 2024
![](https://github.com/neonwatty.png)
https://github.com/allenai/allennlp
An open-source NLP research library, built on PyTorch.
data-science deep-learning natural-language-processing nlp python pytorch
Last synced: 02 Jul 2024
![](https://github.com/allenai.png)
https://github.com/safe-graph/UGFraud
An Unsupervised Graph-based Toolbox for Fraud Detection
anomaly-detection data-science fraud-detection fraud-prevention graph-algorithms machine-learning opensource outlier-detection security-tools spam-detection toolbox
Last synced: 02 Jul 2024
![](https://github.com/safe-graph.png)
https://github.com/kennethleungty/Text-to-Audio-with-Bark
Exploring Bark, the Open-Source Text-to-Audio Generative Model
ai artificial-intelligence bark data-science deep-learning gen-ai generative-ai machine-learning prompt-engineering speech text-prompt text-to-audio text-to-music text-to-sound text-to-speech
Last synced: 02 Jul 2024
![](https://github.com/kennethleungty.png)
https://github.com/cleanlab/cleanvision
Automatically find issues in image datasets and practice data-centric computer vision.
computer-vision data-centric-ai data-exploration data-profiling data-quality data-science data-validation deep-learning exploratory-data-analysis image-analysis image-classification image-generation image-quality image-segmentation
Last synced: 01 Jul 2024
![](https://github.com/cleanlab.png)
https://github.com/PKU-DAIR/Hetu
A high-performance distributed deep learning system targeting large-scale and automated distributed training.
artificial-intelligence autograd data-science deep-learning deep-neural-networks distributed-systems distributed-training embeddings gpu high-dimensional machine-learning python state-of-the-art
Last synced: 01 Jul 2024
![](https://github.com/PKU-DAIR.png)
https://github.com/pablofrommars/fsharp-notebook
Data Science Notebook for F# interactive
data-science data-visualization fsharp vscode-extension
Last synced: 01 Jul 2024
![](https://github.com/pablofrommars.png)
https://github.com/jgoerner/beyond-jupyter
🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
airflow apache apistar data-science docker docker-compose jupyter jupyter-notebook minio postgres superset
Last synced: 01 Jul 2024
![](https://github.com/jgoerner.png)
https://github.com/gramian/hapod
HAPOD - Hierarchical Approximate Proper Orthogonal Decomposition
data-driven data-reduction data-science datascience dimension-reduction distributed-memory high-performance-computing hpc limited-memory mapreduce mapreduce-algorithm model-order-reduction model-reduction pca pod proper-orthogonal-decomposition svd unsupervised-learning
Last synced: 01 Jul 2024
![](https://github.com/gramian.png)
https://github.com/sharatsawhney/character_segmentation
A detailed Research project on Character-Segmentation using Neural Networks!
data-science deep-learning deep-neural-networks keras keras-layer keras-models keras-neural-networks matplotlib neural-network numpy opencv-python
Last synced: 30 Jun 2024
![](https://github.com/sharatsawhney.png)
https://github.com/rhiever/datacleaner
A Python tool that automatically cleans data sets and readies them for analysis.
automation data-science machine-learning python
Last synced: 30 Jun 2024
![](https://github.com/rhiever.png)
https://github.com/makcedward/nlp
:memo: This repository recorded my NLP journey.
ai data-science deep-learning machine-learning nlp
Last synced: 30 Jun 2024
![](https://github.com/makcedward.png)
https://github.com/ak-coram/cl-duckdb
Common Lisp CFFI wrapper around the DuckDB C API
c-bindings common-lisp data-science duckdb lisp olap parquet sql
Last synced: 30 Jun 2024
![](https://github.com/ak-coram.png)
https://ibm-cds-labs.github.io/pixiedust
Python Helper library for Jupyter Notebooks
data-science jupyter-notebook pixiedust python python-notebook scala-notebooks spark visualization
Last synced: 30 Jun 2024
![](https://github.com/pixiedust.png)
https://github.com/Esri/arcgis-python-api
Documentation and samples for ArcGIS API for Python
arcgis data-science gis jupyter jupyterlab-extension mapping python spatial-data spatial-data-analysis
Last synced: 30 Jun 2024
![](https://github.com/Esri.png)
https://github.com/mlrun/mlrun
MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications.
data-engineering data-science experiment-tracking kubernetes machine-learning mlops mlops-workflow model-serving python workflow
Last synced: 29 Jun 2024
![](https://github.com/mlrun.png)
https://github.com/NannyML/nannyml
nannyml: post-deployment data science in python
data-analysis data-drift data-science deep-learning jupyter-notebook machine-learning machinelearning ml mlops model-monitoring monitoring performance-estimation performance-monitoring postdeploymentdatascience python visualization
Last synced: 29 Jun 2024
![](https://github.com/NannyML.png)
https://github.com/ploomber/soorgeon
Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊
data-engineering data-science jupyter jupyter-notebooks machine-learning mlops workflow
Last synced: 29 Jun 2024
![](https://github.com/ploomber.png)
https://github.com/ploomber/soopervisor
☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.
airflow argo argo-workflows aws data-science kubeflow kubeflow-pipelines kubernetes machine-learning slurm workflow
Last synced: 29 Jun 2024
![](https://github.com/ploomber.png)
https://github.com/aporia-ai/mlnotify
🔔 No need to keep checking your training - just one import line and you'll know the second it's done.
data-science deep-learning deeplearning machine-learning machinelearning machinelearning-python ml notification notifications opensource python python3 tool tools
Last synced: 29 Jun 2024
![](https://github.com/aporia-ai.png)
https://github.com/a3data/hermione
ML made simple
data-science hermione machine-learning python
Last synced: 29 Jun 2024
![](https://github.com/A3Data.png)
https://github.com/datacarpentry/semester-biology
Forkable teaching materials for course on working with data in R
biology data-carpentry data-science r spatial-data sql teaching-materials
Last synced: 29 Jun 2024
![](https://github.com/datacarpentry.png)
https://github.com/iterative/mlem
🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞
cli data-science deployment developer-tools git machine-learning mlem model-registry python
Last synced: 29 Jun 2024
![](https://github.com/iterative.png)
https://github.com/xiyanghu/OSDT
Optimal Sparse Decision Trees
accelerate acceleration-model algorithm algorithm-optimization data-mining data-science interpretable-ml machine-learning ml-system mlsys neurips python python3
Last synced: 29 Jun 2024
![](https://github.com/xiyanghu.png)
https://github.com/MLReef/mlreef
The collaboration workspace for Machine Learning
artificial-intelligence data-science deep-learning deeplearning machine-learning machine-learning-algorithms mlops mlops-environment models mxnet pytorch reproducibility tensorflow
Last synced: 29 Jun 2024
![](https://github.com/MLReef.png)
https://github.com/carloocchiena/the_statistics_handbook
the statistics handbook open source repository
data-science latex mathematics statistics
Last synced: 29 Jun 2024
![](https://github.com/carloocchiena.png)
https://kevinheavey.github.io/modern-polars/
Code and data for the Modern Polars book
data-analytics data-engineering data-science dataengineering pandas polars python
Last synced: 29 Jun 2024
![](https://github.com/kevinheavey.png)
https://github.com/alinebastos/dev-practice
Practice your skills with these ideas.
back-end backend challenge css css3 data-science development front-end front-end-development frontend frontend-practice frontend-skills game git hackathons hacktoberfest javascript practice vim
Last synced: 29 Jun 2024
![](https://github.com/alinebastos.png)