Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-12-25 00:06:44 UTC
- JSON Representation
https://github.com/sintel-dev/orion
A machine learning library for detecting anomalies in signals.
anomaly-detection benchmarking data-science deep-learning generative-adversarial-network machine-learning orion signals time-series unsupervised-learning
Last synced: 25 Dec 2024
https://github.com/pixiedust/pixiedust
Python Helper library for Jupyter Notebooks
data-science jupyter-notebook pixiedust python python-notebook scala-notebooks spark visualization
Last synced: 20 Dec 2024
https://github.com/aeon-toolkit/aeon
A toolkit for machine learning from time series
data-mining data-science machine-learning scikit-learn time-series time-series-analysis time-series-anomaly-detection time-series-classification time-series-clustering time-series-regression time-series-segmentation
Last synced: 24 Dec 2024
https://ibm-cds-labs.github.io/pixiedust
Python Helper library for Jupyter Notebooks
data-science jupyter-notebook pixiedust python python-notebook scala-notebooks spark visualization
Last synced: 04 Oct 2024
https://github.com/cleanlab/cleanvision
Automatically find issues in image datasets and practice data-centric computer vision.
computer-vision data-centric-ai data-exploration data-profiling data-quality data-science data-validation deep-learning exploratory-data-analysis image-analysis image-classification image-generation image-quality image-segmentation
Last synced: 26 Oct 2024
https://github.com/LongOnly/Quantitative-Notebooks
Educational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
algorithmic-trading algotrading asset-allocation asset-management asset-pricing data-analysis data-science financial-analysis jupyter machine-learning notebook pairs-trading python quantitative-finance quantitative-trading stock-trading trading-algorithms trading-strategies
Last synced: 01 Nov 2024
https://github.com/longonly/quantitative-notebooks
Educational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
algorithmic-trading algotrading asset-allocation asset-management asset-pricing data-analysis data-science financial-analysis jupyter machine-learning notebook pairs-trading python quantitative-finance quantitative-trading stock-trading trading-algorithms trading-strategies
Last synced: 27 Sep 2024
https://github.com/sintel-dev/Orion
A machine learning library for detecting anomalies in signals.
anomaly-detection benchmarking data-science deep-learning generative-adversarial-network machine-learning orion signals time-series unsupervised-learning
Last synced: 30 Oct 2024
https://github.com/maxpumperla/deep_learning_and_the_game_of_go
Code and other material for the book "Deep Learning and the Game of Go"
alphago alphago-zero data-science deep-learning game-of-go games machine-learning neural-networks python
Last synced: 26 Dec 2024
https://github.com/dssg/hitchhikers-guide
The Hitchhiker's Guide to Data Science for Social Good
data-science dssg machine-learning training tutorial-exercises
Last synced: 12 Nov 2024
https://github.com/dataquestio/project-walkthroughs
Data science, machine learning, and web development project code for https://www.youtube.com/c/Dataquestio .
data-science machine-learning pandas python
Last synced: 21 Dec 2024
https://github.com/run-house/runhouse
Dispatch and distribute your ML training to "serverless" clusters in Python, like PyTorch for ML infra. Iterable, debuggable, multi-cloud/on-prem, identical across research and production.
api artificial-intelligence aws azure collaboration data-science deployment distributed fastapi gcp infrastructure machine-learning middleware observability python pytorch ray sagemaker serverless
Last synced: 25 Dec 2024
https://github.com/towardsai/tutorials
AI-related tutorials. Access any of them for free → https://towardsai.net/editorial
collaborative-filtering data-science deep-learning google-colab linear-algebra machine-learning math mathematics monte-carlo-simulation neural-networks nlp programming python python-tutorial recommendation-system sentiment-analysis tutorial
Last synced: 13 Nov 2024
https://github.com/mybridge/machine-learning-open-source
Monthly Series - Machine Learning Top 10 Open Source Projects
ai algorithm artificial-intelligence data-science machine-learning neural-network
Last synced: 07 Nov 2024
https://github.com/Mybridge/machine-learning-open-source
Monthly Series - Machine Learning Top 10 Open Source Projects
ai algorithm artificial-intelligence data-science machine-learning neural-network
Last synced: 28 Oct 2024
https://github.com/sematic-ai/sematic
An open-source ML pipeline development platform
ai data-science machine-learning ml ml-ops ml-pipeline ml-pipelines mlops pipeline python python3
Last synced: 15 Nov 2024
https://github.com/zama-ai/concrete-ml
Concrete ML: Privacy Preserving ML framework using Fully Homomorphic Encryption (FHE), built on top of Concrete, with bindings to traditional ML frameworks.
data-science fhe fully-homomorphic-encryption homomorphic-encryption machine-learning ppml privacy python scikit-learn tfhe torch
Last synced: 25 Dec 2024
https://github.com/WenjieDu/PyPOTS
A Python toolkit/library for reality-centric machine/deep learning and data mining on partially-observed time series, including SOTA neural network models for scientific analysis tasks of imputation/classification/clustering/forecasting/anomaly detection/cleaning on incomplete industrial (irregularly-sampled) multivariate TS with NaN missing values
classification clustering data-mining data-science deep-learning forecasting healthcare imputation incomplete industrial interpolation machine-learning missing-values missingness neural-network partially-observed-time-series pytorch science-research time-series time-series-analysis
Last synced: 02 Nov 2024
https://github.com/tidyverse/datascience-box
Data Science Course in a Box
data-science education r rstats teaching
Last synced: 26 Dec 2024
https://github.com/grailbio/reflow
A language and runtime for distributed, incremental data processing in the cloud
analysis-pipeline aws bioinformatics-pipeline cloud-computing data-science golang language runtime scientific-computing
Last synced: 26 Oct 2024
https://github.com/caserec/Datasets-for-Recommender-Systems
This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)
data-science database datasets public-data recommender-systems
Last synced: 28 Nov 2024
https://github.com/zinggai/zingg
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
analytics analytics-engineering data-science data-transformation data-transformations dataengineering datalake dataquality dedupe deduplication entity-resolution etl fuzzy-matching fuzzymatch identity identity-resolution masterdata ml modern-data-stack spark
Last synced: 25 Dec 2024
https://github.com/ipython-books/cookbook-2nd
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
computing data-analysis data-mining data-science data-visualization ipython jupyter jupyter-notebook machine-learning numerical-computation python visualization
Last synced: 23 Dec 2024
https://github.com/chiphuyen/just-pandas-things
An ongoing list of pandas quirks
data-science machine-learning pandas pandas-dataframe pandas-tutorial python
Last synced: 25 Dec 2024
https://github.com/mlr-org/mlr3
mlr3: Machine Learning in R - next generation
classification data-science machine-learning mlr3 r r-package regression
Last synced: 25 Dec 2024
https://github.com/dataprofessor/code
Compilation of R and Python programming codes on the Data Professor YouTube channel.
data-professor data-science data-science-python dataprofessor datascience exploratory-data-analysis machine-learning machinelearning pandas python python-data-science r scikit-learn scikit-learn-python shiny streamlit
Last synced: 26 Dec 2024
https://github.com/ramiawar/dataline
Chat with your data - AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake, SQLite...
ai chart data-science data-visualization llm sql
Last synced: 20 Dec 2024
https://github.com/egbertbouman/youtube-comment-downloader
Simple script for downloading Youtube comments without using the Youtube API
data-science data-scraper python youtube youtube-comments
Last synced: 25 Dec 2024
https://github.com/ropensci/targets
Function-oriented Make-like declarative workflows for R
data-science high-performance-computing make peer-reviewed pipeline r r-package r-targetopia reproducibility reproducible-research rstats targets workflow
Last synced: 24 Dec 2024
https://github.com/iamaziz/pydataset
Instant access to many datasets in Python.
Last synced: 21 Dec 2024
https://github.com/iamaziz/PyDataset
Instant access to many datasets in Python.
Last synced: 27 Nov 2024
https://github.com/nivu/ai_all_resources
A curated list of Best Artificial Intelligence Resources
artificial-intelligence convolutional-neural-networks data-science decision-trees deep-learning gan kmeans knn machine-learning mathematics neural-networks python random-forest regression reinforcement-learning rnn statistics statquest support-vector-machine tensorflow
Last synced: 02 Nov 2024
https://github.com/fraunhoferportugal/tsfel
An intuitive library to extract features from time series.
classification colab-notebook data-science feature-engineering feature-extraction time-series
Last synced: 26 Oct 2024
https://github.com/probcomp/bayeslite
BayesDB on SQLite. A Bayesian database table for querying the probable implications of data as easily as SQL databases query the data itself.
automatic-data-modeling data-science databases machine-learning probabilistic-programming
Last synced: 24 Dec 2024
https://github.com/youssefhosni/practical-machine-learning
Practical machine learning notebook & articles covers the machine learning end to end life cycle.
Last synced: 23 Dec 2024
https://github.com/RamiAwar/dataline
Chat with your data - AI data analysis and visualization on CSV, Postgres, MySQL, Snowflake, SQLite...
ai chart data-science data-visualization llm sql
Last synced: 30 Nov 2024
https://github.com/norskregnesentral/skweak
skweak: A software toolkit for weak supervision applied to NLP tasks
data-science distant-supervision natural-language-processing nlp-library nlp-machine-learning python spacy training-data weak-supervision
Last synced: 26 Dec 2024
https://github.com/NorskRegnesentral/skweak
skweak: A software toolkit for weak supervision applied to NLP tasks
data-science distant-supervision natural-language-processing nlp-library nlp-machine-learning python spacy training-data weak-supervision
Last synced: 26 Oct 2024
https://github.com/firmai/data-science-career
Career Resources for Data Science, Machine Learning, Big Data and Business Analytics Career Repository
analytics big-data business-analytics business-intelligence career data-science machine-learning resources
Last synced: 27 Nov 2024
https://github.com/youssefHosni/Practical-Machine-Learning
Practical machine learning notebook & articles covers the machine learning end to end life cycle.
Last synced: 27 Oct 2024
https://github.com/epsilla-cloud/vectordb
Epsilla is a high performance Vector Database Management System. Try out hosted Epsilla at https://cloud.epsilla.com/
ai chatgpt data data-science database embeddings embeddings-similarity infrastructure llms machine-learning neural-network neural-search rag retrieval search-engine vector-database vector-search
Last synced: 26 Dec 2024
https://github.com/mentatinnovations/datastream.io
An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
alerts anomaly anomaly-detection anomalydetection anomalydiscovery bokeh-dashboard dashboard data-science data-stream datascience dataset dsio elasticsearch iot jupyter kibana machinelearning python sklearn timeseries
Last synced: 25 Dec 2024
https://github.com/sberbank-ai-lab/LightAutoML
LAMA - automatic model creation framework
automated-machine-learning automl blackbox classification data-science ensembling feature-engineering gradient-boosting kaggle lama linear-model model-selection multiclass nlp parameter-tuning pipeline pytorch regression stacking whitebox
Last synced: 27 Nov 2024
https://github.com/tirthajyoti/stats-maths-with-python
General statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python
analytics anova bayesian-statistics clustering data-science hypothesis-testing inferential-statistics machine-learning mathematical-programming mathematics matplotlib normal-distribution numerical-analysis numpy pandas probability python scipy statistics statsmodels
Last synced: 24 Dec 2024
https://github.com/MentatInnovations/datastream.io
An open-source framework for real-time anomaly detection using Python, ElasticSearch and Kibana
alerts anomaly anomaly-detection anomalydetection anomalydiscovery bokeh-dashboard dashboard data-science data-stream datascience dataset dsio elasticsearch iot jupyter kibana machinelearning python sklearn timeseries
Last synced: 26 Oct 2024
https://github.com/ahmetozlu/vehicle_counting_tensorflow
:oncoming_automobile: "MORE THAN VEHICLE COUNTING!" This project provides prediction for speed, color and size of the vehicles with TensorFlow Object Counting API.
color-recognition computer-vision data-science deep-learning deep-neural-networks detection image-processing machine-learning object-detection object-detection-label opencv prediction python speed-prediction tensorflow tensorflow-object-detection-api vehicle-counting vehicle-detection vehicle-detection-and-tracking vehicle-tracking
Last synced: 23 Dec 2024
https://github.com/opengeos/streamlit-geospatial
A multi-page streamlit app for geospatial
data-science datascience dataviz geopython geospatial housing-data housing-market huggingface mapping open-source python real-estate streamlit streamlit-webapp
Last synced: 27 Dec 2024
https://github.com/google/lightweight_mmm
LightweightMMM 🦇 is a lightweight Bayesian Marketing Mix Modeling (MMM) library that allows users to easily train MMMs and obtain channel attribution information.
bayesian data-science econometrics marketing-science mmm
Last synced: 13 Nov 2024
https://github.com/webartifex/intro-to-python
An intro to Python & programming for wanna-be data scientists
data-science introduction-to-programming jupyter python tutorial
Last synced: 17 Nov 2024
https://github.com/tirthajyoti/Stats-Maths-with-Python
General statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python
analytics anova bayesian-statistics clustering data-science hypothesis-testing inferential-statistics machine-learning mathematical-programming mathematics matplotlib normal-distribution numerical-analysis numpy pandas probability python scipy statistics statsmodels
Last synced: 12 Nov 2024
https://github.com/joaquinamatrodrigo/skforecast
Time series forecasting with scikit-learn models
arima autoregressive-forecasting backtesting-forecasters data-science direct-forecasting exogenous-predictors forecasting lightgbm machine-learning multi-series-forecasting multi-step-forecasting multiple-time-series-forecasting probabilistic-forecasting python quantile-forecasting sarimax scikit-learn time-series weighted-time-series-forecasting xgboost
Last synced: 10 Oct 2024
https://github.com/shenweichen/coursera
Quiz & Assignment of Coursera
computer-vision coursera data-science data-structures deep-learning machine-learning natural-language-processing reinforcement-learning
Last synced: 23 Dec 2024
https://github.com/shenweichen/Coursera
Quiz & Assignment of Coursera
computer-vision coursera data-science data-structures deep-learning machine-learning natural-language-processing reinforcement-learning
Last synced: 27 Nov 2024
https://github.com/turicas/rows
A common, beautiful interface to tabular data, no matter the format
convert-data csv data data-science excel hacktoberfest python table tabular-data xls xlsx
Last synced: 26 Dec 2024
https://github.com/stitchfix/hamilton
A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton
dag data-engineering data-platform data-science dataframe etl etl-framework etl-pipeline feature-engineering featurization hamilton hamiltonian machine-learning numpy pandas python software-engineering stitch-fix
Last synced: 26 Sep 2024
https://github.com/osgeo/grass
GRASS GIS - free and open-source geospatial processing engine
arrays data-science earth-observation geospatial geospatial-analysis gis grass-gis hacktoberfest image-processing jupyter machine-learning open-science parallel-computing python raster remote-sensing science spatial timeseries-analysis vector
Last synced: 26 Dec 2024
https://github.com/empathy87/the-elements-of-statistical-learning-python-notebooks
A series of Python Jupyter notebooks that help you better understand "The Elements of Statistical Learning" book
data-analysis data-science machine-learning python sklearn statistical-learning tensorflow tutorials
Last synced: 25 Dec 2024
https://github.com/GoogleCloudPlatform/DataflowJavaSDK
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
big-data data-analysis data-mining data-processing data-science google-cloud-dataflow
Last synced: 12 Nov 2024
https://github.com/empathy87/The-Elements-of-Statistical-Learning-Python-Notebooks
A series of Python Jupyter notebooks that help you better understand "The Elements of Statistical Learning" book
data-analysis data-science machine-learning python sklearn statistical-learning tensorflow tutorials
Last synced: 12 Nov 2024
https://github.com/bansalkanav/ultimate-data-science-toolkit---from-python-basics-to-generativeai
aws cnn data-analysis data-science deep-learning-algorithms flask machine-learning mlflow mlops mongodb pandas-python prefect python3 search-engine sklearn-library sql statistics streamlit tutorial-code visualization
Last synced: 27 Dec 2024
https://github.com/bansalkanav/Ultimate-Data-Science-Toolkit---From-Python-Basics-to-GenerativeAI
aws cnn data-analysis data-science deep-learning-algorithms flask machine-learning mlflow mlops mongodb pandas-python prefect python3 search-engine sklearn-library sql statistics streamlit tutorial-code visualization
Last synced: 08 Nov 2024
https://github.com/fmind/mlops-python-package
Kickstart your MLOps initiative with a flexible, robust, and productive Python package.
automation data-pipelines data-science machine-learning mlflow mlops pandera pydantic python
Last synced: 20 Dec 2024
https://github.com/OSGeo/grass
GRASS GIS - free and open-source geospatial processing engine
arrays data-science earth-observation geospatial geospatial-analysis gis grass-gis hacktoberfest image-processing jupyter machine-learning open-science parallel-computing python raster remote-sensing science spatial timeseries-analysis vector
Last synced: 05 Nov 2024
https://github.com/microsoft/RD-Agent
Research and development (R&D) is crucial for the enhancement of industrial productivity, especially in the AI era, where the core aspects of R&D are mainly focused on data and models. We are committed to automate these high-value generic R&D processes through our open source R&D automation tool RD-Agent, which let AI drive data-driven AI.
agent ai automation data-mining data-science development llm research
Last synced: 10 Oct 2024
https://github.com/dswah/pyGAM
[HELP REQUESTED] Generalized Additive Models in Python
data-science gams interpretable-machine-learning machine-learning python scientific-computing
Last synced: 30 Oct 2024
https://github.com/d0r1h/ML-University
Machine Learning Open Source University
artificial-intelligence awsome awsome-list computer-science course data-science deep-learning free learning machine-learning mathematics natural-language-processing neural-network open-source reinforcement-learning university
Last synced: 15 Nov 2024
https://github.com/Kotlin/dataframe
Structured data processing in Kotlin
data-analysis data-science dataframe kotlin
Last synced: 07 Nov 2024
https://github.com/aloctavodia/statistical-rethinking-with-python-and-pymc3
Python/PyMC3 port of the examples in " Statistical Rethinking A Bayesian Course with Examples in R and Stan" by Richard McElreath
bayesian-data-analysis data-science pymc python statistics
Last synced: 25 Dec 2024
https://github.com/hazyresearch/meerkat
Creative interactive views of any dataset.
data-science foundation-models machine-learning ml pandas
Last synced: 22 Dec 2024
https://github.com/aloctavodia/Statistical-Rethinking-with-Python-and-PyMC3
Python/PyMC3 port of the examples in " Statistical Rethinking A Bayesian Course with Examples in R and Stan" by Richard McElreath
bayesian-data-analysis data-science pymc python statistics
Last synced: 27 Nov 2024
https://github.com/HazyResearch/meerkat
Creative interactive views of any dataset.
data-science foundation-models machine-learning ml pandas
Last synced: 29 Oct 2024
https://github.com/aerdem4/lofo-importance
Leave One Feature Out Importance
data-science explainable-ai feature-importance feature-selection machine-learning
Last synced: 09 Nov 2024
https://github.com/chawlaavi/daily-dose-of-data-science
A collection of code snippets from the publication Daily Dose of Data Science on Substack: http://www.dailydoseofds.com/
data-analysis data-science data-science-tips data-visualization jupyter jupyter-notebook jupyter-tips matplotlib matplotlib-tips numpy pandas pandas-tips python python-tips sklearn
Last synced: 24 Dec 2024
https://github.com/amodinho/datacamp-python-data-science-track
All the slides, accompanying code and exercises all stored in this repo. 🎈
bokeh data-science datacamp datacamp-course datacamp-exercises datacamp-machine-learning datacamp-projects datacamp-python datacamp-solutions-python datascience machinelearning natural-language-processing neural-network neural-networks nlp pandas python scikit-learn tokenization
Last synced: 22 Dec 2024
https://github.com/elki-project/elki
ELKI Data Mining Toolkit
anomalydetection cluster-analysis clustering data-analysis data-mining data-mining-algorithms data-science distance-functions index indexing java machine-learning outlier-detection outliers time-series visualization
Last synced: 25 Dec 2024
https://github.com/latitude-dev/latitude
Developer-first embedded analytics
analytics business-intelligence dashboard data data-analysis data-analytics data-app data-engineering data-science data-visualization duckdb embedded-analytics exploratory-data-analysis javascript-framework open-source react self-hosted sql svelte tailwindcss
Last synced: 07 Sep 2024
https://github.com/the-black-knight-01/Data-Science-Competitions
Goal of this repo is to provide the solutions of all Data Science Competitions(Kaggle, Data Hack, Machine Hack, Driven Data etc...).
analytics-vidhya competition-code competitive-data-science-github data-science data-science-competition data-science-competitions datahack-competition kaggle kaggle-competition kaggle-competition-for-beginners kaggle-competition-solutions kaggle-solutions-github kaggle-winning-solutions-github machine-learning machinehack-competition xgboost
Last synced: 29 Nov 2024
https://github.com/the-black-knight-01/data-science-competitions
Goal of this repo is to provide the solutions of all Data Science Competitions(Kaggle, Data Hack, Machine Hack, Driven Data etc...).
analytics-vidhya competition-code competitive-data-science-github data-science data-science-competition data-science-competitions datahack-competition kaggle kaggle-competition kaggle-competition-for-beginners kaggle-competition-solutions kaggle-solutions-github kaggle-winning-solutions-github machine-learning machinehack-competition xgboost
Last synced: 13 Nov 2024
https://github.com/nannyml/the-little-book-of-ml-metrics
The book every data scientist needs on their desk.
book classification-metrics clustering-metrics computer-vision-metrics data-science machine-learning machine-learning-evaluation machine-learning-metrics nlp-metrics python ranking-metrics regression-metrics
Last synced: 20 Dec 2024
https://github.com/thealgorithms/jupyter
The repository contains script and notebook related to Statistics, Machine learning, Neural network, Deep learning, NLP, Numerical methods, and Automation.
algorithms data-science data-structures deep-learning hacktoberfest machine-learning neural-network
Last synced: 21 Dec 2024
https://github.com/TheAlgorithms/Jupyter
The repository contains script and notebook related to Statistics, Machine learning, Neural network, Deep learning, NLP, Numerical methods, and Automation.
algorithms data-science data-structures deep-learning hacktoberfest machine-learning neural-network
Last synced: 13 Nov 2024
https://github.com/h1st-ai/h1st
Power Tools for AI Engineers With Deadlines
automl autonomous-vehicles avionics cold-start collaboration cybersecurity data-science datascience-environment energy-optimization ensemble-machine-learning explainability hacktoberfest home-automation human-in-the-loop industrial-iot predictive-maintenance time-series trustworthy-datascience
Last synced: 25 Oct 2024
https://github.com/AmoDinho/datacamp-python-data-science-track
All the slides, accompanying code and exercises all stored in this repo. 🎈
bokeh data-science datacamp datacamp-course datacamp-exercises datacamp-machine-learning datacamp-projects datacamp-python datacamp-solutions-python datascience machinelearning natural-language-processing neural-network neural-networks nlp pandas python scikit-learn tokenization
Last synced: 29 Oct 2024
https://github.com/agnostiqhq/covalent
Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments.
covalent data-pipeline data-science deep-learning hacktoberfest hpc hpc-applications machine-learning machinelearning machinelearning-python orchestration parallelization pipelines python quantum quantum-computing quantum-machine-learning workflow workflow-automation workflow-management
Last synced: 22 Dec 2024
https://github.com/kuwala-io/kuwala
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data science models and products with a focus on geospatial data. Currently, the following data connectors are available worldwide: a) High-resolution demographics data b) Point of Interests from Open Street Map c) Google Popular Times
admin-boundaries data data-integration data-science dbt elt google-trends jupyter kuwala no-code open-data open-source population postgres pyspark python react react-flow scraping spatial-analysis
Last synced: 01 Nov 2024
https://github.com/yoshoku/rumale
Rumale is a machine learning library in Ruby
artificial-intelligence data-analysis data-science machine-learning ml ruby rubyml
Last synced: 23 Dec 2024
https://github.com/jasmcaus/caer
High-performance Vision library in Python. Scale your research, not boilerplate.
ai artificial-intelligence augmentation caer computer-vision cuda data-science deep-learning gpu image-classification image-processing image-segmentation machine-learning neural-network opencv python segmentation type-checking video-processing vision
Last synced: 20 Dec 2024
https://github.com/chrisvoncsefalvay/learn-julia-the-hard-way
Learn Julia the hard way!
data-science hpc julia julia-language julialang language learning learning-by-doing learning-julia scientific-computing statistics technical-computing
Last synced: 22 Dec 2024
https://github.com/compdemocracy/polis
:milky_way: Open Source AI for large scale open ended feedback
civic-tech data-science deliberative-democracy participatory-democracy
Last synced: 01 Nov 2024
https://github.com/scrapinghub/python-crfsuite
A python binding for crfsuite
Last synced: 22 Dec 2024
https://github.com/jwkvam/bowtie
:bowtie: Create a dashboard with python!
ant-design antd dashboard data-science flask interactive jupyter plotly python react socket-io visualization webapp
Last synced: 22 Dec 2024
https://github.com/AgnostiqHQ/covalent
Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments.
covalent data-pipeline data-science deep-learning hacktoberfest hpc hpc-applications machine-learning machinelearning machinelearning-python orchestration parallelization pipelines python quantum quantum-computing quantum-machine-learning workflow workflow-automation workflow-management
Last synced: 01 Nov 2024
https://github.com/abhayspawar/featexp
Feature exploration for supervised learning
data-exploration data-science feature-engineering machine-learning visualization
Last synced: 27 Nov 2024
https://github.com/miguelgfierro/ai_projects
AI projects
analytics artificial-intelligence big-data code-examples data-science deep-learning examples machine-learning neural-networks programming-exercise
Last synced: 23 Dec 2024
https://github.com/JosephLai241/URS
Universal Reddit Scraper - A comprehensive Reddit scraping/archival command-line tool.
archiving command-line comments csv data-analysis data-science json livestream osint-tool praw pyo3 python reddit reddit-scraper redditor rust scraper subreddit trees wordcloud
Last synced: 28 Oct 2024
https://github.com/moderndive/ModernDive_book
Statistical Inference via Data Science: A ModernDive into R and the Tidyverse
bootstrap-method confidence-intervals data-science data-visualization data-wrangling dplyr ggplot2 hypothesis-testing infer moderndive permutation-test r regression regression-models rstats rstudio statistical-inference tidy tidyverse
Last synced: 27 Oct 2024
https://github.com/vivekpa/introneuralnetworks
Introducing neural networks to predict stock prices
algorithmic-trading data-science finance guide keras-tensorflow lstm-neural-networks machine-learning mlp-networks neural-network prediction prediction-mod python quantitative-finance regression-models stock-price-prediction stock-prices trading trading-strategies tutorial yahoo-finance
Last synced: 21 Dec 2024
https://github.com/glue-viz/glue
Linked Data Visualizations Across Multiple Files
data-science linked-data python visualization
Last synced: 21 Dec 2024
https://github.com/nipy/nipype
Workflows and interfaces for neuroimaging packages
big-data brain-imaging brainweb data-science dataflow dataflow-programming neuroimaging python workflow-engine
Last synced: 25 Nov 2024
https://github.com/williamfalcon/test-tube
Python library to easily log experiments and parallelize hyperparameter search for neural networks
caffe caffe2 chainer data-science deep-learning grid-search hyperparameter-optimization keras machine-learning neural-networks pytorch random-search tensorflow
Last synced: 25 Dec 2024