Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

https://github.com/supabase-community/supabase-py

Python Client for Supabase. Query Postgres from Flask, Django, FastAPI. Python user authentication, security policies, edge functions, file storage, and realtime data streaming. Good first issue.

auth authentication authorization community data-science databases django fastapi flask good-first-issue machine-learning postgres postgresql python supabase

Last synced: 24 Jun 2024

https://github.com/caserec/Datasets-for-Recommender-Systems

This is a repository of a topic-centric public data sources in high quality for Recommender Systems (RS)

data-science database datasets public-data recommender-systems

Last synced: 23 Jun 2024

https://github.com/PPshrimpGo/BDCI2018-ChinauUicom-1st-solution

这是BDCI2018的联通赛题第一名解决方案

competition data-science

Last synced: 23 Jun 2024

https://github.com/aikho/awesome-feature-engineering

A curated list of resources dedicated to Feature Engineering Techniques for Machine Learning

ai data-science feature-engineering feature-extraction machine-learning

Last synced: 22 Jun 2024

https://github.com/Shujian2015/FreeML

A List of Data Science/Machine Learning Resources (Mostly Free)

data-science deep-learning machine-learning natural-language-processing

Last synced: 22 Jun 2024

https://github.com/mukeshmithrakumar/Book_List

Python, Machine Learning, Deep Learning and Data Science Books

algorithms books data-science deep-learning free machine-learning python

Last synced: 22 Jun 2024

https://github.com/nickslevine/zebras

Data analysis library for JavaScript built with Ramda

data-analysis data-science functional-programming javascript pandas ramda

Last synced: 22 Jun 2024

https://github.com/business-science/free_r_tips

Free R-Tips is a FREE Newsletter provided by Business Science. It comes with bite-sized code tutorials every week.

data-science newsletter tips tips-and-tricks

Last synced: 22 Jun 2024

https://github.com/gbganalyst/API-in-R-and-Python

This API tutorial will teach you how to fetch data from an external source using HTTP requests and parse the data into a usable format

api data-science python quarto rprogramming

Last synced: 22 Jun 2024

https://github.com/Azure/azure-data-labs

Terraform templates to deploy Azure Data resources

analytics azure blueprints data data-science github github-actions labs terraform

Last synced: 21 Jun 2024

https://github.com/Inist-CNRS/lodex

Linked Open Data EXperiment

data-science data-structures datavisualization mongo nodejs

Last synced: 21 Jun 2024

https://unit8co.github.io/darts/

A python library for user-friendly forecasting and anomaly detection on time series.

anomaly-detection data-science deep-learning forecasting machine-learning python time-series

Last synced: 21 Jun 2024

https://github.com/devsgnr/breadroll

breadroll 🥟 is a simple lightweight library for data processing operations written in Typescript and powered by Bun.

bun csv csv-parser data-engineering data-science data-transformation eda exploratory-data-analysis tsv tsv-parser

Last synced: 21 Jun 2024

https://rasbt.github.io/mlxtend/

A library of extension and helper modules for Python's data analysis and machine learning libraries.

association-rules data-mining data-science machine-learning python supervised-learning unsupervised-learning

Last synced: 21 Jun 2024

https://github.com/tellery/tellery

Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.

analytics bigquery business-intelligence collaboration dashboard data-analytics data-modeling data-science data-visualization database dbt notebook self-hosted sql

Last synced: 21 Jun 2024

https://github.com/iterative/awesome-iterative-projects

A list of projects relying on Iterative.AI tools to achieve awesomeness

awesome awesome-dvc awesome-list awesome-lists data-science deep-learning dvc example hacktoberfest machine-learning

Last synced: 20 Jun 2024

https://github.com/CognonicLabs/awesome-AI-kubernetes

:snowflake: :whale: Awesome tools and libs for AI, Deep Learning, Machine Learning, Computer Vision, Data Science, Data Analytics and Cognitive Computing that are baked in the oven to be Native on Kubernetes and Docker with Python, R, Scala, Java, C#, Go, Julia, C++ etc

ai analytics big-data cognitive-science data-science docker kubeflow kubernetes kubernetes-ai kubernetes-analytics kubernetes-data-science kubernetes-ml ml pachyderm python-ml scala seldon-core spark spark-kubernetes spark-ml

Last synced: 20 Jun 2024

https://github.com/GoogleCloudPlatform/DataflowJavaSDK

Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.

big-data data-analysis data-mining data-processing data-science google-cloud-dataflow

Last synced: 20 Jun 2024

https://github.com/thoughtworks/mlops-platforms

Compare MLOps Platforms. Breakdowns of SageMaker, VertexAI, AzureML, Dataiku, Databricks, h2o, kubeflow, mlflow...

azureml data-science databricks dataiku datarobot google-ai-platform h2oai iguazio knime kubeflow machine-learning mlflow mlops pachyderm sagemaker seldon

Last synced: 20 Jun 2024

https://github.com/shlizee/NeuroAI

NeuroAI-UW seminar, a regular weekly seminar for the UW community, organized by NeuroAI Shlizerman Lab.

ai cvpr data-science deep-learning eccv icml neural-networks neurips neuroscience-methods recurrent-neural-networks sfn

Last synced: 20 Jun 2024

https://github.com/braph-software/BRAPH-2

BRAPH 2.0 is a comprehensive software package for the analysis and visualization of brain connectivity data, offering flexible customization, rich visualization capabilities, and a platform for collaboration in neuroscience research.

biomedical-engineering brain-connectivity-analysis brain-research computational-neuroscience connectomics data-analysis data-science data-visualization deep-learning graph-theory machine-learning matlab network-analysis neuroimaging neuroscience open-source reproducible-research research-tools scientific-software toolbox

Last synced: 20 Jun 2024

https://github.com/jrfiedler/causal_inference_python_code

Python code for part 2 of the book Causal Inference: What If, by Miguel Hernán and James Robins

causal-inference causality data-science python

Last synced: 20 Jun 2024

https://github.com/M4t1ss/parallel-corpora-tools

Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.

cleaning corpora corpus-tools data-processing data-science filtering language language-processing machine machine-translation natural-language natural-language-processing neural neural-machine-translation nlp nmt translation

Last synced: 20 Jun 2024

https://github.com/red-data-tools/unicode_plot.rb

Plot your data by Unicode characters

data-science data-visualization ruby

Last synced: 19 Jun 2024

https://github.com/squaredtechnologies/thread

An AI-powered Python notebook built in React — generate and edit code cells, automatically fix errors, and chat with your code

ai analysis analytics data-science jupyter jupyter-notebook jupyter-notebooks jupyterhub jupyterlab ollama python react reactjs

Last synced: 19 Jun 2024

https://github.com/empathy87/The-Elements-of-Statistical-Learning-Python-Notebooks

A series of Python Jupyter notebooks that help you better understand "The Elements of Statistical Learning" book

data-analysis data-science machine-learning python sklearn statistical-learning tensorflow tutorials

Last synced: 18 Jun 2024

https://github.com/hades217/awesome-ai

A curated list of artificial intelligence resources (Courses, Tools, App, Open Source Project)

artificial-intelligence chatbot data-science deep-learning machine-learning neural-network reinforcement-learning voice-assistant

Last synced: 18 Jun 2024

https://github.com/AidanCooper/shap-analysis-guide

How to Interpret SHAP Analyses: A Non-Technical Guide

data-science machine-learning shap tutorial

Last synced: 18 Jun 2024

https://github.com/souzatharsis/open-quant-live-book

An open source, hands-on and fully reproducible book in quantitative finance, data science and econophysics. Join us and help Make Wall Street Great Again!

algo-trading altdata data-science econophysics financial-analysis financial-markets machine-learning open-source quantitative-finance

Last synced: 17 Jun 2024

https://github.com/justmarkham/trump-lies

Tutorial: Web scraping in Python with Beautiful Soup

beautiful-soup data-science dataset pandas python requests tutorial web-scraping

Last synced: 17 Jun 2024

https://github.com/frictionlessdata/specs

Technical specifications and guidelines for implementing Frictionless Data.

csv data-science json metadata schema validation

Last synced: 17 Jun 2024

https://github.com/compdemocracy/polis

:milky_way: Open Source AI for large scale open ended feedback

civic-tech data-science deliberative-democracy participatory-democracy

Last synced: 17 Jun 2024

https://github.com/girder/girder

A data management platform for the web, developed by Kitware

data-analytics data-management data-science javascript kitware python resonant

Last synced: 17 Jun 2024

https://github.com/rjbergerud/open-source-for-common-good

A list I'm keeping of active open source projects that serve a social or environmental goal.

citizen-science civic-tech community data-science humanity non-profit social social-impact sustainability

Last synced: 17 Jun 2024

https://github.com/rcdilorenzo/ecce

ML Prediction of Bible Topics and Passages (Python / React)

data-science fastapi fully-connected-network interactive-visualizations keras-tensorflow reactjs

Last synced: 17 Jun 2024

https://github.com/nteract/bookstore

📚 Notebook storage and publishing workflows for the masses

data-science notebook nteract scheduling storage versioned-buckets

Last synced: 17 Jun 2024

https://github.com/nuclio/nuclio-jupyter

Nuclio Function Automation for Python and Jupyter

data-science jupyter kubernetes nuclio python

Last synced: 17 Jun 2024

https://github.com/firmai/pandapy

PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x faster (by @firmai)

algorithmic-trading arrays data-science data-structures finance machine-learning numpy pandas structured-data

Last synced: 17 Jun 2024

https://github.com/ropensci/drake

An R-focused pipeline toolkit for reproducibility and high-performance computing

data-science drake high-performance-computing makefile peer-reviewed pipeline r r-package reproducibility reproducible-research ropensci rstats workflow

Last synced: 17 Jun 2024

https://github.com/SOCR/SOCRAT

A Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization

data-analysis data-science data-visualization socr statistics visual-analytics visualization

Last synced: 17 Jun 2024

https://github.com/datacleaner/DataCleaner

The premier open source Data Quality solution

data data-analysis data-science database datacleaner dataquality desktop etl mdm profiling

Last synced: 17 Jun 2024

https://github.com/graphia-app/graphia

A visualisation tool for the creation and analysis of graphs

analysis data data-analysis data-science data-visualization graphs interpretation networks visualisation visualization

Last synced: 17 Jun 2024

https://github.com/quadratichq/quadratic

Quadratic | Data Science Spreadsheet with Python & SQL

data data-analysis data-engineering data-science etl python quadratic spreadsheet sql wasm webgl

Last synced: 17 Jun 2024

https://github.com/dataplane-app/dataplane

Dataplane is an Airflow inspired unified data platform with additional data mesh and RPA capability to automate, schedule and design data pipelines and workflows. Dataplane is written in Golang with a React front end.

airflow data data-analysis data-engineering data-integration data-pipelines data-science dataplane datawarehouse etl finance golang kubernetes pipelines robotics-process-automation rpa scheduler workflow workflow-automation workflows

Last synced: 17 Jun 2024

https://github.com/ClimbsRocks/machineJS

[UNMAINTAINED] Automated machine learning- just give it a data file! Check out the production-ready version of this project at ClimbsRocks/auto_ml

auto-ml automated-machine-learning automl data-science data-scientists javascript javascript-library kaggle machine-learning machine-learning-algorithms machine-learning-library ml numerai scikit-learn

Last synced: 17 Jun 2024

https://github.com/RemiRigal/DatasetExplorer

A web tool for local dataset browsing and processing developped using the Flask + Angular stack.

ai angular data-processing data-science data-visualization dataset dataset-analysis docker docker-compose flask web-application

Last synced: 16 Jun 2024

https://github.com/khuyentran1401/Efficient_Python_tricks_and_tools_for_data_scientists

Efficient Python Tricks and Tools for Data Scientists

data-science python python3

Last synced: 16 Jun 2024

https://github.com/innat/ML-Resource

A concise resource repository for machine learning

data-analysis data-science deep-learning kaggle machine-learning python spark

Last synced: 16 Jun 2024

https://github.com/rio-labs/rio

WebApps in pure Python. No JavaScript, HTML and CSS needed

data-analysis data-science data-visualization deep-learning machine-learning python ui webapp

Last synced: 16 Jun 2024

https://github.com/ideos/gloe

Gloe (pronounced /ɡloʊ/, like “glow”) is a general-purpose library made to help developers create, maintain, document, and test both operational and flow-oriented code.

clean-code data-science flow functional-programming machine-learning python typing

Last synced: 15 Jun 2024