Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data Science

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

https://github.com/outerbounds/metaflowbot

Slack bot for monitoring your Metaflow flows!

data-science metaflow ml mlops slack slack-bot

Last synced: 10 Nov 2024

https://github.com/mainakrepositor/data-analysis

Different types of data analytics projects : EDA, PDA, DDA, TSA and much more.....

data-analysis data-science deeplearning machine-learning-algorithms neural-networks time-series-analysis tsa

Last synced: 12 Nov 2024

https://github.com/datapane/examples

Datapane Examples

data-science datapane jupyter python

Last synced: 09 Aug 2024

https://github.com/raybellwaves/cfanalytics

Downloading, analyzing and visualizing CrossFit data

crossfit crossfit-games data-frames data-science python

Last synced: 08 Nov 2024

https://github.com/klaus78/data-science-flashcards

A large collection of challenges on Data Science and Machine Learning.

data-science hacktoberfest jekyll-website machine-learning python

Last synced: 11 Oct 2024

https://github.com/0x0be/scrapeadvisor

A user-friendly python-based GUI which provides sentiment analysis of users' reviews toward a specific TripAdvisor facility

data-mining data-science python3 r scraping sentiment-analysis sentiment-classification text-mining tripadvisor tripadvisor-scraper web-scraping

Last synced: 04 Nov 2024

https://github.com/thomasnield/bayes_user_input_prediction

Demonstration of using Naive Bayes to predict user inputs with Kotlin 1.2 std-lib

bayes bayes-classifier data-science kotlin

Last synced: 30 Oct 2024

https://github.com/arthurpaulino/miraiml

MiraiML: asynchronous, autonomous and continuous Machine Learning in Python

data-science hyperparameter-optimization machine-learning python

Last synced: 08 Nov 2024

https://github.com/SOM-Research/DescribeML

DescribeML is a Visual Studio Code language plug-in to describe machine-learning datasets in a structured format. Build better data describing the composition, provenance and social concerns of your dataset.

data-science dataset-generation datasets describeml langium machine-learning model-driven modeling open-data open-datasets visual-studio-code vscode

Last synced: 10 Oct 2024

https://github.com/Azure/aml-run

GitHub Action that allows you to submit a run to your Azure Machine Learning Workspace.

aml azure azure-machine-learning data-science machine-learning mlops

Last synced: 13 Aug 2024

https://github.com/amadeusitgroup/cpmml

cPMML is C++ library for scoring machine learning models serialized with the Predictive Model Markup Language (PMML)

ai data-science machine-learning ml model-deployment model-scoring pmml

Last synced: 10 Nov 2024

https://github.com/theengineeringworld/python-data-science

Python Data Science has all the data sets and jupyter notebook files for the Youtube course at http://youtube.com/theengineeringworld under the name of " Python Data Science Course ".

data data-analysis data-mining data-science data-visualization jupyter-notebook jupyter-notebooks machine-learning python python27

Last synced: 12 Oct 2024

https://github.com/computationalcore/introduction-to-python

A very useful collection of Jupyter Notebooks, which aims to introduce the Python programming language.

data-analysis data-science fundamental google-colab jupyter-notebook jupyter-notebooks numpy pandas python python-language python-programming python3

Last synced: 10 Nov 2024

https://github.com/azure/aml-run

GitHub Action that allows you to submit a run to your Azure Machine Learning Workspace.

aml azure azure-machine-learning data-science machine-learning mlops

Last synced: 07 Oct 2024

https://github.com/denadai2/google_street_view_deep_neural

Deep Neural Network model to predict security perception from Google Street View images. Model based on AlexNet CNNs

computational-social-science computer-vision data-science deep-learning urban-planning urban-science

Last synced: 27 Oct 2024

https://github.com/mkcor/advanced-pandas

Pandas is a powerful tool for data exploration and analysis (including timeseries).

data-analysis data-science labeled-data notebooks python3 teaching-materials

Last synced: 16 Oct 2024

https://github.com/dayyass/graph-based-clustering

Graph-Based Clustering using connected components and spanning trees.

clustering data-science graph graph-algorithms hacktoberfest machine-learning python sklearn

Last synced: 07 Nov 2024

https://github.com/rasbt/hbind

Calculates hydrogen-bond interaction tables for protein-small molecule complexes, based on protein PDB and protonated ligand MOL2 structure input. Raschka et al. (2018) J. Computer-Aided Molec. Design

bioinformatics computational-biology data-science hydrogen-bonds protein-ligand-interfaces

Last synced: 22 Oct 2024

https://github.com/nneji123/credit-card-fraud-detection

Credit Card Fraud Detection App built with Streamlit, FastAPI and Docker.

credit-card data-science deployment docker docker-compose fastapi fraud-detection machine-learning streamlit

Last synced: 13 Nov 2024

https://github.com/mpds-io/mpds-api

Tutorials, notebooks, issue tracker, and website on the MPDS API: the data retrieval interface for the Materials Platform for Data Science

calphad crystal-structure crystallography data-science materials materials-informatics materials-platform materials-science mpds-api mpds-platform phase-diagram phase-diagrams

Last synced: 06 Nov 2024

https://github.com/soodoku/data-science

Lecture Slides for Introduction to Data Science

data-science statistical-learning

Last synced: 25 Oct 2024

https://github.com/rjbergerud/open-source-for-common-good

A list I'm keeping of active open source projects that serve a social or environmental goal.

citizen-science civic-tech community data-science humanity non-profit social social-impact sustainability

Last synced: 13 Nov 2024

https://github.com/staircase-dev/piso

Pandas Interval Set Operations: providing methods for set operations, analytics, lookups and joins on pandas' Interval, IntervalArray and IntervalIndex

data-analysis data-science data-structures interval interval-arithmetic interval-set pandas set set-operations set-theory

Last synced: 16 Nov 2024

https://github.com/hugoblox/theme-markdown-slides

🎙 在 Markdown 中创建漂亮的演示文稿。Write, share, and present your slides using the open, future-proof Markdown standard

blogdown data-science hugo hugo-learn-theme hugo-theme jupyter latex-math lms markdown markdown-slides mermaid obsidian obsidian-publish r reveal-js rstudio slides slideshow-maker static-site-generator theme

Last synced: 09 Nov 2024

https://github.com/github/mlops

Use GitHub to facilitate automation, collaboration and reproducibility in your machine learning workflows.

actions cicd data-science devops-tools machine-learning mlops pages primer primer-design

Last synced: 29 Sep 2024

https://github.com/goplus/pandas

Flexible and powerful data analysis / manipulation library for Go+, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

data-analysis data-science data-tech go golang gop goplus pandas scientific-computing

Last synced: 12 Nov 2024

https://github.com/microsoft/autobrewml

With AutoBrewML Framework the time it takes to get production-ready ML models with great ease and efficiency highly accelerates.

anomaly-detection azure-automl cleansing-data data-science datavisualization machine-learning microsoft nlp-machine-learning responsible-ml sampling-strategies text-analysis text-classification text-summarization

Last synced: 08 Nov 2024

https://github.com/dhaitz/data-science-links

A curated list of links to great data science articles, videos, ...

agile ai artificial-intelligence career-advice data-science data-scientists machine-learning

Last synced: 11 Nov 2024

https://github.com/lourd/react-google-sheet

Pulling data from Google Sheets with React components

api-client data-science google-sheets javascript react spreadsheets

Last synced: 14 Oct 2024

https://github.com/incubated-geek-cc/Text-To-Speech-App

A Fusion of OCR Technology (Tesseract.js) & Web Speech API. Standalone, portable and works offline.

data-science javascript machine-learning ocr ocr-recognition tesseract tesseract-ocr tesseract-ocr-api tesseractjs webapp

Last synced: 08 Nov 2024

https://github.com/florents-tselai/pandas-sets

Set-oriented Operations in Pandas

data-science pandas set-operations sets

Last synced: 31 Oct 2024

https://github.com/bgroenks96/normalizing-flows

Implementations of normalizing flows using python and tensorflow

data-science machine-learning machine-learning-algorithms normalizing-flows

Last synced: 28 Oct 2024

https://github.com/anitagraser/eda-protocol-movement-data

Step-by-step exploratory movement data analysis protocol in a Jupyter notebook

data-quality-assessment data-science exploratory-data-analysis movement-data

Last synced: 10 Nov 2024

https://github.com/tomasonjo/bitcoin-to-neo4jdash

Project that listens to bitcoin websocket API for new transactions and stores them to Neo4j to be analyzed

bitcoin dashboard data data-science graph graphdatabase neo4j python websocket

Last synced: 22 Oct 2024

https://github.com/incubated-geek-cc/text-to-speech-app

A Fusion of OCR Technology (Tesseract.js) & Web Speech API. Standalone, portable and works offline.

data-science javascript machine-learning ocr ocr-recognition tesseract tesseract-ocr tesseract-ocr-api tesseractjs webapp

Last synced: 15 Nov 2024

https://github.com/luiscib3r/solar-rad-forecasting

In these notebooks the entire research and implementation process carried out for the construction of various machine learning models based on neural networks that are capable of predicting levels of solar radiation is captured given a set of historical data taken by meteorological stations.

convolutional-neural-networks data-science deep-learning forecasting machine-learning rnn rnn-tensorflow

Last synced: 05 Nov 2024

https://github.com/hemansnation/python-for-data-professionals

This course is designed to get a good grip on python programming, logic building, solving algorithm-based questions, data structures, understanding of data analytics, working with pandas, professional practices, and API building.

data-analytics data-professionals data-science exploratory-data-analysis logic-programming machine-learning pandas python

Last synced: 08 Nov 2024

https://github.com/medoidai/skrobot

skrobot is a Python module for designing, running and tracking Machine Learning experiments / tasks. It is built on top of scikit-learn framework.

artificial-intelligence data-science feature-engineering feature-selection hyperparameter-tuning machine-learning model-evaluation model-selection model-training model-tuning modelling predictive-modelling python scikit-learn

Last synced: 27 Oct 2024

https://github.com/aatmunbaxi/orgroamtools

Helper library for data analysis of org-roam collections

data-science emacs exploratory-data-analysis library org-roam personal-knowledge-management python

Last synced: 12 Oct 2024

https://github.com/humburg/reportmd

Create multi-page HTML reports in R

data-science r rmarkdown rstudio

Last synced: 27 Oct 2024

https://github.com/isala404/speculo

Realtime face detection and recognition using deep learning

data-science face-recognition faces footages opencv python3 reactjs speculo surveillance tensorflow typescript

Last synced: 15 Oct 2024

https://github.com/facultyai/boltzmannclean

Fill missing values in Pandas DataFrames using Restricted Boltzmann Machines

data-cleaning data-science dataframe pandas restricted-boltzmann-machine

Last synced: 08 Nov 2024

https://github.com/jameslamb/talks

Conference talks, meetup talks, and misc. writing

conference-talk data-science machine-learning open-source presentations python r

Last synced: 28 Oct 2024

https://github.com/RConsortium/r-collaboration

Open Collaboration, Data Registry, and Use Cases Developed by the R Community

data-analysis-in-r data-analytics data-science r

Last synced: 08 Aug 2024

https://github.com/mainakrepositor/covid19-india-bcr

A bar chart race demonstrating the start and trends of COVID-19 in India

barchartrace covid-19 data-science data-visualization dataanalysisandmlusingpython visualization

Last synced: 12 Nov 2024

https://github.com/nuhmanpk/webtrench

A powerful and easy-to-use web scrapper for collecting data from the web. Supports scraping of images, text, videos, meta data, and more. Ideal for machine learning and deep learning engineers. Download and extract data with just one line of code

audio-datasets data data-collection data-science dataset-generation deep-learning image-data-generator machine-learning python scarper text-datasets

Last synced: 28 Oct 2024

https://github.com/paulosalem/gpt3-poc-tutorial-with-braindump

A demo application to support my tutorial on building applications with GPT-3.

data-science gpt gpt-3 natural-language-understanding openai proof-of-concept

Last synced: 12 Nov 2024

https://github.com/solegalli/feature-selection-in-machine-learning-book

Code repository for the book feature selection in machine learning

data-science feature-selection machine-learning python

Last synced: 02 Nov 2024

https://github.com/chalmerlowe/machine_learning

A gentle introduction to machine learning: data handling, linear regression, naive bayes, clustering

data data-science linear-regression machine-learning nearest-neighbors python scikit-learn

Last synced: 12 Oct 2024

https://github.com/mainakrepositor/brs

Recommend books using Machine Learning Techniques

data-science python-3

Last synced: 12 Nov 2024

https://github.com/azure/azure-data-labs

Terraform templates to deploy Azure Data resources

analytics azure blueprints data data-science github github-actions labs terraform

Last synced: 07 Oct 2024

https://github.com/datalab-platform/datalab

Open-source Platform for Scientific and Technical Data Processing and Visualization

data-science data-visualization image-processing opencv python scientific-computing scikit-image scipy signal-processing visualization

Last synced: 11 Oct 2024

https://github.com/sanjinkurelic/casebasedreasoning

Find missing values in data set using Euclid distance, normalization and calculating information value, weight of evidence

case-based-reasoning csv data-science influence information-value machine-learning numpy pandas python3 weight-of-evidence

Last synced: 06 Nov 2024

https://github.com/brunorosilva/todoist-analytics

Just a simple app for weekly and monthly reviewing of tasks in todoist.

analytics dashboard data-science streamlit todoist

Last synced: 13 Aug 2024

https://github.com/koonimaru/omniplot

Statistical analysis, clustering and visualinzing scientific data with hassle free

data-science matplotlib numpy pandas python

Last synced: 08 Nov 2024

https://github.com/Azure/azure-data-labs

Terraform templates to deploy Azure Data resources

analytics azure blueprints data data-science github github-actions labs terraform

Last synced: 13 Nov 2024

https://github.com/climopy-dev/climopy

🌍🌏🌎 A succinct toolset for analyzing climate data. This project is a work-in-progress.

climate-analysis climate-science data-science python xarray xarray-accessor

Last synced: 08 Aug 2024