Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/mGalarnyk/datasciencecoursera
Data Science Repo and blog for John Hopkins Coursera Courses. Please let me know if you have any questions.
data-science jhu-coursera john-hopkins-coursera python r stanford
Last synced: 08 Apr 2024
![](https://github.com/mGalarnyk.png)
https://github.com/The-AI-Summer/learn-deep-learning
AI Summer's complete catalog of articles
computer-vision data-science deep-learning deep-neural-networks machine-learning natural-language-processing
Last synced: 08 Apr 2024
![](https://github.com/The-AI-Summer.png)
https://github.com/anthony-wang/BestPractices
Things that you should (and should not) do in your Materials Informatics research.
best-practices common-pitfalls data-science example-code interactive-notebooks jupyter jupyter-notebooks machine-learning materials-informatics materials-science neural-networks python
Last synced: 08 Apr 2024
![](https://github.com/anthony-wang.png)
https://github.com/imgcook/datacook
Machine Learning and Data Analysis in JavaScript.
data-science feature-engineering javascript machine-learning
Last synced: 08 Apr 2024
![](https://github.com/imgcook.png)
https://great-northern-diver.github.io/loon/
A Toolkit for Interactive Statistical Data Visualization
data-analysis data-science data-visualization exploratory-analysis exploratory-data-analysis high-dimensional-data interactive-graphics interactive-visualizations loon python statistical-analysis statistical-graphics statistics tcl-extension tk
Last synced: 08 Apr 2024
![](https://github.com/great-northern-diver.png)
https://drivendata.github.io/cookiecutter-data-science/
A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
ai cookiecutter cookiecutter-data-science cookiecutter-template data-science machine-learning
Last synced: 08 Apr 2024
![](https://github.com/drivendata.png)
https://github.com/DARIAH-DE/Topics
A Python library for topic modeling and visualization
data-science digital-humanities lda machine-learning natural-language-processing python3 text-mining topic-modeling
Last synced: 08 Apr 2024
![](https://github.com/DARIAH-DE.png)
https://github.com/senderle/topic-modeling-tool
A point-and-click tool for creating and analyzing topic models produced by MALLET.
data-science digital-humanities mallet text-analytics topic-modeling
Last synced: 08 Apr 2024
![](https://github.com/senderle.png)
https://github.com/lettier/lda-topic-modeling
A PureScript, browser-based implementation of LDA topic modeling.
bayesian bulma bulma-css clustering data-science functional-programming gibbs-sampling latent-dirichlet-allocation lda machine-learning machine-learning-algorithms natural-language-processing nlp nlp-machine-learning purescript reactive reactive-programming text-mining thermite topic-modeling
Last synced: 08 Apr 2024
![](https://github.com/lettier.png)
https://github.com/ipython-books/cookbook-2nd-code
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
computing data-analysis data-mining data-science data-visualization ipython jupyter jupyter-notebook machine-learning numerical-computation python visualization
Last synced: 08 Apr 2024
![](https://github.com/ipython-books.png)
https://github.com/HoloClean/holoclean
A Machine Learning System for Data Enrichment.
data-enrichment data-science inference-engine machine-learning pytorch
Last synced: 08 Apr 2024
![](https://github.com/HoloClean.png)
https://github.com/jobream/List-of-Learning-Resources
This collection provides a list of educational resources for Software Engineers. Feel free to add your favorite resources as well and help others in their journey of learning.
competitive-programming computer-science data-science resources software-engineering web-development
Last synced: 08 Apr 2024
![](https://github.com/jobream.png)
https://github.com/alenrajsp/tcxreader
tcxreader is a reader / parser for Garmin’s TCX file format. It also works well with missing data!
data-mining data-science python sports-analytics tcx tcx-parser
Last synced: 08 Apr 2024
![](https://github.com/alenrajsp.png)
https://github.com/firefly-cpp/tcx-test-files
A collection of the sports activity (tcx) test files
data-mining data-science garmin-connect tcx-files tcx-parser
Last synced: 08 Apr 2024
![](https://github.com/firefly-cpp.png)
https://codeformunich.github.io/radlquartier/
Command-line tool to prepare and extract bike sharing data. Plus example implementations of visualizations and a example website.
data-science data-visualization munich open-data visualization
Last synced: 08 Apr 2024
![](https://github.com/codeformunich.png)
https://github.com/0x0be/scrapeadvisor
A user-friendly python-based GUI which provides sentiment analysis of users' reviews toward a specific TripAdvisor facility
data-mining data-science python3 r scraping sentiment-analysis sentiment-classification text-mining tripadvisor tripadvisor-scraper web-scraping
Last synced: 07 Apr 2024
![](https://github.com/0x0be.png)
https://github.com/benedekrozemberczki/datasets
A repository of pretty cool datasets that I collected for network science and machine learning research.
benchmark community-detection data-science dataset deepwalk dimensionality-reduction gcn gnn graph-convolution graph-embedding graph-neural-network graph2vec link-prediction machine-learning network-analysis network-embedding network-science node-classification node-embedding node2vec
Last synced: 07 Apr 2024
![](https://github.com/benedekrozemberczki.png)
https://github.com/davendw49/k2
Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024
ai4science data-science geoai geoscience kg large-language-models llm
Last synced: 07 Apr 2024
![](https://github.com/davendw49.png)
https://github.com/rasbt/python-machine-learning-book
The "Python Machine Learning (1st edition)" book code repository and info resource
data-mining data-science logistic-regression machine-learning machine-learning-algorithms neural-network python scikit-learn
Last synced: 07 Apr 2024
![](https://github.com/rasbt.png)
https://github.com/Smat26/Roman-Urdu-Dataset
Compilation of Manually Tagged Roman Urdu Dataset (Urdu written in Latin/Roman Script), along with other helpful Roman Urdu NLP resources
data-science dataset hindi hindi-language natural-language-processing nlp urdu urdu-language urdu-nlp
Last synced: 06 Apr 2024
![](https://github.com/Smat26.png)
https://github.com/zhoudaxia233/pyalpha
A process mining tool written in Python3
alpha-miner data-science petri-net process-mining
Last synced: 06 Apr 2024
![](https://github.com/zhoudaxia233.png)
https://github.com/Mybridge/python-articles
Monthly Series - Top 10 Python Articles
data-science data-visualization django flask python python3
Last synced: 05 Apr 2024
![](https://github.com/Mybridge.png)
https://github.com/Mybridge/machine-learning-open-source
Monthly Series - Machine Learning Top 10 Open Source Projects
ai algorithm artificial-intelligence data-science machine-learning neural-network
Last synced: 05 Apr 2024
![](https://github.com/Mybridge.png)
https://github.com/opengeos/streamlit-geospatial
A multi-page streamlit app for geospatial
data-science datascience dataviz geopython geospatial housing-data housing-market huggingface mapping open-source python real-estate streamlit streamlit-webapp
Last synced: 05 Apr 2024
![](https://github.com/opengeos.png)
https://github.com/pyxelr/recommendations-for-engineers
All of my recommendations for aspiring engineers in a single place, coming from various areas of interest.
awesome awesome-list cybersecurity data-science lists machine-learning macos mlops pyxelr-setup resources windows
Last synced: 05 Apr 2024
![](https://github.com/pyxelr.png)
https://github.com/rballester/tntorch
Tensor Network Learning with PyTorch
cp-decomposition data-science learning pytorch tensor-decomposition tensor-networks tensor-train tensors tucker-decomposition
Last synced: 05 Apr 2024
![](https://github.com/rballester.png)
https://github.com/vmware/versatile-data-kit
One framework to develop, deploy and operate data workflows with Python and SQL.
analytics data data-engineer data-engineering data-engineering-pipeline data-lineage data-pipelines data-science data-structures data-warehouse database dataops elt etl pipeline python snowflake sql trino warehouse
Last synced: 05 Apr 2024
![](https://github.com/vmware.png)
https://github.com/youssefHosni/Practical-Machine-Learning
Practical machine learning notebook & articles covers the machine learning end to end life cycle.
Last synced: 05 Apr 2024
![](https://github.com/youssefHosni.png)
https://github.com/Ph055a/OSINT_Collection
Maintained collection of OSINT related resources. (All Free & Actionable)
court-search data-science dataset infosec investigation journalism osint research search
Last synced: 05 Apr 2024
![](https://github.com/Ph055a.png)
https://github.com/BahramJannesar/IranAgricultureDataAnalysis
Data Analysis on Iran FAO datasets
agriculture agriculture-organization data-analysis data-science data-visualization datasets fao food food-security iran iranian population
Last synced: 05 Apr 2024
![](https://github.com/BahramJannesar.png)
https://github.com/jeroenjanssens/python-polars-the-definitive-guide
Scripts and datasets for the O'Reilly book Python Polars: The Definitive Guide
data-science oreilly oreilly-books polars python
Last synced: 04 Apr 2024
![](https://github.com/jeroenjanssens.png)
https://github.com/Lackoftactics/facebook_data_analyzer
Analyze facebook copy of your data with ruby language. Download zip file from facebook and get info about friends ranking by message, vocabulary, contacts, friends added statistics and more
conversation data-science data-visualization english-language facebook facebook-data facebook-data-analyzer ruby ruby-gem scraping script statistics
Last synced: 04 Apr 2024
![](https://github.com/Lackoftactics.png)
https://github.com/deepfence/FlowMeter
⭐ ⭐ Use ML to classify flows and packets as benign or malicious. ⭐ ⭐
awesome data-science data-science-projects forensics-tools hacktoberfest infosectools machine-learning machine-learning-projects machinelearning machinelearningproject network-analysis network-security packet-analyser pcap security security-tools tcpdump-like
Last synced: 04 Apr 2024
![](https://github.com/deepfence.png)
https://github.com/google/starthinker
Reference framework for building data workflows provided by Google. Accelerates authentication, logging, scheduling, and deployment of solutions using GCP. To borrow a tagline.. "The framework for professionals with deadlines."
airflow app-engine automation bigquery cloud-functions cm360 colab-notebook data-science django dv360 google-ads google-analytics logger python scheduler ui workflows
Last synced: 03 Apr 2024
![](https://github.com/google.png)
https://github.com/chiphuyen/python-is-cool
Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.
advanced-python data-science machine-learning python-tutorials python3
Last synced: 03 Apr 2024
![](https://github.com/chiphuyen.png)
https://github.com/Foundations-of-Applied-Mathematics/Labs
Labs for the Foundations of Applied Mathematics curriculum.
algorithms applied-mathematics applied-mathematics-curriculum computational-mathematics curriculum data-science linear-algebra python
Last synced: 03 Apr 2024
![](https://github.com/Foundations-of-Applied-Mathematics.png)
https://github.com/pzivich/zEpid
Epidemiology analysis package
aipw data-science epidemiology epidemiology-analysis g-computation g-estimation g-formula incidence-rate-ratio inverse-probability-weights ipw odds-ratio risk-difference risk-ratio targeted-maximum-likelihood tmle
Last synced: 03 Apr 2024
![](https://github.com/pzivich.png)
https://github.com/n3mo/data-science
Data science tooling for Racket
data-science racket sentiment-analysis statistics text-processing
Last synced: 02 Apr 2024
![](https://github.com/n3mo.png)
https://github.com/dlab-berkeley/Python-Fundamentals-Legacy
D-Lab's 12 hour introduction to Python. Learn how to create variables and functions, use control flow structures, use libraries, import data, and more, using Python and Jupyter Notebooks.
data-science introduction-to-python jupyter python
Last synced: 02 Apr 2024
![](https://github.com/dlab-berkeley.png)
https://github.com/dlab-berkeley/Python-Data-Wrangling-Legacy
D-Lab's 3 hour introduction to data wrangling in Python. Learn how to import and manipulate dataframes using pandas in Python.
Last synced: 02 Apr 2024
![](https://github.com/dlab-berkeley.png)
https://github.com/scilab/scilab
Read only copy of https://gitlab.com/scilab/scilab
data-science data-structures graphical-functions mathematical-functions scientific-computing system-modeling
Last synced: 02 Apr 2024
![](https://github.com/scilab.png)
https://github.com/jananiarunachalam/Data-Science-Portfolio
Data Science Projects Repository
analytics api data-cleaning data-science data-visualization databases deep-learning excel machine-learning numpy pandas plotly predictive-modeling python3 r r-programming sql
Last synced: 01 Apr 2024
![](https://github.com/jananiarunachalam.png)
https://github.com/GDSL-UL/san
Spatial Modelling for Data Scientists
book cross-validation data-science geographically-weighted-regression maps moran-i multilevel-models r r-spatial spatial-analysis spatial-econometrics
Last synced: 01 Apr 2024
![](https://github.com/GDSL-UL.png)
https://github.com/MindSetLib/Insolver
Low code machine learning library, specified for insurance tasks: prepare data, build model, implement into production.
auto-ml automated-machine-learning automl bayesian-optimization data-science elyra elyra-community feature-engineering hyperparameter-optimization insurance insurance-claims insurance-company insurance-scoring insurance-team low-code machine-learning
Last synced: 01 Apr 2024
![](https://github.com/MindSetLib.png)
https://github.com/Azure/DataScienceVM
Tools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
ai azure big-data data-analysis data-science deep-learning dsvm machine-learning ml python r sqlserver
Last synced: 01 Apr 2024
![](https://github.com/Azure.png)
https://github.com/dataprofessor/code
Compilation of R and Python programming codes on the Data Professor YouTube channel.
data-professor data-science data-science-python dataprofessor datascience exploratory-data-analysis machine-learning machinelearning pandas python python-data-science r scikit-learn scikit-learn-python shiny streamlit
Last synced: 01 Apr 2024
![](https://github.com/dataprofessor.png)
https://github.com/Azure/azureml-examples
Official community-driven Azure Machine Learning examples, tested with GitHub Actions.
azure azure-machine-learning azureml data-science ml
Last synced: 01 Apr 2024
![](https://github.com/Azure.png)
https://github.com/DeutscheAktuarvereinigung/Data_Science_Challenge_2020_Betrugserkennung
In this notebook we take a look at a relevant project that is frequently encountered by insurers: Fraud Detection. For this purpose we use a car data set from a public source and will show the necessary steps to establish an automated fraud detection.
actuarial-modeling betrugserkennung challenge data-science datasciencechallenge fraud-detection frauddetection
Last synced: 01 Apr 2024
![](https://github.com/DeutscheAktuarvereinigung.png)
https://github.com/adityakamble49/loss-ratio-prediction
Predicting Loss Ratios for Auto Insurance Portfolios - ITCS 6100 Big Data Analytics for Competitive Advantage
big-data big-data-analytics data-science insurance jupyter-notebook politics python
Last synced: 01 Apr 2024
![](https://github.com/adityakamble49.png)
https://github.com/aliosmankaya/DataScienceProjects
This repository for my Data Science, Machine Learning and Deep Learning projects. I want to share my work on this areas.
breast-cancer-wisconsin chest-xray classification data-science deep-learning imagedatagenerator insurance kaggle keras machine-learning matplotlib-pyplot numpy pandas python regression seaborn tensorflow tensorflow2 titanic-kaggle
Last synced: 01 Apr 2024
![](https://github.com/aliosmankaya.png)
https://github.com/areed1192/sigma_coding_youtube
This is a collection of all the code that can be found on my YouTube channel Sigma Coding.
data-science google-maps-api m-language mlanguage office-applications outlook-vba power-bi power-query powerpoint-vba python python-tutorials python-windows vba vba-excel win32 win32com word-vba yelp-fusion-api
Last synced: 01 Apr 2024
![](https://github.com/areed1192.png)
https://github.com/aws-samples/cloud-experiments
Open innovation with 60 minute cloud experiments on AWS
amazon-athena amazon-comprehend amazon-rekognition amazon-s3 amazon-sagemaker aws-cloud aws-glue data-science machine-learning notebooks
Last synced: 01 Apr 2024
![](https://github.com/aws-samples.png)
https://github.com/rasgointelligence/RasgoQL
Write python locally, execute SQL in your data warehouse
data-analysis data-science pandas python sql
Last synced: 01 Apr 2024
![](https://github.com/rasgointelligence.png)
https://github.com/woz-u/DS-Student-Resources
Data Science Student Companion Notebooks and Data Lake
data-analysis data-science data-visualization machine-learning nosql python r sql statistics
Last synced: 01 Apr 2024
![](https://github.com/woz-u.png)
https://github.com/Yu-Group/covid19-severity-prediction
Extensive and accessible COVID-19 data + forecasting for counties and hospitals. 📈
coronavirus coronavirus-tracking county-health-data county-level covid-19 covid-19-data covid-19-data-analysis data-analysis data-science epidemic-model forecasting outbreak outbreak-severity python3 response4life risk-assessment risk-modelling statistics ventilator visualization
Last synced: 01 Apr 2024
![](https://github.com/Yu-Group.png)
https://github.com/ActuariesInstitute/cookbook
Data and analytics cookbook for actuaries
actuarial analytics data-science hacktoberfest
Last synced: 01 Apr 2024
![](https://github.com/ActuariesInstitute.png)
https://github.com/JuliaDataScience/JuliaDataScience
Book on Julia for Data Science
book data data-manipulation data-science data-visualization julia julia-language
Last synced: 01 Apr 2024
![](https://github.com/JuliaDataScience.png)
https://github.com/plotly/dashR
Create data science and AI web apps in R
dash data-science data-visualization plotly plotly-dash python r react web-application
Last synced: 01 Apr 2024
![](https://github.com/plotly.png)
https://github.com/vchagas69/vchagas69.github.io
A Portuguese Actuary
actuarial-science data-science
Last synced: 01 Apr 2024
![](https://github.com/vchagas69.png)
https://github.com/InseeFrLab/onyxia
🔬 Web app to simplify data science environment setup on Kubernetes
bluehats data-science datalab helm insee kubernetes onyxia
Last synced: 01 Apr 2024
![](https://github.com/InseeFrLab.png)
https://github.com/spsanderson/steveondata
Repository for R and SQL tips and tricks for @steveondata every Friday
ai blog data data-science machinelearning-r ml r sql time-series tipoftheday
Last synced: 01 Apr 2024
![](https://github.com/spsanderson.png)
https://github.com/rivasiker/ggHoriPlot
A user-friendly, highly customizable R package for building horizon plots in ggplot2
data-science data-visualization ggplot2 horizon-plots r r-package
Last synced: 01 Apr 2024
![](https://github.com/rivasiker.png)
https://github.com/glm-tools/pyglmnet
Python implementation of elastic-net regularized generalized linear models
data-science elastic-net glm lasso machine-learning python
Last synced: 01 Apr 2024
![](https://github.com/glm-tools.png)
https://github.com/sn3fru/datascience_course
Curso de Data Science em Português
artificial-intelligence brasil curso dados data data-analysis data-science data-science-learning dataset deep-learning machine-learning python
Last synced: 01 Apr 2024
![](https://github.com/sn3fru.png)
https://github.com/iesahin/xvc
A robust (🐢) and fast (🐇) MLOps tool for managing data and pipelines in Rust (🦀)
command-line-tool data data-engineering data-pipelines data-science devops machine-learning machine-learning-engineering mlops rust
Last synced: 01 Apr 2024
![](https://github.com/iesahin.png)
https://github.com/Visualize-ML/Book3_Elements-of-Mathematics
Book_3_《数学要素》 | 鸢尾花书:从加减乘除到机器学习;上架;欢迎继续纠错,纠错多的同学还会有赠书!
data-science linear-algebra machine-learning mathematics matrix
Last synced: 01 Apr 2024
![](https://github.com/Visualize-ML.png)
https://github.com/wyfunique/DBSim
The codebase for DBSim
data-science database in-database in-database-analytics query-optimizer sql-parser sql-query
Last synced: 01 Apr 2024
![](https://github.com/wyfunique.png)
https://github.com/dwhitena/gophernet
A simple from-scratch neural net written in Go
artificial-intelligence data-science go golang machine-learning neural-network
Last synced: 01 Apr 2024
![](https://github.com/dwhitena.png)
https://github.com/njtierney/rmd4sci
Rmarkdown for Scientists
book bookdown data-science r rmarkdown rstats science
Last synced: 31 Mar 2024
![](https://github.com/njtierney.png)
https://github.com/briatte/dsr
Introduction to Data Science with R (Sciences Po, Paris, 2023)
course data-analysis data-science data-visualization r statistics
Last synced: 31 Mar 2024
![](https://github.com/briatte.png)
https://github.com/markvanderloo/simputation
Making imputation easy
data-science imputation officialstatistics r rstats
Last synced: 31 Mar 2024
![](https://github.com/markvanderloo.png)
https://github.com/jrnold/r4ds-exercise-solutions
Exercise solutions to "R for Data Science"
bookdown data-science dplyr exercise-solutions ggplot2 r r4ds rmarkdown tidyr tidyverse
Last synced: 31 Mar 2024
![](https://github.com/jrnold.png)
https://github.com/pbiecek/breakDown
Model Agnostics breakDown plots
data-science iml interpretability machine-learning visual-explanations xai
Last synced: 31 Mar 2024
![](https://github.com/pbiecek.png)
https://github.com/bradleyboehmke/data-science-learning-resources
A collection of data science and machine learning resources that I've found helpful (I only post what I've read!)
Last synced: 31 Mar 2024
![](https://github.com/bradleyboehmke.png)
https://github.com/rebecca-vickery/data-science-learning-resources
A comprehensive list of free resources for learning data science
artificial-intelligence data data-science machine-learning python
Last synced: 31 Mar 2024
![](https://github.com/rebecca-vickery.png)
https://github.com/tlverse/sl3
💪 🤔 Modern Super Learning with Machine Learning Pipelines
data-science ensemble-learning ensemble-model machine-learning model-selection r r-package regression stacking statistics
Last synced: 31 Mar 2024
![](https://github.com/tlverse.png)
https://github.com/PetoLau/petolau.github.io
Blog about time series data mining in R.
artificial-intelligence blog data-analysis data-mining data-science data-visualization forecasting machine-learning r time-series time-series-analysis time-series-clustering time-series-data-mining time-series-forecasting time-series-prediction
Last synced: 31 Mar 2024
![](https://github.com/PetoLau.png)
https://github.com/SkBlaz/autobot
An autoML for explainable text classification.
automl automl-algorithms automl-experiments classification data-mining data-science distributed-computing ensemble-learning evolutionary-algorithms machine-learning multimodal-learning natural-language-processing nlp python representation-learning sparse-matrices text-classification transfer-learning transformers-models
Last synced: 31 Mar 2024
![](https://github.com/SkBlaz.png)
https://github.com/DataCanvasIO/Cooka
A lightweight and visual AutoML system
automated-feature-engineering automated-machine-learning automl data-science deep-learning hyperparameter-optimization machine-learning neural-network
Last synced: 31 Mar 2024
![](https://github.com/DataCanvasIO.png)
https://github.com/DeepWisdom/AutoDL
Automated Deep Learning without ANY human intervention. 1'st Solution for AutoDL challenge@NeurIPS.
ai artificial-intelligence autodl autodl-challenge automated-machine-learning automl big-data data-science deeplearning feature-engineering full-automl lightgbm machine-learning model-selection multi-label nas python pytorch resnet tensorflow
Last synced: 31 Mar 2024
![](https://github.com/DeepWisdom.png)
https://hdi-project.github.io/ATM/
Auto Tune Models - A multi-tenant, multi-data system for automated machine learning (model selection and tuning).
automl data-science distributed-computing hyperparameter-optimization machine-learning
Last synced: 31 Mar 2024
![](https://github.com/HDI-Project.png)
https://github.com/PKU-DAIR/mindware
An efficient open-source AutoML system for automating machine learning lifecycle, including feature engineering, neural architecture search, and hyper-parameter tuning.
automl-algorithms automl-pipeline bayesian-optimization blackbox-optimization data-science deep-learning distributed-systems ensemble-learning hyper-parameter-optimization knobs-tuning machine-learning meta-learning neural-architecture-search python
Last synced: 31 Mar 2024
![](https://github.com/PKU-DAIR.png)
https://github.com/swoop-inc/spark-alchemy
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
data-engineering data-science scala spark
Last synced: 31 Mar 2024
![](https://github.com/swoop-inc.png)
https://github.com/benjaminmbrown/real-time-data-viz-d3-crossfilter-websocket-tutorial
Tutorial on real-time data visualization. Python websocket server & d3.js + crossfilter.js frontend
crossfilter d3 d3js data-science data-visualization dcjs tutorial websockets
Last synced: 31 Mar 2024
![](https://github.com/benjaminmbrown.png)
https://github.com/capitalone/datacompy
Pandas and Spark DataFrame comparison for humans and more!
compare dask data data-science dataframes fugue numpy pandas polars pyspark python spark
Last synced: 31 Mar 2024
![](https://github.com/capitalone.png)
https://github.com/great-expectations/great_expectations_action
A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.
actions continuous-integration data-integrity data-quality data-science mlops
Last synced: 31 Mar 2024
![](https://github.com/great-expectations.png)
https://github.com/gdsbook/book
This book serves as an introduction to a whole new way of thinking systematically about geographic data, using geographical analysis and computation to unlock new insights hidden within data.
data-analysis-python data-science geographic-data geographical-information-system spatial-analysis spatial-data-analysis spatial-statistics statistics
Last synced: 31 Mar 2024
![](https://github.com/gdsbook.png)
https://github.com/aws/amazon-redshift-python-driver
Redshift Python Connector. It supports Python Database API Specification v2.0.
amazon-redshift aws-redshift data-analysis data-science
Last synced: 30 Mar 2024
![](https://github.com/aws.png)
https://github.com/RDeconomist/RDeconomist.github.io
RapidCharts - a site for teaching and demonstrating Data Science and Visualisation techniques
data data-science data-visualization economics politics sports
Last synced: 30 Mar 2024
![](https://github.com/RDeconomist.png)
https://github.com/uclatommy/tweetfeels
Real-time sentiment analysis in Python using twitter's streaming api
data-mining data-science python-3-6 sentiment-analysis twitter
Last synced: 30 Mar 2024
![](https://github.com/uclatommy.png)
https://github.com/awesomecosmos/MS-Data-Science
Repository for my MS in Data Science at Pace University.
data-science masters-degree pace-university
Last synced: 30 Mar 2024
![](https://github.com/awesomecosmos.png)
https://github.com/benthecoder/ml-blogs-that-are-worth-reading
Blogs on Machine Learning and Deep learning
ai artificial-intelligence data-science deep-learning machine-learning ml
Last synced: 29 Mar 2024
![](https://github.com/benthecoder.png)
https://github.com/benthecoder/yt-channels-DS-AI-ML-CS
A comprehensive list of 180+ YouTube Channels for Data Science, Data Engineering, Machine Learning, Deep learning, Computer Science, programming, software engineering, etc.
ai artificial-intelligence awesome awesome-list coding data data-analysis data-engineering data-science deep-learning machine-learning math ml programming python resources software-engineering statistics web-development youtube
Last synced: 29 Mar 2024
![](https://github.com/benthecoder.png)
https://github.com/dogoncouch/logdissect
CLI utility and Python module for analyzing log files and other data.
cli command-line data-analysis data-science forensic-analysis forensics json library log-analysis log-parser module parser parsing parsing-library python-library python-module python-modules security syslog
Last synced: 28 Mar 2024
![](https://github.com/dogoncouch.png)
https://github.com/d5555/TagEditor
🏖TagEditor - Annotation tool for spaCy
annotation annotation-tool coreference-resolution data-science labeling-tool machine-learning named-entities named-entity-recognition natural-language-processing neural-networks neuralcoref nlp spacy spacy-visualizer tagging-tool text-annotation text-tagging training-data
Last synced: 28 Mar 2024
![](https://github.com/d5555.png)
https://github.com/shenwei356/awesome
Awesome resources on Bioinformatics, data science, machine learning, programming language (Python, Golang, R, Perl) and miscellaneous stuff.
awesome data-science git golang linux perl programing-language python
Last synced: 28 Mar 2024
![](https://github.com/shenwei356.png)
https://github.com/ottogroup/palladium
Framework for setting up predictive analytics services
data-science machine-learning scikit-learn
Last synced: 28 Mar 2024
![](https://github.com/ottogroup.png)
https://github.com/incubated-geek-cc/Text-To-Speech-App
A Fusion of OCR Technology (Tesseract.js) & Web Speech API. Standalone, portable and works offline.
data-science javascript machine-learning ocr ocr-recognition tesseract tesseract-ocr tesseract-ocr-api tesseractjs webapp
Last synced: 28 Mar 2024
![](https://github.com/incubated-geek-cc.png)
https://github.com/modelscope/data-juicer
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
chinese data-analysis data-science data-visualization dataset gpt gpt-4 instruction-tuning large-language-models llama llava llm llms multi-modal nlp opendata pre-training pytorch sora streamlit
Last synced: 28 Mar 2024
![](https://github.com/modelscope.png)
https://github.com/arcelien/pba
Efficient Learning of Augmentation Policy Schedules
artificial-intelligence augmentation automated-machine-learning automl convolutional-neural-networks data-augmentation data-science deep-learning image-classification machine-learning python tensorflow
Last synced: 28 Mar 2024
![](https://github.com/arcelien.png)