Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-11-19 00:06:52 UTC
- JSON Representation
https://github.com/nishkarshraj/automation-using-shell-scripts
Development Automation using Shell Scripting.
anacron at automation automation-framework backup bash-script cron crontab data-science data-structures development linux scenarios scheduler shell shell-scripts sorting-algorithms
Last synced: 16 Nov 2024
https://github.com/shenxiangzhuang/pythondataanalysis
The data and code that used in my book.
data-science python3 webcrawler
Last synced: 14 Nov 2024
https://github.com/fneum/data-science-for-esm
data-science energy energy-data energy-system-modelling
Last synced: 14 Nov 2024
https://github.com/shenxiangzhuang/PythonDataAnalysis
The data and code that used in my book.
data-science python3 webcrawler
Last synced: 30 Oct 2024
https://github.com/empower-ai/sql-agent
Ai Agent that helps you do data analytics with natural language.
analytics bigquery chatgpt chatgpt-bot data data-analytics data-science mysql postgresql slack slack-bot slackbot
Last synced: 14 Nov 2024
https://github.com/jbramburger/DataDrivenDynSyst
Scripts and notebooks to accompany the book Data-Driven Methods for Dynamic Systems
autoencoder autoencoder-neural-network autoencoders conservation-laws data-science dynamic-mode-decomposition dynamical-systems extended-dynamic-mode-decomposition forecasting kernel-methods machine-learning neural-network physics-informed-learning physics-informed-neural-networks poincare-map sindy sindy-algorithm stroboscopic-map time-delay universal-approximation-theorem
Last synced: 12 Nov 2024
https://github.com/devsgnr/breadroll
breadroll 🥟 is a simple lightweight library for data processing operations written in Typescript and powered by Bun.
bun csv csv-parser data-engineering data-science data-transformation eda exploratory-data-analysis tsv tsv-parser
Last synced: 17 Aug 2024
https://github.com/argilla-io/biome-text
Custom Natural Language Processing with big and small models 🌲🌱
allennlp data-science natural-language-processing nlp pytorch
Last synced: 30 Sep 2024
https://github.com/hsbc/tslumen
A library for Time Series EDA (exploratory data analysis)
analysis data-analysis data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations pandas profiling python time-series time-series-analysis time-series-eda time-series-profiling timeseries timeseries-analysis timeseries-eda
Last synced: 14 Nov 2024
https://github.com/ahmedfgad/arithmeticencodingpython
Data Compression using Arithmetic Encoding in Python
arithmetic-coding data-compression data-science entropy-coding lossless-compression-algorithm python
Last synced: 17 Nov 2024
https://github.com/frjnn/bhtsne
Parallel Barnes-Hut t-SNE implementation written in Rust.
barnes-hut bhtsne data-science data-visualization dimensionality-reduction machine-learning rust similarity-measures
Last synced: 14 Nov 2024
https://github.com/localcascadeensemble/lce
Random Forest or XGBoost? It is Time to Explore LCE
classification data-science machine-learning python regression scikit-learn-api
Last synced: 31 Oct 2024
https://github.com/visual-layer/visuallayer
Simplify Your Visual Data Ops. Find and visualize issues with your computer vision datasets such as duplicates, anomalies, data leakage, mislabels and others.
cleaning computer computer-vision data data-science dataset datasets-preparation generative machine-learning python vision
Last synced: 16 Nov 2024
https://github.com/jianzhnie/autotabular
Automatic machine learning for tabular data. ⚡🔥⚡
automl catboost data-science deep-learning feature-engineering hpo lightgbm machine-learning pytorch-lightning scikit-learn structured-data tabular-data xgboost
Last synced: 27 Oct 2024
https://github.com/cloud-cv/evalai-starters
How to create a challenge on EvalAI?
agent ai cv data-science data-science-competition environments evalai get-started getting-started ml reinforcement-learning rl
Last synced: 16 Nov 2024
https://github.com/aiwithqasim/Free-Artificial-Intelligence-Resources
Welcome, to this Open Source Repository regarding FREE ARTIFICIAL INTELLIGENCE RESOURCE. Get Benefit from the free resources mention & kindly five STAR & FORK this so that it can get maximum Fame so that Everyone can take advantage.
ai article artificial-intelligence artificial-neural-networks blog data-science datascientist deep-learning freeresources hacktoberfest hecktoberfest2021 jobs machine-learning machine-learning-algorithms natural-language-processing nlp project python3 youtube
Last synced: 02 Nov 2024
https://github.com/charmve/paperweeklyai
📚「@MaiweiAI」Studying papers in the fields of computer vision, NLP, and machine learning algorithms every week.
advanced applied-machine-learning computer-vision data-mining data-science deep-learning machine-learning machine-learning-algorithms nlp paper-with-code papers study-papers tutorials
Last synced: 28 Oct 2024
https://github.com/jianzhnie/AutoTabular
Automatic machine learning for tabular data. ⚡🔥⚡
automl catboost data-science deep-learning feature-engineering hpo lightgbm machine-learning pytorch-lightning scikit-learn structured-data tabular-data xgboost
Last synced: 05 Aug 2024
https://github.com/gitonthescene/csv-reconcile
A reconciliation service for OpenRefine serving data from a given CSV file.
Last synced: 06 Nov 2024
https://github.com/montanaz0r/bayesian-statistics-the-fun-way
Solutions and workflow for the Bayesian Statistics The Fun Way book in Python
bayesian-data-analysis bayesian-statistics data-science jupyter-notebook numpy pandas probability python scipy statistics
Last synced: 07 Nov 2024
https://github.com/brubinstein/diffpriv
Easy differential privacy in R
data-science differential-privacy diffpriv machine-learning r r-package statistics
Last synced: 13 Nov 2024
https://github.com/tpvasconcelos/ridgeplot
Beautiful ridgeline plots in Python
data-analysis data-science data-visualization distplot ggridges graphing joyplot plot plotly plotting python ridgeline visualization
Last synced: 05 Nov 2024
https://github.com/tomasonjo/graphs-network-science
Accompanying repository for my book about Graph Data Science
algorithms data-science graph graph-algorithms machine-learning
Last synced: 09 Nov 2024
https://github.com/LaihoE/did-it-spill
Check if you have training samples in your test set
computer-vision data-science deep-learning pytorch semantic-similarity time-series
Last synced: 12 Nov 2024
https://github.com/tirendazacademy/awesome-data-science-resources
Resources about data science, machine learning, deep learning, data engineering, and SQL.
ai artificial-intelligence data-analysis data-engineering data-science dataengineering datascience deep-learning deeplearning machine-learning machinelearning machinelearning-python sql
Last synced: 08 Nov 2024
https://github.com/anna-geller/prefect-deployment-patterns
Code examples showing flow deployment to various types of infrastructure
automation aws data data-engineering data-engineering-infrastructure data-engineering-pipeline data-engineering-team data-products data-science dataflow dataflow-ops orchestration pipeline prefect python serverless serverless-framework
Last synced: 28 Oct 2024
https://github.com/apple/ml-symphony
Symphony: Interactive Data Widgets (CHI 2022)
computational-notebooks data-science data-visualization machine-learning
Last synced: 07 Oct 2024
https://github.com/ndleah/ibm-data-analyst-professional
Capstone projects of the IBM Data Analyst Professional
analyzing-data data-analysis data-analyst data-manipulation data-science data-visualization data-visualizations ibm-datascience-certification pandas python
Last synced: 13 Nov 2024
https://github.com/provectus/sak-kubeflow
🚀 Deploy Kubeflow on AWS EKS with Terraform 🤖
ai argocd artificial-intelligence automation aws cluster data-science deep-learning devops eks gitops iac infrastructure infrastructure-as-code kubeflow machine-learning ml open-source terraform
Last synced: 08 Nov 2024
https://github.com/omarsar/mri-analysis-pytorch
MRI analysis using PyTorch and MedicalTorch
data-science deep-learning health healthcare medicine neural-network pytorch
Last synced: 28 Oct 2024
https://github.com/vatshayan/final-year-disease-prediction-project
Final Year Project Diseases Prediction System through Machine Learning. Disease Prediction system with code and documents
btech btech-project btechfinalyear btechproject college-project data-science disease disease-prediction final final-project final-year-project finalyearproject finalyearprojects machine-learning machine-learning-algorithms machinelearning prediction python sem8
Last synced: 28 Oct 2024
https://github.com/dayyass/text-classification-baseline
Pipeline for fast building text classification TF-IDF + LogReg baselines.
baseline classification data-science fast hacktoberfest logistic-regression machine-learning natural-language-processing nlp python text text-classification tf-idf
Last synced: 07 Nov 2024
https://github.com/bcg-x-official/sklearndf
DataFrame support for scikit-learn.
cross-validation data-science feature-traceability hyper-parameter-tuning machine-learning model-selection pandas-dataframe python
Last synced: 15 Nov 2024
https://github.com/bnosac/crfsuite
Labelling Sequential Data in Natural Language Processing with R - using CRFsuite
chunking conditional-random-fields crf crfsuite data-science intent-classification natural-language-processing ner nlp r r-package
Last synced: 11 Nov 2024
https://github.com/DARIAH-DE/Topics
A Python library for topic modeling and visualization
data-science digital-humanities lda machine-learning natural-language-processing python3 text-mining topic-modeling
Last synced: 13 Nov 2024
https://github.com/renumics/sliceguard
A library for detecting problematic data segments in structured and unstructured data with few lines of code.
data-analysis data-cleaning data-curation data-exploration data-science data-visualization deep-learning eda exploratory-data-analysis machine-learning python visualization
Last synced: 27 Oct 2024
https://github.com/polyaxon/hypertune
A library for performing hyperparameter optimization
data-science deep-learning hyperparameter-optimization hyperparameter-tuning machine-learning mlops numpy scikit-learn workflow
Last synced: 10 Oct 2024
https://github.com/dayyass/qaner
Unofficial implementation of QaNER: Prompting Question Answering Models for Few-shot Named Entity Recognition.
data-science machine-learning named-entity-recognition natural-language-processing ner nlp python python3 question-answering
Last synced: 07 Nov 2024
https://github.com/ahammadmejbah/artificial-intelligence-important-documents-collections
AI technology is significant because it allows software to do human functions—understanding, reasoning, planning, communication, and perception—increasingly effectively, efficiently, and affordably.
ai algorithms big-data computer-science computer-vision data-analyst data-engineering data-mining data-science deep-learning machine-learning mathematics python
Last synced: 11 Nov 2024
https://github.com/Desbordante/desbordante-core
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
anomaly-detection correlations data-analytics data-cleaning data-cleansing data-engineering data-exploration data-mining data-mining-algorithms data-preprocessing data-profiling data-science data-wrangling exploratory-data-analysis feature-engineering feature-extraction feature-selection knowledge-discovery spreadsheets tabular-data
Last synced: 04 Nov 2024
https://github.com/tcsvn/activity-assistant
Activity Assistant provides a platform for logging, evaluating and predicting Activities of Daily Living for Home Assistant.
activities-of-daily-living activity-assistant adls data-mining data-science django django-rest-framework home-assistant home-assistant-addons home-automation homeassistant human-activity-recognition machine-learning smart-home smarthome visualization
Last synced: 06 Nov 2024
https://github.com/dask-contrib/dask-awkward
Native Dask collection for awkward arrays, and the library to use it.
columnar-format dask data-analysis data-science data-structure jagged-array python ragged-array
Last synced: 18 Nov 2024
https://github.com/hannansatopay/roughviz
A Python visualization library for creating sketchy/hand-drawn styled charts.
charts data-science hacktoberfest jupyter-notebook python-visualization roughviz vizualisation
Last synced: 08 Nov 2024
https://github.com/oneoffcoder/books
A collection of online books for data science, computer science and coding!
books coder computer-science data-science docker java python r scikit-learn scratch software software-development software-engineering spark sphinx tutorials
Last synced: 05 Nov 2024
https://jaeyk.github.io/comp_thinking_social_science/
Computational Thinking for Social Scientists book project
computational-social-science data-science digital-humanities machine-learning python r social-sciences visualization web-scraping
Last synced: 27 Oct 2024
https://github.com/maxent-ai/zeroshot_topics
Topic Inference with Zeroshot models
bert data-science huggingface hypernymy-extraction keybert keyword-extraction knowledge-graph labelled-data labelling linguistics machine-learning nli nlp taxonomy text text-classification transformers weak-supervision weakly-supervised-learning zeroshot-learning
Last synced: 07 Nov 2024
https://github.com/mitre/menelaus
Online and batch-based concept and data drift detection algorithms to monitor and maintain ML performance.
concept-drift data-drift data-science drift-detection machine-learning statistics
Last synced: 09 Nov 2024
https://github.com/tirthajyoti/covid-19-analysis
Analysis with Covid-19 data
analytics coronavirus covid-19 covid-data covid19-data data-science epidemiology machine-learning modeling numpy object-oriented-programming pandemic python visualization
Last synced: 09 Nov 2024
https://github.com/puzzlelib/puzzlelib
Deep Learning framework with NVIDIA & AMD support
data-science deep-learning deep-neural-networks gpu library machine-learning ml neural-network numpy python tensor
Last synced: 11 Oct 2024
https://github.com/alexioannides/ml-workflow-automation
Python Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deployment as a RESTful service on Kubernetes.
classification data-science flask helm jupyter-notebook kaggle kubernetes machine-learning mlops numpy pandas python rest-api sklearn
Last synced: 28 Oct 2024
https://github.com/thoughtspile/hippotable
Lightweight data analysis in your browser
csv dashboard data-analysis data-science javascript table visualization
Last synced: 15 Nov 2024
https://github.com/cihat/datastructure
📌🔎📝 Veri Yapıları (BMU221) ve bütün derslerin dokümantasyonu. Notes and examples in the data structure and all lessons course. Data Structures with Java.
bilgisayar-muhendisligi computer-science data-science data-structure data-structure-blogs data-structures data-structures-and-algorithms documentation turkce-dokumantasyon veri-bilimi veri-yapilari
Last synced: 06 Nov 2024
https://github.com/seandavi/sars2pack
An R package with over 50 highly cited, read-to-use, up-to-date COVID-19 pandemic data resources
biomedical-data coronavirus coronavirus-tracking covid-19 data-science data-visualization datascience datasets epidemics epidemiology geospatial public-health rstats rstats-package
Last synced: 05 Nov 2024
https://github.com/devparihar5/complete-data-science-roadmap
Complete Roadmap For Data Science
ai big-data data-analysis data-engineering data-science deep-learning machine-learning mathematics natural-language-processing neural-network python r-programming roadmap statistical-analysis statistics
Last synced: 09 Nov 2024
https://github.com/randyzwitch/streamlit-embedcode
Streamlit component for embedding code snippets such as GitHub gists, CodePen snippets, Gitlab snippets, etc.
data-analysis data-science data-visualization python streamlit streamlit-component
Last synced: 11 Oct 2024
https://github.com/bytehub-ai/bytehub
ByteHub: making feature stores simple
bytehub-cloud dask data-engineering data-science feature-engineering feature-store featurestore forecasting machine-learning machinelearning machinelearning-python pandas timeseries
Last synced: 17 Nov 2024
https://github.com/rosetta-ai/rosetta_recsys2019
The 4th Place Solution to the 2019 ACM Recsys Challenge by Team RosettaAI
artificial-intelligence boosting-tree data-mining data-science deep-learning hotel-recommender lightgbm machine-learning mean-reciprocal-rank neural-network python ranking recommender-system session-based-recommendation-system trivago xgboost
Last synced: 08 Aug 2024
https://github.com/ashishpatel26/datascienv
datascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest that provide single line of import all required ml libraries
catboost data-science data-science-env datascienv imbalanced-data lightgbm matplotlib numpy pandas pycaret scikit-learn seaborn tensorflow2 xgboost
Last synced: 10 Oct 2024
https://github.com/tf-encrypted/moose
Secure distributed dataflow framework for encrypted machine learning and data processing
cryptography data-science distributed-computing machine-learning privacy secure-computation
Last synced: 06 Nov 2024
https://github.com/terryyz/pyarmadillo
PyArmadillo: an alternative approach to linear algebra in Python
armadillo-library calculations data-science linear-algebra machine-learning
Last synced: 14 Oct 2024
https://github.com/gesiscss/css_methods_python
A full course of self-explanatory and freely available materials on CSS methods
data-science jupyter-notebook python
Last synced: 16 Nov 2024
https://github.com/wlandau/targets-minimal
A minimal example data analysis project with the targets R package
data-science high-performance-computing pipeline r reproducibility reproducible-research rstats statistics targets workflow
Last synced: 27 Oct 2024
https://github.com/mratsim/mckinsey-smartcities-traffic-prediction
Adventure into using multi attention recurrent neural networks for time-series (city traffic) for the 2017-11-18 McKinsey IronMan (24h non-stop) prediction challenge
data-science deep-learning keras machine-learning neural-networks tensorflow time-series
Last synced: 22 Oct 2024
https://github.com/ermshaua/time-series-segmentation-benchmark
This repository contains the time series segmentation benchmark (TSSB).
change-point change-point-detection data-mining data-science machine-learning python research science segmentation time-series time-series-analysis time-series-data-mining time-series-segmentation unsupervised-learning
Last synced: 13 Nov 2024
https://github.com/astrazeneca/judgyprophet
Forecasting for knowable future events using Bayesian informative priors (forecasting with judgmental-adjustment).
ai bayesian data-science forecasting machine-learning python statistics
Last synced: 18 Nov 2024
https://github.com/dmey/synthia
📈 🐍 Multidimensional synthetic data generation with Copula and fPCA models in Python
augmentation climate copula data-augmentation data-generation data-generator data-modelling data-science dependency-analysis dependency-modeling finance fpca functional-data machine-learning oversampling principal-component-analysis statistics synthetic-data weather xarray
Last synced: 12 Nov 2024
https://github.com/jmwoloso/pychattr
Python Channel Attribution (pychattr) - A Python implementation of the excellent R ChannelAttribution library
channel-attribution data-analysis data-science machine-learning python python-channel-attribution rpy2 wrapper
Last synced: 13 Nov 2024
https://github.com/benedekrozemberczki/pdn
The official PyTorch implementation of "Pathfinder Discovery Networks for Neural Message Passing" (WebConf '21)
bert cheminformatics data-science deep-learning deepwalk gcn gnn gpt-3 graph-classification graph-neural-network graph2vec message-passing molecules multiplex network-science neural-message-passing node-classification pathfinder pytorch transformer
Last synced: 14 Nov 2024
https://github.com/tirthajyoti/julia-data-science
Data science and numerical computing with Julia
artificial-intelligence data-science dataframe deep-learning julia julia-language linear-algebra machine-learning numerical-analysis scientific-computing statistics
Last synced: 22 Oct 2024
https://github.com/AstraZeneca/judgyprophet
Forecasting for knowable future events using Bayesian informative priors (forecasting with judgmental-adjustment).
ai bayesian data-science forecasting machine-learning python statistics
Last synced: 26 Sep 2024
https://github.com/rvanasa/pandas-gpt
Power up your data science workflow with ChatGPT.
chatgpt data-cleaning data-engineering data-science data-visualization jupyter-notebook low-code matplotlib numpy openai pandas productivity scipy seaborn
Last synced: 11 Oct 2024
https://github.com/mine-cetinkaya-rundel/teach-r-online
Materials for the Teaching statistics and data science online workshops in July 2020
data-science education rstats statistics
Last synced: 05 Nov 2024
https://github.com/noahgift/data-engineering-and-dataops
Duke MIDS: Data Engineering and DataOps Course
book cloud course data data-science dataengineering dataops duke mlops software-engineering
Last synced: 13 Nov 2024
https://github.com/wlandau/drake-examples
Example workflows for the drake R package
data-science drake high-performance-computing makefile pipeline r reproducibility reproducible-research ropensci rstats workflow
Last synced: 27 Oct 2024
https://github.com/yusufcinarci/data-science-projects
In this repo, there are (beginner-upper) level projects in the field of data science. I will host these projects that I have done in this field every day in this repo. With the hope that it will be useful to those who are interested in the field of data science like me and will just start...
data-analysis data-science data-science-projects jupyter jupyter-notebook python
Last synced: 07 Nov 2024
https://github.com/stevecondylios/priceR
Economics and Pricing in R
cran data-science econometrics economics finance modeling r-programming statistics
Last synced: 13 Aug 2024
https://github.com/soda-inria/hazardous
Competing Risks and Survival Analysis
competing-risks data-science gradient-boosting machine-learning survival-analysis
Last synced: 06 Nov 2024
https://github.com/stevecondylios/pricer
Economics and Pricing in R
cran data-science econometrics economics finance modeling r-programming statistics
Last synced: 12 Nov 2024
https://github.com/benedekrozemberczki/spatiotemporal_datasets
Spatiotemporal datasets collected for network science, deep learning and general machine learning research.
analytics benchmark data-science dataset deep-learning deepwalk epidemiology gcn gnn machine-learning node2vec pytorch pytorch-geometric spatial-analysis spatial-data spatial-data-analysis time-series time-series-analysis vector-autoregression
Last synced: 14 Nov 2024
https://github.com/mainakrepositor/datasets
A bunch of some 200 datasets. You can call it mini-kaggle :)
csv data data-science database datasets image-files mini-kaggle ml nlp-machine-learning tsv
Last synced: 12 Nov 2024
https://github.com/ropensci-books/drake
The user manual for the drake R package
data-science drake high-performance-computing makefile pipeline r reproducibility reproducible-research ropensci rstats workflow
Last synced: 13 Nov 2024
https://github.com/tgsmith61591/skoot
A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn friendly interface in an effort to expedite the modeling process.
data-science imbalanced-data machine-learning pandas python scikit-learn skutil
Last synced: 07 Nov 2024
https://github.com/tlverse/tlverse-handbook
🎯 :closed_book: Targeted Learning in R: A Causal Data Science Handbook
biostatistics causal-data-science causal-inference causal-machine-learning data-science machine-learning statistics targeted-learning tlverse
Last synced: 05 Aug 2024
https://github.com/zayedrais/documentsearchengine
Document Search Engine project with TF-IDF abd Google universal sentence encoder model
data-science deep-learning document-search document-similarity juypter machine-learning python python-text-analysis semantic-search semantic-search-engine tensorflow tensorflow-models tensorflow-tutorials text-analysis text-search text-semantic-similarity tfidf tfidf-text-analysis tfidf-vectorizer universal-sentence-encoder
Last synced: 12 Nov 2024
https://github.com/likejazz/jupyter-notebooks
This repo contains Jupyter Notebooks, miscellaneous stuff.
data-science decision-tree deep-learning jupyter-notebook keras machine-learning nlp pytorch random-forest statistics tensorflow
Last synced: 29 Oct 2024
https://github.com/Scitator/catalyst-examples
Examples
computer-vision data-science deep-learning deep-neural-networks deep-reinforcement-learning machine-learning python pytorch
Last synced: 07 Aug 2024
https://github.com/kaggler-tv/kaggler-tv-schedule
Kaggler TV
data-science kaggle kaggler-tv machine-learning-competitions youtube-channel
Last synced: 11 Oct 2024
https://github.com/junpenglao/planet_sakaar_data_science
A colourful collection of codes and notebooks, like Planet Sakaar
bayesian-inference data-science pymc3
Last synced: 02 Nov 2024
https://github.com/tomhanika/conexp-clj
A General-Purpose Tool for Formal Concept Analysis
clojure closure-systems conceptual-knowledge data data-analysis data-science formal-concept-analysis lattice order order-theory
Last synced: 16 Nov 2024
https://github.com/alinski29/stonks.jl
Julia library for standardizing financial data retrieval and storage from multiple APIs.
data data-mining data-science dataframe finance julia trading trading-algorithms
Last synced: 02 Nov 2024
https://github.com/meteostat/weather-stations
A list of public weather stations everyone can edit and share.
climate data-science json meteostat weather weather-stations
Last synced: 08 Aug 2024
https://github.com/scrapinghub/aduana
Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even when making big crawls (one billion pages).
Last synced: 10 Nov 2024
https://github.com/ActivitySim/populationsim
An Open Platform for Population Synthesis
activitysim bsd-3-clause data-science microsimulation population-synthesis python
Last synced: 27 Oct 2024
https://github.com/codait/presentations
Talks & Workshops by the CODAIT team
data-science deep-learning fairness-ai fairness-ml machine-learning open-source presentations
Last synced: 09 Nov 2024
https://github.com/mikeizbicki/cmc-csci046
CMC's Data Structures and Algorithms Course Materials
cmc computer-science course data-science python3
Last synced: 16 Nov 2024
https://github.com/pmuens/lab
Research Environment to play around with Algorithms and Data (Structures)
algorithms artificial-intelligence artificial-neural-networks data-science deep-learning jupyter jupyter-notebook machine-learning machine-learning-algorithms
Last synced: 17 Oct 2024
https://ddotta.github.io/cookbook-rpolars/
Cookbook to provide solutions to common tasks and problems in using Polars with R
benchmark cookbook data-engineering data-science datatable dplyr polars r tidyr
Last synced: 18 Nov 2024
https://github.com/dc-aichara/DS-ML-Public
Python Scripts and Jupyter Notebooks
bayesian-optimization beautifulsoup bitcoin catboost dash dashboard data-analysis data-mining data-science data-visualisation hyperparameter-tuning hyperparameters-optimization lightgbm machine-learning news plotly python telegram web-scraping xgboost
Last synced: 15 Nov 2024
https://github.com/realpython/web-dev-for-data-scientists
data-science flask python webdevelopment
Last synced: 17 Nov 2024
https://github.com/svilupp/awesome-generative-ai-meets-julia-language
Comprehensive guide to generative AI projects and resources in Julia.
awesome awesome-list data-science generative-ai julia
Last synced: 28 Oct 2024