Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data Science
Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.
- GitHub: https://github.com/topics/data-science
- Wikipedia: https://en.wikipedia.org/wiki/Data_science
- Related Topics: data-analysis, data-mining, machine-learning, big-data, data-visualization,
- Aliases: datasciences, data-science-project, data-science-algorithm,
- Last updated: 2024-11-18 00:06:52 UTC
- JSON Representation
https://github.com/mvlearn/mvlearn
Python package for multi-view machine learning
data-science machine-learning multiview-learning python
Last synced: 12 Nov 2024
https://github.com/Laurae2/Laurae
Advanced High Performance Data Science Toolbox for R by Laurae
data-science laurae machine-learning r supervised-learning xgboost
Last synced: 07 Aug 2024
https://github.com/blobcity/python-for-data-science
A collection of Jupyter Notebooks for learning Python for Data Science.
data-science jupyter jupyter-notebook jupyter-notebooks learn-python python
Last synced: 13 Nov 2024
https://github.com/PecanProject/pecan
The Predictive Ecosystem Analyzer (PEcAn) is an integrated ecological bioinformatics toolbox.
bayesian cyberinfrastructure data-assimilation data-science ecosystem-model ecosystem-science forecasting meta-analysis national-science-foundation pecan plants r
Last synced: 14 Nov 2024
https://github.com/danaugrs/go-tsne
t-Distributed Stochastic Neighbor Embedding (t-SNE) in Go
3d data-science dimensionality-reduction go machine-learning tsne unsupervised-learning visualization
Last synced: 12 Nov 2024
https://github.com/h2oai/nitro
Create apps 10x quicker, without Javascript/HTML/CSS.
app apps data-analysis data-science developer-tools devtools graphics h2o-nitro low-code python ui ui-components user-interface web-application webapp widget-library widgets
Last synced: 13 Nov 2024
https://github.com/benedekrozemberczki/danmf
A sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
autoencoder cikm clustering community-detection coordinate-descent danmf data-science deep-learning deepwalk dimensionality-reduction embedding gemsec machine-learning mnmf nmf node-embedding node2vec sklearn unsupervised-learning word2vec
Last synced: 10 Oct 2024
https://github.com/benedekrozemberczki/DANMF
A sparsity aware implementation of "Deep Autoencoder-like Nonnegative Matrix Factorization for Community Detection" (CIKM 2018).
autoencoder cikm clustering community-detection coordinate-descent danmf data-science deep-learning deepwalk dimensionality-reduction embedding gemsec machine-learning mnmf nmf node-embedding node2vec sklearn unsupervised-learning word2vec
Last synced: 30 Oct 2024
https://github.com/storieswithsiva/Data-Science-Resources
👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
artificial-intelligence artificial-neural-networks data data-analysis data-analytics data-mining data-science data-science-resource data-science-resources data-scientist data-scientists data-visualization data-world datascience dataset learning learning-kit machine-learning python repository
Last synced: 07 Nov 2024
https://lge-arc-advancedai.github.io/auptimizer/
An automatic ML model optimization tool.
automated-machine-learning automl data-engineering data-science deep-learning hpo hyperparameter-optimization hyperparameter-tuning machine-learning neural-networks
Last synced: 02 Nov 2024
https://github.com/nteract/bookstore
📚 Notebook storage and publishing workflows for the masses
data-science notebook nteract scheduling storage versioned-buckets
Last synced: 16 Nov 2024
https://github.com/ivnvxd/pyquest
Python everything Cheatsheet and a Journey to the land of Python programming
algorithms architecture cheatsheet concurrency data-science data-structures data-types database fundamentals jupyter-notebook learn oop python standard-library tutorial web-development
Last synced: 06 Nov 2024
https://github.com/build-on-aws/cloud-clubs-learner-library
A library for learners! Whether or not you're a part of AWS Cloud Clubs, take a look in this library for free, open, leveled content for students 18+ worldwide
ai aws containers data-analytics data-science databases iot kubernetes ml mobile-development security serverless web web-development
Last synced: 30 Oct 2024
https://github.com/analysiscenter/batchflow
BatchFlow helps you conveniently work with random or sequential batches of your data and define data processing and machine learning workflows even for datasets that do not fit into memory.
data-science machine-learning pipeline pipeline-framework python python3 workflow workflow-engine
Last synced: 02 Nov 2024
https://github.com/ideonate/cdsdashboards
JupyterHub extension for ContainDS Dashboards
bokeh data-science jupyter jupyterhub panel plotly-dash rshiny streamlit visualization
Last synced: 13 Oct 2024
https://github.com/LGE-ARC-AdvancedAI/auptimizer
An automatic ML model optimization tool.
automated-machine-learning automl data-engineering data-science deep-learning hpo hyperparameter-optimization hyperparameter-tuning machine-learning neural-networks
Last synced: 15 Nov 2024
https://github.com/flyteorg/flytekit
Extensible Python SDK for developing Flyte tasks and workflows. Simple to get started and learn and highly extensible.
automation data data-science extensible flyte flyte-tasks hacktoberfest mlops pypi python sdk spark workflows
Last synced: 11 Oct 2024
https://github.com/paddymul/buckaroo
Buckaroo - the data wrangling assistant for pandas. Quickly explore dataframes, and run pandas commands via a GUI. Works inside the jupyter notebook.
buckaroo data-science jupyter paddy pandas
Last synced: 29 Oct 2024
https://github.com/agilescientific/striplog
Lithology and stratigraphic logs for wells or outcrop.
data-mining data-science geology petrophysics sedimentology swung-stack
Last synced: 25 Oct 2024
https://github.com/yogeshhk/TeachingDataScience
Course notes for Data Science related topics, prepared in LaTeX
course-materials data-science deep-learning jupyter-notebooks latex machine-learning natural-language-processing open-source python
Last synced: 30 Oct 2024
https://github.com/Esri/awesome-arcgis-developer
A curated list of resources to help you with ArcGIS development, APIs, SDKs, tools, and location services
arcgis arcgis-apis awesome awesome-list data-science developer developer-experience developer-tools developers gis location-intelligence location-services mapping productivity samples spatial-analysis web-development web-mapping
Last synced: 07 Aug 2024
https://github.com/aws/amazon-redshift-python-driver
Redshift Python Connector. It supports Python Database API Specification v2.0.
amazon-redshift aws-redshift data-analysis data-science
Last synced: 07 Oct 2024
https://github.com/rhenanbartels/hrv
A Python package for heart rate variability analysis
data-science hacktoberfest hrv python signal-processing
Last synced: 13 Nov 2024
https://github.com/AtomScott/SportsLabKit
A python package for turning sports video into csv files
computer-vision data-science football multi-object-tracking multiobject-tracking python soccer sports sports-analytics tracking
Last synced: 10 Nov 2024
https://github.com/slowkow/harmonypy
🎼 Integrate multiple high-dimensional datasets with fuzzy k-means and locally linear adjustments.
bioinformatics data-integration data-science single-cell-analysis
Last synced: 15 Oct 2024
https://github.com/jthomasmock/gtextras
A Collection of Helper Functions for the gt Package.
data-science data-visualization datascience ggplot2 gt plots r rstats sparkline sparkline-graphs sparklines tables
Last synced: 14 Nov 2024
https://github.com/coqui-ai/trainer
🐸 - A general purpose model trainer, as flexible as it gets
ai data-science deep-learning machine-learning pytorch
Last synced: 15 Nov 2024
https://github.com/ActivitySim/activitysim
An Open Platform for Activity-Based Travel Modeling
activitysim bsd-3-clause data-science microsimulation python travel-modeling
Last synced: 27 Oct 2024
https://github.com/Toloka/crowd-kit
Control the quality of your labeled data with the Python tools you already know.
aggregations annotation crowd crowdsourcing data-mining data-science labeling python quality-control toloka truth-inference
Last synced: 30 Oct 2024
https://github.com/sktime/skpro
A unified framework for tabular probabilistic regression and probability distributions in python
ai data-science framework machine-learning prediction probabilistic-models probability-distributions python regression sklearn
Last synced: 10 Oct 2024
https://github.com/launchflow/buildflow
BuildFlow, is an open source framework for building large scale systems using Python. All you need to do is describe where your input is coming from and where your output should be written, and BuildFlow handles the rest. No configuration outside of the code is required.
batch data-science pipeline python streaming
Last synced: 06 Aug 2024
https://github.com/neo4j/graph-data-science-client
A Python client for the Neo4j Graph Data Science (GDS) library
algorithms data-science graph graph-algorithms graph-data-science graph-database graph-machine-learning machine-learning neo4j python python3
Last synced: 07 Oct 2024
https://github.com/dair-ai/dair-ai.github.io
Home of DAIR.AI
ai data-science education machine-learning nlp
Last synced: 10 Nov 2024
https://github.com/kevinheavey/modern-polars
Code and data for the Modern Polars book
data-analytics data-engineering data-science dataengineering pandas polars python
Last synced: 15 Nov 2024
https://github.com/jthomasmock/gtExtras
A Collection of Helper Functions for the gt Package.
data-science data-visualization datascience ggplot2 gt plots r rstats sparkline sparkline-graphs sparklines tables
Last synced: 13 Aug 2024
https://github.com/explosion/jupyterlab-prodigy
🧬 A JupyterLab extension for annotating data with Prodigy
active-learning annotation annotation-tool artificial-intelligence computer-vision data-annotation data-science jupyter jupyterlab labeling-tool machine-learning machine-teaching natural-language-processing nlp prodigy spacy
Last synced: 07 Oct 2024
https://github.com/microsoft/finnts
Microsoft Finance Time Series Forecasting Framework (FinnTS) is a forecasting package that utilizes cutting-edge time series forecasting and parallelization on the cloud to produce accurate forecasts for financial data.
business data-science feature-selection finance finnts forecasting machine-learning microsoft r r-package rstats time-series
Last synced: 13 Nov 2024
https://github.com/trainingbypackt/data-science-for-marketing-analytics
Achieve your marketing goals with the data analytics power of Python
data-science data-visualization matplotlib numpy pandas python seaborn
Last synced: 14 Nov 2024
https://github.com/epistasislab/tpot2
A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
adsp ag066833 aiml alzheimer alzheimers automated-machine-learning automation automl data-science feature-engineering gradient-boosting hyperparameter-optimization lm010098 machine-learning model-selection nia parameter-tuning python random-forest scikit-learn
Last synced: 16 Nov 2024
https://github.com/rapidsai/node
GPU-accelerated data science and visualization in node
cuda data-science data-visualization gpgpu gpu nodejs
Last synced: 13 Nov 2024
https://github.com/tonybeltramelli/Deep-Spying
Spying using Smartwatch and Deep Learning
data-science deep-learning neural-networks privacy recurrent-neural-networks security wearable-devices
Last synced: 07 Aug 2024
https://github.com/plotly/dash-oil-and-gas-demo
Dash Demo App - New York Oil and Gas
dash data-science data-visualization energy plotly python technical-computing
Last synced: 05 Aug 2024
https://github.com/nabeel-oz/qlik-py-tools
Data Science algorithms for Qlik implemented as a Python Server Side Extension (SSE).
advanced-analytics advanced-analytics-integration analytics clustering data-science deep-learning facebook-prophet fbprophet forecasting hdbscan keras machine-learning predictive-analytics python qlik qlik-oss qlik-sense scikit-learn server-side-extension sklearn
Last synced: 09 Oct 2024
https://github.com/gyrdym/ml_algo
Machine learning algorithms in Dart programming language
algorithm batch-gradient-descent classifier dart dartlang data-science hyperparameters lasso-regression linear-regression logistic-regression machine-learning machine-learning-algorithms mini-batch-gradient-descent regression sgd softmax softmax-algorithm softmax-classifier softmax-regression stochastic-gradient-descent
Last synced: 12 Nov 2024
https://github.com/blue-season/pywarm
A cleaner way to build neural networks for PyTorch.
clean-code data-science deep-learning keras machine-learning neural-network neural-networks python3 pytorch
Last synced: 14 Nov 2024
https://github.com/seg/2016-ml-contest
Machine learning contest - October 2016 TLE
contest data-science fun geophysics geoscience machine-learning
Last synced: 07 Aug 2024
https://github.com/d5555/tageditor
🏖TagEditor - Annotation tool for spaCy
annotation annotation-tool coreference-resolution data-science labeling-tool machine-learning named-entities named-entity-recognition natural-language-processing neural-networks neuralcoref nlp spacy spacy-visualizer tagging-tool text-annotation text-tagging training-data
Last synced: 14 Oct 2024
https://github.com/TMiguelT/PandasSchema
A validation library for Pandas data frames using user-friendly schemas
data-science pandas schema validation
Last synced: 07 Aug 2024
https://github.com/multimeric/PandasSchema
A validation library for Pandas data frames using user-friendly schemas
data-science pandas schema validation
Last synced: 10 Nov 2024
https://github.com/multimeric/pandasschema
A validation library for Pandas data frames using user-friendly schemas
data-science pandas schema validation
Last synced: 17 Nov 2024
https://github.com/drakearch/kaggle-courses
Kaggle courses and tutorials to get you started in the Data Science world.
data-science deep-learning machine-learning pandas python
Last synced: 08 Nov 2024
https://github.com/robmarkcole/HASS-data-detective
Explore and analyse your Home Assistant data
data data-science home home-assistant home-automation
Last synced: 05 Nov 2024
https://github.com/vatshayan/final-year-project-cryptographic-technique-for-communication-system
Top B.tech/M.tech Final Year Project "Design and Analysis of Cryptographic Technique for Communication System" with Project Code, Report, PPT, Synopsis, IEEE Research Paper and HD Video Explanation
algorithms btech-project cipher-algorithms ciphers college-project college-projects computer-science-project cryptography cryptography-algorithms cryptography-tools cse-project data-science final-year-project final-year-projects finalyearproject ieee machine-learning mtech-project python research-paper
Last synced: 27 Oct 2024
https://github.com/robmarkcole/hass-data-detective
Explore and analyse your Home Assistant data
data data-science home home-assistant home-automation
Last synced: 13 Nov 2024
https://github.com/juliuskunze/jaxnet
Concise deep learning for JAX
data-science deep-learning jax machine-learning neural-networks python
Last synced: 17 Nov 2024
https://github.com/swoop-inc/spark-alchemy
Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
data-engineering data-science scala spark
Last synced: 12 Oct 2024
https://github.com/stocknear/backend
Backend of stocknear - Open Source Stock Analysis
data data-science fastapi fastify finance javascript machine-learning nodejs pocketbase python redis
Last synced: 13 Nov 2024
https://github.com/nshiab/simple-data-analysis.js
Easy-to-use and high-performance JavaScript library for data analysis.
data data-analysis data-science duckdb javascript nodejs typescript
Last synced: 12 Aug 2024
https://github.com/tirendazacademy/pandas-tutorial
Jupyter Notebooks and Data Sets for Pandas Library
data data-analysis data-preprocessing data-science machine-learning pandas pandas-dataframe pandas-datareader pandas-library pandas-python pandas-series pandas-tricks-for-data-manipulation pandas-tutorial python
Last synced: 15 Nov 2024
https://github.com/nshiab/simple-data-analysis
Easy-to-use and high-performance JavaScript library for data analysis.
data data-analysis data-science duckdb javascript nodejs typescript
Last synced: 28 Oct 2024
https://github.com/ptyadana/data-science-and-machine-learning-projects-dojo
collections of data science, machine learning and data visualization projects with pandas, sklearn, matplotlib, tensorflow2, Keras, various ML algorithms like random forest classifier, boosting, etc
boosting-algorithms data-analysis data-science data-visualization deep-learning keras machine-learning machine-learning-algorithms natural-language-processing pandas probability-statistics scikit-learn seaborn tensorflow
Last synced: 15 Nov 2024
https://github.com/d5555/TagEditor
🏖TagEditor - Annotation tool for spaCy
annotation annotation-tool coreference-resolution data-science labeling-tool machine-learning named-entities named-entity-recognition natural-language-processing neural-networks neuralcoref nlp spacy spacy-visualizer tagging-tool text-annotation text-tagging training-data
Last synced: 18 Nov 2024
https://github.com/cyb3r-monk/rita-j
Implementation of RITA (Real Intelligence Threat Analytics) in Jupyter Notebook with improved scoring algorithm.
cybersecurity data-science dfir jupyter-notebook threat-hunting
Last synced: 15 Nov 2024
https://github.com/eurostat/gridviz
A package for visualizing gridded data 🌐
cartography csv d3 data data-analysis data-science data-visualization datascience geospatial gis gridded-statistics grids gridviz map map-making mapping mapping-tools maps visualization webgl
Last synced: 18 Nov 2024
https://github.com/coqui-ai/Trainer
🐸 - A general purpose model trainer, as flexible as it gets
ai data-science deep-learning machine-learning pytorch
Last synced: 07 Aug 2024
https://kevinheavey.github.io/modern-polars/
Code and data for the Modern Polars book
data-analytics data-engineering data-science dataengineering pandas polars python
Last synced: 04 Aug 2024
https://github.com/kevin-hanselman/dud
A lightweight CLI tool for versioning data alongside source code and building data pipelines.
data-engineering data-pipelines data-science dataset dvcs machine-learning mlops
Last synced: 26 Oct 2024
https://github.com/SETL-Framework/setl
A simple Spark-powered ETL framework that just works 🍺
big-data data-analysis data-engineering data-science data-transformation dataset etl etl-pipeline framework machine-learning modularization pipeline scala setl spark
Last synced: 08 Nov 2024
https://github.com/setl-framework/setl
A simple Spark-powered ETL framework that just works 🍺
big-data data-analysis data-engineering data-science data-transformation dataset etl etl-pipeline framework machine-learning modularization pipeline scala setl spark
Last synced: 12 Oct 2024
https://github.com/giswqs/geebook
Earth Engine and Geemap: Geospatial Data Science with Python
data-science dataviz earth-engine geemap geopython geospatial google-earth-engine ipyleaflet ipywidgets jupyter mapping python
Last synced: 15 Nov 2024
https://github.com/azure/datasciencevm
Tools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
ai azure big-data data-analysis data-science deep-learning dsvm machine-learning ml python r sqlserver
Last synced: 07 Oct 2024
https://github.com/Azure/DataScienceVM
Tools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
ai azure big-data data-analysis data-science deep-learning dsvm machine-learning ml python r sqlserver
Last synced: 08 Aug 2024
https://github.com/sicara/sicarator
Instant Setup & Best Quality for Data Projects!
data-science generator machine-learning python
Last synced: 14 Nov 2024
https://github.com/oracle-samples/oci-data-science-ai-samples
This repo contains a series of tutorials and code examples highlighting different features of the OCI Data Science and AI services, along with a release vehicle for experimental programs.
ai conda data-science data-science-notebooks deep-learning jupyter-notebook machine-learning oci oracle-cloud-infrastructure python
Last synced: 13 Nov 2024
https://github.com/capeprivacy/cape-dataframes
Privacy transformations on Spark and Pandas dataframes backed by a simple policy language.
collaboration data-science hacktoberfest machine-learning pandas policy privacy python spark
Last synced: 14 Nov 2024
https://github.com/Oxen-AI/Oxen
Oxen.ai's core rust library, server, and CLI
artificial-intelligence data-science database machine-learning version-control
Last synced: 17 Aug 2024
https://github.com/fedora-infra/fedmsg
Federated Messaging with ZeroMQ
data-science fedora-project message-bus python zeromq
Last synced: 20 Aug 2024
https://github.com/kdr-aus/ogma
Scripting language focused on processing tabular data.
data-science language rust scripting-language table-data
Last synced: 30 Oct 2024
https://github.com/brendanhasz/probflow
A Python package for building Bayesian models with TensorFlow or PyTorch
bayesian bayesian-inference bayesian-methods bayesian-neural-networks bayesian-statistics data-science machine-learning python pytorch statistics tensorflow
Last synced: 13 Oct 2024
https://github.com/curiousily/machine-learning-from-scratch
Succinct Machine Learning algorithm implementations from scratch in Python, solving real-world problems (Notebooks and Book). Examples of Logistic Regression, Linear Regression, Decision Trees, K-means clustering, Sentiment Analysis, Recommender Systems, Neural Networks and Reinforcement Learning.
artificial-intelligence book classification data-science machine-learning machine-learning-algorithms neural-networks notebook recommender-systems regression reinforcement-learning sentiment-analysis
Last synced: 18 Nov 2024
https://github.com/learnbyexample/py_resources
Collection of Python learning resources
curated-list data-science learning machine-learning python resources scientific-computing
Last synced: 13 Nov 2024
https://github.com/denizyuret/autograd.jl
Julia port of the Python autograd package.
autograd automatic-differentiation data-science deep-learning knet machine-learning neural-networks
Last synced: 15 Oct 2024
https://learnbyexample.github.io/py_resources/
Collection of Python learning resources
curated-list data-science learning machine-learning python resources scientific-computing
Last synced: 02 Nov 2024
https://github.com/youssefhosni/my-medium-articles-friendly-links
Friendly link to all of my medium articles
data-science deep-learning machine-learning python
Last synced: 07 Nov 2024
https://github.com/maxheld83/ghactions
GitHub actions for R and accompanying R package
cicd continous-delivery continous-integration data-science devops github github-actions rstats setup
Last synced: 31 Oct 2024
https://github.com/tirthajyoti/ds-with-pysimplegui
Data science and Machine Learning GUI programs/ desktop apps with PySimpleGUI package
analytics application artificial-intelligence data-science desktop-app gui machine-learning python windows
Last synced: 15 Nov 2024
https://github.com/rsokl/learning_python
Source material for Python Like You Mean it
data-science educational numpy numpy-tutorial python python-tutorial textbook tutorial
Last synced: 17 Nov 2024
https://github.com/dlab-berkeley/Python-Fundamentals-Legacy
D-Lab's 12 hour introduction to Python. Learn how to create variables and functions, use control flow structures, use libraries, import data, and more, using Python and Jupyter Notebooks.
data-science introduction-to-python jupyter python
Last synced: 11 Nov 2024
https://github.com/pydatablog/python-for-data-science
A blog for data analytics using data science technologies
Last synced: 15 Nov 2024
https://github.com/hugohadfield/kalmangrad
Automated, smooth, N'th order derivatives of non-uniformly sampled time series data
data-science derivatives kalman-filter signal-processing smoothing
Last synced: 23 Oct 2024
https://github.com/Automunge/AutoMunge
Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbations.
Last synced: 27 Oct 2024
https://github.com/apachecn/ds-ai-tech-notes
:book: [译] 数据科学和人工智能技术笔记
ai data-science matplotlib notes numpy python sklearn
Last synced: 10 Oct 2024
https://github.com/ahammadmejbah/machine-learning-book-collections
Machine learning is the study and development of data-driven strategies to enhance task performance. AI includes it.
data-science deep-learning machine-learning
Last synced: 11 Nov 2024
https://github.com/anthony-wang/BestPractices
Things that you should (and should not) do in your Materials Informatics research.
best-practices common-pitfalls data-science example-code interactive-notebooks jupyter jupyter-notebooks machine-learning materials-informatics materials-science neural-networks python
Last synced: 13 Nov 2024
https://github.com/google/starthinker
Reference framework for building data workflows provided by Google. Accelerates authentication, logging, scheduling, and deployment of solutions using GCP. To borrow a tagline.. "The framework for professionals with deadlines."
airflow app-engine automation bigquery cloud-functions cm360 colab-notebook data-science django dv360 google-ads google-analytics logger python scheduler ui workflows
Last synced: 29 Sep 2024
https://github.com/lamastex/scalable-data-science
Scalable Data Science, course sets in big data Using Apache Spark over databricks and their mathematical, statistical and computational foundations using SageMath.
apache-spark data-science databricks scala
Last synced: 12 Oct 2024
https://github.com/robb/rbbjson
Flexible JSON traversal for rapid prototyping.
data-science json jsonpath prototyping swift
Last synced: 27 Oct 2024
https://github.com/unnati-xyz/scalable-data-science-platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
data-engineer data-pipeline data-science luigi machine-learning rest-api spark
Last synced: 07 Aug 2024
https://github.com/davendw49/k2
Code and datasets for paper "K2: A Foundation Language Model for Geoscience Knowledge Understanding and Utilization" in WSDM-2024
ai4science data-science geoai geoscience kg large-language-models llm
Last synced: 02 Nov 2024
https://github.com/solegalli/machine-learning-imbalanced-data
Code repository for the online course Machine Learning with Imbalanced Data
data-science imbalanced-classification imbalanced-data imbalanced-learning machine-learning python
Last synced: 13 Nov 2024