Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-01-03 00:07:32 UTC
- JSON Representation
https://github.com/visactor/vtable
VTable is not just a high-performance multidimensional data analysis table, but also a grid artist that creates art between rows and columns.
canvas-table data-analysis data-visualization database datagrid grid javascript-table javescript list-table list-tree online-excel pivot-chart pivot-grid pivot-tables react-table sparklines spreadsheet tree-table visualization vue-table
Last synced: 28 Dec 2025
https://github.com/weijie-chen/linear-algebra-with-python
Lecture Notes for Linear Algebra Featuring Python. This series of lecture notes will walk you through all the must-know concepts that set the foundation of data science or advanced quantitative skillsets. Suitable for statistician/econometrician, quantitative analysts, data scientists and etc. to quickly refresh the linear algebra with the assistance of Python computation and visualization.
computational-science data-analysis data-science data-visualization diagonalization eigenvalues eigenvectors gram-schmidt jupyter linear-algebra linear-transformations mathematics matrix matrix-calculations multivariate-normal-distribution null-space python singular-value-decomposition symmetric-matrices vector-space
Last synced: 15 May 2025
https://github.com/Kanaries/graphic-walker
An open source alternative to Tableau. Embeddable visual analytic
bi data data-analysis data-mining data-visualization eda k6s kanaries low-code pivot-table react tableau tableau-alternative typescript vega vega-lite visualization
Last synced: 14 Mar 2025
https://github.com/secretflow/secretflow
A unified framework for privacy-preserving data analysis and machine learning
confidential-computing data-analysis differential-privacy federated-learning homomorphic-encryption machine-learning privacy-preserving private-set-intersection secure-multiparty-computation split-learning trusted-execution-environment
Last synced: 04 Apr 2025
https://krzjoa.github.io/awesome-python-data-science/
Probably the best curated list of data science software in Python.
awesome awesome-list awesome-python data-analysis data-science data-visualization deep-learning machine-learning python scikit-learn statistics
Last synced: 17 Oct 2025
https://github.com/unslothai/hyperlearn
2-2000x faster ML algos, 50% less memory usage, works on all hardware - new and old.
data-analysis data-science deep-learning econometrics gpu machine-learning neural-network optimization python pytorch regression-models research scikit-learn statistics statsmodels tensor
Last synced: 04 Oct 2025
https://github.com/weijie-chen/Linear-Algebra-With-Python
Lecture Notes for Linear Algebra Featuring Python. This series of lecture notes will walk you through all the must-know concepts that set the foundation of data science or advanced quantitative skillsets. Suitable for statistician/econometrician, quantitative analysts, data scientists and etc. to quickly refresh the linear algebra with the assistance of Python computation and visualization.
computational-science data-analysis data-science data-visualization diagonalization eigenvalues eigenvectors gram-schmidt jupyter linear-algebra linear-transformations mathematics matrix matrix-calculations multivariate-normal-distribution null-space python singular-value-decomposition symmetric-matrices vector-space
Last synced: 27 Mar 2025
https://github.com/tabixio/tabix
Tabix.io UI
bi business-intelligence businessintelligence clickhouse dashboard data-analysis data-visualization sql-query tabix
Last synced: 15 May 2025
https://github.com/lana-k/sqliteviz
Instant offline SQL-powered data visualisation in your browser
charting csv data-analysis pivot pivot-table plotly plotting sql sqlite visualization
Last synced: 28 Dec 2025
https://github.com/justmarkham/pandas-videos
Jupyter notebook and datasets from the pandas video series
data-analysis data-cleaning data-science jupyter-notebook pandas python tutorial
Last synced: 15 May 2025
https://github.com/running-elephant/datart
Datart is a next generation Data Visualization Open Platform
analytics bi business-analytics business-intelligence chart d3 dashboard data-analysis data-analytics data-engineering data-visualization data-viz datart davinci display echarts react report sql-editor typescript
Last synced: 14 May 2025
https://github.com/elementary-data/elementary
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
analytics-engineer bigquery data-analysis data-governance data-lineage data-observability data-pipeline data-pipelines data-reliability data-warehouse dataops dbt dbt-artifacts dbt-packages lineage redshift snowflake
Last synced: 12 May 2025
https://github.com/nannyml/nannyml
nannyml: post-deployment data science in python
data-analysis data-drift data-science deep-learning jupyter-notebook machine-learning machinelearning ml mlops model-monitoring monitoring performance-estimation performance-monitoring postdeploymentdatascience python visualization
Last synced: 14 May 2025
https://github.com/NannyML/nannyml
nannyml: post-deployment data science in python
data-analysis data-drift data-science deep-learning jupyter-notebook machine-learning machinelearning ml mlops model-monitoring monitoring performance-estimation performance-monitoring postdeploymentdatascience python visualization
Last synced: 05 May 2025
https://github.com/rilldata/rill
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
bi business-analytics csv data data-analysis data-visualization dataviz duckdb gcs golang parquet parquet-tools parquet-viewer s3 sql sql-editor svelte sveltejs sveltekit
Last synced: 13 May 2025
https://github.com/pymc-devs/pymc-resources
PyMC educational resources
bayesian-inference bayesian-statistics data-analysis data-science
Last synced: 14 May 2025
https://github.com/chris1610/pbpython
Code, Notebooks and Examples from Practical Business Python
data-analysis data-visualization datascience pandas python scikit-learn
Last synced: 15 May 2025
https://github.com/pretzelai/pretzelai
The modern replacement for Jupyter Notebooks
analytics artificial-intelligence business-intelligence businessintelligence dashboard data data-analysis data-analytics data-science data-visualization duckdb notebooks open-source prql reporting sql sql-editor sql-editor-online visualization wasm
Last synced: 14 May 2025
https://github.com/vizzuhq/vizzu-lib
Library for animated data visualizations and data stories.
animation chart charting charting-library charts dashboard data-analysis data-visualization datavisualization dataviz graph graphs javascript javascript-library plotting storytelling
Last synced: 13 May 2025
https://github.com/tiledb-inc/tiledb
The Universal Storage Engine
arrays data-analysis data-science dataframes dense-data hdfs s3 s3-storage scientific-computing sparse-arrays sparse-data storage-engine tiledb
Last synced: 13 May 2025
https://github.com/rilldata/rill-developer
Rill is a tool for effortlessly transforming data sets into powerful, opinionated dashboards using SQL. BI-as-code.
bi business-analytics csv data data-analysis data-visualization dataviz duckdb gcs golang parquet parquet-tools parquet-viewer s3 sql sql-editor svelte sveltejs sveltekit
Last synced: 08 Mar 2025
https://github.com/DAGWorks-Inc/hamilton
Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
dag data-analysis data-engineering data-science dataframe etl etl-framework etl-pipeline feature-engineering hacktoberfest lineage llmops machine-learning mlops orchestration pandas python rag software-engineering
Last synced: 26 Mar 2025
https://github.com/man-group/arcticdb
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
big-data data data-analysis data-science database dataframe pandas quantitative-analysis quantitative-finance quantitative-trading
Last synced: 13 May 2025
https://github.com/TileDB-Inc/TileDB
The Universal Storage Engine
arrays data-analysis data-science dataframes dense-data hdfs s3 s3-storage scientific-computing sparse-arrays sparse-data storage-engine tiledb
Last synced: 28 Mar 2025
https://github.com/h2oai/datatable
A Python package for manipulating 2-dimensional tabular data structures
data-analysis data-structure ftrl performance python
Last synced: 13 May 2025
https://github.com/cannylab/tsne-cuda
GPU Accelerated t-SNE for CUDA with Python bindings
barnes-hut barnes-hut-tsne cuda data-analysis data-visualization fit-tsne gpu mnist multithreading python tsne tsne-algorithm tsne-cuda
Last synced: 14 May 2025
https://github.com/CannyLab/tsne-cuda
GPU Accelerated t-SNE for CUDA with Python bindings
barnes-hut barnes-hut-tsne cuda data-analysis data-visualization fit-tsne gpu mnist multithreading python tsne tsne-algorithm tsne-cuda
Last synced: 15 Mar 2025
https://github.com/apachecn/python_data_analysis_and_mining_action
《python数据分析与挖掘实战》的代码笔记
data-analysis data-science python3 readingnotes
Last synced: 08 Apr 2025
https://github.com/deepnote/deepnote
Deepnote is a drop-in replacement for Jupyter with an AI-first design, sleek UI, new blocks, and native data integrations. Use Python, R, and SQL locally in your favorite IDE, then scale to Deepnote cloud for real-time collaboration, Deepnote agent, and deployable data apps. https://deepnote.com/
artificial-intelligence data data-analysis data-science data-visualization deepnote eda jupyter jupyterhub jupyterlab machine-learning notebooks python r sql
Last synced: 18 Nov 2025
https://github.com/404notf0und/AI-for-Security-Learning
安全场景、基于AI的安全算法和安全数据分析业界实践
data-analysis data-mining machine-learning security
Last synced: 27 Apr 2025
https://github.com/404notf0und/ai-for-security-learning
安全场景、基于AI的安全算法和安全数据分析业界实践
data-analysis data-mining machine-learning security
Last synced: 25 Mar 2025
https://github.com/jadianes/spark-py-notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
big-data bigdata data-analysis data-science ipython ipython-notebook machine-learning mllib notebook pyspark python spark
Last synced: 15 May 2025
https://github.com/justmarkham/dat8
General Assembly's 2015 Data Science course in Washington, DC
clustering course data-analysis data-cleaning data-science data-visualization decision-trees ensemble-learning jupyter-notebook linear-regression logistic-regression machine-learning model-evaluation naive-bayes natural-language-processing pandas python regular-expressions scikit-learn web-scraping
Last synced: 15 May 2025
https://github.com/re-data/re-data
re_data - fix data issues before your users & CEO would discover them 😊
data-analysis data-monitoring data-observability data-quality data-quality-checks data-quality-monitoring data-reliability data-testing dataquality dbt dbt-packages open-source-tooling
Last synced: 14 May 2025
https://github.com/datageartech/datagear
DataGear数据可视化分析平台,自由制作任何您想要的数据看板
bi business-intelligence chart data-analysis data-analytics data-visualization echarts
Last synced: 14 May 2025
https://github.com/microsoft/responsible-ai-toolbox
Responsible AI Toolbox is a suite of tools providing model and data exploration and assessment user interfaces and libraries that enable a better understanding of AI systems. These interfaces and libraries empower developers and stakeholders of AI systems to develop and monitor AI more responsibly, and take better data-driven actions.
data-analysis data-science data-visualization error-analysis explainability explainable-ai explainable-ml fairness fairness-ai fairness-ml interpretability jupyter machine-learning machinelearning ml responsible-ai ui visualization widget widgets
Last synced: 13 May 2025
https://github.com/ecmadao/hacknical
Hacknical, hacker & technical. A website for GitHub user to make a better resume.
contribute-languages contributions data-analysis github github-analysis github-commits github-contributions reac react resume resume-template
Last synced: 08 Apr 2025
https://github.com/nubank/fklearn
fklearn: Functional Machine Learning
data-analysis data-science machine-learning ml python
Last synced: 13 May 2025
https://github.com/hi-primus/optimus
:truck: Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
big-data-cleaning bigdata cudf dask dask-cudf data-analysis data-cleaner data-cleaning data-cleansing data-exploration data-extraction data-preparation data-profiling data-science data-transformation data-wrangling machine-learning pyspark spark
Last synced: 14 May 2025
https://github.com/DataBrewery/cubes
[NOT MAINTAINED] Light-weight Python OLAP framework for multi-dimensional data analysis
cube data data-analysis data-warehouse multidimensional-analysis olap sql
Last synced: 26 Mar 2025
https://github.com/man-group/ArcticDB
ArcticDB is a high performance, serverless DataFrame database built for the Python Data Science ecosystem.
big-data data data-analysis data-science database dataframe pandas quantitative-analysis quantitative-finance quantitative-trading
Last synced: 12 Mar 2025
https://github.com/sepandhaghighi/pycm
Multi-class confusion matrix library in Python
accuracy ai artificial-intelligence classification confusion-matrix data data-analysis data-mining data-science deep-learning deeplearning evaluation machine-learning mathematics matrix ml multiclass-classification neural-network statistical-analysis statistics
Last synced: 13 May 2025
https://github.com/capitalone/dataprofiler
What's in your data? Extract schema, statistics and entities from datasets
avro csv data-analysis data-labels data-science dataprofiling dataset gdpr graph-data machine-learning network-data nlp npi pandas pii privacy python security sensitive-data tabular-data
Last synced: 14 May 2025
https://github.com/capitalone/DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
avro csv data-analysis data-labels data-science dataprofiling dataset gdpr graph-data machine-learning network-data nlp npi pandas pii privacy python security sensitive-data tabular-data
Last synced: 02 Apr 2025
https://github.com/Litlyx/litlyx
Powerful Analytics Solution. Setup in 30 seconds. Display all your data on a Simple, AI-powered dashboard. Fully self-hostable and GDPR compliant. Alternative to Google Analytics, MixPanel, Plausible, Umami & Matomo.
ai analytics angular charts data data-analysis data-visualization javascript metrics nextjs nodejs nuxt open-source react statistics typescript vue website
Last synced: 25 Aug 2025
https://github.com/ptyadana/sql-data-analysis-and-visualization-projects
SQL data analysis & visualization projects using MySQL, PostgreSQL, SQLite, Tableau, Apache Spark and pySpark.
apache-spark challenges data-analysis digital-music-store exercises mysql mysql-database mysql-notes mysqlworkbench pgadmin postgres postgresql pyspark python sql sql-data-analysis sql-queries sqlite tableau
Last synced: 16 May 2025
https://github.com/sfirke/janitor
simple tools for data cleaning in R
data-analysis data-cleaning data-science dirty-data excel pivot-tables r spss tabulations tidyverse
Last synced: 13 May 2025
https://github.com/googlecloudplatform/data-science-on-gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
cloud-computing data-analysis data-engineering data-pipeline data-processing data-science data-visualization machine-learning
Last synced: 14 Apr 2025
https://github.com/skrub-data/skrub
Machine learning with dataframes
data data-analysis data-cleaning data-preparation data-preprocessing data-science data-wrangling dataframe dataframes dirty-data machine-learning
Last synced: 13 May 2025
https://github.com/data-forge/data-forge-ts
The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
csv data data-analysis data-cleaning data-cleansing data-forge data-management data-manipulation data-munging data-visualization data-wrangling javascript json linq nodejs pandas visualization
Last synced: 13 May 2025
https://github.com/patmartin/dex
Dex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
d3 d3js data-analysis data-mining data-science data-visualization datavis datavisualization dataviz groovy java javafx visualization
Last synced: 16 May 2025
https://github.com/PatMartin/Dex
Dex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
d3 d3js data-analysis data-mining data-science data-visualization datavis datavisualization dataviz groovy java javafx visualization
Last synced: 04 May 2025
https://github.com/GoogleCloudPlatform/data-science-on-gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
cloud-computing data-analysis data-engineering data-pipeline data-processing data-science data-visualization machine-learning
Last synced: 19 Jul 2025
https://github.com/dongsuo/vue-data-board
A Data Analysis Board in Vue.
bi big-data-analytics business-intelligence data-analysis data-analysis-board data-visualization databoard drag echarts element-ui no-code visualization vue
Last synced: 27 Mar 2025
https://github.com/singer-io/getting-started
This repository is a getting started guide to Singer.
data-analysis etl etl-framework python singer
Last synced: 14 May 2025
https://github.com/starpig1129/datagen
DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing. Now expanding into crypto market intelligence. Learn more: https://datagen.digital/.
agent ai ai-data-analysis artificial-intelligence code-generation data-analysis data-analytics data-science langchain langgraph large-language-model large-language-models llm multiagent-systems python
Last synced: 14 May 2025
https://github.com/alan-turing-institute/clevercsv
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
csv csv-converter csv-export csv-files csv-format csv-import csv-parser csv-parsing csv-reader csv-reading data-analysis data-mining data-science datascience machine-learning python python-library python3
Last synced: 13 May 2025
https://github.com/starpig1129/AI-Data-Analysis-MultiAgent
DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing. Now expanding into crypto market intelligence. Learn more: https://datagen.digital/.
agent ai ai-data-analysis artificial-intelligence code-generation data-analysis data-analytics data-science langchain langgraph large-language-model large-language-models llm multiagent-systems python
Last synced: 02 May 2025
https://github.com/uxlfoundation/scikit-learn-intelex
Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
ai-inference ai-machine-learning ai-training analytics big-data data-analysis gpu machine-learning machine-learning-algorithms oneapi python scikit-learn swrepo
Last synced: 11 Dec 2025
https://github.com/alan-turing-institute/CleverCSV
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
csv csv-converter csv-export csv-files csv-format csv-import csv-parser csv-parsing csv-reader csv-reading data-analysis data-mining data-science datascience machine-learning python python-library python3
Last synced: 26 Mar 2025
https://github.com/VisActor/VTable
VTable is not just a high-performance multidimensional data analysis table, but also a grid artist that creates art between rows and columns.
canvas-table data-analysis data-visualization database datagrid grid javascript-table javescript list-table list-tree online-excel pivot-chart pivot-grid pivot-tables sparklines spreadsheet table tree-chart tree-table visualization
Last synced: 06 Aug 2025
https://github.com/starpig1129/DATAGEN
DATAGEN: AI-driven multi-agent research assistant automating hypothesis generation, data analysis, and report writing. Now expanding into crypto market intelligence. Learn more: https://datagen.digital/.
agent ai ai-data-analysis artificial-intelligence code-generation data-analysis data-analytics data-science langchain langgraph large-language-model large-language-models llm multiagent-systems python
Last synced: 17 Nov 2025
https://github.com/machow/siuba
Python library for using dplyr like syntax with pandas and SQL
data-analysis dplyr pandas python sql
Last synced: 15 May 2025
https://github.com/nfstream/nfstream
NFStream: a Flexible Network Data Analysis Framework.
artificial-intelligence cybersecurity data-analysis data-mining data-science dataset-generation deep-packet-inspection machine-learning ndpi netflow network-analysis network-monitoring network-security packet-analyser packet-capture pcap python traffic-analysis traffic-classification
Last synced: 14 May 2025
https://github.com/litlyx/litlyx
Powerful Analytics Solution. Setup in 30 seconds. Display all your data on a Simple, AI-powered dashboard. Fully self-hostable and GDPR compliant. Alternative to Google Analytics, MixPanel, Plausible, Umami & Matomo.
ai analytics angular charts data data-analysis data-visualization javascript metrics nextjs nodejs nuxt open-source react statistics typescript vue website
Last synced: 14 May 2025
https://github.com/predict-idlab/plotly-resampler
Visualize large time series data with plotly.py
data-analysis data-science data-visualization plotly plotly-dash python time-series visualization
Last synced: 13 May 2025
https://github.com/apachecn/pyda-2e-zh
:book: [译] 利用 Python 进行数据分析 · 第 2 版
book data-analysis numpy pandas pyda python
Last synced: 12 Apr 2025
https://github.com/LongOnly/Quantitative-Notebooks
Educational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
algorithmic-trading algotrading asset-allocation asset-management asset-pricing data-analysis data-science financial-analysis jupyter machine-learning notebook pairs-trading python quantitative-finance quantitative-trading stock-trading trading-algorithms trading-strategies
Last synced: 30 Mar 2025
https://github.com/ruc-datalab/deepanalyze
DeepAnalyze is the first agentic LLM for autonomous data science.
agent agentic agentic-ai ai ai-scientist chatbot chatgpt data data-analysis data-engineering data-science data-visualization database gpt llama llm qwen science structured-data vllm
Last synced: 11 Nov 2025
https://github.com/comet-ml/kangas
🦘 Explore multimedia datasets at scale
data-analysis data-exploration dataframe datagrid machine-learning
Last synced: 14 May 2025
https://github.com/longonly/quantitative-notebooks
Educational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
algorithmic-trading algotrading asset-allocation asset-management asset-pricing data-analysis data-science financial-analysis jupyter machine-learning notebook pairs-trading python quantitative-finance quantitative-trading stock-trading trading-algorithms trading-strategies
Last synced: 02 Oct 2025
https://github.com/xinglie/report-designer
⚡打印设计、可视化、标签打印、编辑器、设计器、数据分析、报表设计、组件化、表单设计、h5页面、调查问卷、pdf生成、流程图、试卷、SVG、图形元素、物联网、标签纸
cloud-print data-analysis data-visualization editor h5-creator h5-editor h5-maker iot-demo layouts-and-renderings online-design online-printing printer snapshot visiual-editor xinglie
Last synced: 16 Mar 2025
https://github.com/d4t4x/data-selfie
Data Selfie - a browser extension to track yourself on Facebook and analyze your data.
chrome-extension data-analysis data-dashboard firefox-addon privacy
Last synced: 12 Apr 2025
https://github.com/databricks/lilac
Curate better data for LLMs
artificial-intelligence data-analysis dataset-analysis unstructured-data
Last synced: 10 Mar 2025
https://github.com/markwk/qs_ledger
Quantified Self Personal Data Aggregator and Data Analysis
apple-health data-analysis data-visualization fitbit habitica instapaper kindle kindle-highlights lastfm personal-data pocket quantified-self rescuetime self-tracking strava todoist toggl
Last synced: 03 Apr 2025
https://github.com/hurshd0/must-read-papers-for-ml
Collection of must read papers for Data Science, or Machine Learning / Deep Learning Engineer
convolutional-networks data-analysis data-science deep-learning exploratory-data-analysis generalized-additive-models machine-learning neural-networks papers recommender-system recurrent-neural-networks rnn-lstm
Last synced: 10 Apr 2025
https://github.com/ipython-books/cookbook-2nd
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
computing data-analysis data-mining data-science data-visualization ipython jupyter jupyter-notebook machine-learning numerical-computation python visualization
Last synced: 16 May 2025
https://github.com/bruin-data/bruin
Build data pipelines with SQL and Python, ingest data from different sources, add quality checks, and build end-to-end flows.
analytics bigquery data-analysis data-ingestion data-modeling data-pipelines data-platform data-transformation python snowflake sql
Last synced: 02 Jan 2026
https://github.com/kotlin/dataframe
Structured data processing in Kotlin
data-analysis data-science dataframe kotlin
Last synced: 04 Jul 2025
https://github.com/latitude-dev/latitude
Developer-first embedded analytics
analytics business-intelligence dashboard data data-analysis data-analytics data-app data-engineering data-science data-visualization duckdb embedded-analytics exploratory-data-analysis javascript-framework open-source react self-hosted sql svelte tailwindcss
Last synced: 09 Nov 2025
https://github.com/apache/cloudberry
One advanced and mature open-source MPP (Massively Parallel Processing) database. Open source alternative to Greenplum Database.
ai big-data c cloudberry data-analysis data-warehouse database distributed-database greenplum mpp olap postgres postgresql sql
Last synced: 14 May 2025
https://github.com/bansalkanav/ultimate-data-science-toolkit---from-python-basics-to-generativeai
aws cnn data-analysis data-science deep-learning-algorithms flask machine-learning mlflow mlops mongodb pandas-python prefect python3 search-engine sklearn-library sql statistics streamlit tutorial-code visualization
Last synced: 15 May 2025
https://github.com/bansalkanav/Ultimate-Data-Science-Toolkit---From-Python-Basics-to-GenerativeAI
aws cnn data-analysis data-science deep-learning-algorithms flask machine-learning mlflow mlops mongodb pandas-python prefect python3 search-engine sklearn-library sql statistics streamlit tutorial-code visualization
Last synced: 16 Apr 2025
https://github.com/visualpython/visualpython
GUI-based Python code generator for data science, extension to Jupyter Lab, Jupyter Notebook and Google Colab.
bigdata chrome-extension code-generator data-analysis jupyter-lab-extension jupyter-notebook-extension jupyterlab-extension pandas python visual-coding
Last synced: 15 May 2025
https://github.com/chawlaavi/daily-dose-of-data-science
A collection of code snippets from the publication Daily Dose of Data Science on Substack: http://www.dailydoseofds.com/
data-analysis data-science data-science-tips data-visualization jupyter jupyter-notebook jupyter-tips matplotlib matplotlib-tips numpy pandas pandas-tips python python-tips sklearn
Last synced: 04 Apr 2025
https://github.com/scikit-hep/awkward
Manipulate JSON-like data with NumPy-like idioms.
apache-arrow cern-root columnar-format data-analysis jagged-array json numba numpy pandas python ragged-array rdataframe scikit-hep
Last synced: 14 May 2025
https://github.com/empathy87/the-elements-of-statistical-learning-python-notebooks
A series of Python Jupyter notebooks that help you better understand "The Elements of Statistical Learning" book
data-analysis data-science machine-learning python sklearn statistical-learning tensorflow tutorials
Last synced: 13 Apr 2025
https://github.com/Kotlin/dataframe
Structured data processing in Kotlin
data-analysis data-science dataframe kotlin
Last synced: 11 Apr 2025
https://github.com/GoogleCloudPlatform/DataflowJavaSDK
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
big-data data-analysis data-mining data-processing data-science google-cloud-dataflow
Last synced: 01 May 2025
https://github.com/googlecloudplatform/dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
big-data data-analysis data-mining data-processing data-science google-cloud-dataflow
Last synced: 03 Oct 2025
https://github.com/empathy87/The-Elements-of-Statistical-Learning-Python-Notebooks
A series of Python Jupyter notebooks that help you better understand "The Elements of Statistical Learning" book
data-analysis data-science machine-learning python sklearn statistical-learning tensorflow tutorials
Last synced: 01 May 2025
https://github.com/androz2091/discord-data-package-explorer
🌀 What's really in your Discord Data package?
data-analysis discord discord-data-package statistics
Last synced: 16 May 2025
https://github.com/yoshoku/rumale
Rumale is a machine learning library in Ruby
artificial-intelligence data-analysis data-science machine-learning ml ruby rubyml
Last synced: 29 Apr 2025
https://github.com/Androz2091/discord-data-package-explorer
🌀 What's really in your Discord Data package?
data-analysis discord discord-data-package statistics
Last synced: 31 Mar 2025
https://github.com/ChawlaAvi/Daily-Dose-of-Data-Science
A collection of code snippets from the publication Daily Dose of Data Science on Substack: http://www.dailydoseofds.com/
data-analysis data-science data-science-tips data-visualization jupyter jupyter-notebook jupyter-tips matplotlib matplotlib-tips numpy pandas pandas-tips python python-tips sklearn
Last synced: 04 Oct 2025
https://github.com/mrankitgupta/Data-Analyst-Roadmap
I am sharing my Journey of 66DaysofData into Data Analytics by participating in Ken Jee's #66daysofdata challenge
ankit ankit-gupta ankitgupta data-analysis data-analytics data-science data-structures data-visualization excel mongodb mysql pandas powerbi python sql sql-server tableau
Last synced: 07 Sep 2025
https://github.com/elki-project/elki
ELKI Data Mining Toolkit
anomalydetection cluster-analysis clustering data-analysis data-mining data-mining-algorithms data-science distance-functions index indexing java machine-learning outlier-detection outliers time-series visualization
Last synced: 14 May 2025
https://github.com/xiaopujun/light-chaser
light chaser is a lightweight data visualization designer tool
blueprints data-analysis data-visualization draggable javascript typescript web-editor
Last synced: 16 May 2025
https://github.com/mrankitgupta/data-analyst-roadmap
I am sharing my Journey of 66DaysofData into Data Analytics by participating in Ken Jee's #66daysofdata challenge
ankit ankit-gupta ankitgupta data-analysis data-analytics data-science data-structures data-visualization excel mongodb mysql pandas powerbi python sql sql-server tableau
Last synced: 13 Apr 2025