Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-30 00:07:38 UTC
- JSON Representation
https://github.com/kyle-wannacott/DataCamp-Projects
DataCamp project solutions.
data-analysis data-mining data-science datacamp-projects machine-learning python r
Last synced: 13 Oct 2025
https://github.com/viper373/baidutieba
爬取百度贴吧(指定吧名、起始页数/重点页数、日志输出)
baidutieba-crawler bert data-analysis deep-learning python spider
Last synced: 30 Mar 2025
https://github.com/scailfin/flowserv-core
Reproducible and Reusable Data Analysis Workflow Server
benchmarks data-analysis reproducibility reusability workflows
Last synced: 14 Jan 2026
https://github.com/jupyterphysscilab/documentation
Documentation for the Jupyter Physical Science Lab Suite of Packages
analog-to-digital-converter data-acquisition data-analysis education jupyter-notebooks pandas physical-sciences plotting python raspberry-pi
Last synced: 22 Jan 2026
https://github.com/pizofreude/data-career-navigator
An interactive dashboard providing deep insights into career opportunities for data-related roles, utilizing a comprehensive dataset sourced from LinkedIn. Features include analysis of experience levels, salaries, key skills, job locations, and industry trends, aiding job seekers and professionals in exploring and identifying optimal career paths.
codeinplace data-analysis data-visualization standford-university
Last synced: 13 Mar 2026
https://github.com/hayesall/babybear
🐼 It's like pandas, but tiny.
data-analysis data-analysis-python data-science dataframe python teaching teaching-tool
Last synced: 31 May 2026
https://github.com/quantumudit/alteryx-weekly-challenges
This repository contains Alteryx solutions to the weekly challenges published in Alteryx Community
alteryx alteryx-workflow data-analysis data-science data-transformation data-visualization etl
Last synced: 27 Jan 2026
https://github.com/gher-uliege/liege-colloquium-on-ocean-dynamics
Python tools and latex files for the Colloquium
data-analysis data-assimilation numerical-simulations ocean-modelling oceanography remote-sensing submesoscale turbulence
Last synced: 14 Oct 2025
https://github.com/mustafah/dream-my-plots
Create visual plots in Python with the help of text prompting popular LLMs through langchain
ai artificial-intelligence automation data-analysis data-visualization langchain llms machine-learning plotting python
Last synced: 13 Apr 2026
https://github.com/parisaroozgarian/ibm-data-analyst-professional-certificate
The IBM Data Analyst Professional Certificate, consisting of 9 courses, equips with essential skills in Excel, SQL, Python, data visualization, and analysis techniques
big-data business-analysis business-communication communication data-analysis data-management data-structures data-visualization databases general-statistics human-resources planning python-programming spreedsheet sql
Last synced: 27 Jan 2026
https://github.com/haloapping/malas-ngetik-clf
Saya malas ngetik, makanya saya buat aja template proyek kompetisi Kaggle 😜. Template ini khusus untuk kasus klasifikasi.
data-analysis exploratory-data-analysis feature-engineering kaggle kaggle-competition machine-learning python3 scikit-learn
Last synced: 12 Apr 2026
https://github.com/camille-maslin/simulfcimage
🔍 SimulFCImage: A professional multispectral image processing application developed for ImViA Laboratory.
academic-project computer-vision data-analysis gui-application image-processing image-viewer multispectral-images pyqt5 python scientific-visualization spectral-analysis
Last synced: 05 Feb 2026
https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation
we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.
algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning
Last synced: 05 Feb 2026
https://github.com/victor-lis/regression-ai-model-practice
ai data-analysis python regression-model
Last synced: 01 Apr 2025
https://github.com/martincastroalvarez/django-data-analytics
Data Analytics, PnL, LTV & retention analysis with Django
analytics beautifulsoup4 d3 d3js data-analysis django ltv rest-api visualization
Last synced: 06 May 2026
https://github.com/code-jl/nfl-point-kicker-data-scraper
A Python-based web scraping toolkit that extracts and processes NFL kicking statistics from Pro-Football-Reference. This project automates the collection of comprehensive game data, with a particular focus on field goal attempts and environmental conditions.
automation beautifulsoup csv data-analysis data-collection field-goals football-statistics kicking-stats nfl python selenium sports-analysis statistics weather-data web-scraping
Last synced: 06 Sep 2025
https://github.com/gjbex/python-dashboards
Repository that contains material for training sessions on creating dashboards using Python.
dash dashboard data-analysis data-exploration data-science data-visualization panel python streamlit training training-materials visualization
Last synced: 13 Jul 2025
https://github.com/jonperk318/machine-learning-analysis-of-hyperspectral-data
Using Non-negative Matrix Factorization (NMF) and Variational Autoencoder (VAE) machine learning architectures to analyze spatial and spectral features of hyperspectral cathodoluminescence (CL) spectroscopy images taken from hybrid inorganic-organic perovskite material
data-analysis data-science deep-neural-networks explained-variance hybrid-perovskite hyperspectral-image-classification machine-learning matplotlib nmf non-negative-matrix-factorization python pytorch scikit-learn semi-supervised-learning signal-processing solar-energy spectroscopy unsupervised-learning vae variational-autoencoder
Last synced: 28 Jan 2026
https://github.com/dcs-training/bayesian-statistics
Materials for the CDCS Introduction to Bayesian Statistics course. Go to the readme file
bayesian-statistics data-analysis r statistics
Last synced: 05 Feb 2026
https://github.com/maciekmalachowski/crypto-charts-site
📊Application that returns financial data for selected cryptocurrency.
binance-api data-analysis jupyter-notebook matplotlib mplfinance numpy pandas python python-binance
Last synced: 12 Apr 2026
https://github.com/luminati-io/Amazon-dataset-samples
A sample dataset of over 1,000 Amazon product listings, extracted using the Bright Data API, perfect for competitive analysis, market trends, and eCommerce insights.
amazon api data-analysis data-science dataset ecommerce products web-scraping
Last synced: 09 Apr 2025
https://github.com/aravind-selvam/covid_dashboard
With Covid death and vaccine data. I have created a dashboard.
covid-19 data-analysis data-science data-visualization tableau tableau-public visualization
Last synced: 08 Mar 2026
https://github.com/asifdotexe/timeseriesanalysis
This repository serves as a central hub for all of my projects related to time series analysis. Here, you'll find a collection of projects, code samples, and resources that explore various aspects of time series data and its analysis.
data-analysis feature-engineering jupyter-notebook pandas python time-series-analysis visualization
Last synced: 14 Apr 2026
https://github.com/olow304/goboard
Python Data Analysis Dashboard using Public Dataset, Django
dashboard dashboard-templates data-analysis data-science django jupyter-notebook machine-learning python sklearn
Last synced: 11 Apr 2026
https://github.com/jhrcook/100daysofpython
100 days, at least 1 hour a day, of learning the Python programming language.
100-days-of-code 100daysofcode continued-learning data-analysis data-science decision-trees deep-learning keras keras-tensorflow machine-learning neural-network neural-networks plots python python3 scikit-learn tensorflow
Last synced: 09 Apr 2025
https://github.com/a-r-j/npview
CLI tools for quickly inspecting CSV/TSV & NumPy (.npy) array files
cli csv data-analysis inspector npy numpy python tsv
Last synced: 18 Jan 2026
https://github.com/rvalla/chessevolution
Some code to analyze my chess games using the Lichess API.
chess data-analysis lichess lichess-api python
Last synced: 23 Oct 2025
https://github.com/trybnetic/tu7-acceleration-sleep-wake-classification
Supporting material for the paper ''Discrimination of sleep and wake periods from a hip-worn raw acceleration sensor using recurrent neural networks''
accelerometer accelerometry actigraphy data-analysis sensors sleep
Last synced: 01 Jun 2026
https://github.com/luizassimoes/fitness-report
Create a personalized Fitness Wrapped report using your Apple Health data with this Streamlit application. Generate comprehensive, detailed summaries of your annual fitness activities, providing valuable insights into your year-long progress and achievements.
data-analysis data-visualization python streamlit
Last synced: 12 Feb 2026
https://github.com/nunesma/Health-analytics
Data analysis focusing on health problems
data-analysis epidemiology-analysis health-analytics python r-programming
Last synced: 30 Jul 2025
https://github.com/kevinschoon/qviz
QViz Interactive Plotting
data-analysis data-visualization go gonum qframe yaegi
Last synced: 01 Jun 2026
https://github.com/mljar/enrichment
Data enrichment with AI for pandas DataFrame
data-analysis data-enrichment data-science openai pandas
Last synced: 01 Jul 2025
https://github.com/c0deta1ker/MatBaseX
MatBase provides access to an extensive database of material parameters, inelastic mean free paths (IMFP), photoionization binding energies, cross sections, and asymmetry parameters. Additionally, MatBase includes a suite of functions for users to load, process, model and fit their own data, making it an indispensable tool in the field.
cross-sections crystal-structure crystallography data-analysis data-fitting database electron imfp imfp-calculator-matlab material material-database matlab matlab-application matlab-gui matlab-toolbox pes-modelling photoelectron-spectroscopy photoionization simulation xps
Last synced: 23 Jul 2025
https://github.com/saltiola7/data-analysis-portfolio
Data engineering & analysis portfolio, which showcases my use of Python & SQL
airflow airtable-block anaconda automation back4app chatgpt csv-parser data-analysis data-engineering docker-compose gcp graphql-api jupyter-notebook nosql prefect python rest-api sql streamlit web-scraping
Last synced: 21 Jan 2026
https://github.com/jpcadena/onemetric-plus
OneMetric+ project for analytical tool on demand forecast and outlier detection
black-formatter data-analysis data-analytics data-science data-visualization demand-forecasting isort machine-learning matplotlib mypy numpy outlier-detection pandas pre-commit-hook pydantic python ruff scikit-learn seaborn solid-principles
Last synced: 10 Apr 2026
https://github.com/dcs-training/r-qgisintegratingspatialanalysis
This was an intermediate course of three sessions with a focus on developing skills in data visualisation, analysis and integration using both R studio and QGIS. Go to the readme file
data-analysis data-visualisation data-wrangling gis qgis r spatial-analysis
Last synced: 28 Jan 2026
https://github.com/elfgk/ogretmenanalizantalya
OgretmenAnalizAntalya
analysis data-analysis data-science data-visualization ogretmenanaliz
Last synced: 08 Feb 2026
https://github.com/thennen/py-ivtools
A package for reproducible measurement and analysis of current-voltage characteristics of electronic devices.
current-voltage data-analysis data-visualization electrical-engineering emerging-technology instrumentation measurements
Last synced: 23 Jan 2026
https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm
📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.
big-data data data-analysis data-science data-visualization eda gotomarket
Last synced: 13 Jun 2025
https://github.com/DCS-training/intromachinelearning
This course is aimed at providing an introduction to machine learning for those with some beginner level python/Rstudio skills. Go to the readme file
data-analysis data-wrangling machine-learning python statistics
Last synced: 25 Apr 2025
https://github.com/alieymsxxn/sql_project_data_job_analysis
This project explores top-paying jobs, in-demand skills, and where high demand meets high salary in data analytics.
data-analysis postgresql sql sqlite
Last synced: 16 Apr 2025
https://github.com/roland045/road_quality_measurement_analysis
Novel road quality measurement system for cost effective pavement monitoring, ML-based
azure data-analysis data-engineering data-science machine-learning mlops model-deployment python sql unsupervised-learning
Last synced: 24 Jan 2026
https://github.com/iamfoysal/data-analysis
This repository contains various examples and exercises to help learn data science using Python.
data-analysis data-science database jupyter-notebook python3
Last synced: 10 Feb 2026
https://github.com/mrjxtr/tokyo_airbnb_analysis_project
Full project case study and analysis to show potential opportunities to start an AirBnb business in Tokyo, Japan.
data-analysis data-cleaning data-science data-visualization pandas python3
Last synced: 24 Feb 2026
https://github.com/anil951/early-detection-of-mental-health
This project develops a predictive model to identify early signs of mental health issues in adolescents using social media activity, school performance, health records, and an AI chatbot. It analyzes emotional tone, academic changes, and health data, offering personalized recommendations and resources for mental wellness.
data-analysis deep-learning early-detection lstm mental-health sentiment-analysis social-media
Last synced: 28 Jan 2026
https://github.com/wiseaidev/corona-virus-data-analysis-modeling-and-visualization
Data analysis of covid-19 and SEIRD model implementation.
coronavirus coronavirus-tracking covid-19 data-analysis data-analysis-python data-visualization folium-maps modeling-dynamic-systems numpy ploty population python3 science science-research seird-model seird-simulator simulation
Last synced: 14 Apr 2025
https://github.com/narius2030/hive-datawarehouse-analysis
Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems
apache-hadoop apache-hive data-analysis etl-pipeline hiveql machine-learning statistics
Last synced: 01 Apr 2025
https://github.com/cosmoduende/r-arduino
Interoperability Data-IoT: How to send and receive data and take control of your Arduino, from R. How to establish interoperability between R and Arduino (Data and IoT) using a data flow between the two
arduino arduino-data arduino-dataflow arduino-serial arduino-serial-data arduino-serial-led arduino-uno data-analysis data-arduino data-cleaning data-iot data-visualization interoperability iot-rstudio r-analytics r-data-visualization r-iot rstudio-arduino serial-read serialport
Last synced: 24 Apr 2026
https://github.com/leechristophermurray/parquetframe
Unlocking the power of Parquets
data data-analysis dataframe entity-framework etl graph interactive python rust workflow worklow zanzibar
Last synced: 28 May 2026
https://github.com/aditiiprasad/whatsstat
A fun and insightful WhatsApp chat analyzer that turns your conversations into beautiful stats, juicy graphs, and quirky insights.
chat-analyzer data-analysis data-visualization nlp streamlit text-processing whatsapp
Last synced: 02 Sep 2025
https://github.com/adriens/endoflife-date-snapshots
Daily consolidated and enriched snapshots of endoflife.date
apache-parquet csv csv-export data-analysis data-science database datavisualization dataviz duckdb duckdb-database end-of-life endoflife eol jupyter-notebook kaggle kaggle-notebook olap python release-policy release-schedule
Last synced: 11 Apr 2026
https://github.com/jshinm/web-scrapper
Web Scrapper used to extract NeuroData github repo stats
Last synced: 04 Apr 2025
https://github.com/walidbosso/r_data_mining
Extract knowledge from a data using different techniques, including Association Rules Hierarchical Agglomerative Clustering (HAC) K-means Clustering Decision Trees
association-rule-mining association-rules clustering data-analysis data-mining data-science data-visualization decision-tree-classifier decision-trees exportation extract-data hac hierarchical-clustering k-means k-means-clustering k-means-r r-programming r-studio
Last synced: 23 Mar 2025
https://github.com/goggle/dataisbeautiful
Some data analysis Jupyter notebooks, mainly indented for submissions on the subreddit /r/dataisbeautiful.
data-analysis data-visualisation jupyter-notebook notebook reddit
Last synced: 16 May 2025
https://github.com/martial2023/bank-performance-analysis
Analyse de données bancaires du Berka Dataset (1993-1998) pour calculer et visualiser des KPI clés
dashboard data-analysis data-visualization nextjs pandas plotly-express pymongo python recharts-js sqlalchemy
Last synced: 26 Aug 2025
https://github.com/grburgess/gbm_kitty
Database, reduce, and analyze GBM data without having to know anything. Curiosity killed the catalog.
3ml catalogue data-analysis fermi-science grbs pipelines
Last synced: 29 Jun 2025
https://github.com/ziaeemehr/itng_nest
Nest Simulator quick guides and examples, adding new model using NESTML
computational-neuroscience data-analysis nest-simulator neuroscience
Last synced: 30 Aug 2025
https://github.com/vishrut-b/end-to-end-data-analytics-with-python-and-sql
This project involves the data cleaning and SQL-based analytics of a retail orders dataset using Python and SQL. It focuses on preprocessing data, followed by detailed analytics to extract insights on sales trends and product performance.
data-analysis python retail sql sql-server sqlalchemy
Last synced: 07 Feb 2026
https://github.com/realkarthiknair/data-science-notes
Data science notes and programs
data-analysis data-science data-visualization
Last synced: 27 Aug 2025
https://github.com/giatraskon/clustering-countries-socioeconomic-health-analysis
Exploration and analysis of socio-economic and health data from 167 countries using MATLAB. Application of clustering algorithms to identify development patterns, visualize disparities, and understand global trends.
calinski-harabasz-index clustering country-data data-analysis data-visualization davies-bouldin-index elbow-method feature-selection health-indicators human-development-index k-means-clustering k-median-clustering k-medoids-clustering machine-learning matlab pca pearson-correlation silhouette-score socio-economic-indicators unsupervised-learning
Last synced: 29 Jan 2026
https://github.com/virajbhutada/cliquebait-digital-marketing-analysis-using-sql
This GitHub repository contains the CliqueBait Digital Marketing Analysis project, utilizing SQL for comprehensive analysis of marketing campaigns, user engagement, product performance, and website interactions within the Clique Bait food app. The project offers actionable insights for optimizing marketing strategies in competitive landscape.
campaign-website data-analysis data-extraction data-science digital-marketing food-store microsoft-excel mysql product-performance sql sql-database sql-project user-engagement website-analytics
Last synced: 27 Feb 2025
https://github.com/yusufcinarci/covid-19-data-analysis-visualization
The first project of our data visualization studies is the COVID-19 data analysis project. In this project, we analyzed the data of the COVID-19 pandemic, which started in the first month of 2020 and still continues to affect the world, on the basis of countries. You can find the brief details of the project we realized in 3 stages in the readme file. We have tried to explain the details of the project step by step below. We wish you healthy days.
covid-19-data-visualization data-analysis data-science data-visualization
Last synced: 22 Jul 2025
https://github.com/ivangrigorov/neutrino-search-engine
Creating Java search engine both for HTML or document type of files
data data-analysis data-knowledge information-extraction information-retrieval java-language search-engine
Last synced: 31 Mar 2025
https://github.com/virajbhutada/music-recommendation-system
This project is designed to provide personalized music recommendations for relaxation and meditation. Leveraging ML and data analysis, the system suggests tracks based on user preferences such as tempo, energy, and genre. Join us in enhancing music discovery through advanced algorithms and community-driven contributions.
data-analysis data-science-projects data-visualization eda html machine-learning ml-algortihms model-deployment model-evaluation music-recommendation-system nlp pivot-table principal-component-analysis python python-library similarity-matrix spotify-data streamlit-web user-experience
Last synced: 24 Jan 2026
https://github.com/aad99bxp/whatsapp-chat-analyzer
A project intended for Business Owners / Managers to analyze Whatsapp chats between their customer care executives and their customers.
data-analysis heroku-deployment python3
Last synced: 15 Mar 2025
https://github.com/gxelab/tutorials
Tutorials of frequently used software packages and libraries in the lab
bioinformatics data-analysis evolution genetics genomics julia python3 r-language statistics visualization
Last synced: 18 Jan 2026
https://github.com/shlokashah/student-depression-and-suicide-rate-prediction
https://shlokashah.github.io/Student-Depression-And-Suicide-Rate-Prediction/
data-analysis data-visualization machine-learning student suicide-rate-prediction
Last synced: 19 Nov 2025
https://github.com/avinesh-masih/data-analytics-assignment
Complete PW Skills Data Analytics Assignments: This repository contains all PW Skills Data Analytics assignments, covering topics like Python, SQL, Statistics, Data Visualization, and more. It includes well-structured solutions with notebooks and queries, ideal for learners seeking clarity and hands-on practice.
ai api data-analysis data-science data-visualization eda flask jupyter-notebook machine-learning matplotlib numpy pandas pw pw-assignment pw-skills-assignment pwskills python seaborn sql statistics
Last synced: 13 Jun 2025
https://github.com/akshat0427/spotify_history
code to find out some insights in spotify streaming data (work in progress)
data-analysis data-visualization
Last synced: 04 Feb 2026
https://github.com/cbg-ethz/scdna-pipe
Python data analysis pipeline for single cell copy number event history reconstruction
bioinformatics bioinformatics-pipeline data-analysis genomics python snakemake snakemake-workflows workflow
Last synced: 05 Jan 2026
https://github.com/johnsesana/eda-liquor-sales
Exploratory Data Analysis on Public Datasets
data-analysis data-visualization sql tableau-dashboards
Last synced: 09 Mar 2026
https://github.com/draym/covid19tracker
Coronavirus COVID-19 dashboard to track global cases
covid-19 covid19-tracker dashboard data-analysis
Last synced: 07 Jan 2026
https://github.com/timzatko/fifa-19-dataset-machine-learning
Player's value prediction and game position classification on FIFA 19 dataset.
data-analysis fifa19 machine-learning scikit-learn
Last synced: 04 May 2026
https://github.com/juicedata/juicefs-deeplearning-tutorials
Deep Learning and Data Analytics Techniques with the help of JuiceFS.
data-analysis deep-learning filesystem juicefs machine-learning
Last synced: 07 Jul 2025
https://github.com/felixcharotte/ibm_datascience_capstone
In this project, we predicted if the SpaceX Falcon 9 first stage will land successfully by following the data science methodology. We also summarized the results for the business stakeholders.
analysis data-analysis data-science data-visualization databases folium jupyter-notebook machine-learning machine-learning-alrgorithms matplotlib pandas plotly plotly-dash python scikit-learn scipy seaborn sql
Last synced: 26 Jul 2025
https://github.com/sn2606/global-temperature-time-series
Time series analysis is performed on the Berkeley Earth Surface Temperature dataset.
arima arima-forecasting arima-model climate-change data-analysis data-visualization forecasting-model global-temperature series-analysis singular-spectrum-analysis time-series time-series-analysis time-series-forecasting
Last synced: 21 May 2026
https://github.com/jakebrehm/demesstify
📱Demystifies your messages and allows for easy analysis and visualization of conversations.
data-analysis data-science imessage messages messaging nlp pandas python sentiment-analysis visualization wordcloud
Last synced: 13 Apr 2025
https://github.com/tameronline/tameronline
Showcasing Projects on Data Analysis, Programming, and AI — Developed Using Python and Modern Frameworks
data-analysis deep-learning flask machine-learning numpy pandas python3 sql web-development
Last synced: 11 Jun 2025
https://github.com/mohd-faizy/07p_tumor-diagnosis-exploratory-data-analysis-on-breast-cancer-wisconsin-dataset
Tumor Diagnosis: Exploratory Data Analysis With Seaborn
data-analysis data-visualization eda exploratory-data-analysis knn-classification pca-analysis python random-forest random-forest-classifier statistics support-vector-machines tumor-detection visualization
Last synced: 17 May 2026
https://github.com/spacebody/mcm-icm-2018-problem-c
The source code of MCM/ICM 2018 Problem C
Last synced: 13 Apr 2025
https://github.com/hongbo-wei/global-status-of-cc-security-certification
Data visualization of CC Security Certification using VUE, Django, and MySQL.
big-date common-criteria data-analysis data-visualisation data-visualization
Last synced: 07 Jul 2025
https://github.com/armanx200/diabetes_model
🚀 A machine learning model predicting diabetes with logistic regression, feature scaling, and VIF analysis. 📊🩺
arman-kianian classification data-analysis data-science data-visualization feature-engineering healthcare logistic-regression machine-learning model-evaluation predictive-modeling python scaling scikit-learn statistical-analysis statsmodels
Last synced: 20 May 2026
https://github.com/ocramz/record-encode
Generic encoding of record types
categorical-data categorical-features data-analysis data-mining data-science generic-programming machine-learning one-hot-encode preprocessing
Last synced: 14 Apr 2025
https://github.com/yash22222/data-analysis-with-python
This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.
binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3
Last synced: 09 Apr 2026
https://github.com/frikishaan/browsing-history-analysis
This is a data analysis of my browsing history for the last 7 months.
browsing-history data-analysis jupyter-notebook python
Last synced: 18 May 2026
https://github.com/leonism/customer-predictive-analysis
Explore this repository, a comprehensive resource offering an in-depth guide to conducting customer predictive analysis using cutting-edge machine learning techniques, all within the intuitive framework of Dataiku.
data-analysis data-model data-science data-visualization dataiku machine-learning predictive-modeling
Last synced: 28 Mar 2025
https://github.com/dcs-training/from-spss-to-r-how-to-make-your-statistical-analysis-reproducible
Comfortable/aware of how to run your stats in SPSS? Curious to learn how to run them in R? You've come to the right place. Go to the readme file
data-analysis data-visualisation data-wrangling good-practices-digital-research r rmarkdown spss statistics
Last synced: 25 Jan 2026
https://github.com/gallillio/data_science-data_visualizer_tool
## About Supervised ML Helper is a Python application that streamlines exploratory data analysis (EDA) and preprocessing for supervised machine learning. Featuring a user-friendly Tkinter interface, it enables users to load CSV files, visualize data, and perform essential transformations, making data preparation accessible for all skill levels.
data-analysis data-science data-visualization matplotlib numpy pandas seaborn sklearn
Last synced: 17 Feb 2026
https://github.com/vvipjain/ecommerce-sales-analysis
Ecommerce Sales Analysis
data-analysis pandas pandas-dataframe python sql sqlalchemy
Last synced: 16 Apr 2026
https://github.com/lykmapipo/scala-spark-product-sales-analysis
Scala application to process, and analyze product sales using Spark
anomaly-detection apache-spark apache-spark-sql customer-segmentation data-analysis data-processing lykmapipo market-basket-analysis product-sales product-sales-analysis rolling-average running-total sbt scala summary-statistics time-series-analysis
Last synced: 18 May 2026
https://github.com/mljar/mercury-notebook-apps
Amazing apps build from Python notebooks with Mercury
data-analysis data-science data-visualization jupyter jupyter-notebook jupyterlab mljar python
Last synced: 21 May 2026
https://github.com/aalkiyumi/senior-design-project
Web scraper for collecting product and review data from e-commerce sites using Scraping Bee, AWS, Selenium, and Pandas. Focuses on cost-effective solutions, user-friendly interfaces, and efficient data extraction and analysis.
aws cs5001 data-analysis data-extraction data-processing data-storage e-commerce-analytics e-commerce-data pandas product-reviews review-sentiment-analysis scraping-bee selenium senior-design-project uc uc2026 university-of-cincinnati web-crawlers web-scraping
Last synced: 18 Feb 2026
https://github.com/cherylisabella/statistics--caret
Training Regression and Classification Models using caret
data data-analysis data-mining data-science datascience dataset r statistics
Last synced: 24 Jun 2025
https://github.com/bala-ceg/digital-payment-index
This project aims to develop an index for the digital transactions of India
collaborate data-analysis fintech hacktoberfest machine-learning statistics
Last synced: 20 Jun 2025
https://github.com/stimulsoft/samples-dashboards.js-for-html
JavaScript samples for Dashboards.JS data visualization tool for HTML and native JavaScript applications
analytics automation components dashboard-application dashboard-designer dashboard-viewer data-analysis embedded html5 indicators javascript js json-database native-javascript onepage panels pivot-tables simple-dashboard transformation website
Last synced: 20 Oct 2025
https://github.com/patex1987/temperature-calibration
Notebook for sensor calibration evaluation
calibration data-analysis jupyter-notebook sensor
Last synced: 20 Jun 2025
https://github.com/cosmoduende/r-marvel-vs-dc
DC Comics vs Marvel Comics - Exploratory Data Analysis and Data Visualization with R. Who has the smartest, strongest, fastest, or most powerful hero or villain? How to answer this and more questions with R
comics data-analysis data-analysis-r data-analytics data-visualization dataviz dc-characters dc-comics eda exploratory-analysis exploratory-data-analysis exploratory-data-visualizations marvel-characters marvel-comics marvel-vs-dc shdb superherodb superheroes superheros
Last synced: 11 Apr 2025