Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-27 00:07:21 UTC
- JSON Representation
https://github.com/sebastianofazzino/ibm-data-science-professional-certificate
In this repository I've stored exercises and projects I've been working on while attending IBM Data Science Professional Certificate, using Python and its libraries.
data-analysis data-mining data-science data-structures data-visualization database machine-learning matplotlib numpy pandas python regression seaborn sql
Last synced: 09 Apr 2026
https://github.com/namratha2301/starbucks_global_presence
Exploring the global presence of Starbucks.
business-analysis data-analysis data-science data-visualization matplotlib pandas pycountry
Last synced: 19 May 2026
https://github.com/davecumin/ancir_next
analysis chronobiology circadian d3 data data-analysis data-visualization svelte timeseries
Last synced: 18 May 2026
https://github.com/lucycatherine/healthinsuranceproject
This repository contains a machine learning project that analyzes the factors influencing health insurance charges, such as age, smoking status, and medical conditions.
data-analysis data-science data-visualization jupyter-notebook machine-learning python
Last synced: 18 May 2026
https://github.com/maheera421/pandas
Implementation of essential Pandas functions.
data-analysis data-manipulation pandas-dataframes pandas-datareader pandas-python
Last synced: 17 Jul 2025
https://github.com/gui-sitton/y.music
In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 18 May 2026
https://github.com/niniola-creator/niniola-creator
This is a repository that I have created to show my skills, share my projects and track my progress in my data science/web development journey.
bootstrap5 css3 data-analysis data-science data-visualization database html5 javascipt javascript matplotlib pandas powerbi python spreadsheets sql
Last synced: 07 Apr 2026
https://github.com/amr-yasser226/interactive-sales-analytics-dashboard
An interactive web-based dashboard for visualizing multinational electronics sales data. This project for the DSAI 203 course integrates a Python/Flask backend with an amCharts frontend to provide dynamic insights into product revenues, sales distribution, and employee statistics across different countries.
am5charts amcharts business-intelligence css dashboard data-analysis data-analytics data-visualization flask html javascript python sqlalchemy sqlite web-application
Last synced: 13 Apr 2026
https://github.com/namratagulati/tweets_analysis
This repository focuses on sentiment analysis of Twitter data using Python, Natural Language Processing (NLP), and the Natural Language Toolkit (NLTK). The goal is to extract valuable insights from social media discussions, such as word frequency, hashtag trends, and sentiment patterns.
analysis data-analysis natural-language-processing nlp-machine-learning nltk-corpus nltk-python sentiment-analysis twitter-sentiment-analysis
Last synced: 07 Aug 2025
https://github.com/priyadarshinijain/air-quality-data-analysis-and-visualization
# 🌍 Air Quality Data Analysis and Visualization
data-analysis jupyter-notebook python visualization
Last synced: 06 Feb 2026
https://github.com/gagan8605/zepto_sql_analysis
This project explores and analyzes the inventory data of Zepto, a rapidly growing 10-minute grocery delivery platform in India. The dataset contains over 3,000+ SKUs across key product categories such as Fruits & Vegetables, Dairy, Beverages, Packaged Foods, and more. The analysis was performed using PostgreSQL, covering both data cleaning and bus
cleaning-data data-analysis database-management postgresql sql
Last synced: 16 Jul 2025
https://github.com/progati00/marketing-mix-modeling-mmm-for-marketing-budget-optimization
A Marketing Mix Modeling (MMM) project using Python to analyze channel performance, calculate ROI, and simulate marketing budget changes for better business decisions. Includes a trained Linear Regression model, ROI analytics, and a Flask API for revenue prediction.
api budget-optimization data data-analysis data-science ecommerce eda flask jupyter-notebook linear-regression machine-learning marketing-analytics marketing-mix-modeling python roi-analysis vscode
Last synced: 14 Apr 2026
https://github.com/shrutiijoshi/airbnb-listing-reviews
Airbnb is an online marketplace that connects people who want to rent out their homes with travelers seeking accommodations.
data-analysis matplotlib-pyplot pandas-python python seaborn
Last synced: 17 May 2026
https://github.com/v41bh4vr4jput/data-analysis-with-python
This repository is a comprehensive collection of data analysis projects and tutorials using Python's most powerful libraries: NumPy, Pandas, Seaborn, and Matplotlib. It is designed to help you explore, clean, visualize, and analyze data efficiently.
api data data-analysis data-visualization matplotlib numpy pandas python sakila-db seaborn
Last synced: 09 Apr 2026
https://github.com/aakashjhawar/twitter-sentiment-analysis
Sentiment analysis of tweets to detect negative tweets.
bagofwords data-analysis data-science doc2vec logistic-regression machine-learning nlp-machine-learning nltk regex sentiment-analyser sentiment-analysis support-vector-machine textblob tf-idf-features twitter twitter-sentiment-analysis word2vec xgboost
Last synced: 29 Mar 2025
https://github.com/cosmoduende/r-ggcats
StrangeR things: Adding… Cats? to your plots on R. How to analyze and visualize data with the help of funny cats with the ”ggcat” package.
data-analysis data-analytics data-science data-visualisation data-visualization data-viz dataviz ggcats r-language r-library r-package r-programming r-scripts r-studio rstats rstudio
Last synced: 22 Jul 2025
https://github.com/gmasson/datadash
DataDash é uma biblioteca JavaScript e CSS para criar dashboards interativos, para visualização de dados dinâmicos em páginas web.
dashboard dashboard-application dashboards data-analysis data-science data-visualization javascript
Last synced: 08 Aug 2025
https://github.com/a19xys/dm-csgo_analysis
Analysis to address the most important aspects of the knowledge discovery process from data.
data-analysis data-mining data-science dataset jupyter-notebook python
Last synced: 18 May 2026
https://github.com/datalopes1/bankabc_churn
Neste projeto será realizado o processo de EDA (Exploratory Data Analysis) com foco na análise de Churn a partir do datas ser Bank Customer Churn Dataset, que pode ser encontrado no Kaggle e disponibilizado por Gaurav Topre.
churn-analysis data-analysis data-science eda python
Last synced: 18 May 2026
https://github.com/viper373/lol-dataanalytics
腾讯游戏-英雄联盟赛事20/21/22年数据综合分析预测
crawler-python data-analysis jupyter-notebook lol python spider
Last synced: 15 Jul 2025
https://github.com/kwonnayeon/medium-post-projects
Code & projects from my Medium posts
data-analysis data-science data-visualization medium-articles python r-language sql
Last synced: 18 May 2026
https://github.com/vishnu-vamshii/layoffs-data-analysis-in-sql
This project focuses on the cleaning and exploratory analysis of a dataset containing layoff information. It includes data deduplication, standardization of columns, handling null and blank values, and analyzing layoffs by company, industry, country, and date. Various SQL queries are used to explore trends and patterns in layoffs over time.
Last synced: 15 Jul 2025
https://github.com/hosseinkarimi128/zed-one
An AI-powered assistant that analyzes CSV data using natural language queries to generate pandas code and visualizations.
ai-data-analysis automated-pandas automated-pandas-queries csv data-analysis fastapi langchain machine-learning matplotlib nlp openai pandas restful-api summarization visualization-tools
Last synced: 07 Apr 2026
https://github.com/jarrarshahid/nutrition-calculator
Simple python app to calculate nutritions in everyday meals.
data-analysis health json jupyter-notebook logic-programming python
Last synced: 15 Jul 2025
https://github.com/sourceduty/data_metrics
📈 Analyzing, sorting and visualizing data.
data data-analysis data-metrics data-sci data-science data-science-projects data-sorting data-visualization database dataset metrics sorting statistics visualization
Last synced: 08 Aug 2025
https://github.com/1adityakadam/carnegie-classifications-ancestry-grid
A concise, interactive tool for exploring the historical lineage of U.S. higher education institutions using Carnegie Classification data from 1973–2021.
dash data-analysis html javascript pandas python
Last synced: 25 Jun 2025
https://github.com/miusarname2/proyectos-final-analitica-de-datos
Welcome to the repository where the magic of data analytics comes to life! This is the result of our effort and creativity in the subject of data analysis at the Universidad Cooperativa de Colombia (UCC). Here we keep everything we did to analyse data, draw cool conclusions and solve the workshop we were given. 🎯📊
data-analysis data-science data-visualization pip python
Last synced: 15 Jul 2025
https://github.com/andersoncrs/analisis-de-texto-tweets
En este proyecto exploro el análisis de texto de tweets para descubrir tendencias, opiniones y temas relevantes en redes sociales. Usando herramientas de procesamiento de lenguaje natural, convierto grandes volúmenes de mensajes en información clara y visualmente atractiva.
data-analysis data-visualization eda text-mining
Last synced: 21 Jul 2025
https://github.com/artemzarubin/xml-document-processor
XML processing tool using the Strategy design pattern.
csharp data-analysis data-transformation design-patterns strategy xml
Last synced: 21 Jul 2025
https://github.com/BingyanStudio/github-analyzer
锐评一下你都在 GitHub 写了什么
data-analysis github llm reports selfhosted typescript
Last synced: 12 May 2025
https://github.com/1adityakadam/Carnegie_classifications_website
A comprehensive data analytics platform analyzing 50+ years of U.S. higher education trends through interactive visualizations and historical institution tracking.
css data-analysis html javascript python ui-design web-development
Last synced: 25 Jun 2025
https://github.com/nadamarei/data-analyzer
The Qualitative Data Analysis Tool is a powerful Streamlit application designed for researchers to analyze word frequencies in corporate documents. This tool processes PDF reports, identifies target words and their contextually relevant synonyms, and generates comprehensive reports with document statistics, summary analysis, and per-file breakdowns
data-analysis data-visualization python-3 streamlit
Last synced: 18 May 2026
https://github.com/rohansoni45/movie-recommendation-system
This project is a Content-Based Recommender System that suggests movies to users based on their preferences and watched history. The system leverages cosine similarity to find and recommend movies similar to a selected title. It is built using Python and libraries like Pandas, NumPy, and Scikit-learn.
content-based-filtering cosine-similarity data-analysis data-science machine-learning numpy pandas python recommender-system render scikit-learn
Last synced: 17 Apr 2026
https://github.com/jigyasag18/aircraft-data-management
This repository offers a comprehensive simulation of global military air deployments involving 10 countries, aircraft models, mission types, and strategic zones. It analyzes air power distribution, mission intent (offensive, defensive, support), and geopolitical positioning. The project provides structured insights into regional & zone level threat
aircraft-data aircraft-performance data data-analysis data-visualization database database-management dataset datavisualisation mysql powerbi powerbi-report powerbi-visuals sql
Last synced: 04 Feb 2026
https://github.com/prajjwol09/data-cleaning-project
This project is dedicated to cleaning, standardizing a dataset, dealing with null values from a CSV file named "layoffs" using MySQL, with MySQL Workbench as the workspace environment. The goal is to prepare the data for analysis.
cleaning-data columns data-analysis database duplicates mysql rows standard
Last synced: 20 Apr 2026
https://github.com/abhipatel35/diabetes_ml_classification
Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.
classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn
Last synced: 20 Jan 2026
https://github.com/privatepeople/dataanalysis
데이터 분석 공부를 위한 저장소
data-analysis data-science data-visualization matplotlib numpy pandas python scikit-learn scipy
Last synced: 26 Jan 2026
https://github.com/neerajcodes888/data-science
This repository is a hub for data science enthusiasts, offering a diverse collection of projects, notebooks, and resources covering topics such as data analysis, machine learning, deep learning, and generative AI. Explore innovative ideas, contribute to cutting-edge research, and enhance your skills in the dynamic field of data science
data-analysis data-science data-visualization deep-learning deep-learning-algorithms eda genai jupyter-notebook machine-learning machine-learning-algorithms openai-api pandas plotting python3 sklearn-library streamlit
Last synced: 01 May 2026
https://github.com/0-mostafa-rezaee-0/sandwich_structures
Impact test of Sandwich Structures
composite-materials data-analysis r
Last synced: 09 Aug 2025
https://github.com/ilke-kas/multivariate-data-analysis
A curated collection of R-based data analysis projects applying regression modeling, clustering, dimensionality reduction, multivariate statistics, and classification. Each project showcases practical data science techniques, interpretability, and domain insights using real-world and academic datasets.
classification data-analysis data-visualization dimensionality-reduction machine-learning multivariate-analysis r regression statistics
Last synced: 05 Oct 2025
https://github.com/codesaadumair/pandas_exercises_personal
Personalized enhancements to pandas exercises with comprehensive solutions and practical insights for mastering data analysis in Python.
data-analysis data-science pandas python
Last synced: 09 May 2026
https://github.com/seblehner/feldprakt
Collection of plotting routines for a field exercise work using different measurement tools and Hobo weather stations.
data-analysis data-visualization jupyter-notebook python
Last synced: 05 Oct 2025
https://github.com/paphada1103/data-analysis-with-python
📊 Analyze data efficiently using Python’s top libraries. Learn to explore, clean, and visualize data for meaningful insights in your projects.
carpentries data-analysis data-carpentry data-visualisation dataframe-api dataset english hacktoberfest ibm jovian lsl machine-learning matplotlib programming python realtime social-sciences spark
Last synced: 09 May 2026
https://github.com/harkishen/Agriculture-DS
An Agricultural based Mtech project, on Data Science, which predicts the growth of crops based on previous year records.
Last synced: 11 Dec 2025
https://github.com/pranavsp108/time-series-forcasting
A time-series forecasting project to predict hourly energy consumption using Python, Pandas, and an XGBoost regression model.
data-analysis data-science energy-consumption forecasting matplotlib numpy pandas python scikit-learn sustainability time-series xgboost
Last synced: 10 Apr 2026
https://github.com/vishal786-commits/target-businesscasestudy-sql
This project analyzes Target’s e-commerce transactions in Brazil between 2016 and 2018 using SQL. The goal was to explore customer behavior, order patterns, payments, delivery times, and freight costs to generate actionable business insights.
Last synced: 05 Oct 2025
https://github.com/egbe34/sql-portfolio
SQL portfolio showcasing business-focused queries for KPIs, retention, churn, RFM, and Pareto analysis. Built with sample commerce data for analytics and BI use cases.
bigquery business-intelligence churn-analysis cohort-analysis data-analysis kpi postgresql rfmsegmentation sql windowfunction
Last synced: 19 May 2026
https://github.com/affan005-ai/tesla-stock-prediction
This project analyzes Tesla stock data and builds machine learning models to predict and classify stock movements. The analysis includes EDA, feature correlation, moving averages, and two models
data data-analysis data-science data-visualization-project eda machine-learning matplotlib pandas predictive-analytics predictive-modeling python scikit-learn
Last synced: 05 Oct 2025
https://github.com/trismald/eurosoccer1023
Data Analyst - European Soccer 2010 2023
data-analysis data-visualization jupyter-notebook pandas powerbi python
Last synced: 06 May 2026
https://github.com/manishkaa/google_data_analytics_capstone_case_study
This case study is a part of Google Data Analytics Capstone Project
bigquery data-analysis sql tableau
Last synced: 05 Oct 2025
https://github.com/thenazar9/user-behavior-email-campaign-analysis-sql
Analysis of user behavior and email campaign performance using BigQuery and Looker Studio, focusing on account creation trends, email engagement, and user segmentation.
analytics bigquery data-analysis data-visualization etl looker-studio sql structured-query-language
Last synced: 16 Oct 2025
https://github.com/nahiyanhkhan/sales-insight-dashboard_powerbi
Build a dashboard to display the sales insights of a company's sales data over the 4 years period. It includes displaying revenue, sales quantity in different regions over the years.
dashboard data-analysis data-analytics data-visualization powerbi salesdashboard
Last synced: 08 Jan 2026
https://github.com/mikma03/datascience_python_datacamp
DataScience with Python. Code and examples. Python libraries, including pandas, NumPy, Matplotlib, and many more.
data-analysis data-science datacamp datascience numpy pandas python
Last synced: 06 May 2026
https://github.com/subhamghimire/dataanavis
Learning Data analysis and visualization
data-analysis data-science data-visualization dataset
Last synced: 06 Oct 2025
https://github.com/farhad-here/height-distribution-analysis
Statistical comparison of height distributions in two groups using mean, standard deviation, and boxplots.
coefficient-of-variation data-analysis interquartile-ranges matplotlib mean numpy python scipy standard-deviation variance
Last synced: 13 Apr 2026
https://github.com/data-edd/mastering_sql
This is a repo documenting me mastering sql
data-analysis mysql mysql-database sql
Last synced: 06 Oct 2025
https://github.com/davifeliciano/modern_physics_experiments
Collection of data analysis and visualization scripts developed in Python around some modern physics experiments
data-analysis data-visualization modern-physics physics physics-experiments
Last synced: 18 Jan 2026
https://github.com/sora468/best-of-ml-python
🏆 Discover top-ranked Python libraries for machine learning, updated weekly to help you find the best tools for your projects.
airport airport-simulation chatgpt configuration data-analysis data-science data-visualization data-visualizations gpt keras machine-learning nlp python scikit-learn tensorflow transformer usg-ai-training-data usg-artificial-intelligence
Last synced: 09 May 2026
https://github.com/chaitanyac22/investment-analysis-for-an-asset-management-company
Data analysis to identify the best sectors, countries, and a suitable investment type for making investments.
business-analytics business-intelligence data-analysis data-cleaning data-insights data-manipulation data-preparation data-visualization decision-making finance python3 risk-management statistics
Last synced: 06 May 2026
https://github.com/swatisinghit/treadmill-customer-profiling-for-aerofit
Create comprehensive customer profiles for each AeroFit treadmill product through descriptive analytics. Develop two-way contingency tables and analyze conditional and marginal probabilities to discern customer characteristics, facilitating improved product recommendations and informed business decisions.
analytics conditional-probability data-analysis data-science data-visualization eda numpy pandas probability statistics
Last synced: 08 May 2026
https://github.com/zachbateman/easy_plot
Easy Statistical Visualization in Python
data-analysis data-visualization graphics matplotlib python seaborn
Last synced: 18 Jan 2026
https://github.com/surbhi242singh/pizza_sales_project
Used SQL to analyze pizza sales data
data-analysis mysql pizza-sales sql
Last synced: 07 Oct 2025
https://github.com/nuriadevs/informes-powerbi
Este repositorio contiene informes elaborados con Power BI.
Last synced: 18 Feb 2026
https://github.com/junpenglao/spafv
SPAFV - Surface Profile Analysis for Free Viewing eye movement experiment in 2AFC task
data-analysis statistics temporal-logic
Last synced: 31 Mar 2025
https://github.com/michaelcurrin/yahoo-finance-reports
Use the Yahoo Finance API to get info on shares of interest and report on them
data-analysis data-science python reporting shares stock-market yahoo-finance yahoo-finance-api
Last synced: 07 Oct 2025
https://github.com/bocchio01/skyward_recruitment_assignment
Assignment to join the PoliMi SkyWard software team
data-analysis kalman-filter model-rocket
Last synced: 15 Mar 2025
https://github.com/prajjwol09/sql_retail_analysis_project
This project demonstrates SQL-based data cleaning, exploration, and business analysis on a retail sales dataset. It involves setting up a database, removing null values, performing EDA, and using SQL queries to extract key insights such as top customers, best-selling categories, and monthly sales trends.
data data-analysis datacleaning dataexploration pgadmin4 sql
Last synced: 15 Feb 2026
https://github.com/anuragmudgal96/data-warehouse-project
Designing and implementing a modern data warehouse on SQL Server, covering ETL pipelines, dimensional modeling, and analytical reporting.
data-analysis data-engineering data-warehouse datawarehousing etl etl-job etl-pipeline sql sql-server
Last synced: 09 Oct 2025
https://github.com/shourya1997/boston_housing
In this project, you will apply basic machine learning concepts on data collected for housing prices in the Boston, Massachusetts area to predict the selling price of a new home.
boston-housing-dataset data-analysis jupyter-notebook machine-learning python unsupervised-machine-learning
Last synced: 18 May 2026
https://github.com/rusiru-erandaka/pupil-dilation-signal-classification-pipeline-with-noise-filtering-feature-extraction
In this repository I have worked on Pupil Diameter Time series Dataset. here I have worked on data sampling, Blink detection and Noise Handling, Stimulus Onset Alignment & Ensemble Averaging, Baseline correction, Feature Extraction and finally create a Patient classification ML pipeliner
anomaly-detection classification-pipeline data-analysis data-preprocessing data-science time-series
Last synced: 08 Oct 2025
https://github.com/omarsolieman/socialgiveawaydataanalysis
This project involved cleaning, analyzing, and processing data from an Instagram giveaway to ensure a fair and data-driven winner selection process. The primary goal was to automate the process of identifying valid entries, weighting them based on engagement (likes and multiple entries), and performing a post-giveaway analysis
data-analysis data-science data-visualization instagram scraping threejs
Last synced: 14 May 2026
https://github.com/maccccd/sql-proficiency-journey
A technical journey of my SQL understanding.
data-analysis sql systems-analysis-and-design uml-class-diagram
Last synced: 15 Feb 2026
https://github.com/mansogf/datascience_introduction
Data Science Introductions Practices
data-analysis data-science data-visualization graph
Last synced: 04 Apr 2025
https://github.com/friendotjava/air-quality-analysis
Data Analysis for Air Quality in different District in Beijing.
data-analysis explanatory-data-analysis exploratory-data-analysis jupyter-notebook pytho streamlit
Last synced: 06 May 2026
https://github.com/pranavsp108/market_basket_analysis-instacart
Customer segmentation and market basket analysis using the Instacart dataset with Python, Pandas, and K-Means clustering.
customer-segmentation-and-buying-behavior data-analysis data-visualization instacart jupyter-notebook kmeans-clustering market-basket-analysis pandas python scikit-learn
Last synced: 05 May 2026
https://github.com/ndomah1/learning-probability-and-statistics
This repo is a comprehensive learning resource that covers fundamental to advanced topics in probability and statistics, including probability theory, descriptive and inferential statistics, probability distributions, regression analysis, and data exploration techniques.
correlation-analysis data-analysis descriptive-statistics exploratory-data-analysis hypothesis-testing inferential-statistics probability regression statistics
Last synced: 18 Jan 2026
https://github.com/shiva16/da
Data Analytics - Study materials
analytics data-analysis data-science data-structures
Last synced: 07 Feb 2026
https://github.com/bhaveshbhakta/student-performance-prediction-using-ml
Student Performance Prediction
data-analysis data-visualization linear-regression machine-learning student-performance-analysis student-performance-prediction
Last synced: 08 Oct 2025
https://github.com/alexquilis1/spanish-fuel-stations-analysis
Real-time analysis of Spanish fuel prices using government API data with interactive maps and regional comparisons
data-analysis data-visualization fuel-prices geospatial-analysis ggplot2 government-data leaflet open-data r shiny spain tidyverse
Last synced: 08 Oct 2025
https://github.com/sorebit/pdrpy-pd-2
Data analysis of various stackechange.com archives.
data-analysis stackexchange time-travel university-project
Last synced: 08 Oct 2025
https://github.com/souravsuvarna/whatsapp-chat-analyzer-and-visualizer-web-application
The WhatsApp chat analyzer and visualizer uses NLP algorithms to analyze chat data, tracking usage patterns and presenting insights through visually appealing charts and graphs. It helps users understand communication patterns and behaviors on WhatsApp.
data-analysis data-science data-visualization python python3 streamlit
Last synced: 18 Apr 2026
https://github.com/tyriek-cloud/power-bi-nyc-housing-financial-report
This report was conducted to provide a comprehensive analysis of various NYC housing and financial data.
dashboard data-analysis data-visualization financial-analysis powerbi statistics
Last synced: 21 Jan 2026
https://github.com/debjyotisaha/power-bi-projects-phase-2
Created interactive dashboards and reports using Power BI to visualize complex datasets. Demonstrated proficiency in data modelling, DAX calculations, and storytelling through data to provide actionable insights.
dashboards data-analysis data-modeling data-visualisation power-query powerbi
Last synced: 18 Jan 2026
https://github.com/faisal-khann/ipl-analysis
The IPL Analysis project is a comprehensive data-driven exploration of the Indian Premier League (IPL), analyzing historical match data to uncover patterns in team performance, player statistics, and match outcomes.
data-analysis exploratory-data-analysis jupyter-notebook matplotlib numpy pandas seaborn
Last synced: 08 May 2026
https://github.com/abdelmajidlh/cours
Cours Data engineering et data analyse.
apache-spark big-data data-analysis data-engineering docker jupyter-notebook pyspark
Last synced: 06 May 2026
https://github.com/l1ght14/tradersentiment_primetrade
Analyzes Bitcoin market sentiment's impact on Hyperliquid trader PnL & behavior. Uncovers patterns using Python (Pandas, Seaborn) to derive actionable trading insights. Junior Data Scientist assignment for PrimeTrade
bitcoin crypto-trading cryptocurrency data-analysis financial-data-analysis jupyter-notebook market-sentiment pandas python trader-behavior web3
Last synced: 20 Oct 2025
https://github.com/izzyl3333/mosquito_analysis
An exercise using Python and statistical analysis in mosquito data to understand the relationship between the different variables and the mosquito number.
chicago data-analysis data-science exploratory-data-analysis mosquitoes python statistical-analysis west-nile-virus
Last synced: 19 Jan 2026
https://github.com/marianamartiyns/api-logisticregression
Data analysis, modeling, and deployment of a logistic regression model for churn prediction, integrating a FastAPI backend and a Streamlit frontend.
data-analysis data-science fastapi logistic-regression pyhton streamlit
Last synced: 29 Apr 2026
https://github.com/curtisalexander/cramisc
Personal R functions for data analysis
Last synced: 12 Mar 2025
https://github.com/debopriya2320/classification-of-biological-databases
A project for categorizing and analyzing various biological databases based on their type and content.
bioinformatics biological-databases computational-biology data-analysis data-classification databases genomics proteomics research-tools systems-biology
Last synced: 09 Oct 2025
https://github.com/aalekhpatel07/billie-ai-lish-and-oscar-w-ai-lde
Generating music and poetry using RNNs.
data-analysis generating rnn-tensorflow tensorflow-keras
Last synced: 16 Mar 2025
https://github.com/mirwais-farahi/data-visualization-with-tableau-specialization
The Specialization provides Tableau for data visualization and business intelligence. The series covers skills like assessing data quality, designing visualizations and dashboards, and combining data sources to create compelling, data-driven stories.
dashboard data-analysis geospatial map tableau visualization
Last synced: 16 Feb 2026
https://github.com/priyanshubiswas-tech/priyanshubiswas-tech
SWE-Data Engineer @ EDN | Kubeflow-MLOps | Kubernetes | Databricks | AWS EMR-Lambda-Glue, Eventbridge, SQS-SNS | OCI Multi-Cloud Architect Professional | GCP GA4 | Gen AI | IEEE Brand Amb. | Ex-Chair, PES | Ex-Sec, SB
apache-spark aws data-analysis data-engineering data-visualization dbt hadoop kubernetes python3 sql
Last synced: 21 Jan 2026
https://github.com/gabrieladados/analise-ecommerce
Análise SQL para E-commerce: Estratégias de Crescimento para Impulsionar Vendas
bigquery data-analysis ecommerce sql
Last synced: 31 Mar 2025
https://github.com/atiqisrak/py
This repository houses the code and resources for the **100 Days of Python Challenge** – an intensive learning journey designed to propel you from beginner to a a confident Python programmer in just 100 days.
data-analysis data-science machine-learning python3
Last synced: 10 Oct 2025
https://github.com/lunafrost-lab/berry-donut
Exploring berry combinations to produce Donut in Pokémon Legends: Z-A: Mega Dimensions.
data-analysis data-filtering parquet pokemon winforms
Last synced: 13 Jan 2026