Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/mateib20/proiect-achizi-ia-i-prelucrarea-datelor
Procesarea semnalului, analiza datelor și analiza spectrală pentru semnal sonor
c c-language c-programming c-programming-language data-analysis data-engineering data-science data-visualization datascience python python-lambda python-library signal-analysis signal-processing
Last synced: 11 Jun 2025
https://github.com/hilalguleryuz/excel_supermarket_data_analysis_project
Supermarket Data Analysis with Excel
dashboard data-analysis data-visualization excel microsoft-excel supermarket supermarket-data-analysis supermarket-dataset
Last synced: 06 Jan 2026
https://github.com/hadeel-13/new_home
New Home is a Website for Buying and Selling Real Estate with user preferences, it is my Graduation project with a grade of 93%.
bootstrap5 chartjs css css3 data-analysis data-mining google-maps html html5 javascript jquery
Last synced: 12 Apr 2026
https://github.com/vikpires/ds_tips-dataset
Projeto individual do bootcamp de ciência de dados avanti 2024.2, com o objetivo de analisar e observar padrões no conjunto de dados "Tips".
data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas python seaborn tips
Last synced: 17 Sep 2025
https://github.com/wardenkenny/data-analyst-portfolio
A repository I have created to show and explore data analytics.
data-analysis excel r spreadsheets sql tableau
Last synced: 02 Apr 2025
https://github.com/eubrunoo/beer-consumption-predictor
An R project analyzing the impact of environmental factors on beer consumption in São Paulo, with a predictive linear regression model.
data-analysis data-science data-visualization machine-learning r statistical-analysis statistics
Last synced: 02 Apr 2025
https://github.com/hari7261/data-visualization
Python-based application built using CustomTkinter for the graphical user interface (GUI) and Matplotlib for data visualization. It allows users to import datasets, perform real-time data visualization, and analyze data using various chart types and machine learning techniques.
data-analysis data-visualization export hari7261 import python realtime-visualization
Last synced: 17 Jun 2025
https://github.com/gaboelc/analysis-of-the-employment-situation-in-costa-rica-2018-2022
This is an analysis with data extracted from the INEC in order to identify the changes that occurred in the Costa Rican labor market before, during and after the COVID-19 pandemic.
costa-rica data-analysis empleo employment
Last synced: 24 Mar 2025
https://github.com/shreeparab1890/chat-analyzer
This project is a Data Analysis project to analyze the WhatsApp chats.
data-analysis numpy pandas python
Last synced: 12 Apr 2026
https://github.com/m4tice/qm_project
Bicycle project crowd evaluation.
data-analysis data-engineering data-visualization
Last synced: 16 Mar 2025
https://github.com/parthds02/pizza_sales_sql
SQL project analyzing pizza sales data. Includes creating tables, executing queries, and solving basic to advanced analytical questions to derive insights from sales data.
analytics data-analysis data-science pizza-sales sql sql-query
Last synced: 04 Mar 2026
https://github.com/arthurosipyan/jamascript
JamaScript, your personal data assistant
blueprint data-analysis data-mining data-science exceldatareader jama jama-api python python-3 python-script python3
Last synced: 03 Sep 2025
https://github.com/k8hertweck/intro_r
data-analysis data-analysis-in-r r tidyverse training
Last synced: 29 May 2026
https://github.com/v-mayya/quantitative-analysis-data-dashboard
Quantitative survey data analysis using R
data data-analysis data-visualization flourish r
Last synced: 01 Apr 2025
https://github.com/suchi25sathavara/r-projects
R projects in Real world Scenerios for Data Analysis
data data-analysis datavisualization r
Last synced: 01 Apr 2025
https://github.com/seif-elkateb/dataset-analysis-r
cu-boulder data data-analysis datamodeling datascience ms-ds msds434 r
Last synced: 01 Apr 2025
https://github.com/torchstack-ai/cancer-biomarker-discovery
scRNASeq drug discovery and biomarker project
bioinformatics cancer-research data-analysis data-visualization r scrna-seq-analysis startup
Last synced: 01 Apr 2025
https://github.com/saiteja-talluri/data-analytics-assignement
Report on World Happiness Data (Data Analysis and Visualisation of the data)
data-analysis data-visualization ipynb-jupyter-notebook
Last synced: 20 Jan 2026
https://github.com/danielafishwickinacap/coderhouse_da
Data analyst Final Project files
Last synced: 18 Jan 2026
https://github.com/azaz9026/car_price_prediction_model
This repository contains a machine learning model designed to predict car prices based on various features. Using historical data on car attributes such as make, model, year, mileage, and other relevant factors, the model aims to provide accurate and reliable price estimates for used cars.
data-analysis data-engineering liner-regestion machine-learning modeling numpy pandas python3 rendering
Last synced: 09 Apr 2026
https://github.com/ajay1214/credit-card-transaction-dashboard
Credit Card weekly dashboard that provides real-time insights into key performance metrics and trends
Last synced: 04 Feb 2026
https://github.com/shridhar1504/tableau-visualization-viz.-project-
This repository contains Visualization Projects which is visualized through Tableau Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and also it provides social values in some cases to calculate damages and intensity of calamities.
dashboards data-analysis data-science data-visualization exploratory-data-analysis tableau tableau-dashboards tableau-public tableau-workbooks visualization
Last synced: 04 Feb 2026
https://github.com/tomijuarez/lemmatisation
Lemmatisation fully implemented in Java.
algorithms data-analysis data-science java-8 lemmatization oop
Last synced: 08 Apr 2025
https://github.com/farzeen-2001/financial-analysis-report-using-powerbi
comprehensive analysis of financial report
data-analysis data-visualization datacleaning dax powerbi
Last synced: 17 Feb 2026
https://github.com/pedramjlo/car_sales_analysis
Car sales analysis
data-analysis jupyter-notebook pandas python
Last synced: 01 Apr 2025
https://github.com/callmezoe/neo4j-supplychainmanagement
cypher data-analysis data-visualization graphdatabase neo4j
Last synced: 08 Apr 2025
https://github.com/aldrinjenson/smart-qa
Query any structured data and find relations using natural language
Last synced: 06 Apr 2025
https://github.com/vaishnavi502/data-analysis-work
A set of Google colab notebooks with my work on data analysis
data-analysis deep-learning facial-emotion-recognition facial-expression-recognition fer2013-dataset machine-learning python unemployment-rate
Last synced: 28 Apr 2026
https://github.com/irsol/udacity-data-foundations-nd
data data-analysis data-visualization exel sql udacity udacity-data udacity-nanodegree
Last synced: 05 Mar 2026
https://github.com/bryanfks-dev/klempoken-analysis
Analysis and forcasting model for Klempoken MSMEs
big-data-analytics data-analysis data-forecast data-visualization
Last synced: 01 Apr 2025
https://github.com/rahulsm20/insurance-data
A data analytics project dealing with risk assessment and it's effects in health insurance.
data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 12 Apr 2026
https://github.com/parth-jatav/super-store-analysis-project
The Super Store Analysis project leverages Python libraries such as pandas, matplotlib, and numpy to perform a comprehensive analysis of a retail store's data. This project includes data cleaning, visualization, and statistical analysis to identify key trends, optimize inventory, enhance decision-making processes for improved business performance.
data-analysis matplotlib numpy pandas python super-store
Last synced: 12 Apr 2026
https://github.com/pranav016/exploratory-data-analysis-of-sp500-dataset
This a data-analysis that I performed on the S&P 500 dataset and answered a few questions through data visualization techniques.
Last synced: 30 Oct 2025
https://github.com/soumya-thoutam/covid-19-impact-on-u.s.-states-and-colleges
Covid-19 analysis and impact on United States Colleges and States using SQL and Tableau.
covid-19 dashboard data-analysis data-visualization dataset sql sql-server tableau
Last synced: 04 Sep 2025
https://github.com/satvikpraveen/rsvp_case_study
A comprehensive IMDB dataset analysis using SQL. Includes database setup, advanced queries, and actionable insights. Organized with files for database creation, queries, and solutions. Features an Entity-Relationship Diagram (ERD), executive summary, and SQL scripts. Perfect for SQL workflows and business intelligence in the film industry.
aggregate-functions business-intelligence common-table-expressions data-analysis data-driven-decisions data-querying database-design entity-relationship-diagram imdb-dataset relational-database sql subqueries-and-joins
Last synced: 11 Jan 2026
https://github.com/yash-3-bit/online-sales-analysis
Project-Merging the different months datasets and performing the data cleaning ,Analysis and Visualization
data-analysis data-visualization pandas-library
Last synced: 27 Mar 2025
https://github.com/noodleslove/house-of-representatives-analysis-ii
In this project, we want to estimate if a transaction will have capital gains exceeding $200 using the provided dataset.
coursework data-analysis data-science eda feature-engineering pandas python3
Last synced: 12 Apr 2026
https://github.com/ernanej/data-science-dca0131
Files, developed throughout the 2024.1 semester of the Data Science discipline taught at the Federal University of Rio Grande do Norte by the Department of Computer Engineering and Automation (DCA). 📚
big-data data-analysis data-science ia
Last synced: 30 Mar 2025
https://github.com/hemangsharma/breast-cancer-patient-dashboard
This interactive Streamlit dashboard visualizes insights from the SEER Breast Cancer Dataset (2006-2010)
data-analysis streamlit streamlit-dashboard streamlit-webapp
Last synced: 05 May 2026
https://github.com/theveryhim/massive-text-processing-1
cleaning, processing and analysis of papers' dataset in pyspark(rdd) framework
big-data data-analysis frequent-itemsets massive-datasets pyspark text-preprocessing
Last synced: 03 Jul 2025
https://github.com/mpoojithavigneswari/bangalore-house-price-prediction
This project involves creating a website that predicts Bangalore house prices with 94.65% accuracy using a machine learning algorithm.
data-analysis data-science flask-server machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 12 Apr 2026
https://github.com/sarveshdhond/top_25_cad_stocks
In this project I have used Python Jupyter lab and Pandas to import data set from Yahoo stocks website. I have imported the top 25 most active Canadian stocks on 12th July 2024. This project shows skills such as Python, Web Scrapping and Pandas.
data-analysis pandas-dataframe python webscraping
Last synced: 01 Apr 2025
https://github.com/bpkaur/exploring-the-evolution-of-linux
This project explores the evolution of the Linux kernel by finding top 10 contributors and visualization of commits over the years.
data-analysis data-science datacamp ipynb-jupyter-notebook python3
Last synced: 21 Feb 2026
https://github.com/vasulab/knightshock
Shock tube experiment planning and data analysis package.
cantera data-analysis matplotlib numpy shock-tube
Last synced: 18 Jul 2025
https://github.com/cassandrajm/reddit-dashboard
INTERACTIVE DASHBOARD: Analyzing Political Discourse on Reddit: A Multi-Faceted NLP Approach to Toxicity, Bias, and Political Stance
capstone data data-analysis data-science politics python reddit
Last synced: 09 Apr 2025
https://github.com/nilayhangarge/data-analysis-with-python
This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.
data-acquisition data-analysis data-analytics data-binning data-cleaning data-engineering data-fundamentals data-insights data-integration data-preprocessing data-science data-wrangling numpy pandas python
Last synced: 12 Apr 2026
https://github.com/noorulhudaajmal/customer-segmentation-analysis
Customer segmentation and analysis of purchasing behaviour
cluster-analysis customer-segmentation data-analysis
Last synced: 07 Oct 2025
https://github.com/chinmayee4/vrinda_store_data_analysis
Analyzed Data By Creating Interactive Dashboard Using MS Excel
data-analysis data-cleaning data-visualization excel-dashboard pivot-tables power-query
Last synced: 07 Jan 2026
https://github.com/jbalooshie/election_analysis
A Python script built to analyze specific election's results, and be re-purposed to analyze the results of other elections. The script provides you with different breakdowns of the vote based on candidate and county,
data-analysis data-science elections python
Last synced: 09 Apr 2025
https://github.com/bkataru/physics-e.e
Project repository for IB physics extended essay. Topic: Predictive data modeling of a variable binary star’s brightness over a period of time using astrostatistics.
astrometry astronomical-algorithms astronomical-images astronomy astrophotography astrostatistics data-analysis data-science data-visualization modeling physics polynomial-regression regression-analysis
Last synced: 09 Apr 2025
https://github.com/rohitblaze10/netflix_analysis_using_tableau
The Netflix dashboard in Tableau provides a professional and visually captivating interface for users to explore a vast collection of TV shows and series. With seamless navigation and interactive filters, users can easily personalize their recommendations based on release year, genre, duration, and rating.
data data-analysis data-science data-visualization netflix tableau
Last synced: 04 Feb 2026
https://github.com/saigeethika05/global-connect
International Student Engagement Platform
data-analysis figma prototyping ui-design ux-design wireframes
Last synced: 04 Jul 2025
https://github.com/ankitmishralive/machinelearning
Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.
artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn
Last synced: 22 Mar 2025
https://github.com/faysalalmahmud/bd-med-professional-analysis
Analysis of healthcare professionals in Bangladesh through web scraping, data processing, and interactive visualization.
data-analysis data-visualization jupyter-notebook python scraper selenium selenium-webdriver tableau
Last synced: 04 Sep 2025
https://github.com/prakshal0809/power-bi-analytics-dashboard
I have developed a dashboard in Power BI utilizing data from an Excel file. The dashboard effectively visualizes and analyzes the given data.
Last synced: 22 Feb 2026
https://github.com/noeldevelops/stem-degrees-analysis-cpp
C++ Data Analysis, I/O - takes an external data file for processing, performs some statistical analysis, and displays the results in the console
Last synced: 29 May 2026
https://github.com/trivediayush/analysis-work
anayltics business-analytics data-analysis excel excel-dashboard powerbi powerbidashboard
Last synced: 04 Feb 2026
https://github.com/LipunKumarDalai/Youtube-Analysis
A Simple DataAnalysis Project On Youtube-Data.
apache-superset beautifulsoup bootstrap5 data-analysis data-visualization django html jupyter-notebook postgresql-database python scraping selenium-webdriver sqlite-database youtube-api
Last synced: 30 Dec 2025
https://github.com/shinie19/sql-data-warehouse-project
Build a modern Data Warehouse from scratch with SQL Server, including ETL processes, data modeling and analytics.
data-analysis data-analytics data-cleaning data-engineering data-lake data-lakehouse data-modeling data-normalization data-science data-standardization data-warehouse etl-pipeline medallion-architecture sql-server
Last synced: 11 Mar 2025
https://github.com/dmdlgg/spotify-analysis
An interactive data analysis app built with Python, Pandas, Plotly, and Streamlit, showcasing insights about the top 1000 most played songs on Spotify. Dataset sourced from Kaggle. Users can explore the frequency, popularity, and most played songs by artist in a clean and intuitive interface.
data-analysis data-visualization pandas plotly python streamlit
Last synced: 11 May 2026
https://github.com/xza85hrf/excel-comparison-app
Excel Comparison Application is a Python-based tool that compares two Excel files and generates a new Excel file with the differences. It's primarily designed to help in database updating by identifying new clients. The app also has a graphical user interface for easier use and logs operations for potential troubleshooting.
case-sensitive-comparison data-analysis data-difference database-comparison database-updates excel-comparison file-merging file-processing gui-application new-client-detection python
Last synced: 25 Mar 2025
https://github.com/fbarffmann/python-challenge
Automated financial and election data analysis using Python. Cleaned and transformed large CSV datasets, calculated key business metrics, and generated automated reports for stakeholders.
automation csv data-analysis data-cleaning election-analysis financial-analysis python reporting
Last synced: 24 Apr 2025
https://github.com/siddhant2105s/airman-database-system
This repository contains the design and implementation of the AirMan System for managing airport operations at London Biggin Hill Airport. It includes an ERD diagram, MySQL scripts for database creation, data insertion, and queries, as well as detailed data definitions and system requirements documentation.
data-analysis database-design database-normalization entity-relationship-diagram entity-relationship-models mysql relational-databases sql-queries
Last synced: 25 Mar 2025
https://github.com/avratanubiswas/fluorpenplugin
A matlab user interface for analysing OJIP curve datasets from FluorPen instrument. That is, serving as an additional plug in for "quick categorical analysis".
data-analysis fluorpen ojip-curve
Last synced: 18 Mar 2026
https://github.com/anniefib/otherprojects
Powering Data Dreams: From Orchestration to Analytics with Cloud Precision
airflow-etl-orchestration aws cloud-native-data-solutions data-analysis data-visualization database datamodelling datawarehousing eda end-to-end-data-pipelines machine-learning-models pgadmin4 spark-analytics sql
Last synced: 07 May 2026
https://github.com/anas436/predictive-modelling-urban-growth-ai
Predictive Modelling for Urban Growth using AI
artificial-intelligence dashboard data-analysis data-collection data-preprocessing data-science data-visualization deep-learning deployment jupyterlab machine-learning python3 remote-sensing streamlit webapplication webscraping
Last synced: 05 Sep 2025
https://github.com/zkan/python-for-data-scientists
Python for Data Scientists
data-analysis data-science data-scientists machine-learning pandas python
Last synced: 13 Apr 2026
https://github.com/analysisbyvivek/Crime-data
Analyzes crime patterns across different areas, exploring factors such as crime type, weapon usage, demographic influences, and geographic distribution to uncover trends in frequency, correlations, and hotspots.
apache-superset data-analysis eda jupyter-notebook python
Last synced: 29 Jan 2026
https://github.com/asergioscosta/portfolio-dados
Portfolio of dashboards and data analysis projects.
business-intelligence dashboard data-analysis data-visualization kpi looker-studio powerbi
Last synced: 22 Feb 2026
https://github.com/amoghkori/effect-of-box-office-on-unemployment
Data preparation and cleaning process for movie ratings and reviews dataset and US unemployment rate dataset, involving an 8-step data wrangling process to create an Analytic Base Table (ABT) structure, emphasizing data structuring techniques, cleaning for outliers and missing values, and the importance of accurate and reliable data for analysis.
data-analysis data-cleaning data-preprocessing data-validation data-wrangling model-selection
Last synced: 13 Jun 2025
https://github.com/zen204/renewable-energy-usage-v-electricity-access
Interactive data visualization project created for COSI 116A: Introduction to Information Visualization at Brandeis University (Fall 2024). The project showcases data-driven insights using advanced visualization techniques and user interactivity. Hosted on GitHub Pages.
d3js data-analysis data-visualization electricity github-pages html-css-javascript information-visualization interactive python renewable-energy tableau web-development
Last synced: 08 Feb 2026
https://github.com/spacebakery/variance-in-weather-project
Statistics for Data Analysis | Variance and Standard Deviation
data-analysis python standard-deviation statistics variance
Last synced: 05 Jul 2025
https://github.com/ray-chew/pycsam
pyCSAM is a robust approach for approximating geodesic subgrid-scale orographic spectra with applications to weather forecasting and broader data analysis
data-analysis gmted icon-model merit-dem orographic spectral-analysis topography weather-forecast
Last synced: 28 Feb 2025
https://github.com/busradeveci/student-performance-prediction
A machine learning project to predict student exam performance based on academic, social, and personal features. Built with Python and scikit-learn.
data-analysis kaggle linear-regression machine-learning predictive-modeling python scikit-learn student-performance
Last synced: 25 Apr 2025
https://github.com/shivamsharma32/customer-churn-analysis-power-bi-
This project is about analyzing and visualizing customer churn data using Power BI. Customer churn is the percentage of customers who stop doing business with a company over a given period of time. It is an important metric for businesses to understand why customers leave and how to retain them.
data-analysis dataanalytics datavisualization powerbi
Last synced: 15 Jan 2026
https://github.com/gintuvedula/crime-data-analysis-with-mysql-and-python
This project aims to analyze crime data using MySQL for database management and Python for data analysis and visualization. The objective is to uncover crime trends, hotspots, and patterns to support law enforcement and urban planning efforts.
data-analysis data-exploration database mysql python
Last synced: 05 May 2026
https://github.com/mxagar/data_science_udacity
My personal notes, code and projects of the Udacity Data Science Nanodegree.
dashboard data-analysis data-engineering data-science machine-learning-pipelines
Last synced: 09 Apr 2025
https://github.com/manojrathod0777/loan-prediction
Predict loan approval status using machine learning techniques. This project demonstrates data preprocessing, feature engineering, model training, and evaluation, along with an interactive Streamlit app for real-time predictions. Ideal for financial decision-making.
classification-models data-analysis data-science financial-analytics jupyter-notebook loan-prediction machine-learning predictive-modeling python streamlit-app
Last synced: 13 Apr 2026
https://github.com/luminati-io/Indeed-dataset-samples
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
api data-analysis datasets indeed jobs web-scraping
Last synced: 09 Apr 2025
https://github.com/luminati-io/Target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 09 Apr 2025
https://github.com/leandrocollares/nyc-film-permits
NYC film permits: an exploratory data analysis
data-analysis data-visualization pandas plotly
Last synced: 05 Jul 2025
https://github.com/khushi-sabarad/8-week-sql-challenge
Case studies' solutions for the #8WeekSQLChallenge by Danny Ma
8weeksqlchallenge case-study data-analysis mysql sql
Last synced: 06 Sep 2025
https://github.com/code-jl/dna-sequence-analyzer
A robust Python-based bioinformatics tool for comprehensive DNA sequence analysis and manipulation.
bio-tools bioinformatics biological-data computational-biology data-analysis dna-analysis dna-sequencing fasta gc-content gene-detection genetics genomics molecular-biology motif-finding nucleotide-analysis python python3 scientific-computing sequence-analysis sequence-manipulation
Last synced: 11 Mar 2025
https://github.com/shellynagar27/business-insights-360-project
A comprehensive Dashboard which provides better understanding of the business's market standing, key focus areas for optimization, underperforming customers, and year-wise financial insights, aiding in better inventory planning and performance tracking. Further it can be used in answering n number of why questions based on the situations.
dashboard data-analysis data-visualization dax-languague dax-studio excel performance-optimization power-bi reporting sql storage-manager
Last synced: 27 Jan 2026
https://github.com/mehedi-hassan81/mastercourse
Data analysis project analysing renewable energy production across 212 countries, visualizing trends with Tableau. Highlights China's dominance (2,894 TWh) and Paraguay's 100% renewable share.
data-analysis pandas python renewable-energy selenium tableau-dashboards tableau-public web-scraping
Last synced: 08 May 2026
https://github.com/ndiplacide7/r-project
Explore diverse data analysis techniques using R programming combined with advanced machine learning algorithms to uncover insights and create powerful predictive models.
data-analysis data-visualization machine-learning-algorithms r
Last synced: 25 Mar 2025
https://github.com/allanotieno254/powerbi-dax-filter-context
This repository contains a Power BI project that explores **DAX Filter Context**, a crucial concept in DAX calculations. The project focuses on **Bank Loan Analysis**, demonstrating how different filter contexts affect DAX formulas.
business-intelligence data data-analysis dax dax-functions powerbi powerbi-visuals visualization
Last synced: 08 Jan 2026
https://github.com/chiragkumargohil/co2-emissions-data-analysis
A Python programme that analyses CO2 emission data from 1997 to 2010. This programme prints data, provides brief of a given year, displays and compares Year vs. Emission graphs for chosen countries, and generates a separate data file for chosen countries. It was a self-paced project that Guru 99 provided.
co2-emission data-analysis matplotlib python
Last synced: 28 Aug 2025
https://github.com/pawlo77/smarty
End-to-End Data Science tool
data-analysis data-processing pandas pipeline
Last synced: 08 May 2026
https://github.com/anushkundu/student-performance-analysis
Exploring Student Performance Factors
classification-algorithm clustering-algorithm data-analysis data-science exploratory-data-analysis machine-learning matplotlib numpy pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/giyanellow/time-series-analysis-on-philippine-debt-and-inflation
A Time Series Analysis on the Philippine Inflation Rate with some predictions using RandomForest.
data-analysis data-analysis-python machine-learning python random-forest
Last synced: 18 Mar 2026
https://github.com/meokullu/prefill
PreFill adds desired characters onto output values to increase their legibility.
alignment data data-analysis data-engineering data-science legibility
Last synced: 17 Jan 2026
https://github.com/sadia-khan13/data-preprocessing
Welcome to the Data preprocessing Repository! This repository is dedicated to showcase the comprehensive resources and implementations related to Data Preprocessing using Python and Jupyter Notebook.
artificial-intelligence data-analysis data-mining data-preprocessing data-science jupyter-notebook matplotlib numpy pandas python seaborn-python sklearn
Last synced: 11 Apr 2026
https://github.com/marielachirinosr/pandas-weather-project
Pandas Weather Data. Explore straightforward Python scripts for weather information analysis.
Last synced: 29 Apr 2026
https://github.com/abhisek-13/whatsapp-chat-analyzer
The WhatsApp Chat Analyzer is a data analysis project that provides insights into WhatsApp chats. It analyzes chat data to show metrics like the number of lines, most used letter, chatting duration, media files shared, most used emojis, and group member activity. The results are displayed on a user-friendly dashboard built with Streamlit.
data-analysis data-mining data-visualization eda machine-learning machine-learning-algorithms matplotlib numpy pandas python seaborn sklearn
Last synced: 13 Apr 2026
https://github.com/isaqueiros/newspapersales-predictions-linearregression_and_regularisation
This notebook is a study on the sales of newspapers of a local stand, with intention to predict the newspaper sales performance based on the different features available. For this, 4 sklearn models are applied: Linear Regression, Lasso Regression, Ridge Regression and Elastic Net Regression.
data-analysis data-science linear-regression machine-learning python regularization-methods sklearn-library sklearn-linear-regression
Last synced: 02 May 2026