Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/tomijuarez/lemmatisation
Lemmatisation fully implemented in Java.
algorithms data-analysis data-science java-8 lemmatization oop
Last synced: 08 Apr 2025
https://github.com/zen204/accenture-tech-news-summarization-engine
A tool developed to analyze knowledge graphs from technology news articles, uncovering insights and trends about technology products, platforms, services, and their industry impact. Built during an internship at Accenture to inform decision-making in the tech landscape.
data-analysis decision-making graph-visualization industry-insights jupyter-notebook knowledge-graph machine-learning python tech-news tech-trends
Last synced: 29 Apr 2026
https://github.com/pedramjlo/car_sales_analysis
Car sales analysis
data-analysis jupyter-notebook pandas python
Last synced: 01 Apr 2025
https://github.com/callmezoe/neo4j-supplychainmanagement
cypher data-analysis data-visualization graphdatabase neo4j
Last synced: 08 Apr 2025
https://github.com/vaishnavi502/data-analysis-work
A set of Google colab notebooks with my work on data analysis
data-analysis deep-learning facial-emotion-recognition facial-expression-recognition fer2013-dataset machine-learning python unemployment-rate
Last synced: 28 Apr 2026
https://github.com/cyberoctane29/noaa-lightning-analysis
This project explores lightning strike data from the National Oceanic and Atmospheric Administration (NOAA) to identify seasonal trends and analyze strike frequency across months. It demonstrates data manipulation, aggregation, and visualization using Python, providing insights into lightning activity patterns.
data-analysis data-science data-visualization eda python
Last synced: 20 Apr 2026
https://github.com/brianrscode/delitos-cdmx
Página simple que muestra estadísticas sobre los delitos ocurridos en CDMX
analisis-de-datos data-analysis django pandas plotly python python3
Last synced: 18 Apr 2026
https://github.com/shellynagar27/transportation-and-logistics-challenge
Analyzing logistics data to optimize shipment efficiency, reduce delays, and enhance supply chain visibility using Power BI. Insights include top routes, delays, supplier trends, and peak shipments.
cleaning-data critical-thinking data-analysis data-visualization exploratory-data-analysis feature-engineering powerbi preprocessing-data problem-solving python
Last synced: 16 May 2026
https://github.com/soumya-thoutam/covid-19-impact-on-u.s.-states-and-colleges
Covid-19 analysis and impact on United States Colleges and States using SQL and Tableau.
covid-19 dashboard data-analysis data-visualization dataset sql sql-server tableau
Last synced: 04 Sep 2025
https://github.com/jofaval/iris-flowers
Multilabel Classification of the famous Iris Flowers Dataset from Ronald Aylmer Fisher in 1936
classification data-analysis data-science data-visualization google-colab iris-flowers kaggle machine-learning python scikit-learn xgboost
Last synced: 05 Apr 2026
https://github.com/yash-3-bit/online-sales-analysis
Project-Merging the different months datasets and performing the data cleaning ,Analysis and Visualization
data-analysis data-visualization pandas-library
Last synced: 27 Mar 2025
https://github.com/apoorvalal/misc_stata_ados
Misc Utility programs in Stata.
data-analysis stata stata-command
Last synced: 04 Feb 2026
https://github.com/dulajkavinda/pandas-exploring-data-ml
🐼 Exploring data with pandas library.
data-analysis machine-learning pandas python
Last synced: 09 May 2026
https://github.com/theveryhim/massive-text-processing-1
cleaning, processing and analysis of papers' dataset in pyspark(rdd) framework
big-data data-analysis frequent-itemsets massive-datasets pyspark text-preprocessing
Last synced: 03 Jul 2025
https://github.com/mpoojithavigneswari/bangalore-house-price-prediction
This project involves creating a website that predicts Bangalore house prices with 94.65% accuracy using a machine learning algorithm.
data-analysis data-science flask-server machine-learning matplotlib numpy pandas python scikit-learn seaborn
Last synced: 12 Apr 2026
https://github.com/sarveshdhond/top_25_cad_stocks
In this project I have used Python Jupyter lab and Pandas to import data set from Yahoo stocks website. I have imported the top 25 most active Canadian stocks on 12th July 2024. This project shows skills such as Python, Web Scrapping and Pandas.
data-analysis pandas-dataframe python webscraping
Last synced: 01 Apr 2025
https://github.com/abdelrahmanbayoumi/titanic-machine-learning-from-disasters
Knowing from a training set of samples listing passengers who survived or did not survive the Titanic disaster, can our model determine based on a given test dataset not containing the survival information, if these passengers in the test dataset survived or not.
data-analysis data-science data-visualization machine-learning pandas
Last synced: 09 Apr 2025
https://github.com/malk97sc/data_science
Data Science Projects
data-analysis data-science data-visualization
Last synced: 20 Jun 2026
https://github.com/kevingastelum/mydataanalysis
My DataAnalyst Projects | Python, SQL, Excel, PowerBI & Tableau
data-analysis python sql visualization
Last synced: 20 May 2026
https://github.com/sreekar0101/electric-vehicle-market-growth-and-incentive-impact-analysis-dashboard
About This project involves the development of a comprehensive Tableau dashboard to analyze the growth and market dynamics of electric vehicles (EVs). The dashboard reveals key insights, including a 20% increase in EV adoption over five years, the dominance of Battery Electric Vehicles (BEVs) which make up 60% of the market
data-analysis data-visualization tableau-desktop
Last synced: 07 Jan 2026
https://github.com/kernix13/github-readme-seo-analysis
A Jupyter Notebook GitHub README and Repo SEO Analysis to determine what makes a repo rank in the SERPS
accessibility data-analysis readme seo seo-analysis
Last synced: 29 May 2026
https://github.com/yanny-alt/competitor-sales-analysis-in-power-bi
This project aims to analyze competitor sales for a fictional manufacturing company, Sintec, using Power BI. The focus is on integrating, cleaning, and modeling data from multiple sources to generate insightful reports on company and competitor performance.
data-analysis powerbi sales-analysis
Last synced: 07 Jan 2026
https://github.com/who-else-but-arjun/isro_xrf_sr
Source Codes for super resolution of the lunar elemental abundance map using a semi-supervised deep spatial interpolation model. This hybrid approach combined ResNet50 for spatial feature extraction with Graph Neural Network (GATv2Conv) layers and Convolutional Neural Networks (CNNs), followed by fusion layers.
cnn data-analysis graph-neural-networks pytorch semi-supervised-learning spatial-interpolation super-resolution
Last synced: 30 Apr 2026
https://github.com/sgb31/covid-19-data-analysis
"In this project, I analyzed COVID-19 data to explore trends, case growth, and key patterns. I worked on cleaning the data, performing exploratory analysis, and visualizing infection rates, recoveries, and fatalities. The goal was to gain insights into how the pandemic evolved and its overall impact.
data-analysis data-visualization matplotlib pandas python seaborn
Last synced: 13 May 2026
https://github.com/kseniatyschuk/excel-data-matcher
Compare and match Excel files via a simple Python GUI
automation data-analysis etl excel gui pandas python3 tkinter
Last synced: 23 Apr 2025
https://github.com/ehsan-behzadi/online-retail-data-analysis-and-preprocessing
This project analyzes and preprocesses the Online Retail dataset to uncover insights into customer purchasing behaviors, sales trends, and product performance. It includes data cleaning, exploration, and visualization, with the goal of enhancing understanding of online retail dynamics.
cohort-analysis data-analysis data-cleaning data-exploration duplicate-detection exploratory-data-analysis-eda feature-encoding feature-engineering handling-missing-values online-retail outlier-detection preprocessing trends-visualization visualization z-score-method
Last synced: 16 Apr 2026
https://github.com/noorulhudaajmal/customer-segmentation-analysis
Customer segmentation and analysis of purchasing behaviour
cluster-analysis customer-segmentation data-analysis
Last synced: 07 Oct 2025
https://github.com/giatraskon/clustering_algorithms_analytical_and_computational
Analytical and computational exploration of clustering algorithms, focusing on k-means and k-medians, with MATLAB implementations and synthetic dataset analyses.
clustering computational-mathematics data-analysis data-science data-visualization k-means k-means-clustering k-median-clustering k-medians k-medoids k-medoids-clustering machine-learning matlab noise-robustness numerical-methods outlier-detection possibilistic-clustering-algorithms statistical-analysis synthetic-data unsupervised-learning
Last synced: 21 Mar 2025
https://github.com/hermaeus1618/patternrecognition
Stock Pattern Recognition Dataset
data-analysis machine-learning pattern-recognition python stock-market
Last synced: 28 Dec 2025
https://github.com/muthukumar0908/imdb_movie_analysis_with_powerbi
The project aim is to analyze the dataset using Power Bi, The dataset is related to IMDB Movies.
data-analysis data-visualization powerbi
Last synced: 12 Jun 2025
https://github.com/vavarm/data-analysis-french-electric-automobile-infrastructure
Data analysis realized in R Shiny and Python about the French electric vehicle and charging station infrastructure
data-analysis data-science data-visualization factominer geojson ggplot2 plotly python r rshiny
Last synced: 15 May 2026
https://github.com/lord3008/instances-of-data-analysis
This repository of mine shows my work on data analysis of various projects that I made. I feel data analysis is the very key to investigate a solution. Further more it enlightens the direction towards model building.
Last synced: 03 Mar 2025
https://github.com/jonek/pv-city-mastr
Extract and analyze data about photovoltaic systems in Germany
data-analysis germany jupyter-notebook pandas photovolatic-power photovoltaic
Last synced: 11 May 2026
https://github.com/sciencesar-labs/py485-final-project
ROOT-based muon data analysis using Python & Jupyter – final project for PY485E @ CERN
cern computational-physics data-analysis jupyter-notebook muons python root uproot
Last synced: 15 May 2026
https://github.com/dina-hosny/sparkify---data-modeling-with-cassandra
Sparkify - Data Modeling with Cassandra - Udacity Data Engineering Expert Track.
cassandra cql data-analysis data-engineering data-modeling data-warehousing etl python
Last synced: 11 Apr 2026
https://github.com/chingu-voyages/v47-tier3-team-30
An easily accessible tool for calculating electricity-related carbon emissions, along with insights for reducing environmental impact. | Voyage-47 | https://chingu.io/ | Twitter: https://twitter.com/ChinguCollabs
carbon-emissions carbon-footprint data-analysis data-engineering data-science
Last synced: 10 May 2026
https://github.com/satyacoder29/comparison-of-region-based-sales-tableau
The region-based sales comparison analyzes sales performance across different regions. It identifies trends, top-performing regions, and areas needing improvement by comparing metrics like revenue, growth rate, and product demand. This analysis helps optimize sales strategies and resource allocation for better performance.
data-analysis data-cleaning data-collection data-visualization powerquerym relationships tableau tableau-desktop unions
Last synced: 02 Feb 2026
https://github.com/janashanaa/flightanalysis
This Jupyter Notebook presents an exploratory data analysis of data derived from a flight booking website.
data-analysis data-visualization exploratory-data-analysis jupyter-notebook python
Last synced: 15 May 2026
https://github.com/emmarhoffmann/analysis-of-california-real-estate-market-factors-influencing-home-prices
Investigates how home size, number of bedrooms, and bathrooms influence home prices, with comparisons across California, New York, New Jersey, and Pennsylvania.
data-analysis r real-estate statistical-models
Last synced: 17 Mar 2025
https://github.com/emmarhoffmann/analysis-of-student-debt-among-first-generation-college-students
Explores the financial landscape of first-generation college students, analyzing patterns in student debt based on factors like median income, net price of attendance, and enrollment size.
data-analysis first-generation-college-students r statistical-models
Last synced: 17 Mar 2025
https://github.com/satyacoder29/crm-analytics
CRM Analytics Dashboard – An interactive dashboard using Tableau, SQL, and Salesforce CRM Analytics (CRMA) to analyze sales performance, customer segmentation, and churn prediction. Features automated ETL pipelines, predictive analytics, and real-time insights for data-driven decision-making. 🚀📊
advanced-excel data-analysis data-cleaning data-collection data-transformation data-visualization matplotlib numpy pandas powerbi python seaborn sql tableau
Last synced: 03 Mar 2025
https://github.com/mindlessmuse666/apartment-price-predictor
Python-проект по прогнозированию стоимости аренды квартир с помощью линейной регрессии. Практическая работа по теме: "Основы машинного обучения" дисциплины "МДК 13.01: Основы применения методов искусственного интеллекта в программировании".
apartment-price-prediction data-analysis data-science linear-regression linear-regression-models machine-learning matplotlib python regression sklearn unit-testing
Last synced: 11 Apr 2026
https://github.com/mindlessmuse666/iris-knn
Проект демонстрирует применение алгоритма k-ближайших соседей (KNN) для классификации набора данных Iris. Включает загрузку данных, обучение модели, оценку производительности и визуализацию результатов с использованием библиотек Pandas, Scikit-learn, Matplotlib, Seaborn и Plotly.
algorithm classification data-analysis data-visualization iris-dataset knn lazy-learning machine-learning python scikit-learn
Last synced: 17 Aug 2025
https://github.com/ljadhav25/knn-algorithm-data-science-
This repository contains a project demonstrating the implementation and application of the K-Nearest Neighbors (K-NN) algorithm in Data Science. The objective is to provide a comprehensive understanding of the K-NN algorithm, including data preprocessing, model training, evaluation, and visualization of results. This project is ideal for beginners
data-analysis data-science knn-classification machine-learning matplotlib-pyplot numpy pandas-library seaborn
Last synced: 16 Apr 2026
https://github.com/pramodkondur/dataspark-end-to-end-dataanalytics
Cleaned, performed EDA and stored data in MySQL. Queried, and analyzed data, uncovering opportunities to drive revenue growth and optimize operations, with a potential revenue growth of $30.03 million. Reported key insights using Power BI.
data-analysis data-visualization eda powerbi python sql
Last synced: 21 May 2026
https://github.com/spshah1701/world-development-indicators
Analysis of World Development Indicators (WDI) using big data technologies, specifically Databricks, Apache Spark, and Scala.
apache-spark big-data data-analysis spark-sql
Last synced: 17 Mar 2025
https://github.com/mosalem149/pythonutilities
A collection of Python scripts for common utility tasks including file manipulation, word counting, longest word detection, and grade categorization. Perfect for quick and easy solutions to everyday programming problems.
data-analysis educational-tools file-io file-manipulation grade-calculation python text-analysis text-processing utility word-counting
Last synced: 15 May 2026
https://github.com/oshinrathor/data-science-systems-and-analytics-projects
Dive into my Data Science Projects Repository, featuring a Spam SMS Classifier, NIA Dashboard, H1N1 Vaccine Prediction, and NYC Taxi Fare Prediction. Each project showcases my skills in data cleaning, exploratory analysis, modeling, and visualization, offering valuable insights and methodologies for data enthusiasts and practitioners.
dashboard data-analysis data-driven-decisions data-presentation data-science data-visualization dataexploration eda insights nia webanalytics
Last synced: 02 Mar 2025
https://github.com/darshan1924/house-price-pridiction
This repository contains a machine learning project for predicting house prices based on various features, including geographical coordinates. The project includes data preprocessing steps to handle# House Price Prediction Project
data-analysis data-preprocessing house-prices jupyter-notebook machine-learning prediction
Last synced: 27 Mar 2025
https://github.com/sondosaabed/data-visualization-in-tableau
data-analysis data-visualization nanodegree plot tableau udacity
Last synced: 08 Sep 2025
https://github.com/bjornmelin/minneanalytics
MinneAnalytics project work.
competitive-programming data-analysis data-visualization r
Last synced: 09 Jul 2025
https://github.com/thecoderpinar/globalwarmingforecast
🌍 Global Warming Forecast Tool An advanced tool for analyzing and forecasting climate trends using ARIMA and Prophet models, with interactive visualizations and scenario simulations.
arima climate-change data-analysis environmental-science forecasting global-warming machine-learning prophet streamlit time-series-analysis visualization
Last synced: 27 Mar 2025
https://github.com/nehul1149/olympic-data-analysis
This project is an interactive data visualization and analytics platform for exploring historical Olympic Games data. Built with Python and Streamlit, it offers an in-depth analysis of medal tallies, athlete statistics, and country-wise performance trends, providing users with powerful insights into the world's biggest sporting event.
analysis data-analysis data-science data-visualization matplotlib python streamlit
Last synced: 18 May 2026
https://github.com/brevex/hotel-booking-demand-data-analysis
Data analysis in Python of demand for urban hotels and resorts showing their causes and relationships
data-analysis data-science hotel-booking-analysis kaggle python
Last synced: 08 May 2026
https://github.com/tknishh/investing-platform
An investing platform application to help users get information and analyze various foreign currency assets. The investing platform uses an ETL pipeline to insert new batches of Forex data once a day.
data-analysis investing-platform pipeline
Last synced: 18 Mar 2025
https://github.com/abhinav-codealchemist/open-government-data-analysis
Data Analysis Using Pandas
data-analysis data-science jupyter-notebook python
Last synced: 18 May 2026
https://github.com/bilalhameed248/power-bi-learning-and-dev
Power BI Learning And Development
chats data-analysis data-preprocessing dataanalysis dax powerbi statistics visualization
Last synced: 06 Mar 2026
https://github.com/iamsainikhil/us-births-analysis
Analysis of US-Births during 1994-2003 based on CDC-NCHS data set.
Last synced: 16 May 2026
https://github.com/ebrizzzz/data-visualization-project-using-tableau
A data visualization project for the Visual Data Analysis course (Spring Term 2025) at the University of Skövde. This project explores the factors influencing national happiness scores across different global regions from 2005 to 2022.
analytics data data-analysis data-science data-visualization python regression tableau
Last synced: 16 Jun 2025
https://github.com/qorah/vic-edu-housing-insights
Analysis of education outcomes and housing affordability in Victoria, Australia.
data-analysis jupyter-notebook
Last synced: 18 Mar 2025
https://github.com/clemence-g/heat-dome-analysis
atmospheric-science data-analysis geopotential heat-wave jupyter
Last synced: 18 Mar 2025
https://github.com/aalkiyumi/project-4-big-data-analysis-with-pyspark-on-weather-data
In this project, I analyzed weather data from the NCEI Global Surface Summary of Day dataset using PySpark in Jupyter Notebook. Tasks included data cleaning, statistical analysis, and forecasting for temperature, wind speed, precipitation, and extreme weather events. The project also predicts future weather patterns for Cincinnati and Florida.
big-data-analytics cs5165 data-analysis data-cleaning data-engineering data-science introduction-to-cloud-computing jupyter-notebook machine-learning precipitation-analysis predictive-modeling pyspark statistical-analysis temperature-forecasting time-series-forecasting uc uc2026 university-of-cincinnati wind-speed-data
Last synced: 17 Mar 2025
https://github.com/betkh/datascieneinpython
Jupiter Notebook files
data-analysis data-visualization
Last synced: 16 Jun 2025
https://github.com/abidshafee/google.colaboratory_projects
This repository contains the collections of interactive python notebooks (ipynb) that are some of my projects on Data Science, Machine Learning (ML), and Natural Language Processing (NLP).
colaboratory data-analysis data-science lstm machine-learning nlp statistics time-series
Last synced: 09 Jul 2025
https://github.com/fabricioism/nyc-schools-perceptions
data-analysis data-science data-visualization dataquest r
Last synced: 29 Oct 2025
https://github.com/cyberoctane29/deutsche-bank-customer-churn-prediction-end-to-end-analysis-and-modeling
In this project, I aim to predict customer churn for Deutsche Bank using supervised machine learning. It involves data exploration, feature engineering, and building Naive Bayes, Decision Tree, Random Forest, and XGBoost models. Models are tuned, evaluated, and compared to identify the best approach for churn prediction.
bank-customer-churn churn-analysis churn-prediction customer-churn-analytics data-analysis data-analytics data-visualization decision-tree eda gaussian-naive-bayes machine-learning random-forest supervised-learning xgboost
Last synced: 11 Oct 2025
https://github.com/felipe-veas/visor-sueldos-publicos
Herramienta interactiva para visualizar y analizar remuneraciones del sector público en Chile, construida con Streamlit.
audit chile data-analysis python streamlit transparency
Last synced: 16 May 2026
https://github.com/czesctuklap/sustainable-fashion-database-analysis
This project, analyzes a dataset of sustainable fashion trends for 2024. It includes data preprocessing, exploration, visualization, and insights on environmental impact factors such as carbon footprint, water usage, waste production, and sustainability practices.
data-analysis data-visualization database dataset keggle sustainable-fashion
Last synced: 30 Apr 2026
https://github.com/krypten/nycsubwayturnstileweatheranalysis
Analyzing the NYC Subway Dataset
data-analysis machine-learning machinelearning python
Last synced: 01 Sep 2025
https://github.com/kfrural/dashboard_agro
Dashboard Agro is a technological platform that integrates several components to support Brazilian agribusiness through data analysis, visualization and forecasts. This innovative solution was developed to serve three main groups: farmers, researchers and public managers.
big-data data-analysis predictive-analytics python
Last synced: 15 May 2026
https://github.com/fmind/malpop
Rank the popularity of malware applications by their occurrence on VirusTotal
data-analysis malware popularity ranking virustotal
Last synced: 11 Apr 2025
https://github.com/estevan-ulian/py-agent-voice
Um projeto para lidar com interações de voz entre humano e agente de I.A. permitindo a leitura e análise de dados de um arquivo CSV.
agent-based-modeling data-analysis python3 whisper-ai
Last synced: 11 Apr 2025
https://github.com/badranalyst/startup-expansion-analysis-with-pandas-matplotlib-and-power-bi
Analyzes startup growth and expansion factors using Pandas for data analysis and Matplotlib for visualizations. Complements findings with data visualizations in Power BI, providing actionable insights into funding and market trends.
dashboard data-analysis data-visualization dataset matplotlib matplotlib-pyplot pandas power-bi powerbi
Last synced: 16 May 2026
https://github.com/josedanielchg/nyc-schools-test-scores-exploration
DataCamp project analyzing NYC public school test scores to identify top math-performing schools, the best overall SAT scores, and borough-level variability using Python and pandas
data-analysis jupyter-notebook python
Last synced: 19 Mar 2025
https://github.com/coslynx/fitness-tracker-mvp-community
Project: Track fitness goals, log workouts, and share progress with friends. Created at https://coslynx.com
code-generation data-analysis developer-tools devops fitness-tracker goal-setting machine-learning mvp mvp-development nextjs postgresql prisma react social-community software-development tailwindcss typescript user-authentication workout-tracking zustand
Last synced: 06 Jan 2026
https://github.com/swat1563/recommendation-system
This repository features a recommendation system and analytics engine using datasets on users, organizations, contents, contacts, events, and recommendations. It includes data preprocessing, building a recommendation system, and creating visual reports with Power BI.
analytics data-analysis data-visualization engine kaggle numpy pandas powerbi powerbi-dashboards powerbi-desktop powerbi-reports python recommendation-engine recommendation-system recommender-systems scikit-learn scipy
Last synced: 07 Jan 2026
https://github.com/damianmarti/big-mac-index
Data analysis from BigMac index
Last synced: 03 Apr 2025
https://github.com/rijul007/diamonds-analysis-using-r
Diamonds data analysis using R, exploring relationships between diamond attributes (such as carat, cut, color, and clarity) and price, with a focus on providing insights for engagement ring selection through various statistical techniques and data visualizations including histograms, boxplots, scatter plots, and bar charts.
Last synced: 25 Jan 2026
https://github.com/istinnew/enaic-s-discount-strategy-analysis
**(Open to Collaboration):** This project evaluates the impact of discounts on sales and customer retention for Eniac. It includes data cleaning, visualization, storytelling, and strategic insights to optimize discount strategies while maintaining brand reputation. 📊🛍️✨
cleaning-data cleaning-data-in-python cost-optimization data-analysis data-science data-visualization library presentation python visualization
Last synced: 03 Apr 2025
https://github.com/bhaveshbhakta/blood-glucose-prediction-using-ann
Blood Glucose Prediction
ann artificial-neural-networks blood-glucose-prediction data-analysis data-visualization deep-learning
Last synced: 16 May 2026
https://github.com/elliotone/nl-semantic-kernel-sales-analyzer
A console project showing Microsoft Semantic Kernel examples for sales data analysis using local AI models via LM Studio.
ai csharp data-analysis dotnet lm-studio local-ai machine-learning semantic-kernel
Last synced: 16 May 2026
https://github.com/sadia-khan13/modern_arts_data_cleaning
Welcome to the Data Cleaning project! This repository is dedicated to showcasing best practices and techniques for cleaning data using Pandas within Jupyter Notebook
data-analysis data-analysis-python data-cleaning data-science jupyter-notebook pandas-python
Last synced: 10 May 2026
https://github.com/as16082023/goodcabs-performance-analysis
Codebasics Resume Challenge 13 Analysing Goodcabs' performance in transportation across India from January to June 2024
codebasicsresumeprojectchallenge data-analysis goodcabs mysql sql
Last synced: 03 Apr 2025
https://github.com/malucor/analise_exploratoria_dados
Programa em Python para fazer uma Análise Exploratória de Dados de Logística.
analise-de-dados analise-exploratoria analise-exploratoria-de-dados data-analysis ebac exploratory-data-analysis ipynb jupyter-notebook python
Last synced: 16 May 2026
https://github.com/alejandrolara11/desafio_latam_introduccion_analisis_de_datos
Repositorio del curso "Introducción al Análisis de Datos" de Desafío Latam. Ejercicios prácticos realizados durante el curso, enfocados en análisis de datos con Python, Pandas, y visualización básica.
data-analysis data-science data-visualization matplotlib numpy pandas python seaborn statsmodels
Last synced: 29 Apr 2026
https://github.com/jwt218/isonq
MATLAB package for Qtegra-generated data file processing.
data-analysis geochemistry isotopes matlab
Last synced: 03 Apr 2025
https://github.com/yasir-arafah/nyc-trip-fare-prediction-using-tcn
"NYC Trip Fare Prediction Using Temporal Convolutional Networks (TCN)" is a Data Analytics Project where the trip and fare data of NYC taxi are combined and then analyzed using Pyspark and visualized using Matplotlib library. The project predicts the fare by using Temporal Convolutional Neural Network.
colab data-analysis matplotlib nyc-taxi-dataset pyspark python
Last synced: 29 Apr 2026
https://github.com/ggarciajavier/udacity-dalf-project3-test-perceptual-phenomenom
Work performed for the 3rd project of Udacity Data Analyst Nanodegree: statistical testing of a perceptual phenomenom (Stroop task).
data-analysis python statistical-inference udacity-data-analyst-nanodegree
Last synced: 18 May 2026
https://github.com/pdiegel/currencytracker
A Python application that fetches real-time currency exchange rates from an API, securely stores the data in an SQLite database, and includes error handling, logging, and good programming practices for reliable and periodic data capturing.
analysis api currency data-analysis data-capture logging python python3 sqlite3 tracker
Last synced: 09 Sep 2025
https://github.com/dylanbk/exploring-data
A collection of programs that explore data engineering and analysis.
data-analysis data-engineering matplotlib pandas python
Last synced: 02 Mar 2025
https://github.com/michael-angelo-mootoo/quanta-app
Quanta is an open source statistical package app / toolkit for neuroscience and general computational descriptive and inferential statistics.
computational-statistics customtkinter data-analysis descriptive-statistics gui-application inferential-statistics neuroscience python r statistical-analysis statistics tkinter-python
Last synced: 16 May 2026
https://github.com/grindelfp/two-data-manipulative-tasks
Two simple tasks on data analysis and processing.
Last synced: 17 Feb 2026