Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-01 00:07:23 UTC
- JSON Representation
https://github.com/mothraa/etl-marketanalysis-webscraping-poo
OC project 2 refactoring (POO version not yet completed)
data-analysis etl poo python web-scraping
Last synced: 20 Oct 2025
https://github.com/omr5221/bi-scripts
Example of DI BI tool scripting
automation configuration-files data-analysis data-warehousing dimensional-analysis diver etl-pipeline lookup modeling perl-script python shell-script sql summarization
Last synced: 15 Mar 2026
https://github.com/jigyasag18/bird-strikes-in-aviation-project
This project analyzes over a decade of U.S. bird strike data (2000–2011) to evaluate safety risks, damage trends, and cost implications in aviation. Using PostgreSQL for database management and Power BI for dashboard visualization, it uncovers critical insights into when, where, and how wildlife impacts aircraft. Key findings inform strategically.
bird-strike-prevention bird-strike-prevention-in-real-airport data data-analysis data-analysis-project data-visualisation data-visualization data-visualization-project data-visualizations database dataset dax-query postgresql postgresql-database powerbi powerbi-desktop powerbi-report powerbi-visuals sql sql-database
Last synced: 09 May 2026
https://github.com/browndwarf/contracosta
Wavelength dependent starspot contrast with Kepler/K2 and TESS
Last synced: 23 Jan 2026
https://github.com/sumitkundu102022/air-quality-report
Air Quality Report using PowerBI
data data-analysis data-visualization powerbi
Last synced: 23 Jan 2026
https://github.com/brianlesko/r_data_science_stat5730
Written by Brian Lesko, the repository contains R Scripts demonstrating data science topics largely originating from study at Ohio State. Contents are written in R studio using the R markdown file. As of 1/21/23 Future projects concerning data science, statistics, and machine learning will be in python in my machine learning Repository
data data-analysis flight-data ggplot2 olympics-data r-markdown tidyverse
Last synced: 23 Jan 2026
https://github.com/satyacoder29/crm-analytics-power-bi
CRM Analytics Dashboard – An interactive dashboard using Tableau, SQL, and Salesforce CRM Analytics (CRMA) to analyze sales performance, customer segmentation, and churn prediction. Features automated ETL pipelines, predictive analytics, and real-time insights for data-driven decision-making. 🚀📊
advanced-excel data-analysis data-cleaning data-collection data-transformation data-visualization matplotlib numpy pandas powerbi python seaborn sql tableau
Last synced: 14 Apr 2026
https://github.com/alessandroryo/bike-rental-data-analysis
A data analysis project focused on understanding and predicting bike rental patterns. This project utilizes data processing, visualization, and predictive modeling techniques to gain insights into bike rental usage, fulfilling the final submission requirement for Dicoding Indonesia's Data Analysis course.
bike-rental data-analysis data-visualization jupyter-notebook machine-learning python streamlit
Last synced: 09 Apr 2026
https://github.com/psychelzh/cogstruct-old
Data Analysis on Cognitive Structure
cognition data-analysis intelligence psychology
Last synced: 25 Oct 2025
https://github.com/madhursinghbhadoriya/data_analysis_fifa-players
• Using NumPy, Matplotlib, Pandas, etc processed important Information and Characteristic traits on Jupyter Notebook.
analysis data-analysis data-science graphs jupyter-notebook pandas python
Last synced: 07 May 2026
https://github.com/campagnucci/exercitando_pandas
Exercícios práticos de pandas com dados abertos da educação de São Paulo
data-analysis data-science education-data exercises pandas-tutorial
Last synced: 28 Jan 2026
https://github.com/code-jl/nfl-kicker-predictor
A sophisticated Python application that provides real-time NFL kicker statistics and performance analysis with an intuitive graphical interface.
beautifulsoup data-analysis data-visualization espn football gui nfl prediction python real-time-analytics real-time-data sport-analytics sports-data statistics tkinter web-scraping
Last synced: 01 Jun 2026
https://github.com/gaurabkundu1/road-accident-data-analysis
This is an Excel project on Road Accident Data Analysis in the form of an interactive Dashboard.
dashboard data-analysis data-vizualisation excel road-accidents
Last synced: 24 Jan 2026
https://github.com/diegopino/publibdata_codexhackathon
Public Library Data processing/analysis codex hackathon attempt
data-analysis data-visualization libraries public
Last synced: 24 Jan 2026
https://github.com/annnieglez/fraud-detection-eda
Fraud Detection - Exploratory Data Analysis (EDA). Analyzing financial transactions to detect fraud patterns using Python and Tableau. Libraries: Pandas, Seaborn and Matplotlib. Key Focus: Data cleaning, fraud trends, high-risk transactions, time-based patterns
data-analysis data-science data-visualization eda fraud-detection fraud-prevention matplotlib seaborn
Last synced: 28 Jan 2026
https://github.com/yash1882/music-store-data-analysis
A project focuses on analyzing music store data using SQL ♬
begineer-friendly data-analysis music music-store-data music-store-data-analysis sql-project
Last synced: 28 Jan 2026
https://github.com/angchekar28/sales-report-power-bi
A Power BI sales report analyzing country-wise and product-wise sales trends. Includes dashboards, decomposition trees, and key influencers analysis for business insights.
dashboard data-analysis data-cleaning data-visualization powerbi sales-report
Last synced: 16 Mar 2026
https://github.com/smahala02/magnetism-lab
This repository contains Python scripts and data for analyzing inductance in toroidal coils to calculate the magnetic permeability of ferrite materials. The project helps classify materials as soft or hard magnets based on experimental data.
data-analysis inductance jupyter-notebook magnetism python toroids
Last synced: 29 Jan 2026
https://github.com/edumoraes1/comissao-reduzida
Criação de segmentação de publico via SQL para nova feature do enjoei de comissão reduzida
bq data-analysis salesforce sql
Last synced: 06 Feb 2026
https://github.com/surajwate/datalab
DataLab is a versatile toolkit designed to simplify data exploration, analysis, and visualization for data scientists.
data-analysis data-science python visualization
Last synced: 30 Jan 2026
https://github.com/aygp-dr/values-compass
Tools for exploring and analyzing Anthropic's Values-in-the-Wild dataset for AI ethics research
ai-ethics anthropic-claude data-analysis nlp values
Last synced: 25 Feb 2026
https://github.com/jaseel342/ecommerce_sales_dashboard
The E-commerce Sales Dashboard project offers a comprehensive view of e-commerce sales performance using interactive Power BI dashboards. It focuses on key metrics like YTD Sales, YTD Profit, YTD Profit Margin, and Quantity of Products sold, analyzing data by product categories, states, and regions.
data-analysis data-modelling dax-expression excel power-query powerbi visualization
Last synced: 07 Feb 2026
https://github.com/luminati-io/indeed-dataset-samples
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
api data-analysis datasets indeed jobs web-scraping
Last synced: 07 Feb 2026
https://github.com/traore-07/fedex-sales-analysis
Analysis of the FedEx Sales Transaction
data-analysis data-visualization sales-analysis tabeau
Last synced: 31 Jan 2026
https://github.com/shafaq-aslam/pandas-lab
A comprehensive collection of Jupyter notebooks exploring Pandas, from Series and DataFrames to data cleaning, aggregation, merging, and visualization. A complete hands-on guide for mastering data manipulation and analysis with Python.
analytics data-analysis data-cleaning data-science data-visualization dataframe jupyter-notebook machine-learning pandas pandas-dataframe pandas-library pandas-series python python3 series
Last synced: 15 Apr 2026
https://github.com/ajmannust41288/data-analyst
Data Analyst ,Microsoft Professional expert,Desktop PowerBi ,Tablue and Dashboards with ChatGP4 AI uses
business-analytics data-analysis data-analyst data-analytics eda
Last synced: 01 Feb 2026
https://github.com/bineet-ratna-shakya/data-science-salary-analysis
analyzing a dataset containing salaries of data science professionals from 2020 to 2023.
data-analysis data-science data-visualization jupyter numpy pandas python
Last synced: 01 Feb 2026
https://github.com/tr41z/machine-learning
machine learning models
ai artificial-intelligence data-analysis data-preprocessing google-colab jupyter-notebook machine-learning models python tensorflow
Last synced: 01 Feb 2026
https://github.com/rissh/titanicsurvivalpredictionusingml
Predicting Titanic passenger survival through machine learning. This project includes data preprocessing, exploratory data analysis, feature engineering, and model training using Python. 🚢
data data-analysis data-science data-visualization dataanalysis jupiter-notebook machine-learning machine-learning-algorithms machinelearning matplotlib numpy pandas prediction prediction-model python python3 seaborn tenserflow tflearn titanic
Last synced: 01 Feb 2026
https://github.com/vishnu-vamshii/data-science-jobs-salaries
Created an interactive dashboard to analyze data science jobs salaries in different regions of the world, experience levels, average salaries in USD and type of employment along with a geographical visual.
data-analysis data-science data-visualization tableau tableau-dashboard
Last synced: 01 Feb 2026
https://github.com/yeuner/file-analysis-sql-demo
Streamlit-based application that leverages pandas, sqlite3, and file handling libraries (OpenPyXL and PyArrow) to practice SQL queries, analyze datasets, and export results. A personal project to enhance Python and SQL skills.
data-analysis dataset pandas sql sqlite streamlit vizualization
Last synced: 15 Apr 2026
https://github.com/vladimiracunadev-create/python-data-science-program
Python Data Science Program — 197 clases en 9 partes. Pauta avanzada derivada de Géron, VanderPlas, Huyen, ISLP y Barocas/Hardt/Narayanan. Recurso personal de aprendizaje, enseñanza y mejora continua.
bootcamp data-analysis data-science education jupyter machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 01 Jun 2026
https://github.com/rodrigojunqueiradev/curso-sql-para-analise-de-dados
data-analysis data-science nosql pg pgadmin4 postgresql sql
Last synced: 08 Feb 2026
https://github.com/sroman0/data-analytics
Data Analytics Exercises is a collection of comprehensive university-level exercises aimed at enhancing skills in data analytics. The repository includes practical notebooks covering data manipulation, exploratory data analysis (EDA), statistical analysis, data visualization, and machine learning fundamentals.
data-analysis data-analytics data-science data-visualization education exercises exploratory-data-analysis hands-on-practice jupyter-notebook machine-learning python statistics
Last synced: 15 Apr 2026
https://github.com/grindelfp/datasets-analysis
The Machine Learning and Data Analysis course task dedicated to training skills of data normalizing and preprocessing.
data-analysis datasets ipynb mlda
Last synced: 05 Mar 2026
https://github.com/siddhant2105s/airline-performance-analysis-dashboard
Enhancing Airline Performance Analysis for the Department of Transport
data-analysis data-visualization tableau
Last synced: 08 Feb 2026
https://github.com/michalspano/maturitna-skuska-proj
Maturitná skúška 2021/2022 - objektívna spracovanie a analýza dát
Last synced: 19 Mar 2026
https://github.com/jweinst1/xenon
A processing based language
data-analysis interpreter reactive-programming
Last synced: 15 Apr 2026
https://github.com/themihirmathur/uber-data-analytics
The goal of this project is to perform comprehensive data analytics on Uber trip data using a modern data engineering stack on Google Cloud Platform (GCP).
bigquery data-analysis data-engineering etl-pipeline google-cloud-platform looker python
Last synced: 09 Feb 2026
https://github.com/aisurjyasamantaray/-optimizing-target-s-brazilian-operations-insights-from-order-processing-pricing-and-payment-trends-
This project offers an in-depth analysis of consumer behavior, logistical performance, and payment preferences within the e-commerce sector. By examining order costs, delivery times, and payment methods, businesses can uncover valuable insights into operational efficiency and customer preferences.
bigquery consumer-insights data-analysis database sql target
Last synced: 26 Feb 2026
https://github.com/nulltea/kicksware-scrapebot
Web scraping tool to retrieve sneaker details & images from web store sites
bot data-analysis pandas python sneakers web-scraping
Last synced: 15 Apr 2026
https://github.com/vanajmoorthy/bibliotype
Find out your bibliotype!
alpinejs data-analysis django goodreads
Last synced: 09 Feb 2026
https://github.com/shruti23-ui/blinkit-powerbi-dashboard
A comprehensive Power BI dashboard analyzing Blinkit's sales performance, outlet metrics, and multi-tier market analytics with interactive visualizations and business intelligence insights.
data-analysis data-visualization microsoft-excel microsoft-power-bi powerbi sales-analysis sql
Last synced: 09 Feb 2026
https://github.com/animesh-chourey/power-bi
Various projects at my attempt to learn Power BI
business-analytics data-analysis data-visualization powerbi
Last synced: 10 Feb 2026
https://github.com/tushar2704/imdb-movie-analysis
This project extracts meaningful insights, trends, and patterns from the data, shedding light on various aspects of the movie industry. By leveraging this analysis, filmmakers, studios, and enthusiasts can gain valuable information to inform decision-making, understand audience preferences, and contribute to the creation of successful movies.
artificial-intelligence data-analysis data-science imdb project tushar2704
Last synced: 10 Feb 2026
https://github.com/vikktor93/proyecto-final-python-datascience
Dataset analysis of worldwide sales of video games on different platforms in 2020
data-analysis data-science jupyter-notebook kaggle matplotlib pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/ohimoiza1205/mastercard-cybersecurity-simulation
Served as an analyst on Mastercard’s Security Awareness Team to identify and report security threats
cybersecurity data-analysis data-presentation security-awareness-training technical-security-awareness
Last synced: 11 Feb 2026
https://github.com/rodrigojunqueiradev/python-exercises
Repositório para armazenar exercícios realizados na linguagem Python / Repository to organize exercises with Python language
data-analysis data-science data-structures data-visualization database math pandas pandas-python python python-3 python3 sql statistics
Last synced: 16 Apr 2026
https://github.com/thanaraklee/exploring-and-analyzing-data-in-oracle-database
This project focuses on data analysis using SQL with Oracle Database 21c. It aims to familiarize with data management and data analysis using SQL commands and Oracle Database 21c.
data-analysis oracle-database sql sql-developer
Last synced: 12 Feb 2026
https://github.com/rohitblaze10/-excel-_seller_store_analysis
A collection of data analysis projects showcasing data cleaning, exploration, visualization, and machine learning. Using "Excel" and more to uncover insights and drive data-driven decision-making. Feel free to explore, contribute, or collaborate!
data-analysis data-visualization excel excel-export
Last synced: 12 Feb 2026
https://github.com/koldlight/bluetab-data-science-2017
Repositorio para compartir material y publicar los retos
course data-analysis data-science exercises
Last synced: 12 Feb 2026
https://github.com/nabilshadman/power-bi-essential-training
Exercise files for Power BI Essential Training (2024): datasets and dashboards for hands-on learning
dashboard data-analysis data-science data-visualization power-bi power-bi-dashboard
Last synced: 12 Feb 2026
https://github.com/edoaltamura/rotational-ksz-macsis
Repository for suppelementary material from my publication on the rotational kinetic SZ effect in MACSIS
cosmology data-analysis galaxy-clusters high-performance-computing hydrodynamics
Last synced: 28 Feb 2026
https://github.com/karlyndiary/data-visualisation-empowering-business-with-effective-insights
This Tata Group Sales Insights Dashboard uses a dataset provided by Forage.
analysis-and-presentation analytics-and-insights dashboard data-analysis data-cleanup data-interpretation data-visualization forage tableau tata-group visualisation
Last synced: 28 Feb 2026
https://github.com/m-ah07/text-sentiment-analysis-api
A lightweight Python project for analyzing the sentiment of textual data using the TextBlob library. This project provides a simple and effective way to measure the polarity and subjectivity of any given text.
data-analysis machine-learning python python-project sentiment-analysis text-analysis text-mining
Last synced: 14 Feb 2026
https://github.com/balajimohan18/tableau-visualization-project
This repository contains Visualization Projects which is visualized through Tableau Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and also it provides social values in some cases to reduce damages by calamities.
data-analysis data-science data-visualization exploratory-data-analysis tableau tableau-public
Last synced: 19 Mar 2026
https://github.com/fhdsl/seattlestatsummer_r
A 4-day introduction to R programming, focused on Fred Hutch Research Interns
beginner beginner-friendly course data-analysis data-science introduction-to-programming r-programming tidyverse
Last synced: 19 Mar 2026
https://github.com/kunalkumar2001/data-analyst-power-bi
Data Analyst Power BI Project for Portfolio
data-analysis data-analyst data-analyst-power-bi powerbi
Last synced: 16 Feb 2026
https://github.com/arnoudbuzing/iowa-caucus
Data Analysis on 2020 Iowa Caucus results
caucus data-analysis iowa iowa-caucus mathematica primaries primary-election wolfram-language
Last synced: 01 Mar 2026
https://github.com/oscarmtr/metrov
Interactive viewer for tropospheric meteorological soundings
climate data-analysis meteorology skew-t soundings temperature tropospheric web
Last synced: 01 Mar 2026
https://github.com/johannaschmidle/road-collisions-project
Analyzed road accident data in the UK from 2019 to 2022 to identify patterns and trends in road accidents, for Effective Road Management [Excel]
data-analysis data-visualization excel pivot-tables traffic-analysis
Last synced: 01 Mar 2026
https://github.com/yash22222/pwc-power-bi-virtual-case-experience
The Power BI PwC Virtual Case Experience is an exciting and educational program designed to provide participants with hands-on exposure to Power BI, a prominent business intelligence and data visualization tool, within the context of consulting at PwC.
business-analyst business-analytics business-intelligence dashboard data-analysis data-analyst data-analytics dax microsoft-power-bi powerbi powerbi-dashboards powerbi-visuals pwc
Last synced: 02 Mar 2026
https://github.com/paladitya/cn_term_project
Code for testbed
automated-testing data-analysis reliability-score tcl testbed wrapper-library
Last synced: 02 Mar 2026
https://github.com/soumya-kushwaha/uber-analysis
data-analysis data-science data-visualization uber-analysis
Last synced: 16 Apr 2026
https://github.com/huseyincenik/looker_studio
Looker Studio
dashboard data-analysis data-visualization looker-studio lookerstudio
Last synced: 03 Mar 2026
https://github.com/grindelfp/logistic-regression-study
Example of logical regression data analysis and exercise on it.
data-analysis ipynb logistic-regression python
Last synced: 03 Mar 2026
https://github.com/banner-19/extraction-and-analysis-of-text
The objective is to analyze text content from a list of URLs. This involves extracting article titles and text, then performing natural language processing to generate metrics like sentiment, readability, and word usage. Finally, the results are stored for further analysis or visualization.
data-analysis data-analytics data-science nlp nltk python3 text-analysis text-extraction
Last synced: 03 May 2026
https://github.com/bishopce16/school_district_analysis
The school board requested an analysis on the various performance metrics for the school district.
data-analysis jupyter-notebook numpy pandas python visual-studio-code
Last synced: 16 Apr 2026
https://github.com/santiago-giordano/ahora12project
Excel, SQL and Python processing from excel files
data-analysis excel jupyter-notebook microsoft-sql-server pandas sql sqlalchemy sqlserver
Last synced: 16 Apr 2026
https://github.com/marben06/rent-in-germany
Interactive visualizations and maps depicting topics around rent prices and income in Germany built with Svelte.
charts d3 d3-visualization d3js data-analysis data-visualization gis gis-data infographic infographics map mapbox mapbox-gl mapbox-gl-js mapboxgl svelte
Last synced: 27 Apr 2026
https://github.com/satyacoder29/e-commerce-sales-analysis
Performed E-commerce Sales Analysis to identify trends, optimize sales, and improve decision-making. Analyzed customer patterns, seasonal trends, and product performance using Python, SQL, and Power BI. Delivered actionable insights to enhance revenue, streamline inventory management, and boost customer engagement.
data-analysis data-visualization datacleaning msexcel pivottables powerquerym visualisation vlookups
Last synced: 05 Mar 2026
https://github.com/shashwat9kumar/us-accidents-data-analysis
Analysis of the US accidents using the US-Accidents dataset (4.2 million entries) from Kaggle
accidents accidents-analysis data-analysis data-analytics data-visualisation data-visualization matplotlib numpy pandas python
Last synced: 17 Apr 2026
https://github.com/kheriberto/knn_project
This is a simple project that uses dummie data to practice and demonstrate my knowledge of the KNN algorithm.
data-analysis knn-classifier numpy python scikit-learn seaborn
Last synced: 02 Apr 2026
https://github.com/jabercrombia/video-game-data
This project integrates FastAPI as the backend and Next.js as the frontend to create a full-stack web application. It processes and displays vides game sales data, enabling seamless API communication while maintaining a scalable and efficient architecture.
data-analysis nextjs nintendo playstation python typescript video-game
Last synced: 02 Apr 2026
https://github.com/hugo-hattori/rpa_email_report
Robotic Process Automation Project.
automation data-analysis data-analysis-python data-analytics jupyter jupyter-notebook pandas pandas-dataframe pandas-python pyautogui pyautogui-automation pyperclip python time
Last synced: 17 Apr 2026
https://github.com/eliasdehondt/learn-r
Welcome to the Learn-R repository! This is your go-to resource for learning the R programming language, whether you're a beginner or looking to enhance your skills.
data-analysis data-visualization education machine-learning programming r statistics tutorials
Last synced: 03 Apr 2026
https://github.com/coder36459/fcc-projects
freeCodeCamp projects
bash bootstrap c-sharp css d3 data-analysis html javascript matplotlib numpy pandas postgresql programming python react seaborn sql topojson xml xslt
Last synced: 03 Apr 2026
https://github.com/mahmoudwal27/manufacturing_downtime
This project focuses on improving manufacturing efficiency by analyzing production data. Using Python, SQL, and Power BI, we built interactive dashboards to uncover patterns, minimize downtime, and optimize operations. The goal is to help stakeholders make data driven decisions for enhanced productivity.
data-analysis data-analysis-python data-visualization google-colab powerbi python sql
Last synced: 17 Apr 2026
https://github.com/sharmas1ddharth/data-analysis-with-python
Freecodecamp's Data Analysis with Python Projects Code
data-analysis data-analysis-with-python freecodecamp-project
Last synced: 03 Jun 2026
https://github.com/kgotsosm/fcc-data-analysis
Notebooks created for the Data Analysis Course on freeCodeCamp
data-analysis data-visualization matplotlib pandas seaborn
Last synced: 17 Apr 2026
https://github.com/victoorv/prediction_covid19
Prédire si un invidu est positif au COVID19 ou non.
classification covid-19-classifier covid-19-data-analysis covid19-data data-analysis data-science data-visualization exploratory-data-analysis hyperparameter-tuning machine-learning machine-learning-algorithms neural-networks oversampling-algorithms python statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/victoorv/criminalite_us
Une analyse de la criminalité en fonction de variables socio-économiques a été menée, incluant la sélection et la comparaison de modèles de régression multiple ainsi que des tests d'hypothèses sur les coefficients et la significativité des modèles.
data-analysis data-science r regression regression-analysis regression-models statistical-analysis statistical-tests statistics
Last synced: 04 Apr 2026
https://github.com/davidmalko87/steam-library-exporter
Python script to export your Steam game library to CSV — playtime, genres, reviews, metacritic scores, prices, tags & estimated owners via Steam Web API + Store API + SteamSpy
csv-export data-analysis game-data metacritic playtime-tracker python steam steam-api steam-games steam-library steamspy
Last synced: 04 Apr 2026
https://github.com/ahmad-ali-rafique/decision-tree-regressor-modeling
Comprehensive exploration of decision tree regressors, including data cleaning, model building, and performance evaluation on various datasets.
artificial-intelligence data data-analysis dataanalytics decision-trees decisiontreeregressor modeling models regression-models
Last synced: 17 Apr 2026
https://github.com/sevilaymuni/project-no.3-seaborn-plots
Pandas and Seaborn Mediated Comprehensive Analysis on Differentiated Thyroid Cancer
data-analysis data-structures data-visualization mathplotlib pandas python seaborn
Last synced: 18 Apr 2026
https://github.com/prangonghose/wikipedia-blocking-policies
This study investigates the relationship between editors’ disruptive behavior and regulation policies on English Wikipedia, focusing on the Blocking Policy page. The study collects and analyzes data from 2004 to 2022 using the Wikipedia API, page statistics, and keyword extraction.
data-analysis data-visualization matplotlib open-source pandas python3 seaborn
Last synced: 18 Apr 2026
https://github.com/andryadsm/predicting-house-prices
🏘️ Project Predicting House Prices (Python)
data-analysis data-preprocessing data-visualization feature-engineering house-prices machine-learning matplotlib numpy pandas python real-estate seaborn sklearn
Last synced: 04 Apr 2026
https://github.com/vansh-py04/data-analysis-questions-pandas-numpy-sql
Solution to 450+ Data Science Tech Stack questions essential for Data Analysts and Scientists!
data-analysis data-science deepnote machine-learning numpy pandas python sql
Last synced: 18 Apr 2026
https://github.com/akhundmuzzammil/energyconsumptionprediction
This repository contains code and resources for training a linear regression model to predict energy consumption based on various building parameters.
data-analysis energy-consumption linear-regression machine-learning python scikit-learn streamlit visualization
Last synced: 18 Apr 2026
https://github.com/wang-q/tva
tva: Tab-separated Values Assistant
cli command-line-tool csv data-analysis data-processing etl high-performance rust streaming tabular-data tsv unix-philosophy
Last synced: 05 Apr 2026
https://github.com/manalisbhavsar/mall-customers-clustering
K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.
data-analysis data-visualization matplotlib numpy pandas python scikit-learn
Last synced: 18 Apr 2026
https://github.com/al-ghaly/prosper-loans-analysis
A statistical Analysis Project, to analyze the data of a finance company’s loans Using Python packages (pandas – NumPy – seaborn – matplotlib)
data-analysis matplotlib numpy pandas python python-data-analysis seaborn statistical-analysis statistics
Last synced: 18 Apr 2026
https://github.com/bolshovaelizaveta/covid19_spark_analysis
Учебный проект по дисциплине 'Базы данных для компьютерного зрения'. Разработка аналитической платформы для эпидемиологического мониторинга COVID-19 с использованием Apache Hadoop и Spark
apache-hadoop apache-spark covid-19 data-analysis jupyter-notebook machine-learning medical-imaging pyspark sql
Last synced: 18 Apr 2026
https://github.com/arv-anshul/notebooks
My Jupyter notebooks in which I practice data science.
data-analysis data-science jupyter-notebook llm machine-learning marimo matplotlib regression transformers
Last synced: 19 Apr 2026