Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-02-20 00:07:05 UTC
- JSON Representation
https://github.com/nafisalawalidris/sales-performance-dashboard
Sales Performance Dashboard: Analyze and visualize sales data using Power BI. Gain insights into trends, customer segments, product performance, and geographic distribution. Make data-driven decisions to optimize sales strategies and maximize revenue.
analytics-revenue dashboard-power-bi data data-analysis intelligence-sales optimization performance sales visualization-business
Last synced: 23 Jan 2025
https://github.com/prime-infinity/type-one
Software to visualize and analyze GitHub repos based on certain statistics such as stars, forks and issues
data-analysis data-visualization
Last synced: 24 Nov 2024
https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing
Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.
data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates
Last synced: 23 Jan 2025
https://github.com/nafisalawalidris/buybuy-e-commerce-company
The BuyBuy E-commerce Company repository is a comprehensive hub for the company's e-commerce platform. It includes source code, documentation, and data analysis insights, providing a data-driven approach to improve customer experience, drive revenue, and inform decision-making.
buybuy cleaning-data company customer-experience data data-analysis decision-making documentation e-commerce excel insights postgresql repository revenue source-code sql
Last synced: 23 Jan 2025
https://github.com/ysayaovong/stockroom_management
The Stockroom Management project is a comprehensive tool that automates and simplifies the process of managing inventory in stockrooms. By incorporating features like real-time updates, report generation, and low-stock alerts, it helps businesses save time, reduce errors, and optimize their inventory operations.
business-applications data-analysis data-visualization database-management inventory-control inventory-management logistics sql warehouse warehouse-management
Last synced: 30 Jan 2025
https://github.com/antononcube/wl-quantileregression-paclet
Wolfram Language (aka Mathematica) paclet that provides various Quantile Regression functions.
data-analysis machine-learning quantile-regression time-series time-series-analysis
Last synced: 08 Feb 2025
https://github.com/akash1070/freecodecamp-data-analysis-with-python-
contains study notes and assignments from freecodecamp of Data Analysis With Python
data-analysis demographic-analysis mean-variance-standard-calculator medical-data-visualisation numpy-library pandas-library python3 sea-level-predictor time-series-analysis
Last synced: 29 Jan 2025
https://github.com/jku-vds-lab/loops
Loops is a JupyterLab extension to support iterative and exploratory data analysis in computational notebooks.
data-analysis data-science data-visualization jupyter jupyter-notebook notebook provenance
Last synced: 20 Feb 2025
https://github.com/datawithbaraa/sql-modern-warehouse-and-analytics
A comprehensive guide to building a modern data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
data-analysis data-analytics data-cleaning data-engineering data-lake data-lakehouse data-science data-warehouse data-warehousing database datalake datascience datawarehouse datawarehousing etl medallion-architecture pipeline sql sql-query sql-server
Last synced: 15 Feb 2025
https://github.com/grypesc/graduateadmissions
Visualization, analysis and predictive modeling of a Kaggle graduate admissions dataset.
data-analysis data-mining data-science data-visualization dataset
Last synced: 08 Feb 2025
https://github.com/akash1070/data-science-virtual-internship-by-anz
Exploratory data analysis and prediction of annual salary for customers from the dataset provided by ANZ.
data-analysis data-science predictive-analytics presentation-slides
Last synced: 29 Jan 2025
https://github.com/magnaopus1/synthron-cfd-trader-pro
SYNTHRON CFD Trader PRO is a cutting-edge trading platform featuring raw, custom-designed machine learning models. From reinforcement learning for dynamic strategies to predictive analytics, sentiment analysis, and optimization techniques, it empowers trading across stocks, forex, indices, commodities, futures, and crypto with precision.
ai backtesting cfd commodities data-analysis data-science data-structures forex futures indices machine-learning trading
Last synced: 05 Feb 2025
https://github.com/nishnash54/recomax---recommendation-platform
P&G Hack - Recommendation platform
data-analysis data-science data-visualization machine-learning prediction-model recommendation-engine
Last synced: 08 Feb 2025
https://github.com/shelton-beep/predicting-gpa-using-lifestyle-factors
Predicting student GPA using lifestyle factors like study habits, sleep, and stress levels. A machine learning model built to help students and educators understand the impact of lifestyle choices on academic performance.
data-analysis data-preprocessing data-science feature-engineering gpa-prediction machine-learning model-interpretability predictive-modeling python regression-analysis student-performance xgboost
Last synced: 29 Jan 2025
https://github.com/rayyan9477/diamond-price-forecasting
This is a comprehensive machine learning project focused on predicting diamond prices. Using a dataset of diamond attributes, the project implements various machine learning models to forecast prices. Key features include data preprocessing, exploratory data analysis (EDA), and model training with algorithms such as Linear Regression, Decision Tree
data-analysis data-science decision-trees eda linear-regression machine-learning
Last synced: 10 Jan 2025
https://github.com/rayyan9477/multiple-disease-prediction-system
This repository contains a Multiple Disease Prediction System leveraging machine learning techniques for accurate predictions. It utilizes Python, Pandas, Scikit-learn, and Flask for data preprocessing, model building, and web deployment. Explore the project and connect on LinkedIn for collaborations.
data-analysis data-science machine-learning python streamlit
Last synced: 10 Jan 2025
https://github.com/rayyan9477/coin-detection-project
This Coin Detection Project leverages machine learning techniques to identify coins using a dataset from Kaggle. Key libraries utilized include OpenCV for image processing, TensorFlow for model training, and Pandas for data manipulation. The project also employs NumPy for numerical operations and Matplotlib for visualization.
computer-vision data-analysis data-science data-visualization machine-learning notebook python
Last synced: 10 Jan 2025
https://github.com/rayyan9477/youtube-spam-detection-with-flask-and-machine-learning
This is a web application built using Flask that detects spam comments on YouTube using a Naive Bayes classifier. It leverages techniques such as CountVectorizer for feature extraction and scikit-learn for machine learning. The application reads data from a CSV file and predicts whether a comment is spam or not.
data-analysis data-science machine-learning nlp-machine-learning spam-detection
Last synced: 10 Jan 2025
https://github.com/x1ao4/doc-merger
通过 python 脚本将两个相对不完整的文档合并为一个完整的文档 / merge two relatively incomplete documents into one complete document via python script
data-analysis data-merging document-analysis document-comparison document-processing documents filtering filtering-data merge merge-documents
Last synced: 20 Feb 2025
https://github.com/dogan-the-analyst/developer_survey_analysis
Analysis of the 2024 Stack Overflow developer survey. Tools used include Python, Pandas, Matplotlib, and IBM Cognos.
data-analysis data-visualization ibm-cognos-analytics matplotlib pandas python
Last synced: 16 Feb 2025
https://github.com/rayyan9477/household-transactions-analysis-and-clustering
This project involves analyzing household transaction data to gain insights into spending patterns and behaviors. The analysis includes data cleaning, exploratory data analysis (EDA), clustering using K-Means, and visualization of customer segments.
customer-segmentation data-analysis data-cleaning data-science exploratory-data-analysis kmeans-clustering machine-learning
Last synced: 10 Jan 2025
https://github.com/seabbs/explorebcgonoutcomes
Analysis to explore the association of BCG vaccination and TB outcomes.
bcg data-analysis regression rstats tuberculosis
Last synced: 01 Jan 2025
https://github.com/patilni3/seaborn-in-depth
Python's Seaborn Library for Data Analysis, Machine Learning, Data Science and many more...
data-analysis data-reporting data-representation data-science data-visualization plots-in-python powerbi seaborn sns
Last synced: 08 Feb 2025
https://github.com/patilni3/numpy-in-depth
Python's NumPy Library for Data Analysis, Machine Learning, Data Science and many more...
data-analysis data-engineering data-science machine-learning numpy pandas
Last synced: 08 Feb 2025
https://github.com/patilni3/matplotlib-in-depth
Python's Matplotlib Library for Data Analysis, Machine Learning, Data Science and many more...
data-analysis data-representation data-science data-visualization matplotlib matplotlib-pyplot plots-in-python powerbi seaborn
Last synced: 08 Feb 2025
https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba
First assignment for the course Data Mining @CSE.UOI
data-analysis data-science numpy scipy seaborn statistics
Last synced: 16 Jan 2025
https://github.com/phillbertnevinemmanuel/movieindustryanalysis-correlation
This project is a comprehensive data analysis endeavor within the Movie Industry, spanning from Data Cleaning to Exploratory Data Analysis, Correlation Analysis, and Temporal Analysis. The dataset was sourced from Kaggle, purportedly scraped using the IMDb API. Python was the primary tool utilized for analysis.
data-analysis data-cleaning python
Last synced: 15 Feb 2025
https://github.com/phillbertnevinemmanuel/coviddeathvaceda
an exploratory data analysis based on dataset of covid statisics from 2020-2022
Last synced: 15 Feb 2025
https://github.com/phillbertnevinemmanuel/dataprofessionalquestionnaireanalysis_pbix
This project delves into the demographics and job satisfaction of data professionals, presenting insights through a user-friendly dashboard built with Power BI. The dataset has been graciously provided by Alex The Analyst
dashboard data-analysis powerbi visualization
Last synced: 15 Feb 2025
https://github.com/listiangr/ecommerce_sales_data_analysis
Proyek ini menganalisis data penjualan e-commerce untuk membantu bisnis memahami tren penjualan, performa produk, dan segmen pelanggan. Tujuan utamanya adalah memberikan wawasan yang dapat meningkatkan strategi pemasaran dan pengelolaan produk.
dashboard data-analysis data-cleaning data-collection data-penjualan data-visualization exploratory-data-analysis microsoft-excel
Last synced: 09 Feb 2025
https://github.com/listiangr/product_sales_data_analysis
Proyek ini menganalisis data penjualan untuk memberikan wawasan tentang tren penjualan, profitabilitas, dan permintaan produk, guna membantu perusahaan merencanakan strategi harga, promosi, dan pengelolaan inventaris yang lebih efektif.
corrplot data-analysis data-preprocessing data-visualization dplyr ggcorrplot ggplot2 product-sales r-language rstudio
Last synced: 09 Feb 2025
https://github.com/bgr8/bokeh-ile-veri-gorsellestirme
Data visualization with Bokeh Library
bokeh color data-analysis data-visualization hbar html python vbar
Last synced: 08 Feb 2025
https://github.com/huseyincenik/tableau
This repository contains Tableau visualizations and related resources for my project.
analytics api bianalyst business-analytics business-intelligence business-solutions dashboard data data-analysis data-science data-structures dataanalysis dataset datavisualization drilldown interactive-visualizations tableau tableau-dashboards viz
Last synced: 29 Jan 2025
https://github.com/emredurukn/data-analysis
Example notebooks for analyzing data
data-analysis data-visualization python
Last synced: 10 Jan 2025
https://github.com/jrbourbeau/cr-composition
IceCube cosmic-ray composition analysis
cosmic-rays data-analysis machine-learning physics python
Last synced: 09 Feb 2025
https://github.com/discdiver/new-belgium-ratings
Find the most popular New Belgium beers of all time!
beautifulsoup data-analysis pandas python seaborn webscraping
Last synced: 10 Jan 2025
https://github.com/wiseaidev/corona-virus-data-analysis-modeling-and-visualization
Data analysis of covid-19 and SEIRD model implementation.
coronavirus coronavirus-tracking covid-19 data-analysis data-analysis-python data-visualization folium-maps modeling-dynamic-systems numpy ploty population python3 science science-research seird-model seird-simulator simulation
Last synced: 10 Dec 2024
https://github.com/vijayjoshi16/credit-card-fraud-detection-using-ml-in-python
Credit Card Fraud Detection Using ML in Python
data-analysis jupyter-notebook logistic-regression machine-learning matplotlib-pyplot numpy pandas python regression seaborn
Last synced: 23 Jan 2025
https://github.com/hyperspy/exspy-demos
eXSpy Jupyter Notebook demos
data-analysis data-visualization eds edx eels electron-energy-loss-spectroscopy hyperspy life-sciences materials-science multi-dimensional physical-sciences spectroscopy tutorial x-ray-spectroscopy xrf
Last synced: 18 Nov 2024
https://github.com/shridhar1504/foreign-exchange-rate-time-series-datascience-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis
Last synced: 15 Feb 2025
https://github.com/sanikamal/machine-learning-atoz
Beginner-friendly machine learning tutorials and mini-projects.
collaborative-filtering data-analysis data-visualization decision-trees kmeans-clustering knn machine-learning machine-learning-algorithms recommender-system regression svm
Last synced: 10 Jan 2025
https://github.com/ernestaroozoo/memestocks.net
MemeStocks.net is a Python web app that tracks the historical popularity of specific stocks by monitoring Reddit mentions. Users can search for a stock symbol and view information such as the stock's name, price, and historical popularity data. The data is gathered using the Pushshift API and stored in a PostgreSQL database.
dashboard data-analysis financial-data meme-stock python reddit-scraper scraping sql stock streamlit
Last synced: 05 Feb 2025
https://github.com/shridhar1504/loan-clustering-datascience-project
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning
Last synced: 15 Feb 2025
https://github.com/abhi-lab2/ipl-data-analysis
IPL data analysis for future predictions
data-analysis data-science python
Last synced: 19 Feb 2025
https://github.com/shridhar1504/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.
dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides
Last synced: 15 Feb 2025
https://github.com/frankelavsky/political-polarization-challenge
I had 8 hours to build a solution to the research claim that "politics have become more divided in the past 50 years." You can navigate views of congressional voting patterns using arrows. I used d3, require, MVC pattern, and vanilla js. Pre-processed the data in node.js. Data is from DW-NOMINATE: ftp://k7moa.com/junkord/HANDSL01114A20_STAND_ALONE_30.DAT
client-side css d3 d3js data-analysis data-visualization frontend frontend-app html interactive interactive-visualizations javascript modular nodejs political-science politics requirejs research single-page-app visualization
Last synced: 19 Jan 2025
https://github.com/vitia-fritelle/ipynb_converter
Jupyter notebook to Python file conversor
data-analysis data-science jupyter-notebook python
Last synced: 09 Feb 2025
https://github.com/azmainadel/twitter-data-neo4j
Playing with graph database on a large dataset of twitter data.
data-analysis data-visualization neo4j-database snap
Last synced: 12 Feb 2025
https://github.com/mohamedhany99/collecting-sound-features-frequency-from-an-audio-file-and-save-it-in-excel
This script takes a single audio file and collect all the sound features in it including (Frequency - mean - variance - minimum value - maximum value - Median - Kurtosis - Skewness) and save it in an array and import this array in a row in the datasheet.
automation data-analysis data-science dataset-generation excel-import signal-processing
Last synced: 17 Jan 2025
https://github.com/mrjxtr/coffee_sales_analysis
Full data analytics process from data gathering, data processing, data visualization and reporting on a small coffee shop sales data.
dashboard data-analysis data-cleaning data-visualization kpi-report pandas python3 spreadsheet tableau-public
Last synced: 16 Feb 2025
https://github.com/eesunmoon/on-device_multimodal_er
[Research] Multimodal Emotion Recognition for On-device AI
artificial-intelligence data-analysis deep-learning embedded-systems emotion-recognition heart-rate-analysis multimodal-fusion npu on-device python speech-processing speech-recognition tensorflow wearable-devices
Last synced: 22 Dec 2024
https://github.com/mayankagg9722/movie-recommendation
Collaborative Filtering is performed over Movie Lens Dataset.
collaborative-filtering data-analysis jupyter-notebook movie-recommendation python-script website
Last synced: 29 Jan 2025
https://github.com/faezeh-gholamrezaie/coronavirus-d3-chart
Tracing the spread of the coronavirus from the White House under the title of diagram D3
coronavirus coronavirus-analysis coronavirus-outbreak-tracking d3 data-analysis datavisualization white-house-coronavirus-outbreak-tracking
Last synced: 19 Feb 2025
https://github.com/aravind-selvam/bikeshare-company-analysis
Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company
analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server
Last synced: 14 Jan 2025
https://github.com/thecoderpinar/worldpopulationanalysis2024
World Population Analysis 2024: An In-Depth Exploration of Urban and Rural Populations and Infrastructure Accessibility
data-analysis data-science economic-indicators machine-learning population-growth prophet-forecasting
Last synced: 09 Feb 2025
https://github.com/jfjlaros/spreadscript
SpreadScript: Use a spreadsheet as a function.
automation command-line data-analysis evaluation function interface spreadsheet
Last synced: 12 Jan 2025
https://github.com/hfxbse/dhbw-data-analysis
Exploratory data analysis R notebook for the module T3INF4333 "Grundlagen Data Science" held in 2024 by Lothar B. Blum at the DHBW Stuttgart.
data-analysis data-science dhbw dhbw-stuttgart ggplot2 r r-notebook
Last synced: 12 Feb 2025
https://github.com/ivanildobarauna-dev/currency-quote
Complete solution for extracting currency pair quotes data with comprehensive testing, parameter validation, flexible configuration management, Hexagonal Architecture, CI/CD pipelines, code quality tools, and detailed documentation.
data-analysis data-analytics data-engineering library pypi-packages python
Last synced: 12 Feb 2025
https://github.com/thecoderpinar/telecommunication-customer-churn-analysis-and-prediction
📊 This project focuses on customer churn analysis and prediction in the telecommunications sector. Using data analysis, modeling, and predictive techniques, it aims to understand and mitigate customer loss by developing strategies.
churn churn-prediction classification customer data-analysis data-science deep-learning machine-learning neural-network telecom
Last synced: 09 Feb 2025
https://github.com/rani-sikdar/python-data-structures
A repository for sharing Python implementations of common data structures and algorithms, including arrays, linked lists, stacks, queues, trees, graphs, and sorting/searching techniques. Perfect for learning, revisiting fundamentals, and collaborating on computer science concepts. Contributions welcome! 🚀
data-analysis data-structure data-visualization numpy-library pandas-dataframe python python-3
Last synced: 05 Feb 2025
https://github.com/as16082023/music-store-analysis
This project involves analyzing music store data using SQL queries in MySQL workbench to enhance decision-making, identify trends, and understand customer behavior
data-analysis music-store-analysis mysql sql
Last synced: 15 Feb 2025
https://github.com/as16082023/restaurant-order-analysis
Analyzing order data to identify the most and least popular menu items and types of cuisine
data-analysis maven-analytics mysql restaurant-order sql
Last synced: 15 Feb 2025
https://github.com/as16082023/hotel-booking-analysis-eda-
Exploratory Data Analysis on hotel booking data using Python
data-analysis data-visualization exploratory-data-analysis jupyter-notebook python
Last synced: 15 Feb 2025
https://github.com/thecoderpinar/customer-segmentation-clv-analysis
Optimize marketing strategies and enhance decision-making. Explore customer data, segment behavior, calculate CLV, analyze demographics, and visualize insights. 🚀
clv-analysis customer-segmentation data-analysis data-science data-visualization jupyter-notebook machine-learning marketing-strategy python
Last synced: 09 Feb 2025
https://github.com/mardavsj/weather-prediction
Weather prediction model which mainly focuses on visualization.
data-analysis data-visualization matplotlib numpy pandas pandas-dataframe
Last synced: 14 Feb 2025
https://github.com/nafiealhilaly/analyzing-sa-schools-data
A simple python streamlit app to explore and analyze Saudi Arabia schools dataset from data.gov.sa
data-analysis data-visualization eda python streamlit
Last synced: 08 Feb 2025
https://github.com/viper373/163-buff
爬取网易BUFF平台CS:GO武器皮肤交易数据
163 arima crawler-python csgo data-analysis prediction python
Last synced: 05 Feb 2025
https://github.com/supersjgk/data-analysis-dns-over-https
A Data Analytics + ML project to classify Benign and Malicious DNS-over-HTTPS traffic
classification-model data-analysis data-analysis-python data-analytics datamining decision-trees dns dns-over-https doh gradient-boosting knn machine-learning random-forest
Last synced: 25 Jan 2025
https://github.com/kmihajlo/dataprocessing_graduatesadmissionprediction
Statistical processing of a data set using R.
data-analysis data-processing r statistical-analysis
Last synced: 09 Feb 2025
https://github.com/virajbhutada/coursera-google-data-analytics-capstone
A repository containing the Capstone project for the Google Data Analytics Professional Certificate, focusing on analyzing FitBit fitness tracker usage data to derive insights relevant to Bellabeat, a wellness technology company.
coursera data-analysis data-visualization fitbit fitness-tracker google google-data-analytics-capstone-project python-programming
Last synced: 10 Jan 2025
https://github.com/luiscib3r/streamlit-examples
Streamlit examples.
data-analysis data-science machine-learning python streamlit
Last synced: 15 Feb 2025
https://github.com/sarincr/basics-of-julia-programming-language
Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.
data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning
Last synced: 21 Jan 2025
https://github.com/nakshjainsonigara/vba-canteenmanagementsystem
The Canteen Management System is a comprehensive software solution designed to modernize and optimize canteen operations. It aims to simplify the complexities of managing a canteen by automating key processes such as order management, payment processing, and report generation.
canteen canteen-mangement-system charts data-analysis email excel microsoft payment-gateway vba vba-excel vba-macros word
Last synced: 15 Feb 2025
https://github.com/savinrazvan/heredity
An AI that assesses the likelihood of genetic traits in individuals using a Bayesian Network to analyze family genetic data, modeling genetic inheritance and mutations to infer probabilities of gene presence and trait expression.
ai bayesian-network biological-data-analysis data-analysis educational-project family-genetics genetic-inheritance genetic-traits heredity mutation-modeling probability-calculation python
Last synced: 10 Jan 2025
https://github.com/savinrazvan/degrees
A program that computes the "degrees of separation" between two actors by identifying the sequence of movies connecting them, inspired by the Six Degrees of Kevin Bacon game. Uses IMDb-based datasets for actors, movies, and their relationships.
actor-connections ai data-analysis degrees-of-separation educational-project graph-theory imdb movie-database python six-degrees-of-kevin-bacon
Last synced: 10 Jan 2025
https://github.com/bretsw/subreddits-over-time
Study of the r/Teachers and r/education subreddits over time
Last synced: 06 Feb 2025
https://github.com/ituvtu/datamining-ab-testing
This project focuses on conducting A/B testing to evaluate the effectiveness of two marketing campaigns. Using statistical analysis and hypothesis testing, we determine which campaign is more effective in improving conversion rates.
a-b-testing data-analysis data-analysis-python data-mining ipynb jupyter jupyter-notebook python
Last synced: 16 Jan 2025
https://github.com/as16082023/nashville-housing-data-cleaning-project
This project involved using MySQL to clean and optimize a Nashville housing dataset, addressing key data quality issues to ensure it was ready for accurate analysis.
data-analysis data-cleaning mysql nashville-housing-data
Last synced: 15 Feb 2025
https://github.com/revan-alqahmi/summarize-talabat-company-reviews
Natural Language Processing Project, which is a program that analyzes Arabic comments at Talabat Company and classifies them into positive, negative, and neutral using machine learning algorithms and natural language processing techniques.
artificial-intelligence data-analysis machine-learning-algorithms natural-language-processing python
Last synced: 19 Feb 2025
https://github.com/arielle0222/data_analysis
📊 Data analysis projects for autonomous driving and smart mobility engineering using Python and SQL.
autonomous-driving composite data-analysis electric-vehicles environmental-data python visualizatoin
Last synced: 06 Feb 2025
https://github.com/pranavarora1895/proteintypeprediction
Data Analysis on Protein Type Prediction
bioinformatics data-analysis supervised-learning
Last synced: 18 Jan 2025
https://github.com/17bit0216/machine-learning
All of my data analysis and Machine learning Projects.
analysis data-analysis linearr logistic logisticregression machine-learning python3 random-forest
Last synced: 18 Jan 2025
https://github.com/dhairyac/customer-churn-prediction
Analyze, visualize and predict customer churn using Machine Learning
data-analysis data-visualization ensemble-classifier machine-learning performance-metrics python-3 random-forest-classifier softmax-regression svm-classifier
Last synced: 22 Jan 2025
https://github.com/noturlee/imdb-dataanalysis
A data model that predicts the IMDb rating of a movie based on features like genre, director, and actors. Using regression techniques to tackle this problem.
data-analysis data-cleaning data-modeling data-science data-visualization
Last synced: 14 Feb 2025
https://github.com/sevdanurgenc/python-for-data-science-lecture-notes
In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.
data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial
Last synced: 29 Jan 2025
https://github.com/ansh420/mcdonald_case-study
It is basically depend on the market Segment Analysis. It is a case study of mcDonald.
algorithms-implemented data-analysis python3 segmentation
Last synced: 19 Jan 2025
https://github.com/anilkumarteegala/aspiration.ai-ml-internship
This repo contains the internship project by Career Launcher.
data-analysis data-science financial internship machine-learning python3 stock-analysis stock-market visualization
Last synced: 13 Nov 2024
https://github.com/phillbertnevinemmanuel/automotivesalesdataanalysis
This marks my inaugural venture into personal data analysis, employing SQL and Python for Correlation Analysis. I've sourced the dataset from Kaggle, specifically focusing on automotive sales. You can find the dataset linked on my website below. I'm excited to share that I've independently managed the majority of tasks involved in this project.
data-analysis dataset microsoft-sql-server python python-lambda sql ssms tsql
Last synced: 18 Jan 2025
https://github.com/mxagar/airbnb_data_analysis
An analysis of the AirBnB dataset from Euskadi / the Basque Country.
airbnb data-analysis data-science eda feature-engineering modeling pandas regression
Last synced: 15 Feb 2025
https://github.com/chengkangzai/malaysia-pandemic-dashboard
covid-19 data-analysis pandemic-dashboard
Last synced: 03 Feb 2025
https://github.com/abdelhakim-gh/machine-learning_data-analysis_project
recognizing handwritten numbers & comparing the Life Expectancy vs Fertility in 1960 & 2013 of regions
data-analysis jupyter-notebook machine-learning python r r-studio
Last synced: 30 Jan 2025
https://github.com/harshmule1/school-data-analysis-
School Data Analysis Using SQL
Last synced: 18 Jan 2025
https://github.com/alejo1630/sport_stats
Data analysis of information from the summer and winter Olympic games over the years. UC Davis SQL Specialization Final Project
data-analysis jupyter-notebook olympics-dataset plotly python seaborn sql
Last synced: 31 Dec 2024
https://github.com/graphieros/data-visualisation
data visualisation solutions in vanilla js
data-analysis data-visualization pure-javascript svg-manipulating
Last synced: 07 Feb 2025
https://github.com/shrawans007/data_science_students
Customer Engagement Analysis in Excel for 365datascience.com
2021-2022 365datascience case-study case-study-analysis case-study-project customer-engagement-analysis data-analysis data-analytics data-science data-science-students free-plan indian-students microsoft-excel ms-excel ms-excel-addin ms-excel-data-analytics ms-excel-task paid-plan us-students
Last synced: 17 Jan 2025
https://github.com/prernarohra/mental-health-prediction
This project focuses on predicting mental health outcomes using machine learning algorithms. By analyzing various psychological, social, and lifestyle factors, the model aims to identify individuals at risk, enabling early intervention and support.
data-analysis data-science data-visualization machine-learning mental-health python
Last synced: 23 Jan 2025
https://github.com/prernarohra/quakeguard
QuakeGuard is an innovative project for reducing earthquake intensity and structural damage. It takes a proactive approach to seismic activity, by using complex algorithms and real-time data to improve safety and resilience for people in earthquake-prone areas.
artificial-intelligence backend data-analysis data-science earthquake-intensity final-year-project front-end geology machine-learning open-source python visualization
Last synced: 23 Jan 2025