Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-01-29 00:07:16 UTC
- JSON Representation
https://github.com/spacebakery/variance-in-weather-project
Statistics for Data Analysis | Variance and Standard Deviation
data-analysis python standard-deviation statistics variance
Last synced: 03 Jan 2025
https://github.com/nikitalpopov/news
v semester project
data-analysis data-science python scikit-learn
Last synced: 25 Jan 2025
https://github.com/krypten/nycsubwayturnstileweatheranalysis
Analyzing the NYC Subway Dataset
data-analysis machine-learning machinelearning python
Last synced: 11 Jan 2025
https://github.com/chaitanyaprasad60/sql-queries
This is a list of complex SQL Queries I have practiced.
data-analysis sql window-functions
Last synced: 08 Jan 2025
https://github.com/27ahmad/ibm-data-science-capstone
The Capstone is the final course in the IBM Data Science Professional Certificate program. It's a project that combines all the skills and knowledge you've gained throughout the specialization.
data-analysis data-science folium-maps machine-learning plotly-dash python sql
Last synced: 15 Jan 2025
https://github.com/27ahmad/amazon-sales-analysis
This repository contains an exploratory data analysis (EDA) and visualization project of Amazon sales data. The goal is to uncover insights and present key metrics through a Tableau dashboard.
data-analysis eda pandas python seaborn tableau
Last synced: 15 Jan 2025
https://github.com/souravsuvarna/whatsapp-chat-analyzer-and-visualizer-web-application
The WhatsApp chat analyzer and visualizer uses NLP algorithms to analyze chat data, tracking usage patterns and presenting insights through visually appealing charts and graphs. It helps users understand communication patterns and behaviors on WhatsApp.
data-analysis data-science data-visualization python python3 streamlit
Last synced: 05 Jan 2025
https://github.com/27ahmad/foreign-direct-investment-analytics
This repository contains an exploratory data analysis (EDA) and visualization project on a dataset of Foreign Direct Investment (FDI) by companies. The objective is to analyze FDI trends and present key insights through an interactive Tableau dashboard.
data-analysis eda matplotlib pandas python seaborn tableau
Last synced: 15 Jan 2025
https://github.com/27ahmad/heart-disease-diagnostic-eda
This project conducts Exploratory Data Analysis on a dataset related to heart diagnostic disease, aiming to derive valuable insights from the analysis.
data-analysis data-visualization pandas python
Last synced: 15 Jan 2025
https://github.com/parthshah02/customer_churn_dashboard
This repository features a comprehensive project showcasing data analysis and interactive dashboard using Python
data-analysis matplotlib numpy pandas python
Last synced: 25 Jan 2025
https://github.com/prgermux/yield-reporter
This Python application provides a graphical user interface (GUI) for analyzing and visualizing production data from various machines. It uses the PyQt5 framework for the GUI and Matplotlib for plotting data.
automation data-analysis python reporting
Last synced: 07 Jan 2025
https://github.com/kelvintechnical/web-scraper
Tableau Book Price Analysis
data data-analysis data-science tableau tableau-public
Last synced: 19 Dec 2024
https://github.com/elzasimoes/challenge-xplab
Challenge for data analysis course.
data-analysis data-science jupyter-notebook python
Last synced: 05 Jan 2025
https://github.com/27ahmad/netflix_sql_project
The Netflix SQL Project analyzes the Netflix dataset using SQL queries to gain insights into its content, identify trends, and address business problems related to movies and TV shows.
data-analysis postgresql-database sql
Last synced: 15 Jan 2025
https://github.com/al-ogr/sf_pr2_job_analysis_hh_sql
SkillFactory DataScience PROJECT-2. Анализ вакансий из HeadHunter
data-analysis data-science ipynb plotly python sql
Last synced: 08 Jan 2025
https://github.com/dcostachar/telco-customer-churn-dashboard
An interactive Tableau dashboard using the Telco Customer Churn dataset to analyze key drivers of customer churn and develop data-driven retention strategies for the telecommunications industry.
business-intelligence customer-churn-analysis data-analysis data-visualization marketing-analytics tableau
Last synced: 15 Jan 2025
https://github.com/rahul-jha98/restauranttrends.stats-backend
Application that scrapes the Zomato Dataset and enables the user to visualise the results.
data-analysis data-extraction firebase-storage web-scraping zomato-api
Last synced: 19 Jan 2025
https://github.com/samuelbarbosadev/casting
A Solução Casting é uma empresa que presta serviços para outras empresas, e hoje a predição do faturamento para os clientes é realizada de forma manual e simplificada, mas estão em busca de automatizar para gerar mais valor aos clientes, reduzir custos e otimizar o tempo da equipe.
data-analysis model python skit-learn time-series
Last synced: 27 Jan 2025
https://github.com/odinsride/monpl
PL/SQL Data Load Monitoring Tools
data-analysis database etl logging logging-framework monitoring oracle plsql
Last synced: 23 Jan 2025
https://github.com/fortunewalla/flight-delays
Data Expo 2009: Airline on time data
airlines data-analysis data-science data36 database dataexpo dataset flightdelays flights ontimedata pgsql postgres postgresql sql tomimester
Last synced: 08 Jan 2025
https://github.com/achique-luisdan/tops-songs-db
Base de datos de Tops Semanales de Canciones🎵 más reproducidas en Spotify🎶. Prácticas de SQL enfocadas en el Análisis de Datos (Data Analysis).
Last synced: 08 Jan 2025
https://github.com/abdelmajidlh/cours
Cours Data engineering et data analyse.
apache-spark big-data data-analysis data-engineering docker jupyter-notebook pyspark
Last synced: 08 Jan 2025
https://github.com/damiieibikun/web-scrapping-and-python-data-visualization-on-top-500-movies-imdb
Web Scrapping and Python Data visualization on Top 500 movies IMDb
beautifulsoup4 data-analysis data-visualization matplotlib-pyplot numpy pandas plotly-express python requests seaborn web-scraping
Last synced: 15 Jan 2025
https://github.com/robinmillford/sales-metrics-dashboard-streamlit
This Streamlit dashboard provides an interactive and comprehensive analysis of customer behavior, regional sales trends, and revenue insights. The dashboard enables businesses to identify key performance metrics, customer segments, and revenue drivers, supporting data-driven decision-making.
dashboard data-analysis data-visualization duckdb sales-analysis sales-dashboard streamlit-dashboard
Last synced: 15 Jan 2025
https://github.com/sakan811/stress-pattern-occurrence-in-english-words
This project is intended to provide English learners with data that allows them to make a data-driven guess when encountering words that they aren't sure where to stress
data-analysis data-visualization english english-language english-learning language powerbi powerbi-report powerbi-visuals
Last synced: 05 Jan 2025
https://github.com/sakan811/find-common-japanese-character-from-news
Showcase visualizations about common Japanese characters that appear in the news
beautifulsoup beautifulsoup4 data-analysis dataanalysis japanese japanese-language language news powerbi requests sqlite sqlite3 visualization webscraper webscraping
Last synced: 05 Jan 2025
https://github.com/raul23/dev-jobs-insights
Data analysis of developer job posts from Stack Overflow
data-analysis data-mining data-visualization python
Last synced: 13 Jan 2025
https://github.com/hrolive/patc-big-data-analytics-bsc
Introduction to the main concepts and technologies related to Big Data and Data Analytics and its applications to real projects.
analytics bias big-data data-analysis hadoop hpc machine-learning mapreduce nosql python spark spark-streaming visualization
Last synced: 04 Jan 2025
https://github.com/sakan811/honkai-star-rail-a-few-fun-insights-with-data-analysis
The project gives insights that delve into the Honkai Star Rail's character's stats of all available characters as of the given date.
data data-analysis data-science data-visualization game honkai honkai-star-rail honkai-starrail webscraping webscraping-data webscraping-selenium
Last synced: 05 Jan 2025
https://github.com/phomint/udacity_free_datawragling_with_mongodb
Udacity Free course to use MongoDB
data-analysis mongodb udacity-course
Last synced: 15 Jan 2025
https://github.com/csoren66/customer-personality-analysis
Predict how different customer segments will respond for a particular product or service.
data-analysis data-visualization python
Last synced: 13 Jan 2025
https://github.com/shrutiijoshi/restaurant-order-analysis
Analyze order data to identify the most and least popular menu items and types of cuisine
analytics data-analysis mysql sql
Last synced: 09 Jan 2025
https://github.com/rafgpereira/obmep-analise
Código que analisa a retrospectiva das premiações da Obmep em determinada localidade e escola
data-analysis excel pandas python
Last synced: 28 Jan 2025
https://github.com/cano1998/data-visualization-project
A project focused on data visualization to explore various aspects of a car dataset. The visualizations provide insights into car performance, efficiency, and characteristics based on different manufacturers and features.
bar-pl bar-plot data-analysis data-visualization histogram jupyter-notebook line-plot
Last synced: 04 Jan 2025
https://github.com/krzysikd/uber_fare_prediction
Predicting uber fares using advanced machine learning models and feature engineering techniques
data-analysis data-processing eda hyperparameter-tuning jupyter machine-learning regression-models
Last synced: 15 Dec 2024
https://github.com/rohithsaji97/toll_gate
This is a electronic toll collection system.
data-analysis digital-image-processing ocr-text-reader opencv python3 trained-models
Last synced: 20 Jan 2025
https://github.com/akmj1011/hill-and-valley-prediction-using-logistic-regression
Created A Prediction System Using Logistic Regression For Figuring Out The Hall And Valley From The Given Datasets
cloud-computing data-analysis data-manipulation data-preprocessing data-transformation data-visualization google-colab
Last synced: 09 Jan 2025
https://github.com/dcs-training/regressionandmixedeffectsmodelling
This course will introduce you to regression and linear mixed-effects models (LMMs). It will help to develop your theoretical understanding and practical skills for running such models in R. Go to the readme file
data-analysis r rmarkdown statistics
Last synced: 07 Jan 2025
https://github.com/dcs-training/datavisualisationwithr2021
Data Visualisation with R Course (delivered by the Centre in October/November 2021). This workshop is focusing on good practice of creating graphs with R and R Studio. Go to the readme file
data-analysis data-visualisation data-wrangling r
Last synced: 07 Jan 2025
https://github.com/dcs-training/data-wrangling-and-vis-pandas
Introduction to analyzing structured data with the Python libraries pandas, for CSV and TSV data, and ElementTree, for XML data. Go to the readme file
data-analysis data-visualisation data-wrangling python
Last synced: 07 Jan 2025
https://github.com/dcs-training/interactive-analysis-reports-with-r-markdown.github.io
This workshop will help you create your own reproducible, customisable, and interactive analysis reports through R Markdown. By building on the basics of R, we will show you how to instantly prepare your results into a ready-made document (No more copy and pasting your results! Less human error!). Go to the readme file
data-analysis data-visualisation data-wrangling r rmarkdown statistics
Last synced: 10 Nov 2024
https://github.com/dcs-training/introtostatistics_2023
This is a repository which contains all the materials to be used in the introduction to statistics course in December 2023. Go to the readme file
data-analysis r rmarkdown statistics
Last synced: 10 Nov 2024
https://github.com/mahmoudnamnam/superstore-analysis
This project explores the SuperStore dataset to uncover insights into sales, profit, and customer behavior. It identifies key trends, regional variations, and product performance, using data analysis and machine learning techniques to guide business strategy and optimize performance.
clustering data-analysis data-science data-visualization geopandas jupyter-notebook machine-learning numpy pandas plotly regression seaborn sklearn
Last synced: 16 Jan 2025
https://github.com/dcs-training/much-ado-about-nothing-missing-data-in-research
Repo for the Much ado about nothing workshop. Go to the Readme file
data-analysis data-cleaning data-wrangling r
Last synced: 07 Jan 2025
https://github.com/matteofasulo/algos
Progetto di Algoritmi e Strutture Dati
algorithms-and-data-structures astar-algorithm data-analysis dijkstra fibonacci-heap haversine pandas python
Last synced: 20 Jan 2025
https://github.com/dcs-training/scottishaccounts
This repo contains various examples of analysis that can be performed on the Statistical Accounts of Scotland dataset. Go to the readme file
data-analysis data-visualisation data-wrangling geographical-data r rmarkdown text-analysis
Last synced: 07 Jan 2025
https://github.com/mehassanhmood/storyboard
data-analysis framer-motion javascript react scrollytelling storyboard tailwind-css vite
Last synced: 21 Jan 2025
https://github.com/matteofasulo/cdc-finf
Project of fundamentals of Computer Science
data-analysis data-science data-visualization numpy pandas python python3
Last synced: 20 Jan 2025
https://github.com/jendives2000/regressions
Performing of a Linear Regression analysis to determine the strength of the relationship between the number of reviews and sales for a retail company.
data-analysis linear-regression pearson-correlation-coefficient regression
Last synced: 26 Jan 2025
https://github.com/nemat-al/multivariate_data_analysis
Tasks for Multivariate Data Analysis Course @ ITMO University
data-analysis multivariate-analysis python
Last synced: 23 Jan 2025
https://github.com/aleskandro/r-hadoop-madreduce-examples
A lot of examples about using R with hadoop for MapReduce with and without libraries as rhadoop/rhipe - [email protected] - Advanced Programming Languages
data-analysis hadoop mapreduce r
Last synced: 27 Dec 2024
https://github.com/koushikphy/kfutils
A common file operation utility
data-analysis data-files data-operations file-operations interpolation numerical-analysis python python-library python-package
Last synced: 13 Jan 2025
https://github.com/yaser-123/energy-consumption-dashboard
A Power BI dashboard to analyze energy consumption for water, gas, and electricity across cities and buildings. Features include interactive charts, drill-down insights, and dynamic filters for easy monitoring and optimization.
dashboard data-analysis data-analytics data-visualization energy-consumption energy-efficiency powerbi
Last synced: 28 Jan 2025
https://github.com/ddihora1604/dataaichemy
This project creates a Generative AI system to produce high-quality synthetic data using minimal real data. It combines VAEs, Tabular GANs, and Copula-based methods to generate realistic and diverse datasets, helping overcome data shortages, privacy issues, and biases.
copula-models data-analysis data-visualization gans-models nextjs python statistical-analysis vae vae-gan
Last synced: 28 Jan 2025
https://github.com/guilherme-marcello/r-data-analysis-piechart
Reading RDS files, processing and presentation in pie charts
data-analysis data-visualization pie-chart r
Last synced: 08 Jan 2025
https://github.com/terrelbrinkley/r-projects
Data Analyst & Visualization Projects
data-analysis data-science data-visualization
Last synced: 08 Jan 2025
https://github.com/sejalkoli/british-airways-web-scraping
Data science virtual internship program by British Airways through Forage!
british-airways data-analysis data-science internship-project internship-task machine-learning present-insights project reporting web-scraping
Last synced: 08 Jan 2025
https://github.com/brownred/python-and-sql
Python and SQL (postgreSQL & mySQL) for data analysis.
data-analysis databases python3 sql
Last synced: 26 Jan 2025
https://github.com/meinhere/dicoding-analisis-data
Submission Analisis Data dengan tema E-Commerce Streamlit App
data-analysis data-mining e-commerce python streamlit
Last synced: 25 Dec 2024
https://github.com/rdrahul123/ecommerce-sales-dashboard
This project focuses on analyzing e-commerce sales data to uncover actionable insights and improve business decision-making. Using interactive dashboards and data analysis techniques, the project evaluates key performance metrics, customer behavior, sales trends, and payment modes across different categories and regions.
data-analysis data-science excel powerbi
Last synced: 28 Jan 2025
https://github.com/sadratehranian/prediction-of-covid-19-diagnosis
Build an algorithm in MATLAB using ML techniques to predict if a person is having COVID-19 or not depending on the existing medical conditions. Further research has been conducted on identifying the most suitable machine learning techniques and increase their prediction accuracy.
covid-19 data-analysis data-science data-visualization machine-learning matlab prediction visualization
Last synced: 22 Jan 2025
https://github.com/prakshal0809/sql-data-analysis-project
This project involves analyzing pizza sales data using SQL to address various data analysis questions, providing essential foundational to advanced SQL knowledge.
Last synced: 28 Jan 2025
https://github.com/elcarrillo/computational_bootcamp_material
Material for a Computational Bootcamp
bootcamp-project computational-physics data-analysis data-visualization jupyter-notebooks
Last synced: 10 Nov 2024
https://github.com/meinhere/ta-pendat
Proyek Akhir Mata Kuliah Penambangan Data - Klasifikasi Trauma Pasien Menggunakan Metode Naive Bayes
data-analysis data-mining python
Last synced: 25 Dec 2024
https://github.com/jihoonerd/national_health_insurance_sharing_service_project
국민건강보험 데이터를 활용한 EDA
data-analysis exploratory-data-analysis health insurance
Last synced: 15 Jan 2025
https://github.com/progdrummer1/cyclistic-data-analysis-in-sql-and-r
Study Case: Cyclistic
Last synced: 13 Dec 2024
https://github.com/jpcadena/ventas-facturas
Ventas con facturas
data data-analysis data-exploration data-extraction data-science excel feature-engineering matplotlib microsoft numpy pandas powerbi product-sales pylint python receipts sales
Last synced: 15 Jan 2025
https://github.com/jpcadena/tweets-classification-frontend
Frontend project for the Classification Tweets project
api axios css data-analysis data-science data-visualization ecuador eslint frontend html insecurity json machine-learning node npm openapi-typescript-generator react tweets-classification twitter typescript
Last synced: 15 Jan 2025
https://github.com/jpcadena/pharmacy-prices-prediction
Prices prediction project for Pharmacy products.
artificial-intelligence data-analysis data-science deep-learning keras machine-learning machine-learning-models neural-network numpy pandas pharmacy prediction price-prediction pylint python scikit-learn supervised-learning tensorflow
Last synced: 15 Jan 2025
https://github.com/jpcadena/cancer-classification
Breast cancer classification project.
cancer-detection classification data-analysis data-science deep-learning imblearn machine-learning neuronal-network numpy pandas pylint python scikit-learn supervised-learning tensorflow
Last synced: 15 Jan 2025
https://github.com/eslamdyab21/weratedogs-twitter-data-analysis
In this challenging project, I do data wrangling processes
csv data-analysis data-wrangling data-wrangling-twitter json-data pandas python twitter udacity-data-analyst-nanodegree
Last synced: 22 Jan 2025
https://github.com/eslamdyab21/apara-data-gui
Custom application for Apara's data wrangling scripts, Technologies used are Qt-designer, PyQt5 for the GUI and Pandas, Numpy for the data work.
csv data data-analysis data-wrangling gui pandas pyqt5-desktop-application qt5-gui
Last synced: 22 Jan 2025
https://github.com/eslamdyab21/data-visualization-using-matplotlib-and-seaborn
This is the last project in the nanodegree udacity program. it's about data visualization.
data data-analysis data-visualization matplotlib pandas python seaborn udacity udacity-data-analyst-nanodegree
Last synced: 22 Jan 2025
https://github.com/eslamdyab21/a-b-test-to-an-e-commerce-website
A/B test to an e-commerce website
csv data-analysis data-science hypothesis-testing pandas python udacity-data-analyst-nanodegree
Last synced: 22 Jan 2025
https://github.com/netcodez/analysing-unicorn-companies---sql
Analysing Unicorn Companies using SQL
data-analysis data-structures database postresql sql
Last synced: 15 Jan 2025
https://github.com/karlyndiary/spotify-excel-dashboard
Data Analysis on the Spotify Dataset using Microsoft Excel and VBA.
charts data-analysis data-cleaning data-visualization excel excel-export excel-vba pivot-tables
Last synced: 28 Jan 2025
https://github.com/karlyndiary/bellabeat-eda
Bellabeat Case Study - Google Data Analytics Capstone using Python.
bellabeat bellabeat-case-study bellabeat-eda bellebeat-data-analysis case-study case-study-analysis data-analysis data-visualization eda python reports
Last synced: 28 Jan 2025
https://github.com/chitranjan806/greyatom_learning_repo
A Collection of Projects, Tasks and Challenges as part of Data Science Masters - Transition Program at GreyAtom.
data-analysis data-science greyatom python3
Last synced: 08 Jan 2025
https://github.com/errea/vet_clinic_database
For this project you need special preparation. As the goal of this project is to solve some performance issue, first we need to introduce those issues. In order to do that, you will populate your database with a significant number of data.
data data-analysis data-structures data-visualization database
Last synced: 15 Jan 2025
https://github.com/greenpau/esqrunner
Run Elasticsearh queries and create metrics based on the result of the queries in Elasticsearch database.
data-analysis elasticsearch query-builder querydsl
Last synced: 26 Jan 2025
https://github.com/teamtigers/echartify
A web application built with .net core 2.2 that has come with the idea of reading the National Election's Data-set of Bangladesh in a fastest possible time and then representing the data-set with different statistical charts.
bangladesh chartjs code-first-migration cross-platform data-analysis data-structures data-visualization dotnet-core election-analysis election-data entity-framework-core materializecss mvc npoi razor-pages
Last synced: 16 Jan 2025
https://github.com/edoaltamura/rotational-ksz-macsis
Repository for suppelementary material from my publication on the rotational kinetic SZ effect in MACSIS
cosmology data-analysis galaxy-clusters high-performance-computing hydrodynamics
Last synced: 05 Jan 2025
https://github.com/lotfiferaga/sig_explore
3d-graphics api data-analysis data-visualization openstreetmap python
Last synced: 15 Jan 2025
https://github.com/giatraskon/sandbox.bio-solutions
Bash scripts replicating the commands from sandbox.bio's interactive bioinformatics tutorials, organized by categories such as Data Exploration, File Formats, Quality Control, and Data Analysis.
bam-files bash bed-files bioinformatics bioinformatics-workflows command-line-tools computational-biology data-analysis data-exploration data-wrangling fasta-files fastq-files file-formats genomic-data quality-control sandbox-bio sandbox-bio-tutorials sequence-alignment unix-shell variant-calling
Last synced: 13 Dec 2024
https://github.com/faizantkhan/python_matplotlib
Matplotlib is a powerful Python library for creating visualizations and plots. It’s widely used for data representation, making complex information more accessible and interpretable. It offers various types of plots, including line graphs, scatter plots, bar charts, histograms, and more
data-analysis data-analytics data-engineering data-science data-visualization deep-learning graphs line machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot matplotlib-python python
Last synced: 15 Jan 2025
https://github.com/faizantkhan/automated-eda
This repository showcases tools for automatic Exploratory Data Analysis (EDA) in Python. These tools help you quickly understand your datasets and generate insightful reports.
automatic automation autoviz data-analysis data-analysis-python data-science data-visualization dtale dtale-library eda exploratory-data-analysis ml pandas pandas-profiling python python-library sweetviz
Last synced: 15 Jan 2025
https://github.com/atharvapathak/rsvp_movies_case_study
SQL queries performed on IMDb database to provide recommendations to RSVP Movies based on insights.
data-analysis data-cleaning data-science imdb-dataset rsvp-movies sql
Last synced: 15 Jan 2025
https://github.com/pablo1785/receipt-rs
Receipt processing backend built with Shuttle.rs, Axum and Azure Form Recognizer API
api-rest axum azure backend cognitive-services computer-vision data-analysis rust shuttle-rs sqlx
Last synced: 22 Jan 2025
https://github.com/alefrp/properties_dbt
A DBT project for analyzing city property data.
data-analysis data-warehouse dbt python sql
Last synced: 13 Jan 2025
https://github.com/bibymaths/python_snippets
A collection of Python scripts for bioinformatics data analysis, including tools for transcription counts, nucleotide composition, and protein sequence evaluation.
amino-acid-scoring bioinformatics data-analysis fasta-generation mathematical-evaluation nucleotide-analysis protein-sequence-analysis transcription-counts
Last synced: 28 Jan 2025
https://github.com/namratagulati/tweets_analysis
This repository focuses on sentiment analysis of Twitter data using Python, Natural Language Processing (NLP), and the Natural Language Toolkit (NLTK). The goal is to extract valuable insights from social media discussions, such as word frequency, hashtag trends, and sentiment patterns.
analysis data-analysis natural-language-processing nlp-machine-learning nltk-corpus nltk-python sentiment-analysis twitter-sentiment-analysis
Last synced: 08 Jan 2025
https://github.com/namratagulati/fraud_detection
This fulfills all the requirements of a fraud detection model developed on linear regression using feature scaling, engineering and testing model with the help of auc-roc curve and others.
data-analysis data-visualization machine-learning machine-learning-algorithms machinelearning-python
Last synced: 08 Jan 2025
https://github.com/vaishnavipaithane/cyclistic-bike-share-analysis-case-study
This capstone project was done as a part of Google Data Analytics Professional Certificate course.
data-analysis r-programming-language rstudio
Last synced: 15 Jan 2025
https://github.com/florence-nyokabi/house-power-consumption
Machine Learning: Exploring Regression Analysis
data-analysis data-cleaning data-science data-visualization feature-engineering jupyter-notebook jupyterlab machine-learning pandas-python regression-analysis regression-models
Last synced: 13 Jan 2025
https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql
This project involves cleaning a dataset containing information about layoffs from companies around the world.
data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql
Last synced: 12 Jan 2025
https://github.com/dina-hosny/retail-store-data-modeling-and-analysis-using-datastage
The project implements a star-schema data warehousing flow, then utilize IBM InfoSphere DataStage to develop efficient ETL pipelines to create data marts and perform some analysis on them.
data-analysis datastage datawarehousing etl extract ibm load transform
Last synced: 13 Jan 2025
https://github.com/dina-hosny/sparkify---data-modeling-with-cassandra
Sparkify - Data Modeling with Cassandra - Udacity Data Engineering Expert Track.
cassandra cql data-analysis data-engineering data-modeling data-warehousing etl python
Last synced: 13 Jan 2025