Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-22 00:07:31 UTC
- JSON Representation
https://github.com/madhuresh2011/telco-customer-churn-analysis-using-python
The analysis primarily investigates factors influencing customer churn, particularly focusing on payment methods and contract types.
csv data-analysis matplotlib numpy pandas pyhton seaborn vizualisation
Last synced: 02 May 2026
https://github.com/ferrangarciarovira/premier-league-betting-analysis
Comprehensive Python analysis of Premier League betting market inefficiencies (2005–2024). Evaluates bookmaker biases, betting strategies, and market efficiency using statistical methods and Monte Carlo simulations.
betting-strategies bias-detection data-analysis market-efficiency monte-carlo-simulation premier-league python sports-analytics
Last synced: 03 May 2026
https://github.com/mirseo/pandas_learning
pandas_learning
data-analysis data-analysis-python data-science data-visualization numpy numpy-example pandas pd python python-3 python3
Last synced: 03 May 2026
https://github.com/anandanraju/youtube-data-api-model
The YouTube Analytics API enables you to generate custom reports containing YouTube Analytics data. The API supports reports for channels and for content owners. Report fields are characterized as either dimensions or metrics
analytics data-analysis data-science metrics model python telemetry youtube youtube-api
Last synced: 03 May 2026
https://github.com/manikantasanjay/time_series_data_analysis_on_stocks
Time Series Data Analysis project on Daily Stock Prices of the following companies(Apple, Microsoft, Google, Amazon) for a span of 5 years.
data-analysis pandas stock time-series time-series-analysis
Last synced: 03 May 2026
https://github.com/theairbend3r/mice-memory-response
Effect of memory on current response in mice using methods from computational neuroscience and machine learning.
computational-neuroscience data-analysis data-science machine-learning neuroscience python
Last synced: 09 Jun 2026
https://github.com/shelton-beep/predicting-gpa-using-lifestyle-factors
Predicting student GPA using lifestyle factors like study habits, sleep, and stress levels. A machine learning model built to help students and educators understand the impact of lifestyle choices on academic performance.
data-analysis data-preprocessing data-science feature-engineering gpa-prediction machine-learning model-interpretability predictive-modeling python regression-analysis student-performance xgboost
Last synced: 04 May 2026
https://github.com/cliffordnwanna/climate_data_analysis_and_visualization
The Climate Data Visualization and Analysis project showcases a comprehensive exploration of African climate data, with an emphasis on identifying patterns and trends in average temperatures across different countries and time periods. It serves as a practical demonstration of my data analysis, data visualization, and problem-solving skills.
data-analysis data-science mathplotlib pandas plotly visualization
Last synced: 04 May 2026
https://github.com/ahmad-ali-rafique/handwritten-digit-recognition-mnist
This project demonstrates a complete pipeline for recognizing handwritten digits using the MNIST dataset. The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation of a Fully Connected Neural Network (FCNN).
ai artificial-intelligence data data-analysis datascience deep-learning deep-neural-networks fcnn fully-connected-network machine-learning machine-learning-algorithms ml modeling
Last synced: 09 Jun 2026
https://github.com/kimtth/agent-data-analyst-stream-chainlit
⚡️Chainlit-based Data Analyst Chat Agent (Responses API, Server Sent Events) 📈
agent azure-openai chainlit code-interpreter data-analysis server-sent-events stream-response
Last synced: 09 Jun 2026
https://github.com/gursv/autoworth
Used Car Price Prediction (India)
data-analysis data-analysis-python data-analytics data-cleaning data-preprocessing data-science-projects eda fine-tuning gridsearchcv machine-learning matplotlib-pyplot pandas python3 random-forest-regressor scikit-learn seaborn
Last synced: 05 May 2026
https://github.com/shuddha2021/stellar-candidate-selector
A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.
candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn
Last synced: 05 May 2026
https://github.com/kiranmayi5/python-projects
A collection of Python projects showcasing skills in data analysis and visualization.
data-analysis data-visualization machine-learning nlp python
Last synced: 05 May 2026
https://github.com/githubuseraccountamazing/the-amari-project
a project in which I attempted to push some of the limits of stable-diffusion while taking some data along the way
ai ai-generated-images bash data-analysis machine-learning stable-diffusion textual-inversion
Last synced: 05 May 2026
https://github.com/kirkalyn13/opensignal_autogenerate_report
Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,
data-analysis data-science data-visualization matplotlib numpy pandas python
Last synced: 06 May 2026
https://github.com/mrankitgupta/titanic-survival-prediction-93-xgboost
Titanic Survival Prediction Project (93% Accuracy)🛳️ In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model & Hyperparameter tunning.
classification data-analysis data-science data-visualization gradient-boosting kaggle-competition linear-regression logistic-regression machine-learning machine-learning-algorithms ml ml-models nlp prediction predictive-modeling random-forest titanic titanic-kaggle titanic-survival-prediction xgboost
Last synced: 06 May 2026
https://github.com/scarblase/portfolioprojects
A collection of data analysis and business intelligence projects using SQL, Python, and visualization tools to uncover insights from real-world datasets. 🚀📊
csv data-analysis data-engineering data-mining data-science data-visualization matplotlib matplotlib-pyplot pandas python python3 seaborn sql
Last synced: 06 May 2026
https://github.com/eslamdyab21/imdb-data-analysis
This data set contains information about 10,000 movies collected from The Movie Database (TMDb), including user ratings and revenue
data-analysis pandas python udacity-data-analyst-nanodegree
Last synced: 06 May 2026
https://github.com/freebirdscrew/covid-19-data-analysis
Coronavirus Data-Analysis with Live Data Streaming from the Website and Made a DASH Web-App at Last.
coronavirus coronavirus-real-time coronavirus-tracking countryinfo covid-19 covid-19-india covid19 covid19-data dash dash-button dashboard-application data data-analysis data-cleaning data-science data-visualization github jupyter pycountry python
Last synced: 07 May 2026
https://github.com/sivas-2/coffee-sales-visualization
This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.
data data-analysis data-science data-visualization python visualization
Last synced: 07 May 2026
https://github.com/backdoorali/insider-threat-detection-project
Personal data analysis project combining insider threat detection, cybersecurity, and exploratory data analytics. Built for portfolio showcase and practical skills demonstration.
cybersecurity data-analysis data-analysis-excel data-analysis-project data-analyst data-analytics data-visualization eda excel insider-threat jupyter-lab jupyter-notebook matplotlib numbers pandas portfolio-project python python3 threat-detection threat-intelligence
Last synced: 07 May 2026
https://github.com/gorodroz/crypto-tracker
Realtime Bitcoin price tracker using Binance WebSocket and REST API. Logs prices to CSV and supports Pandas for data analysis.
binance bitcoin crypto csv-logger data-analysis pandas python rest-api websocket
Last synced: 07 May 2026
https://github.com/dogan-the-analyst/web_scraping_job_vacancies
data-analysis python web-scraping
Last synced: 07 May 2026
https://github.com/guglielmo/datalab-notebooks
Data analysis at openpolis
data-analysis data-science jupyter-notebooks pandas python3
Last synced: 08 May 2026
https://github.com/iguptashubham/ott-churn-eda-ml
Understanding why customers discontinue their subscriptions will be crucial in optimizing the user experience, reducing churn, and maximizing customer lifetime value. By using Machine learning model to predict the Customer Churn.
data-analysis data-analysis-project data-science data-science-portfolio data-science-projects data-visualization machine-learning python
Last synced: 08 May 2026
https://github.com/cagandemirmr/google-play-yorum-analizi
Türkiyede 2024 yılında en çok beğenilen My Supermarket Simulator 3D oyununa ait yorumların duygu durumu,yorumların beğeni sayısını,Firmanın geri dönüşleri ve kullanıcı nicknameleri gibi değişkenleri analiz ederek içgörü topladım.
bert data-analysis data-science nlp
Last synced: 10 Jun 2026
https://github.com/aekanshd/crazytics-suicidesindia
Basic interpretation of the Suicides in India data-set using R.
data-analysis data-science graph india r suicides
Last synced: 10 Jun 2026
https://github.com/md-emon-hasan/data_analytics_project
Data analytics tasks and solutions, featuring hands-on exercises for data cleaning, visualization, and analysis using Python libraries.
cars-dataset census-data covid19-data data-analysis london-house-price police-data weather-data
Last synced: 08 May 2026
https://github.com/sermonzagoto/data_manipulation_with_pandas
Data Manipulation with Pandas - Part 1
data-analysis data-science jupyter-notebook pandas-python python
Last synced: 09 May 2026
https://github.com/aminzibayi/atfc
Technology forecasting toolkit
data-analysis data-visualization graph technology-forecasting
Last synced: 09 May 2026
https://github.com/akshat0427/python_youtube_history
a bunch of data science operations performed on youtube history data
data-analysis data-science extracting-features
Last synced: 10 Jun 2026
https://github.com/dogan-the-analyst/developer_survey_analysis
Analysis of the 2024 Stack Overflow developer survey. Tools used include Python, Pandas, Matplotlib, and IBM Cognos.
data-analysis data-visualization ibm-cognos-analytics matplotlib pandas python
Last synced: 09 May 2026
https://github.com/blagojeblagojevic/lol_data_analysis
classification data-analysis data-science jupyter-notebook kaggle python3
Last synced: 09 May 2026
https://github.com/mariam-badr-mb/gtc-ml-project2-diabetes-prediction
This project is part of the GTC Machine Learning Program. It demonstrates the end-to-end ML workflow by building a predictive model for diabetes detection
classification-algorithm data-analysis data-visualization diabetes-prediction gridsearchcv hyperparameter-tuning machine-learning python
Last synced: 09 May 2026
https://github.com/sathyasris27/data-analysis-on-adult-smoking-patterns-in-the-uk
The aim of this analysis is to understand the smoking patterns among adults in the UK.
data data-analysis data-visualization python3
Last synced: 09 May 2026
https://github.com/musaibnagani/fraud-detection
End-to-end fraud detection simulation using Python — Phase 1 (SQLite + Rules) and Phase 2 (MSSQL + Velocity/Behavioral Features) with synthetic banking data.
data-analysis fraud-detection fraudulent-transactions mssql mssql-database pandas python sqlite3 time-series
Last synced: 10 May 2026
https://github.com/chayandatta/got_script_manipulation
Game of Thrones Script - String & file manipulation
data-analysis data-science pandas python3
Last synced: 11 May 2026
https://github.com/ahmednasef3/heart-attack-full-eda
Simple EDA for Heart Attack Dataset.
data-analysis data-science data-visualization eda exploratory-data-analysis heartattack matplotlib pandas seaborn
Last synced: 11 May 2026
https://github.com/is-leeroy-jenkins/sherpa
A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts
budget-management data-analysis data-science data-visualization federal-government
Last synced: 13 May 2026
https://github.com/zpreisler/modules
Python libraries and modules for processing simulation outputs
data-analysis python scripts tensorflow
Last synced: 13 May 2026
https://github.com/reinmagine/eliminating-no-sensor
Contains my project that analyzes air quality sensor data to determine if the NO (Nitric Oxide) sensor in N. Mai, Los Angeles, CA can be removed without affecting data accuracy.
air-quality-sensor colab-notebook cost-optimization data-analysis data-optimization matplotlib-python nitric-oxide pyspark-python python sql
Last synced: 14 Jun 2026
https://github.com/dogan-the-analyst/model_car_warehouse_analysis
This is a SQL project.
Last synced: 15 Jun 2026
https://github.com/ensinho/data-analysis
My repository for data analysis studys in Python.
csv data-analysis graphs python python-documentation
Last synced: 15 Jun 2026
https://github.com/metalwarrior665/actor-results-checker
apify data-analysis json-schema-checker
Last synced: 16 Jun 2026
https://github.com/techshot25/baltimore-911-calls
Analysis of 911 calls provided by the city of Baltimore.
data-analysis data-science decision-tree-classifier logistic-regression machine-learning machine-learning-algorithms statistics
Last synced: 16 Jun 2026
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 19 Jun 2026
https://github.com/souravsuvarna/whatsapp-chat-analyzer-api
The WhatsApp Chat Analyzer API is a public api specifically designed for frontend enthusiasts who are interested in building a WhatsApp Chat Data Visualizer project. Built on FastAPI, this API offers a seamless and efficient method to process chat data and returns the processed result data in JSON format.
api data-analysis data-science fastapi publicapi python
Last synced: 20 Jun 2026
https://github.com/alicankaya192/world-happiness-report-2025
Comprehensive exploratory data analysis (EDA) and visualization of the World Happiness Report 2025. Analyzes global rankings, regional distributions, key happiness factors, and detects wealth-happiness paradox outliers using Python (Pandas, Matplotlib, SciPy).
correlation-analysis data-analysis data-science data-visualization eda exploratory-data-analysis global-happiness happiness-index matplotlib pandas python scipy statistics whr-2025 world-happiness-report
Last synced: 21 Jun 2026
https://github.com/alicankaya192/ai-jobs-market-2025-2026-salaries
🤖 Global AI & LLM jobs market analysis (2025–2026). Salary trends, remote work premiums, top paying skills, and LLM engineering vs traditional AI comparisons. 📈
ai-jobs data-analysis data-science data-visualization eda exploratory-data-analysis generative-ai jobs jupyter-notebook llm-learning market-analysis matplotlib pandas salary-analysis statistics
Last synced: 21 Jun 2026
https://github.com/ituvtu/Data-Science-AB-Testing
This project focuses on conducting A/B testing to evaluate the effectiveness of two marketing campaigns. Using statistical analysis and hypothesis testing, we determine which campaign is more effective in improving conversion rates.
a-b-testing data-analysis data-analysis-python data-mining ipynb jupyter jupyter-notebook python
Last synced: 26 Sep 2025
https://github.com/vimal0156/ruaroa-ai
🧙♂️ Zero-Code Machine Learning Wizard - Transform ideas into intelligent solutions without writing code. AI-powered ML pipeline automation with interactive web interface.
ai-agents ai-assistant artificial-intelligence automated-machine-learning code-generation data-analysis data-science deep-learning jupyter machine-learning machine-learning-pipeline neural-networks no-code openai python scikit-learn streamlit visualization
Last synced: 09 Apr 2026
https://github.com/nafisalawalidris/northwind-traders-sales-analysis
Northwind Traders Sales Analysis project, which analyses sales data for a fictitious company. It utilises the Northwind Database and includes SQL queries to provide insights on employees, products, suppliers and revenue. The project aims to help the company gain valuable information for business decision-making.
business-insights data-analysis database northwind-traders sales sql
Last synced: 07 Aug 2025
https://github.com/simranjeet97/ipl-dataanalysis
Data Analysis performed on IPL Dataset with Data Profiling, Data Pre-Processing, Data Manipulation, and Data Visualization.
artificial-intelligence data-analysis data-manipulation data-mining data-preprocessing data-science data-visualization indian-premier-league-2008-2018 ipl ipl-dataset iplayer python
Last synced: 08 May 2026
https://github.com/prankshaw/election-analytica
Analyzing previous election results for Haryana Vidhan Sabha and other factors and to compare them with various parameter to conclude results.
anaconda collection data-analysis data-science data-visualization elections jupyter-notebook python python-3 wrangling
Last synced: 16 May 2026
https://github.com/airscholar/data_analysis_with_ai
A repository showing how to use AI and ChatGPT for Data Analysis with Pandas and Python
chatgpt data-analysis gpt4 openai pandas pandasai python
Last synced: 10 Apr 2026
https://github.com/devexpress-examples/web-forms-pivot-grid-implement-editable-aspxpivotgrid
This example demonstrates how to allow end-users to modify data cell values in Pivot Grid for Web Forms.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 09 Mar 2026
https://github.com/rayyan9477/multiple-disease-prediction-system
This repository contains a Multiple Disease Prediction System leveraging machine learning techniques for accurate predictions. It utilizes Python, Pandas, Scikit-learn, and Flask for data preprocessing, model building, and web deployment. Explore the project and connect on LinkedIn for collaborations.
data-analysis data-science machine-learning python streamlit
Last synced: 10 Apr 2026
https://github.com/mohamed3nan/udacity
Udacity Data Analysis Nanodegree Program
data-analysis data-visualization numpy pandas python
Last synced: 10 Apr 2026
https://github.com/chen0040/pyspark-advanced-algorithms
Samples of Advanced Algorithms and Data Analysis implemented in pyspark
advanced-algorithms data-analysis map-reduce pyspark
Last synced: 12 Jan 2026
https://github.com/ahmad-ali-rafique/weather-prediction-fcnn
This project demonstrates a complete pipeline for weather prediction using a Fully Connected Neural Network (FCNN). The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation.
ai artificial-intelligence data-analysis data-science deep-learning deep-neural-networks fully-connected-network machine-learning machine-learning-algorithms weather-information
Last synced: 28 Aug 2025
https://github.com/Zen204/airbnb-availability
A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.
binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning
Last synced: 02 Apr 2025
https://github.com/nirmit27/book-recommender-system
This is a book recommendation system based on item-based Collaborative Filtering memory-based model created using Flask.
data-analysis data-science flask python python3 recommender-system render
Last synced: 05 May 2026
https://github.com/umutsevdi/hr-management
HR Management, Analytics and Salary Determination System
analytics data-analysis java java17 postgresql python spring spring-boot vaadin vaadin-flow
Last synced: 10 Apr 2026
https://github.com/rkirlew/workoutrecommendationsdataset
This repository contains a synthetic dataset designed for building personalized workout recommendation models. The data is generated for educational and experimental purposes, allowing users to practice machine learning techniques such as classification, k-NN, and clustering, as well as explore fitness-related data analysis.
classification data-analysis dataset k-nearest-neighbours machine-learning
Last synced: 08 May 2026
https://github.com/virajbhutada/global-universities-success-analysis-powerbi-sql-excel
This capstone project conducts in-depth analysis using Power BI, SQL, and Excel to explore complex dynamics shaping global university success. Integrating data from diverse ranking systems and criteria, our aim is to unravel the factors influencing universities worldwide.
capstone capstoneproject data-analysis data-analytics data-insights data-science data-science-projects data-visualization excel exploratory-data-analysis mece mysql powerbi powerpoint sql
Last synced: 20 Jun 2025
https://github.com/allanotieno254/codsoft
This repository showcases a series of data science projects completed during an internship with CODESOFT. Each project utilizes Python and various machine learning techniques to solve specific problems in data analysis, classification, regression, and predictive modeling.
classification data-analysis data-science feature-engineering machine-learning model-evaluation predictive-modeling python-programming regression
Last synced: 15 May 2025
https://github.com/antonijn/polyfit
Fits a polygon to a given data input
c data-analysis linear-algebra toy
Last synced: 16 Jul 2025
https://github.com/5ekastanx/data-analysis
Extracting data from parsing, for example, like hacking using Python using all sorts of function methods
Last synced: 14 Mar 2025
https://github.com/kishlayjeet/zomato-data-exploration
In this project, we will be exploring a dataset containing information on various restaurants and their ratings, location, and other attributes.
data-analysis eda matplotlib numpy pandas zomato-data-exploration
Last synced: 10 Apr 2026
https://github.com/ivanildobarauna-dev/api-to-dataframe
Python library that simplifies obtaining data from API endpoints by converting them directly into Pandas DataFrames. This library offers robust features, including retry strategies for failed requests.
data-analysis data-analytics data-engineering library pypi-packages python
Last synced: 06 Mar 2025
https://github.com/prime-infinity/type-one
Software to visualize and analyze GitHub repos based on certain statistics such as stars, forks and issues
data-analysis data-visualization
Last synced: 03 Feb 2026
https://github.com/happybono/sonatasmooth
Provides three different noise reduction algorithms for smoothing out data : Rectangular Averaging, Binomial Median Filtering, and Binomial Averaging. It processes data from a list and displays the results in another list.
algorithms average binomial binomial-coefficient binomial-theorem calibration csharp data-analysis data-calibration dynamic-noise-reduction median noise-algorithms noise-reduction noise-reduction-kernel outliers rectangular-averaging windows-desktop windows-desktop-application windows-forms winforms
Last synced: 30 Oct 2025
https://github.com/ryanfranklin237/data-visualization-python
A tool that allows you to visualize data from a csv or excel file in a graph or charts form
data-analysis data-science data-visualization matplotlib pandas-dataframe python
Last synced: 11 Jun 2026
https://github.com/shrawans007/hotel_customers_sentiments
Sentiment Analysis for a Hotel Based on Customer's Reviews
2018-2019 data-analysis data-analysis-in-excel data-cleaning data-cleaning-and-preprocessing data-visualization excel excel-pivot-tables github hotel-review-sentiments hotel-service ms-excel ms-excel-data-analytics pivot-tables sentiment-analysis tableau tableau-public text-reviews treemap
Last synced: 22 Mar 2025
https://github.com/airdac/sim-telco_customer_churn
Prediction of customer churn with logistic regression in R. Team project from UPC's Master's Degree in Data Science
classification data-analysis data-science logistic-regression r statistical-models upc
Last synced: 28 May 2026
https://github.com/quantitext/quantitext
Official repository for QuantiText applications in the .NET ecosystem.
api aspnet-core csharp data-analysis dotnet-core mvc-architecture
Last synced: 30 Mar 2025
https://github.com/mokeddembillel/student-performance-prediction
Using Machine learning to predict a student final grade
data-analysis data-exploration feature-extraction feature-importance feature-selection linear-regression machine-learning power-bi principal-component-analysis regression spyder student-performance-prediction svm-regressor
Last synced: 15 Mar 2025
https://github.com/faisal-khann/diwali-sales-analysis
The "Diwali Sales Analysis" project aims to analyze the sales data during the Diwali festival period to uncover insights and trends that can help improve marketing strategies and sales performance in the future
csv data-analysis eda jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/luabagg/worldwide-trends
Worldwide Google Trends visualization and classification
data-analysis data-visualization google-trends trends
Last synced: 03 Feb 2026
https://github.com/karthikmprakash/911-call-dataanalysis
Data Analysis of Emergency (911) Calls: Fire, Traffic, EMS for Montgomery County, PA
911-call-analysis data-analysis data-visualization python3 united-states-data
Last synced: 10 May 2026
https://github.com/renatomaynard/statistical-modeling-and-regression-analysis-life-expectancy
Statistical Modeling and Regression Analysis for Life Expentancy
data-analysis healthcare linear-regression machine-learning predictive-modeling r regression-analysis statistical-models statitics
Last synced: 23 Mar 2025
https://github.com/shadan100/stroke-prediction-analysis
A web based application to predict whether a patient is likely to get stroke based on the input parameters like gender, age, various diseases, and smoking status. Each row in the data provides relevant information about the patient.
artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python stroke-prediction web-application
Last synced: 08 Mar 2026
https://github.com/agungbudiwirawan/sales-analysis-using-excel-formulas
The objective of this project is to analyze supermarket sales data using formulas in Microsoft Excel.
data-analysis excel excel-formulas microsoft-excel spreadsheet
Last synced: 08 Jan 2026
https://github.com/misszeferino/sql-projects
bigquery data-analysis mysql queries sql sqlite3
Last synced: 29 Jan 2026
https://github.com/gracysapra/r-in-data-science
This repository contains essential guides for data analysis using R, covering topics like data preparation, data reshaping, and data visualization. Each file focuses on fundamental techniques to manipulate, clean, and visualize data effectively using R programming.
data-analysis data-preparation data-reshaping data-science data-visualization data-visualizations ggplot r r-for-data-science
Last synced: 19 Apr 2026
https://github.com/aymane-maghouti/sentiment-analysis-for-jumia-reviews-and-smartphone-price-prediction-system
The project focuses on customer sentiment analysis for Jumia, aiding informed online decisions. It collects and analyzes product comments to determine sentiments and implements a decision-making algorithm. Additionally, it includes product price prediction system using regression techniques.
beutifulsoup data-analysis data-cleaning data-collection data-preprocessing data-scraping data-visualization eda falsk machine-learning python web-application
Last synced: 18 Apr 2026
https://github.com/bilal-belli/personalacademicdocuments
This repository contains some personal academic assignments, maybe it will help someone!
compilation computer-architecture data-analysis data-structures-and-algorithms database front-end hpc networking operating-systems signal-processing
Last synced: 20 Apr 2026
https://github.com/babak2/synthea-data-analysis
Synthea Data Analysis
data-analysis data-visualization jupyter-notebook jupytext matplotlib numpy pandas python3 seaborn synthea
Last synced: 11 Apr 2026
https://github.com/dcs-training/good-data-visualisation-with-r
Our guide on how we create data visualisations through R. Go to the readme file
data-analysis data-visualisation r rmarkdown
Last synced: 16 Jun 2026
https://github.com/samuelsoaress/python-study-datascience-ia
My data science and AI studies
data-analysis data-crawler data-mining data-science deep-learning machine-learning-algorithms
Last synced: 13 May 2026
https://github.com/vidhi1290/zomato-data-analysis
Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!
data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis
Last synced: 11 Apr 2026
https://github.com/boardgameanalytics/bga-notebooks
Exploratory notebooks using the BGG dataset
data-analysis data-exploration data-visualization ipython-notebook python
Last synced: 28 Jan 2026
https://github.com/savinrazvan/heredity
An AI that assesses the likelihood of genetic traits in individuals using a Bayesian Network to analyze family genetic data, modeling genetic inheritance and mutations to infer probabilities of gene presence and trait expression.
ai bayesian-network biological-data-analysis data-analysis educational-project family-genetics genetic-inheritance genetic-traits heredity mutation-modeling probability-calculation python
Last synced: 27 Feb 2025
https://github.com/easonlai/eda_for_prudential_life_insurance_sample_data
Notebook sample of Exploratory Data Analysis (EDA) for Prudential Life Insurance Sample Data
azure-databricks azuredatabricks data-analysis data-analysis-python data-analytics databricks databricks-notebooks eda exploratory-data-analysis insurance insurance-sample-data jupyter-notebook python python3
Last synced: 14 May 2026
https://github.com/priboy313/pandasflow
A set of custom python modules for friendly workflow on pandas
catboost data-analysis data-science pandas phik python scikit-learn shap
Last synced: 20 Jan 2026
https://github.com/vre-hub/science-projects
VRE example science projects
dark-matter data-analysis docker extreme-universe jupyter-notebook
Last synced: 18 Jan 2026
https://github.com/muhammadhilmyputrarisma/ab-test
Python code for A/B testing on Cookie Cats game data. This project analyzes the impact of moving the first gate from level 30 to level 40 on player retention and game rounds, helping to evaluate if delaying the gate improves player engagement and gameplay experience.
ab-testing cookie-cats data-analysis data-visualization game-analytics python statistics
Last synced: 18 May 2026
https://github.com/bretsw/beds
Bookdown project for an open education resource (OER) book: Becoming Educational Data Scientists
analytics data-analysis data-analytics data-science
Last synced: 31 Mar 2025
https://github.com/robcyberlab/linear-regression-application
🔢Linear Regression Application💻
artificial-intelligence data-analysis data-science data-visualization linear-regression machine-learning python python-programming regression-analysis statistics
Last synced: 31 Mar 2025