Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/mtholahan/advanced-mysqlquery-tuning-mini-project
Analyzed EuroCup 2016 data with advanced SQL queries. Imported CSV datasets into MySQL, designed schema with match, player, and referee details, and implemented queries covering match outcomes, penalty shootouts, player stats, bookings, substitutions, and referee activity to explore tournament dynamics.
bootcamp data-analysis data-engineering data-modeling database eurocup football mysql queries soccer sports springboard sql
Last synced: 15 May 2026
https://github.com/sotirismos/pattern-recognition-labs
Lab exercises and quizzes for Pattern Recognition course, Auth winter semester 20-21
classification clustering data-analysis machine-learning pattern-recognition
Last synced: 17 Jun 2025
https://github.com/rachelresende/regressaolinear
Este repositório é destinado as aulas de regressão linear que realizei em um curso da Udemy sobre o assunto em 2025. Sendo um curso de reciclagem, pois estudei esse tratamento também em 2020 em um curso de estatística da Alura.
data-analysis data-science linear-regression
Last synced: 11 Sep 2025
https://github.com/haroontrailblazer/user_behavioral_analysis
Social Media User Engagement Analysis Using Power BI
data-analysis data-science data-visualization database powerbi
Last synced: 29 Mar 2025
https://github.com/mainak-97/pizza-sales-analysis-project
Pizza Sales Analysis Project: This project optimizes a pizza restaurant's operations by analyzing demand patterns, revenue, and efficiency, providing insights to enhance profitability, streamline production, and improve customer satisfaction.
business-analytics business-intelligence dashboards data-analysis operations-optimization peak-hours power-bi restaurant-analysis revenue-analysis
Last synced: 06 Jan 2026
https://github.com/parth-jatav/ipl-data-analysis-mentorness
This project uses Power BI to analyze IPL cricket data, featuring dashboards with insights on batting averages, strike rates, and player roles. It identifies the top 11 players and includes navigable pages focused on specific roles like Anchors, Finishers, and All-Rounders.
dashboard data-analysis ipl ipl-dashboard powerbi
Last synced: 07 Mar 2026
https://github.com/alansteinbarth/eksploracyjna-analiza-danych-o-pasazerach-statku-titanic
🔍 Titanic EDA: odkrywanie wzorców przeżywalności przez analizę danych. Profesjonalny projekt z wizualizacjami i insights
analytics csv data-analysis data-science data-visualization dataset eda exploratory-data-analysis jupyter-notebook kaggle machine-learning matplotlib numpy pandas portfolio python seaborn statistics titanic visualization
Last synced: 11 Apr 2026
https://github.com/syarwinaaa09/analyzing-students-mental-health
data-driven exploration into student mental health trends using survey data
csv-dataset data-analysis education jupyter-notebook mental-health-awareness pandas psychology student-mental-health visualization
Last synced: 11 Sep 2025
https://github.com/eesunmoon/genai_cor-recom
[Project] Outfit Coordination Recommender System using KoAlpaca
data-analysis fine-tuning generative-ai huggingface keyword llm numpy pandas python python3 selenium
Last synced: 06 Apr 2026
https://github.com/rahulsm20/trackbyte
A full-stack web application that helps users keep track of their playlist and provides analytics based on their music taste. Built using React, Node.js, Express.js, MySQL and Bootstrap.
bootstrap data-analysis expressjs mysql nodejs reactjs sql
Last synced: 07 Apr 2026
https://github.com/pylena/movies-prediction
This project focuses on clustering movies based on their genres using machine learning techniques. By analyzing genre data, the model groups similar movies together, facilitating recommendations and insights into genre-based patterns.
data-analysis machine-learning render streamlit unsupervised-learning
Last synced: 18 May 2026
https://github.com/judyway2/de-data
A brief analysis on schools ARR data
data-analysis jupyter-notebook
Last synced: 11 May 2025
https://github.com/natanel567/university_machine_learning_project
Machine Learning final project Tel Aviv University
data-analysis jupyter-notebook machine-learning
Last synced: 11 May 2025
https://github.com/victoorv/detection_malwares
L'objectif de ce projet est de développer un classifieur capable de différencier les logiciels malwares des goodwares.
classification data-analysis data-science machine-learning machine-learning-algorithms malware-analysis malware-detection oversampling-algorithms python scikit-learn supervised-learning undersampling-algorithms
Last synced: 28 Apr 2026
https://github.com/prakhar-code/british_airways_review_analysis
Analysis of the British Airways Reviews by Customers, filtered by several different factors such as food, entertainment, services, etc.
data-analysis data-cleaning excel tableau-dashboards tableau-public tableau-visualization
Last synced: 15 Jan 2026
https://github.com/ziaeemehr/neuro_toolbox
Single Header File C++ library for analysis of neurophysiological and simulated data.
data-analysis data-science signal-processing synchronization
Last synced: 21 Jul 2025
https://github.com/rafinha0rafinha/web-analyzer-backend
(Legacy) This is the backend for Mazaoro SARLU's lead magnet "Web Analyzer". This project analyzes websites using Google Lighthouse and returns a detailed report consumed by the frontend.
azure-app-service azure-devops chartjs cicd data-analysis data-science data-visualization express flask hacktoberfest lighthouse numpy sentiment-analysis vader-sentiment-analyzer
Last synced: 10 Apr 2026
https://github.com/mfakhriazhar/stock-price-prediction
Stock prices are highly volatile and influenced by various factors, making accurate prediction a major challenge in investment decisions.
data-analysis data-science deep-learning python recurrent-neural-networks
Last synced: 18 May 2026
https://github.com/spring-0/netflix-media-data-analysis
Exploring and analyzing Netflix data to uncover trends through data visualization and statistical analysis.
Last synced: 27 Mar 2025
https://github.com/jasonsu131/cps188-term-project
A data analysis program developed in C to extract information about diabetic patients across Canada from a governmental spreadsheet available online. The program showcases summaries and averages based on the extracted data.
c data-analysis data-statictics file-reading
Last synced: 28 Mar 2025
https://github.com/velut/thesis-sw
Software and datasets used in the "Cost-effective and Scalable Activity Matching using Crowdsourcing" thesis
bpmn cost crowdflower crowdsourcing data-analysis dataset performance-analysis plotting-algorithms r thesis
Last synced: 19 Jun 2025
https://github.com/mae776569/weratedogs-wrangling
Wrangling WeRateDogs Twitter data to create interesting and trustworthy analyses and visualizations
data-analysis data-science data-visualization tweets twitter-api
Last synced: 25 Jan 2026
https://github.com/simranjeet97/covid-19
Covid-19 Data Analysis and Important Topics to be Covered to get the Impact and Solution.
coronavirus coronavirus-analysis coronavirus-dataset coronavirus-prediction coronavirus-tracking covid-19-data-analysis covid19 covid19-data covid19-india dash dash-app dash-plotly data data-analysis data-science data-science-projects data-visualization python3
Last synced: 18 May 2026
https://github.com/qalita-io/tutorials
Tutorials how to best use Qalita Products
data-analysis data-engineering data-quality data-quality-checks documentation qalita tutorials
Last synced: 17 Jan 2026
https://github.com/mfakhriazhar/ecom-qtt-prediction
In e-commerce, understanding seasonal sales trends and best-selling products is critical to business strategy. However, companies often struggle with predicting sales, determining factors that influence sales (discounts, product categories, locations), and optimizing stock and marketing.
data-analysis data-science data-visualization e-commerce-project eda machine-learning python
Last synced: 19 May 2026
https://github.com/kenwuqianghao/scotiabank-datathon-2023
Code and data analysis done for 2023 Scotiabank Datathon
data-analysis fraud-detection jupyter-notebook python
Last synced: 18 May 2026
https://github.com/yash22222/british-airways-data-science-internship
All 2 Task Assigned By British Airways Data Science Virtual Internship Programme
csv data-analysis data-science data-visualization google-colaboratory jyputer-notebook machine-learning microsoft-excel microsoft-powerpoint python
Last synced: 16 May 2026
https://github.com/yash22222/web-scraping-for-data-analysis-predictive-model-on-customer-data
Utilized web scraping for customer feedback at Air India, conducting robust data analysis, and applying machine learning for predictive modeling. Drove data-driven decisions, enhancing services, and elevating customer satisfaction. Expertise in web scraping, analysis, and predictive modeling for actionable insights.
data-analysis data-preprocessing data-science data-visualization exploratory-data-analysis machine-learning powerbi random-forest-classifier sentiment-analysis tableau web-scraping
Last synced: 30 May 2026
https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset
This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations
business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis
Last synced: 07 Apr 2026
https://github.com/annaanastasy/classification-project-student-grades
A machine learning project to predict students' academic performance using features like demographics, study habits, and parental involvement, achieving 74% accuracy with the CatBoost model.
catboost-classifier classification data-analysis data-visualization machine-learning-algorithms predictive-modeling
Last synced: 29 Mar 2025
https://github.com/manuelgil/vscode-data-pack
This extension pack includes the essential extensions for data analysts.
data-analysis data-science data-structures data-visualization vscode-extension
Last synced: 07 Apr 2026
https://github.com/prarthana-singh/heart-attack-prediction-model
A Machine Learning model that predicts the risk of a heart attack based on health parameters like cholesterol levels, blood pressure, BMI, smoking habits, and age. Built using Classification models, Scikit-Learn, Pandas, and Python.
classification data-analysis data-science heart-attack-prediction logistic-regression machine-learning numpy pandas python scikit-learn
Last synced: 25 Jun 2025
https://github.com/sparkerdata/hockeyshotmap
Interactive Streamlit app for NHL shot maps & player analysis. Pulls live (or demo) play-by-play data, normalizes rink coordinates, and visualizes shots with context filters (strength, period, player).
data-analysis data-visualization duckdb hockey hockey-analytics ice-hockey nhl nhl-data python sports sports-analytics
Last synced: 18 May 2026
https://github.com/robinmillford/analyzing-e-commerce-transactions---data-cleaning-cohort-analysis-and-sql
In this project, I aimed to analyze the profitability of products in an e-commerce dataset. I performed various SQL queries to extract valuable insights about product profitability, including the identification of the top 5 products with the highest profit margin, and unique combinations of brands and product lines with the highest profitability.
cohort-analysis data-analysis data-visualization excel jupyter-notebook powerbi python3 sql
Last synced: 18 May 2026
https://github.com/ivanayala96/end-to-end-business-intelligence-solution-logistics-financial-performance-dashboard
Project Overview: This project features a comprehensive Power BI solution developed for Ayala's Consultancy. It transforms raw operational data (generated via Python) into a strategic decision-making tool, managing a dataset of $7.71M in total sales and over 2,500 transactions.
anlytics bussines-report bussiness-intelligence data-analysis dax power-bi powerbi python
Last synced: 22 Apr 2026
https://github.com/dacosmicgiant/marketing-sms-analyser
Mini project for R language SEM - V
Last synced: 21 Mar 2025
https://github.com/kammarah/studentdata
I created & deployed a Streamlit app to store, manage & analyze student data. 📊🎓
connection data data-analysis data-visualization deploy deployments libraries python streamlit streamlit-webapp webapp
Last synced: 18 May 2026
https://github.com/stefagnone/moneyball_project
Data-driven analysis inspired by the Moneyball approach, identifying affordable replacements for key Oakland A's players using R and sabermetrics to support cost-effective recruitment.
baseball-statistics data-analysis data-driven-decision-making player-replacement-strategy r-programming sabermetrics sports-analytics
Last synced: 05 Apr 2025
https://github.com/jatin-mehra119/car_price_prediction
Predicting price of the cars using small dataset.
data-analysis data-visualization jupyter-notebook machine-learning python regression-models sklearn sklearn-pipeline
Last synced: 07 Apr 2026
https://github.com/ahadly/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql
Last synced: 18 May 2026
https://github.com/enayar478/nomad_machine_learning_dash_app
An interactive Machine Learning app built with Dash and Plotly, developed as part of the Data Analytics Bootcamp at Le Wagon Bordeaux. It allows users to visualize data, make real-time predictions, and explore various model insights.
analytics cachetools dash dashboard-application data-analysis data-science deployment gunicorn interactive-visualization machine-learning pandas plotly plotly-dash prediction-model python python3 render scikit-learn web-application
Last synced: 02 Jan 2026
https://github.com/danicaalana/sales-review-sentiment-analysis
This project is a sentiment analysis project using a machine learning model. It analyzes Amazon product reviews to determine whether the sentiment expressed is positive, negative, or neutral using Multinomial Naive Bayes Method.
amazon data-analysis data-science machine-learning naive-bayes python sales-review sentiment-analysis
Last synced: 15 May 2026
https://github.com/hoxo-m/blog
HOXO-M Blog
data-analysis data-science r-package
Last synced: 30 Oct 2025
https://github.com/akash1070/predicting-zomato-restaurant-ratings
Perform extensive Exploratory Data Analysis(EDA) on the Zomato Dataset. Building an appropriate Machine Learning Model that will help various Zomato Restaurants to predict their respective Ratings based on certain features deploy the Machine learning model via Flask
data-analysis extratreesregressor flask linear-regression machine-learning random-forest zomato-bangalore zomato-data-analysis
Last synced: 18 May 2026
https://github.com/sebastianurdaneguibisalaya/enfermedades-fissal
Análisis holístico de atenciones por enfermedades raras, huérfanas y transplantes coberturados por FISSAL en el Perú.
data-analysis data-visualization python
Last synced: 24 Feb 2025
https://github.com/jerinpious/house-price-prediction
This project is a machine learning-based application to predict house prices. A frontend interface has been developed using Streamlit to make the prediction process user-friendly for regular customers. The project is structured
data-analysis data-engineering data-science eda machine-learning pandas python random-forest scikit-learn streamlit
Last synced: 05 Apr 2026
https://github.com/nishumehta/uber-rides-data-analysis
An in-depth analysis of Uber ride data for the year 2016, to uncover patterns in ride behavior, mileage trends, and frequent start locations to generate actionable insights for business decisions.
data-analysis jupyter-notebook matplotlib-pyplot pandas python tableau-dashboards
Last synced: 09 May 2026
https://github.com/sreejabethu/smart-report-analyzer
An AI-powered app to analyze and summarize Excel, CSV, and PDF reports using Hugging Face language models. Built with Streamlit.
data-analysis huggingface llm nlp pdf-analysis python question-answering streamlit summarization
Last synced: 18 May 2026
https://github.com/cowboymrzamo2380/json-to-excel-converter
This repository provides a tool to convert JSON data to Excel format (.xlsx). It allows you to easily transform structured JSON data into a well-organized spreadsheet for better analysis and visualization.
automation-script automation-tools data-analysis data-converter data-export data-formatting data-tools data-visualization excel excel-automation excel-converter excel-tools json json-exporter json-parser json-processing json-to-csv json-to-excel programming-tools spreadsheet-tools
Last synced: 05 Apr 2025
https://github.com/saadhaniftaj/logistic--lasso-regression-data-analysis
Iris dataset analysis with logistic and Lasso regression, using coordinate descent for feature selection and binary classification. Includes preprocessing and data visualizations
data-analysis lasso-regression-model logistic-regression python statistics
Last synced: 18 May 2026
https://github.com/andremenezesds/house_rocket_sales_insights
Data analysis for House Rocket Sales Company database, including insights for sales optimization.
data-analysis data-visualization geopandas git github jupyter-notebook linux numpy pandas powerbi python seaborn seaborn-python streamlit streamlit-webapp ubuntu vscode windows10
Last synced: 21 Jan 2026
https://github.com/bhaveshbhakta/amazon-sales-data-visualization
Amazon Sales Data Visualization
amazon-sales-data-analysis data-analysis data-preprocessing data-visualization machine-learning
Last synced: 18 May 2026
https://github.com/thoratstuti/power-bi-dashboards-for-finance-analysis
Power BI can group and gather information from multiple systems to present the whole picture of business data analytics in one “single view”. It made the staff of the financial institution work in a collective digital platform, where they can compute and share relevant data.
data-analysis data-visualizations excel graph pie-chart powerbi
Last synced: 07 Mar 2026
https://github.com/hcrlau/cyclistic-bike-share-analysis
Google Data Analytics Capstone Project
bigquery cyclistic-bike-share-analysis-case-study data-analysis data-visualization sql tableau
Last synced: 05 Apr 2025
https://github.com/ljadhav25/django-data-analyzer
Django Data Analyzer is a web application built using the Django framework, designed to streamline data analysis tasks. Users can upload CSV files containing data for analysis. The application utilizes the powerful data manipulation capabilities of Python libraries like pandas and numpy to perform various analyses on the uploaded data.
data-analysis data-visualization django-application matplotlib numpy pandas python seaborn
Last synced: 01 Mar 2026
https://github.com/syarwinaaa09/exploring-airbnb-market-trends
a data analysis project exploring NYC Airbnb listings, using data visualization and pandas for price trends, room types, and reviews.
airbnb data-analysis data-science data-visualization jupyter-notebook new-york-city nyc pandas price-analysis reviews room-types
Last synced: 30 Apr 2026
https://github.com/a19xys/dm-csgo_analysis
Analysis to address the most important aspects of the knowledge discovery process from data.
data-analysis data-mining data-science dataset jupyter-notebook python
Last synced: 18 May 2026
https://github.com/daveornedo/ejercicios-machine-learning
Proyecto académico de Machine Learning realizado con Python
algoritmos data-analysis data-visualization diccionario hyperparameter-tuning jupyter-notebook jupyter-notebooks knn lambda-functions lasso list machine-learning phd rstudio
Last synced: 10 Apr 2025
https://github.com/1adityakadam/carnegie-classifications-ancestry-grid
A concise, interactive tool for exploring the historical lineage of U.S. higher education institutions using Carnegie Classification data from 1973–2021.
dash data-analysis html javascript pandas python
Last synced: 25 Jun 2025
https://github.com/andersoncrs/analisis-de-texto-tweets
En este proyecto exploro el análisis de texto de tweets para descubrir tendencias, opiniones y temas relevantes en redes sociales. Usando herramientas de procesamiento de lenguaje natural, convierto grandes volúmenes de mensajes en información clara y visualmente atractiva.
data-analysis data-visualization eda text-mining
Last synced: 21 Jul 2025
https://github.com/BingyanStudio/github-analyzer
锐评一下你都在 GitHub 写了什么
data-analysis github llm reports selfhosted typescript
Last synced: 12 May 2025
https://github.com/nadamarei/data-analyzer
The Qualitative Data Analysis Tool is a powerful Streamlit application designed for researchers to analyze word frequencies in corporate documents. This tool processes PDF reports, identifies target words and their contextually relevant synonyms, and generates comprehensive reports with document statistics, summary analysis, and per-file breakdowns
data-analysis data-visualization python-3 streamlit
Last synced: 18 May 2026
https://github.com/jelhamm/model-ensembles-boosting-in-machine-learning
"This repository contains implementations of Boosting method, popular techniques in Model Ensembles, aimed at improving predictive performance by combining multiple models. by using titanic database."
boosting boosting-algorithms boosting-ensemble boosting-machine data-analysis database-analysis datamining datamining-algorithms jupyter-notebook machine-learning machine-learning-models machine-learning-projects matplotlib-python model-ensemble module numpy-library pandas-library python sklearn-library
Last synced: 16 May 2026
https://github.com/jelhamm/singular-value-decomposition-data-mining
"This repository hosts an implementation of the Singular Value Decomposition (SVD) algorithm tailored for data mining tasks. SVD is utilized for efficient dimensionality reduction, aiding in the extraction of key patterns and features from large and complex datasets."
data-analysis dimension-reduction jyputer-notebook machine-learning matplotlib numpy-library pandas-library preprocessing python scipy-library singular-value-decomposition sklearn-library standardscaler svd svd-matrix-factorisation
Last synced: 18 May 2026
https://github.com/jlee9503/defense-risk-prediction
Build a machine learning pipeline that ingests defense procurement data, identifies high-risk contracts, and visualizes the results in an interactive dashboard.
data-analysis data-visualization exploratory-data-analysis python
Last synced: 25 Jan 2026
https://github.com/mituskillologies/aiml-dypiemr-sep24
Programs conducted at DYPIEMR, Pune in training on AIML during September 2024.
artificial-intelligence data-analysis data-science machine-learning matplotlib neural-network numpy pandas python3
Last synced: 05 Apr 2025
https://github.com/Fisseha-Estifanos/telecom
A showcase repository for a specific telecommunication company. Used to analyze several telecommunication data set features and generate useful insights accordingly. Insights generated could be seen at https://github.com/Fisseha-Estifanos/telecom-visualizer or at https://fisseha-estifanos-telecom-visualizer-home-huxgy0.streamlitapp.com/
data-analysis notebooks-jupyter python visual-studio-code visualization
Last synced: 11 Mar 2025
https://github.com/alvarezekiel19/movie-data-analysis
A Data Science elective activity
data-analysis data-science data-visualization jupyter-notebook python python3
Last synced: 18 May 2026
https://github.com/martachesnova/python-apis
A weather analysis that randomly selects more than 500 cities across the globe, pulls data from the OpenWeatherMap API for each city. Analysis of the weather and perfect vacation spot is viewable on my Jupyter Notebook.
Last synced: 24 Feb 2025
https://github.com/smsraj2001/sds-datathon
A simple data science project/hackathon done as part of SDS course
data-analysis data-analysis-python data-cleaning data-science statistics statistics-for-data-science
Last synced: 16 Jul 2025
https://github.com/gui-sitton/games
Identify patterns that determine whether a game is successful or not. This will allow you to identify potential big winners and plan advertising campaigns.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 18 May 2026
https://github.com/pyramidheadshark/ai-mirea-sem1p
Completed set of all MIREA AI an DA practices (1 sem.)
beginner-friendly data-analysis data-science jupyter mirea
Last synced: 05 Apr 2025
https://github.com/quocduyenanhnguyen/yelp-analysis
Yelp data analysis of business rating, categories, any trends/patterns, correlation, etc.
csv-to-database data-analysis data-analytics data-visualization database json json-parsing json-to-csv mysql mysql-database mysql-workbench pycharm python python3 restaurant sql tableau tableau-dashboards tableau-public yelp-dataset
Last synced: 27 Jan 2026
https://github.com/ddihora1604/social_media_analysis
A powerful, interactive dashboard for analyzing social media conversations, trends, and network dynamics. This tool allows researchers and analysts to explore patterns in social media data, identify key trends, and detect coordinated behavior.
aiml css data-analysis data-visualization html javascript python
Last synced: 30 Oct 2025
https://github.com/jonnor/acm-2019-dbscan
clustering data-analysis data-science health machine-learning nhanes nutrition
Last synced: 03 Apr 2025
https://github.com/liebsen/overlemon
Overlemon institutional application
data-analysis design devops sysadmin webdev
Last synced: 21 Jul 2025
https://github.com/lavkalsi/tableau-project-stock-market-analysis
The Tableau Project: Stock Market Analysis features a dashboard that combines Descriptive, Diagnostic, Predictive, and Prescriptive analytics to provide insights into stock market trends. Using Python for data processing and an LSTM model for forecasting, this project visualizes historical and predicted stock prices, helping make informed decision.
dashboard data-analysis deep-learning lstm-model python tableau
Last synced: 18 May 2026
https://github.com/riciokzz/mental-health-in-tech-analysis
Analysis of the Mental Health in the Tech Industry.
data-analysis data-engineering data-science exploratory-data-analysis
Last synced: 21 Jul 2025
https://github.com/rathod-shubham/google-data-analytics
Learning a wide range of skills that are useful in everyday life as well as being a data analyst.
data-analysis data-analysis-in-r data-analyst data-analyst-nanodegree data-analytics data-visualization google
Last synced: 03 Feb 2026
https://github.com/adikahnf/Data-analysis-with-Python
data-analysis numpy pandas python streamlit
Last synced: 31 Dec 2025
https://github.com/kevin-rsj/the-substance-sentiment-analysis
Se analiza los comentarios de usuarios de Reddit sobre la película The Substance (2024) usando técnicas de NLP. Se obtuvo un sentiment score promedio de 0.19, y palabras clave como "horror" y "like" destacan entre las opiniones.
data-analysis notebook python sentiment-analysis tableau visualization
Last synced: 19 May 2026
https://github.com/marcogdepinto/olympichistoryanalysis
Python visual analysis of the Olympic Games history. Kaggle gold medal with 15000+ views, 200+ upvotes and 100+ comments.
data-analysis data-science jupyter-notebook olympic-games python seaborn
Last synced: 29 Apr 2026
https://github.com/shrunga92/5g_qos_data_transformation_python
Resource Allocation in 5G Network Service
Last synced: 19 May 2026
https://github.com/first-coding/aidanalyst
AIDAnalyst is an AI-powered data analysis tool that leverages large language models (LLMs) to generate SQL queries from natural language prompts. Upload CSV files, explore the data schema, and retrieve insights with ease. The system ensures error correction in SQL queries, delivering detailed reports and visualizations in a streamlined workflow
data-analysis llm openai prompt-engineering python
Last synced: 19 May 2026
https://github.com/hamzacham/data_set-projet-8
Analyzing a real world data-set with SQL and Python
data-analysis database dataset jupyter-notebook paython sql
Last synced: 19 May 2026
https://github.com/abdoomohamedd/data-science-projects
A collection of data science projects ranging from exploratory data analysis to predictive modeling and clustering. Each project is designed to solve specific problems or explore particular datasets using various data science techniques and tools.
data-analysis data-analysis-python data-cleaning data-science data-visualization machine-learning machine-learning-algorithms
Last synced: 14 May 2025
https://github.com/sakan811/find-common-japanese-character-from-news
Showcase visualizations about common Japanese characters that appear in the news
beautifulsoup beautifulsoup4 data-analysis dataanalysis japanese japanese-language language news powerbi requests sqlite sqlite3 visualization webscraper webscraping
Last synced: 19 May 2026
https://github.com/jyrki69pro/pdf-insight-agent
📄 Extract insights from PDFs effortlessly with this AI-powered summarizer, transforming documents into structured, actionable points.
agent-based-model agentic-ai agentic-workflow agents ai-agent data-analysis finance-management financial-analysis generative-ai langchain langgraph llama3 llm multiagent-systems pdf phidata python toolcalling
Last synced: 11 Apr 2026
https://github.com/touppercase78/salary-prediction-collection
Salary predictions with ML models and analyses on datasets from several other GitHub repos
data-analysis data-visualization datasets machine-learning python3 regression-models
Last synced: 02 May 2026
https://github.com/ramonanf/tc1002s_semanatec
Herramientas computacionales: El arte de la analítica
data-analysis data-visualization jupiter-notebook pandas-python
Last synced: 15 Jun 2025
https://github.com/eco786786/salaries
This analysis explores the factors influencing salaries for data professionals from 2020 to 2024, including job titles, experience levels, remote work ratios, employment types, company locations and sizes. Using data from Kaggle, the project uncovers trends and insights to guide both companies and professionals in the tech industry.
data-analysis git postgresql powerbi
Last synced: 19 May 2026
https://github.com/mimi-netizen/python-and-machine-learning-in-financial-analysis
This comprehensive repository covers financial data analysis using Python and machine learning techniques, including time series modeling, portfolio optimization, risk assessment, credit risk prediction, and deep learning applications in finance.
data-analysis data-science data-visualization finance financial-analysis financial-data financial-modeling
Last synced: 19 May 2026
https://github.com/mysftz/statistics-analysis
A python statistical analysis of a dataset and probability.
data-analysis matplotlib python python3 statistical-analysis
Last synced: 29 Jun 2025
https://github.com/iamsainikhil/data-visualization
Visualization of Web data using Python
data-analysis data-visualization python webscraping
Last synced: 13 Jun 2026
https://github.com/manishbisht/machine-learning
Machine Learning
data-analysis data-mining machine-learning machine-learning-algorithms machinelearning numpy pandas python
Last synced: 13 Apr 2026
https://github.com/jabulente/kruskall-wallis-test
This repository contain project that provides a reusable Python function to perform the Kruskal-Wallis H-test across multiple continuous variables, grouped by a categorical feature
data-analysis data-science eda hypothesis-tests kruskal-wallis kruskals-algorithm scipy-stats statistics
Last synced: 22 Jul 2025
https://github.com/ireneflorez/e_commerce_a_b_test_analysis
website A/B test data analysis
data-analysis jupyter-notebook matplotlib numpy pandas python statsmodels
Last synced: 14 Apr 2026