Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/mariam-badr-mb/gtc-ml-project2-diabetes-prediction
This project is part of the GTC Machine Learning Program. It demonstrates the end-to-end ML workflow by building a predictive model for diabetes detection
classification-algorithm data-analysis data-visualization diabetes-prediction gridsearchcv hyperparameter-tuning machine-learning python
Last synced: 09 May 2026
https://github.com/sivkri/perseus-ms-proteomics-venn
Mass spectrometry Perseus Data analysis
data-analysis mass-spectrometry perseus proteomics proteomics-data proteomics-data-analysis proteomics-data-integration
Last synced: 14 Apr 2026
https://github.com/anthonybench/datapeek
Peek summary of datafile in a succinct, opinionated manner.
Last synced: 02 Mar 2026
https://github.com/mohamedhany99/human-voice-identifier-counter
the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)
android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python
Last synced: 27 Mar 2026
https://github.com/mayankyadav23/air-bnb-data-analysis
Data analysis and insights from NYC Airbnb listings, focusing on key metrics such as host performance, neighborhood trends, pricing, and customer reviews. Comprehensive documentation of ETL processes and analytical methodologies is provided. Perfect for understanding Airbnb dynamics and decision-making in the NYC market.
advanced-excel business-intelligence data-analysis data-analytics data-visualization power-bi ppt
Last synced: 19 Mar 2026
https://github.com/musaibnagani/fraud-detection
End-to-end fraud detection simulation using Python — Phase 1 (SQLite + Rules) and Phase 2 (MSSQL + Velocity/Behavioral Features) with synthetic banking data.
data-analysis fraud-detection fraudulent-transactions mssql mssql-database pandas python sqlite3 time-series
Last synced: 10 May 2026
https://github.com/santiagortiiz/snowflake-data-warehousing
Snowflake University. Snowflake Data Warehousing. Foundamentals
big-data data-analysis data-warehouse olap snowflake
Last synced: 19 Mar 2026
https://github.com/jofaval/pima-indian-diabetes
Data Analysis and Classification of Pima Indian Women's Diabetes in 1988
data-analysis data-science deep-learning google-colab kaggle logistic-regression machine-learning pima-diabetes-data python scikit-learn xgboost
Last synced: 16 Apr 2026
https://github.com/kinshuk-code-1729/data-visualisation-using-python
This Repository consists of several python snippets for creating Two-Dimensional (2D) Graphics
data-analysis data-science data-visualization matplotlib visualization
Last synced: 02 Jun 2026
https://github.com/nomadsdev/financial-trend-analyzer
FinancialTrendAnalyzer helps analyze and visualize sales data to uncover financial trends. It uses Python to calculate total sales, track changes, and generate insightful charts for better decision-making.
business-intelligence data-analysis data-visualization financial-analysis matplotlib numpy pandas python revenue-trends sales-data seaborn time-series-analysis
Last synced: 19 Jan 2026
https://github.com/strampelligiovanni/straklip
An HST pipeline for reducing wide-field imaging observations not specifically designed for High Contrast Imaging analysis. Published in Strampelli et al. 2022.
binaries data-analysis data-reduction direct-imaging exoplanets high-contrast-imaging hst wide-field-surveys
Last synced: 12 Oct 2025
https://github.com/pratik-khose/data-analysis-with-pandasai
PandasAI with Llama3 for Interactive Data Analysis
data-analysis llama3 llma pandasai streamlit visualization
Last synced: 11 May 2026
https://github.com/chayandatta/got_script_manipulation
Game of Thrones Script - String & file manipulation
data-analysis data-science pandas python3
Last synced: 11 May 2026
https://github.com/srinivasrm/graphics_cards_analysis_and_application
In the current project I have extracted graphics card current prices from an authorizer retailer in India and performed analysis
beautifulsoup data-analysis data-science data-visualization etl graphic-card-price-prediction graphics-card graphics-card-analysis heroku-database machine-learning matplotlib pgsql python regression scikit-learn seaborn sql streamlit webapplication
Last synced: 04 Mar 2026
https://github.com/easycris-software/easycris
Professional statistical analysis and RNA-seq for researchers — no coding required
anova bioinformatics data-analysis desktop-app genomics pharmacology research-tools rna-seq statistics tauri
Last synced: 11 May 2026
https://github.com/ahmednasef3/heart-attack-full-eda
Simple EDA for Heart Attack Dataset.
data-analysis data-science data-visualization eda exploratory-data-analysis heartattack matplotlib pandas seaborn
Last synced: 11 May 2026
https://github.com/mgobeaalcoba/matplotlib_y_seaborn
Aquí dejaré trabajos de visualización realizados con ambas librerías de Python.
data-analysis data-science data-visualization dataset matplotlib numpy pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/listiangr/ecommerce_sales_data_analysis
Proyek ini menganalisis data penjualan e-commerce untuk membantu bisnis memahami tren penjualan, performa produk, dan segmen pelanggan. Tujuan utamanya adalah memberikan wawasan yang dapat meningkatkan strategi pemasaran dan pengelolaan produk.
dashboard data-analysis data-cleaning data-collection data-penjualan data-visualization exploratory-data-analysis microsoft-excel
Last synced: 19 Jan 2026
https://github.com/luminati-io/Airbnb-dataset-samples
A sample dataset of over 1000 Airbnb listings, extracted using the Bright Data API, ideal for competitor tracking, brand reputation, and market analysis.
airbnb airbnb-listings api data-analysis datasets web-scraper web-scraper-api web-scraping
Last synced: 09 Apr 2025
https://github.com/thatsinewave/radiosonde-data-analyzer
A web-based tool for visualizing and analyzing radiosonde flight data from log files generated by the Radiosonde Decoder by 9A4AM
data-analysis data-analytics data-visualization good-first-contribution good-first-issue good-first-issues good-first-pr good-first-pr-first-contribution good-first-project good-first-prs good-practices html-css-javascript html-css-js radiosonde radiosonde-hunting radiosondes rtl-sdr sdr sdr-tool thatsinewave
Last synced: 15 Oct 2025
https://github.com/shubham5027/kisanai--the-ultimate-ai-ml-powered-platform-smart-farming-platform
KisanAI – The Ultimate AI/ML-Powered Smart Farming Platform KisanAI leverages AI/ML to optimize farming practices, enhance crop yields, and empower small-scale farmers with data-driven insights.
ai api aws chatbot crm data-analysis deep-learning deplyment farming llm mapping ml nodejs predictive-modeling reactjs supabase sustainability
Last synced: 30 May 2026
https://github.com/robinmillford/analytics_for_fashion_supply_management
This Streamlit dashboard provides a comprehensive analysis of supply chain data, focusing on key metrics such as production volumes, stock levels, order quantities, revenue, manufacturing costs, lead times, shipping costs, transportation routes, risk factors, and sustainability factors
dashboard data-analysis data-visualization streamlit supply-chain-management
Last synced: 07 Sep 2025
https://github.com/kalfasyan/filoma
profiling files, directories, image data
data-analysis profiler validation
Last synced: 05 Apr 2026
https://github.com/vvipjain/hockey-tournament-analysis
Hockey Tournament Analysis
beautifulsoup data data-analysis data-visualization databases pandas pandas-dataframe powerbi python python-library python-script requests-library-python sql sql-server sqlalchemy
Last synced: 27 Jan 2026
https://github.com/ayobami6/tweet-data-analysis
WeRateDogs Tweets Scrape using twitter Api
data-analysis data-science twitter webscraping
Last synced: 31 May 2026
https://github.com/is-leeroy-jenkins/sherpa
A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts
budget-management data-analysis data-science data-visualization federal-government
Last synced: 13 May 2026
https://github.com/jo-phil/hopkins-statistic
A Python package for computing the Hopkins statistic to assess clustering tendency.
cluster-analysis cluster-tendency clustering data-analysis data-science hopkins-statistic machine-learning python statistical-analysis statistics
Last synced: 13 May 2026
https://github.com/sejalmankar1012/yuvaco_data_analysis_assessment
This assignment involves writing a Python script to calculate the cost of package deliveries based on provided data and a cost grid. The script takes package details such as weight, distance, and delivery type, applies the cost calculation rules, and saves the results in an output file. You can also run the script in Google Colab for convenience.
csv-file-handling data-analysis google-colab package-delivery python python-scripting
Last synced: 29 Apr 2026
https://github.com/olob0/badwords-pt-br
💬 Wordlist com palavrões em pt-BR para análise de dados, filtros, ou texto considerado "evitável"
badword-filter badwords brasil data-analysis filter filter-lists filterlist portugues portuguese text-analysis wordlist
Last synced: 06 Jan 2026
https://github.com/gholamrezadar/favourite-youtube-channels
this program goes through your youtube watch history and sorts channels based how many of their videos you have watched!
data-analysis data-visualization python
Last synced: 16 Jan 2026
https://github.com/lightbridge-ks/zoominterface
A data analysis Shiny app of program Zoom report files.
data-analysis r shiny-apps zoom-class zoom-meetings
Last synced: 01 Jun 2026
https://github.com/meetup-python-grenoble/datasette-workshop
Exploration de données avec Datasette
data-analysis data-science data-visualization datasette exploratory-data-analysis python sql workshop
Last synced: 13 May 2026
https://github.com/john-science/data_science_by_example
Examples of Data Science Tools & Libraries
data-analysis data-science ipython pandas
Last synced: 12 May 2025
https://github.com/arielle0222/data_analysis
📊 Data analysis projects for autonomous driving and smart mobility engineering using Python and SQL.
autonomous-driving composite data-analysis electric-vehicles environmental-data python visualizatoin
Last synced: 30 Apr 2026
https://github.com/solrikk/pictrace-web
PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image
automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api
Last synced: 12 Apr 2026
https://github.com/prajakta1321/kaggle-ai-report-2023
A Report describing the trends in emergence of AI over the years !
data-analysis data-visualization python3
Last synced: 28 Jun 2025
https://github.com/karthikudyawar/passwordometer
To predict the strength of the password
cybersecurity data-analysis data-visualization dataset docker exploratory-data-analysis-eda fastapi jupyter-notebook mongodb password-security password-strength-meter
Last synced: 30 Apr 2026
https://github.com/m-faizan-mahmood/detailed-exploratory-data-analysis-eda-marketing-recomendations.
This project focuses on cleaning, preprocessing, and analyzing data using Pandas and NumPy. Key steps include handling missing values, removing outliers, feature engineering, and exploratory data analysis (EDA). Visualizations with Matplotlib and Seaborn highlight trends in customer spending, campaign performance, and product sales.
big-data data-analysis data-processing data-science eda exploratory-data-analysis numpy pandas python
Last synced: 11 Apr 2026
https://github.com/jxareas/de-zoomcamp-2024
Solutions for @datatalksclub's Data Engineering Zoomcamp 2024.
data-analysis data-engineering data-science database datascience de-zoomcamp docker docker-compose etl etl-pipeline mage-ai orchestration python workflow
Last synced: 09 Apr 2026
https://github.com/mkk-1817/hr-attrition
This project, conducted during my internship at MeriSKILL, focuses on HR Attrition Prediction using advanced Machine Learning models. The initiative includes the development of a dynamic Dashboard and in-depth Analysis to offer actionable insights for proactive human resource strategies.
data-analysis data-science data-visualization jupyter-notebook machine-learning-algorithms powerbi python
Last synced: 03 May 2026
https://github.com/renanmoliveir/analise_de_dados_bikestore_power-bi_atualizan-o
Projeto de análise de dados do banco de dados Bike Store com Power BI.
data-analysis dax-languague powerbi query
Last synced: 15 Mar 2026
https://github.com/nhsdigital/sde_summary_notebooks
Notebooks provided by the Wranglers for users to quickly gain insights on datasets inside the Secure Data Environment (SDE)
data-analysis data-linkage data-quality data-summary metrics statistics
Last synced: 12 Aug 2025
https://github.com/dwidevelopes/database-input-pelanggran-mahasiswa
Menginput data Mahasiswa Yang Melakukan Pelanggran yang siap di data dan di hukum Dan juga siap Terkena Sanksi
aplikasi aplikasi-sekolah data data-analysis database input-method mahasiswa sekolah siswa siswi website
Last synced: 02 May 2026
https://github.com/scarblase/salary-comparison
Submission for the DataCamp Salary Competition(1 level). 🏆
data data-analysis data-science data-visualization engineering python sql structured-data
Last synced: 01 May 2026
https://github.com/nmsby/pca-machine-learning-lab
Principal Component Analysis (PCA) implementation and analysis lab for Machine Learning. Features manual PCA implementation, scikit-learn applications, data compression, and feature extraction with detailed visualizations.
data-analysis dimensionality-reduction jupyter-notebook machine-learning numpy pca python scikit-learn visualization
Last synced: 01 May 2026
https://github.com/cano1998/eda-survival-of-the-titanic
This project focuses on Exploratory Data Analysis (EDA) to identify the key determinants that influenced survival during the infamous Titanic accident.
data-analysis data-cleaning data-preprocessing data-visualization exploratory-data-analysis jupyter-notebook titanic-survival-exploration
Last synced: 21 Jun 2026
https://github.com/helosantosdesousa/analise-previsao-de-rotatividade-ml
Projeto final do Bootcamp Data Girls 2025 que analisa a rotatividade de funcionários usando Machine Learning. Com base no dataset IBM HR Analytics Attrition, o projeto identifica os principais fatores de risco e cria modelos preditivos (SVC e Random Forest) com até 89% de acurácia para antecipar saídas e apoiar decisões estratégicas de RH.
analise-de-dados analise-exploratoria bootcamp ciencia-de-dados colab-notebook dados data data-analysis data-science dataanalytics dataframe eda machine-learning machine-learning-algorithms pandas python random-forest svc
Last synced: 16 Apr 2026
https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba
First assignment for the course Data Mining @CSE.UOI
data-analysis data-science numpy scipy seaborn statistics
Last synced: 01 May 2026
https://github.com/nouman6093/advanced-statistical-models
in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl
data data-analysis data-science data-visualization
Last synced: 11 Jun 2026
https://github.com/easonlai/eda_for_prudential_life_insurance_sample_data
Notebook sample of Exploratory Data Analysis (EDA) for Prudential Life Insurance Sample Data
azure-databricks azuredatabricks data-analysis data-analysis-python data-analytics databricks databricks-notebooks eda exploratory-data-analysis insurance insurance-sample-data jupyter-notebook python python3
Last synced: 14 May 2026
https://github.com/aaryan-agr/canadian-energy
This project analyzes Canada's energy trade, focusing on imports, exports, and market trends in the energy sector.
data-analysis data-cleaning data-manipulation data-processing data-science data-vizualisation energy-sector time-series-analysis
Last synced: 10 Jun 2025
https://github.com/dcs-training/null-hypothesis-testing-with-r
This two-class course will focus on developing theoretical and practical skills for null hypothesis testing in R. Go to the readme file
data-analysis data-wrangling r statistics
Last synced: 24 Oct 2025
https://github.com/kentlouisetonino/sw-project-data-analysis
My project for AMA MATH 6200 course.
data-analysis python school-project
Last synced: 28 Feb 2025
https://github.com/film2549/data-analysis-of-a-simulated-marketing-business-case-using-python-sql-and-power-bi
Data Analysis of a Simulated Marketing Business Case Using Python, SQL and Power BI
chulalongkorn computer-engineering computer-science data-analysis data-visualization database marketing nltk-library pandas powerbi pyodbc python simulation sql sqlserver
Last synced: 01 May 2026
https://github.com/gabriela1dc/dashboard-de-analise-de-salarios-na-area-de-dados
Dashboard de análise de salários na área da tecnologia
country data-analysis data-science data-visualization graphics jobs payments python streamlit
Last synced: 09 Apr 2026
https://github.com/ahmad-ali-rafique/weather-prediction-fcnn
This project demonstrates a complete pipeline for weather prediction using a Fully Connected Neural Network (FCNN). The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation.
ai artificial-intelligence data-analysis data-science deep-learning deep-neural-networks fully-connected-network machine-learning machine-learning-algorithms weather-information
Last synced: 28 Aug 2025
https://github.com/nafisalawalidris/data-analysis-with-python
This repo features Jupyter Notebook labs for learning data analysis with Python. Explore data acquisition, wrangling, visualization, modeling, and evaluation. Enhance your skills in Python data analysis.
data-acquisition data-analysis data-science data-wrangling exploratory-data-analysis feature-engineering machine-learning model-development model-evaluation-and-refinement pandas
Last synced: 02 May 2026
https://github.com/heiderjeffer/evaluating-rule-offsetting-schemes-for-sustainable-policy-growth-in-modern-democracies
Python Java. Research Proposal RP
artificial-intelligence data-analysis data-collection data-merging python qualitative-data-analysis quantitative-analysis statistical-analysis
Last synced: 09 Jun 2026
https://github.com/2kabhishek/pyramen
Data Analysis for Ramen 🍜💹
csv data data-analysis fun python report
Last synced: 26 Oct 2025
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 30 Mar 2025
https://github.com/fybex/chatgpt-conversations-analysis
Analysis of 89,000 ChatGPT conversations to understand interaction patterns and response behaviors.
chatgpt conversation-analysis data-analysis data-visualization language-analysis prompt-patterns sentiment-analysis
Last synced: 02 May 2026
https://github.com/sustentarea/gs-data-analysis-report-3
📓 Exploring potential associations between childhood undernutrition and the Standardized Precipitation Evapotranspiration Index (SPEI) in Brazilian municipalities (2008–2019)
brazil climate-change data-analysis data-science food-systems global-syndemic ibge malnutrition nutrition obesity r rstats sisvan spei sustainable-eating wasting worldclim
Last synced: 27 Oct 2025
https://github.com/randomshek/Working-With-Excel
Using Excel Power Query and PowerPivot, reorganise the data into a star schema and showcasing reports that can be created by data analysts using DAX formulae and PowerPivot
data-analysis excel power-pivot power-query
Last synced: 20 Jul 2025
https://github.com/gauthamnairvm/trex-app
Text Refinement EXplorer - An EDA tool for text based data.
data-analysis data-visualization groq-api large-language-models llama3 natural-language-processing text2sql
Last synced: 03 May 2026
https://github.com/ferrangarciarovira/premier-league-betting-analysis
Comprehensive Python analysis of Premier League betting market inefficiencies (2005–2024). Evaluates bookmaker biases, betting strategies, and market efficiency using statistical methods and Monte Carlo simulations.
betting-strategies bias-detection data-analysis market-efficiency monte-carlo-simulation premier-league python sports-analytics
Last synced: 03 May 2026
https://github.com/mirseo/pandas_learning
pandas_learning
data-analysis data-analysis-python data-science data-visualization numpy numpy-example pandas pd python python-3 python3
Last synced: 03 May 2026
https://github.com/anandanraju/youtube-data-api-model
The YouTube Analytics API enables you to generate custom reports containing YouTube Analytics data. The API supports reports for channels and for content owners. Report fields are characterized as either dimensions or metrics
analytics data-analysis data-science metrics model python telemetry youtube youtube-api
Last synced: 03 May 2026
https://github.com/manikantasanjay/time_series_data_analysis_on_stocks
Time Series Data Analysis project on Daily Stock Prices of the following companies(Apple, Microsoft, Google, Amazon) for a span of 5 years.
data-analysis pandas stock time-series time-series-analysis
Last synced: 03 May 2026
https://github.com/suhas-005/power-bi-dashboard
Power BI Dashboard Projects
data-analysis data-visualization dataset power-bi-project powerbi
Last synced: 01 Apr 2025
https://github.com/banyc/dfsql
SQL REPL/lib for Data Frames
cli csv data-analysis jsonl ndjson repl sql
Last synced: 31 Jul 2025
https://github.com/zen204/airbnb-availability
A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.
binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning
Last synced: 21 Jan 2026
https://github.com/chouaib-629/customersegmentation
Hadoop-based Customer Segmentation project using the Online Retail Dataset. Implements MapReduce for processing and Python for preprocessing to uncover customer purchasing patterns for targeted marketing.
big-data customer-segmentation data-analysis data-science distributed-computing hadoop hadoop-mapreduce java mapreduce marketing-analytics python
Last synced: 04 May 2026
https://github.com/dannyben/datamix
DSL for manipulating tabular data
csv data data-analysis data-engineering gem ruby tabular-data
Last synced: 31 Jul 2025
https://github.com/vhawk19/ambaan
just wants the average analyst to be happi
data-analysis duckdb-wasm sql vue
Last synced: 01 Mar 2026
https://github.com/robthepcguy/ahk-mouse-heatmap
An AutoHotkey script that records left, right, and middle mouse clicks, logging the date, time, and x and y coordinates. It features automatic GUI updates and generates a visual heatmap via a Python script, accessible from the system tray. This tool is ideal for analyzing user interaction and creating detailed mouse activity maps.
autohotkey-script click-counter click-log click-map data-analysis data-visualization heatmap heatmap-visualization mouse-events mouse-tracking python
Last synced: 01 Apr 2025
https://github.com/incubrain/awesome-maharashtra-data
A collection of datasets specific to Maharashtra, India. WIP
ai artificial-intelligence data data-analysis data-science datasets maharashtra marathi
Last synced: 23 May 2026
https://github.com/prernarohra/quakeguard
QuakeGuard is an innovative project for reducing earthquake intensity and structural damage. It takes a proactive approach to seismic activity, by using complex algorithms and real-time data to improve safety and resilience for people in earthquake-prone areas.
artificial-intelligence backend data-analysis data-science earthquake-intensity final-year-project front-end geology machine-learning open-source python visualization
Last synced: 21 May 2026
https://github.com/chen0040/pyspark-advanced-algorithms
Samples of Advanced Algorithms and Data Analysis implemented in pyspark
advanced-algorithms data-analysis map-reduce pyspark
Last synced: 12 Jan 2026
https://github.com/seekinginfiniteloop/fedcal
A feature-rich Python calendar that enables time series analyses of changes in federal workforce schedules and shifts in executive department funding status.
data-analysis data-science econometrics economic-data economics federal federal-government hr pandas pandas-library pandas-python pydata python
Last synced: 15 Apr 2026
https://github.com/sandergi/designbuildfly
Useful tools made for Design Build Fly at UW, hosted on Glitch so teammates can easily access. Check out our optimization tools here: https://github.com/JPaonaskar/DBF-Optimization
data-analysis inav-blackbox webapp
Last synced: 01 Apr 2025
https://github.com/soumya-thoutam/revenue-and-demand-forecasting-analysis
End-to-end data analysis project with SQL and Power BI for revenue and demand forecasting in bike-sharing.
data-analysis data-visualization powerbi sql sql-server
Last synced: 16 Mar 2025
https://github.com/kimtth/agent-data-analyst-stream-chainlit
⚡️Chainlit-based Data Analyst Chat Agent (Responses API, Server Sent Events) 📈
agent azure-openai chainlit code-interpreter data-analysis server-sent-events stream-response
Last synced: 09 Jun 2026
https://github.com/jinkogule/multi-analyst
O Multi Analyst é uma ferramenta de análise de dados com uma usabilidade simples, que utiliza inteligência artificial para interpretar os resultados das análises realizadas, retornando insights úteis aos usuários.
apriori-algorithm bootstrap css data-analysis django html numpy open-ai pandas python web-application
Last synced: 12 Apr 2026
https://github.com/aryansharma5/data-visualization-and-thorough-analysis
comprehensive guide for data analysis and visualization
data-analysis data-visualization
Last synced: 18 Mar 2025
https://github.com/wizardoftrap/football-team-analytics
This Jupyter notebook, created on Kaggle, analyzes football player and team statistics for the 2024-2025 season. It provides insights into player performance, team metrics, and playing styles across major European leagues using data from the dataset players_data-2024_2025.csv.
data-analysis data-visualization jupyter-notebook pandas python
Last synced: 05 May 2026
https://github.com/kiranmayi5/python-projects
A collection of Python projects showcasing skills in data analysis and visualization.
data-analysis data-visualization machine-learning nlp python
Last synced: 05 May 2026
https://github.com/swarnim1812/crime_project
AI-Driven Crime Forecasting Across Indian States — A pioneering machine learning project that harnesses time series modeling (SARIMAX, Ridge Regression) to uncover patterns and forecast crime trends using real-world multi-state temporal and socio-economic data.
analytics crime-locator crime-prediction data-analysis deep-learning machine-learning prophet-facebook sarimax-model time-series-forecasting
Last synced: 31 Jan 2026
https://github.com/matthewgrosman/messenger-analytics
Project that ingests Facebook Messenger conversations and generates analytics.
analytics data-analysis excel facebook facebook-messenger java mongodb
Last synced: 15 Apr 2025
https://github.com/mirokeimioniemi/classifying-software-pirates
Exploring the factors driving people into software piracy by training two machine learning models to predict whether a person with certain characteristics and sentiments is likely to possess any pirated software or not using a dataset collected via a survey targeting users of music production software.
data-analysis data-science decision-tree-classifier logistic-regression machine-learning piracy python software-piracy survey
Last synced: 06 May 2026
https://github.com/thevinh-ha-1710/rstudio-statistics
This project deeply studies 2 datasets using applied statistics techniques.
applied-statistics data-analysis data-science data-visualization rmarkdown rstudio
Last synced: 31 Jan 2026
https://github.com/rekha0suthar/e-commerce-shopper-s-behaviour-understanding
Understand the online shopper purchasing pattern through Machine learning
data-analysis data-preprocessing data-visualization logistic-regression machine-learning numpy pandas python3 scikit-learn seaborn-plots
Last synced: 12 Apr 2026
https://github.com/mafesan/2021-tfm-code
Revelio: Machine-Learning classifier to identify Bots integrable with GrimoireLab
bot-accounts data-analysis data-analytics data-science grimoirelab machine-learning metrics open-source open-source-community project-health python scikit-learn
Last synced: 15 Apr 2026
https://github.com/rahul-404/full_stack_data_science_with_generative_ai
Welcome to the repository for the course "Full Stack Data Science with Generative AI". This repository is designed to accompany the course and provide resources, exercises, and projects related to the study of data science and generative AI techniques.
data-analysis data-science data-visualization database deep-learning exploratory-data-analysis feature-engineering generative-ai machine-learning nlp python statistics
Last synced: 12 Apr 2026
https://github.com/zpreisler/modules
Python libraries and modules for processing simulation outputs
data-analysis python scripts tensorflow
Last synced: 13 May 2026
https://github.com/akash1070/data-science-virtual-internship-by-anz
Exploratory data analysis and prediction of annual salary for customers from the dataset provided by ANZ.
data-analysis data-science predictive-analytics presentation-slides
Last synced: 24 Mar 2025
https://github.com/monish-nallagondalla/diamondpriceprediction
Diamond Price Prediction is an end-to-end machine learning project that predicts diamond prices based on attributes like carat, cut, color, clarity, and dimensions. It features a Flask web application for real-time predictions and utilizes models such as Linear Regression, Lasso, and Ridge.
data-analysis data-science flask jupyter-notebooks machine-learning predictive-modeling python
Last synced: 06 May 2026
https://github.com/yashodatta15/maven_unicorn_company_challenge
An analysis on Unicorn companies.
data-analysis data-cleaning data-visualization powerbi unicorn-companies
Last synced: 19 Feb 2026
https://github.com/danhenriquex/data-science-project
The main goal of this project was to apply the concepts of data visualization and analysis.
data-analysis data-science numpy pandas python
Last synced: 12 Apr 2026