Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-03 00:07:37 UTC
- JSON Representation
https://github.com/valeriopagliarino/electronics-2021-unito-public
Data analysis and simulations for the course "Electronics laboratory" held at Physics Dep. - University of Turin, 2021
data-analysis electronics physics
Last synced: 27 Mar 2025
https://github.com/anilkumarteegala/aspiration.ai-ml-internship
This repo contains the internship project by Career Launcher.
data-analysis data-science financial internship machine-learning python3 stock-analysis stock-market visualization
Last synced: 05 May 2025
https://github.com/rdrahul123/sales-dashboard
The Sales Analysis Dashboard was developed to provide insights into sales, profits, and product performance across different categories, timeframes, and geographic locations. By leveraging Power BI, the project aimed to transform raw data into actionable visualizations, facilitating better decision-making for stakeholders.
data-analysis data-science data-visualization dax powerbi
Last synced: 06 Jan 2026
https://github.com/vara-co/asteroid-impact-predictions
Group Project using Supervised Learning and Neural Network Models
asteroids data-analysis data-science neural-network prediction-model predictions supervised-learning
Last synced: 04 Oct 2025
https://github.com/enamhasan/analyzing-the-impact-of-recession-on-automobile-sales
Data Analyis and Visualization Dashboard of the Impact of Recession on Automobile Sales
dashboard data-analysis data-science data-visualization pandas plotly plotly-dash python
Last synced: 05 May 2026
https://github.com/17bit0216/machine-learning
All of my data analysis and Machine learning Projects.
analysis data-analysis linearr logistic logisticregression machine-learning python3 random-forest
Last synced: 16 May 2026
https://github.com/nabilshadman/r-data-analysis
A modular R framework for data analysis, with emphasis on data processing and reproducible workflows.
data-analysis data-cleaning data-manipulation data-science descriptive-statistics programming r r-studio statistical-analysis statistical-computing t-test
Last synced: 04 Apr 2025
https://github.com/shoyebmd424/design-and-analysis-algorithm
algorithm daa data-analysis data-structures
Last synced: 16 May 2026
https://github.com/reusjimenez/data-analysis-labs
Casos completos y ejercicios prácticos de análisis de datos. 📊
data-analysis data-visualization jupyter-notebook machine-learning matplotib numpy panel python sklearn
Last synced: 04 Apr 2025
https://github.com/foxriver76/iobroker.intelliflow
Stream data analysis adapter for ioBroker.
data-analysis iobroker machine-learning streaming-data
Last synced: 04 Apr 2025
https://github.com/rishabhraj43/diwali-sales-analysis
A Data Analysis project made in Python
Last synced: 01 May 2026
https://github.com/nafiealhilaly/analyzing-sa-schools-data
A simple python streamlit app to explore and analyze Saudi Arabia schools dataset from data.gov.sa
data-analysis data-visualization eda python streamlit
Last synced: 16 May 2026
https://github.com/quantumudit/sales-statistical-analysis
This project focuses on a statistical analysis (using SQL queries) of various key metrics that impacts the overall sales of a certain fictitious store.
data-analysis postgresql sales-analysis sql statistics
Last synced: 16 May 2026
https://github.com/avikdatta/python_data_docker_files
A repository for docker files for data analysis using Python and Hadoop
data-analysis dockerfile python-docker raspbian spark ubuntu1604
Last synced: 06 May 2025
https://github.com/adityav42/deloitte-forage-virtual-internship
About Submission for Deloitte's STEM Virtual Program on Forage, focusing on data analysis, forensic technology, and cybersecurity.
coding cybersecurity data-analysis deloitte development forage forensics-technology virtual-program
Last synced: 29 Oct 2025
https://github.com/anoni-net/onionoo-fastapi
Semantic/OpenAPI proxy for the Tor Metrics Onionoo API, built with FastAPI for easier integration and automated analysis.
agentic-ai ai-agents data-analysis fastapi network-metrics observability onionoo openapi privacy pydantic python semantic-apis tor tor-metrics
Last synced: 16 May 2026
https://github.com/cuadernin/regex_importance
Un simple ensayo sobre expresiones regulares
clean-code data-analysis data-mining data-science python r regex
Last synced: 05 Apr 2025
https://github.com/evardnk/dataanalyticsportfolio
Собрание моих проектов по аналитике данных
api automation data-analysis etl-pipeline jupyter-notebook jupyterlab kpis numpy pandas pipeline postgresql powerbi python sql visualization
Last synced: 17 Feb 2026
https://github.com/priyanshubiswas-tech/customer-churn-analysis-eda-
This project analyzes customer churn in a telecom company using Python, Pandas, SQL, and data visualization. It identifies key factors like contract type, payment method, and tenure to provide insights for improving retention. The skills gained are applicable in customer retention, user behavior analysis, fraud detection, and HR analytics.
data-analysis data-visualization dataset explo exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas-dataframe python seaborn seaborn-python
Last synced: 08 May 2026
https://github.com/celineboutinon/drinking-water-for-all
OpenClassrooms Data Analyst 2022-2023 - Projet 8 using PowerBI Desktop
data-analysis data-analytics data-structures data-visualisation database-design database-schema databases mysql-connector-python mysql-workbench power-bi-dashboard python sql
Last synced: 27 Apr 2026
https://github.com/jpcadena/car-sales-etl
ETL process for a Car Sales project.
asyncpg car-sales data-analysis data-engineering data-visualization database etl etl-pipeline postgresql python sqlalchemy
Last synced: 04 May 2026
https://github.com/spacetelescope/jwst_da_roadmap
Roadmap document for the JWST data analysis tools
astronomy data-analysis documentation jwst
Last synced: 17 Feb 2026
https://github.com/bcko/ud-da-datawrangling
Udacity Data Analyst Nanodegree Project : Wrangle and Analyze Data
data-analysis data-analyst-nanodegree data-wrangling python tweepy twitter-api udacity udacity-data-analyst-nanodegree udacity-nanodegree we-rate-dogs
Last synced: 25 Oct 2025
https://github.com/richardwarepam16/hotel_analysis_using_python
Unlocking Insights: Analyzing Hotel Reservation Data to Boost Business Performance
data data-analysis data-visualization hotel-booking hotel-cancellation-solution hotel-management-system jupyter-notebook python python3
Last synced: 02 Jul 2026
https://github.com/eesunmoon/on-device_multimodal_er
[Research - MINES Lab] Multimodal Emotion Recognition for On-device AI
artificial-intelligence data-analysis deep-learning embedded-systems emotion-recognition heart-rate-analysis multimodal-fusion npu on-device python speech-processing speech-recognition tensorflow wearable-devices
Last synced: 08 Feb 2026
https://github.com/pranabdas/suvtools
Python library for analyzing and visualizing SSLS SUV Beamline data.
data-analysis data-visualisation python
Last synced: 07 May 2025
https://github.com/akash1070/data-science-advanced-analytics-virtual-experience-program
The BCG Open-Access Data Science & Advanced Analytics Virtual Experience Program
data-analysis data-science machine-learning-algorithms
Last synced: 16 May 2026
https://github.com/priyanshubiswas-tech/data-analysis-with-python
This repository showcases Python projects completed for a Data Analysis with Python certification, demonstrating skills in data manipulation, visualization, and statistical analysis using libraries like NumPy, Pandas, Matplotlib, Seaborn, and SciPy.
data-analysis demographic-data-analyzer mean-variance-standard-deviation-calculator medical-data-visualizer page-view-time-series-visualizer python scipy-stats sea-level-predictor seaborn
Last synced: 07 May 2025
https://github.com/priyanshubiswas-tech/airflow_dbt_superset_project
End-to-end ITSM data engineering pipeline using PostgreSQL, DBT, Airflow, and Superset. Covers ingestion, cleaning, transformation, orchestration, and visualization, validated across Docker Toolbox and Docker Desktop environments.
apache-airflow apache-superset dags data-analysis dbt docker etl etl-automation etl-pipeline postgresql
Last synced: 07 May 2025
https://github.com/draym/swmanager
Web-app to help you in your daily life raids in SpacesWars thanks to game statistics and data management
dashboard-application data-analysis data-visualization game-data game-utility
Last synced: 19 Jun 2025
https://github.com/drcbeatz/aynm-data
Python scripts for data cleaning and processing for AYNM (Pandas/NumPy/Selenium/AWS Textract)
automation aws-textract csv data-analysis data-cleaning ipynb numpy ocr pandas python reverb selenium shopify webscraping xml
Last synced: 07 Mar 2026
https://github.com/luizassimoes/bachelor-thesis
Bachelor Thesis developed for the completion of the graduation in Electrical Engineering.
5g-networks data-analysis data-visualization python
Last synced: 30 Apr 2026
https://github.com/kushagrakumar04/traffic-accident-analysis
This project analyzes traffic accident data to identify patterns based on road conditions, weather, and time of day. Visual representations of accident hotspots and contributing factors are created to offer a comprehensive understanding of the dynamics involved. The insights from this analysis aim to develop targeted strategy to improve safety.
data-analysis matplotlib pandas visualization
Last synced: 15 May 2026
https://github.com/navdeep-g/data-quality-checker
A comprehensive Python tool for data analysis and data quality
data-analysis data-science pandas python
Last synced: 16 May 2026
https://github.com/abhaysingh71/laptop-price-predictor
Laptop Price Predictor is a Dockerized machine learning project that predicts laptop prices based on specs using ensemble models like Random Forest, XGBoost, and Gradient Boosting.Including Streamlit UI, and full Docker support.
data-analysis data-science deployment docker docker-image ensemble-learning laptop-price-prediction machine-learning-algorithms streamlit xgboost
Last synced: 05 May 2026
https://github.com/atxtechbro/glassdoorwebscraping
"Scraping Glassdoor: A GraphQL Journey" is an advanced data harvesting tool leveraging GraphQL and an API-first strategy to extract and analyze Glassdoor data for business intelligence and predictive analytics.
api-first-approach business-intelligence data-analysis data-harvesting data-mining data-science glassdoor-scraper graphql html machine-learning performance-optimization predictive-analytics python requests-library-python scaleability scraper system-design web-scraping
Last synced: 16 May 2026
https://github.com/thanaphongk37/data-science-and-data-analyst-project
Portfolio Data Analysis and Data Science projects and Data Engineer built using Azure Service, SQL and Python.
apache-superset azure-storage dashboards data-analysis data-science databricks dataengineering datafactory datapipeline powerbi python sisense sql sql-server visualization
Last synced: 11 May 2026
https://github.com/muhozgu/bi_image_classifier
Binary image classifier (dogs vs cats) using a convolutional neural network (CNN)
cnn computer-vision data-analysis data-analytics data-science data-visualization deep-learning keras machine-learning matplotlib numpy pandas pyhton pytorch seaborn tensorflow torchvision
Last synced: 07 Apr 2026
https://github.com/valeriopagliarino/esp2-2021-unito-public
Physics laboratory 2 course (electromagnetism, optics and modern physics)
data-analysis electronics optics physics
Last synced: 22 Jun 2025
https://github.com/luiscib3r/streamlit-examples
Streamlit examples.
data-analysis data-science machine-learning python streamlit
Last synced: 16 May 2026
https://github.com/oguzgn/budget-checker-for-campaign-budget-allocation
This project focuses on modeling campaign performance data for Looker, helping determine which campaigns to scale up or cut back. It aggregates metrics over the last 7 and 30 days, providing actionable insights for budget optimization and performance improvement.
budget-allocation budget-controller budget-management calculated-fields campaign-analytics data-analysis data-modeling looker-studio sql
Last synced: 17 Feb 2026
https://github.com/nafisalawalidris/dr.-semmelweis-and-the-discovery-of-handwashing
Uncover the revolutionary impact of handwashing on mortality rates in healthcare. Explore the story of Dr. Semmelweis and his groundbreaking findings.
data data-analysis handwashing healthcare-analysis medical-breakthrough mortality-rates
Last synced: 13 Jul 2025
https://github.com/cosmoduende/r-uber-trips-analyisis
Explore your activity on Uber with R: How to analyze and visualize your personal data history. Find out how you consume the Uber App using a copy of your data.
analisis-de-data data-analysis data-analytics data-science data-visualisation data-visualization data-viz eda flexdashboard ggmap ggplot2 mobility-as-a-service qmplot r-language r-programming ridesharing uber uber-data visualizacion-de-datos
Last synced: 14 Jul 2025
https://github.com/deepanshkhurana/cloudsimplifier
Simple helper functions to fetch and read data from various formats stored on Amazon AWS S3 Buckets. Most functions are essentially wrapping over cloudyR.
amazon aws cloudyr data-analysis data-fetching data-science package r rpackage s3
Last synced: 20 May 2026
https://github.com/ahmednurabdii/data-analytics-portfolio-superstore
My first portfolio project showcasing data cleaning, analysis, and visualization of Superstore sales data.
data-analysis data-visualization jupyter-notebook matplotlib numpy pandas portfolio-project python sales-analysis scipy seaborn superstore-dataset
Last synced: 07 Apr 2026
https://github.com/naruaika/eruo-data-studio
A powerful yet friendly ETL tool powered by Polars backend
data-analysis data-science desktop-app gnome-desktop gtk4 proof-of-concept python spreadsheet
Last synced: 18 Jul 2025
https://github.com/maheshthedev/twitter-analysis
Analysis on Various Topics with Twitter Data
data-analysis twitter-analysis
Last synced: 18 Jul 2025
https://github.com/vara-co/pandas-challenge
PyCitySchools - Analysis between budget and academic performance in schools
budget-analysis data-analysis jupiter-notebook pandas-dataframe python school-performances
Last synced: 17 May 2026
https://github.com/shubhammohanty680/uber_data_analysis
bigquery data-analysis gcp-compute gcp-project looker-studio mageai python
Last synced: 17 Feb 2026
https://github.com/kumaranand05/suicide-rate-analysis
Analysis of Mortality data of WHO and visualization using Power BI
analytics data-analysis data-visualization mortality-rates powerbi python suicide-dataset suicide-rate
Last synced: 04 May 2026
https://github.com/andryadsm/ibrd-statement-loans
🏦 Project IBRD Statement of Loans (Python, SQL, Excel, Power BI, Tableau)
bank bank-loans dashboard data-analysis data-transformation data-visualization database-management excel finance international-development loans mssqlserver mysql powerbi python sql tableau
Last synced: 08 Apr 2026
https://github.com/thecoderpinar/telecommunication-customer-churn-analysis-and-prediction
📊 This project focuses on customer churn analysis and prediction in the telecommunications sector. Using data analysis, modeling, and predictive techniques, it aims to understand and mitigate customer loss by developing strategies.
churn churn-prediction classification customer data-analysis data-science deep-learning machine-learning neural-network telecom
Last synced: 07 Aug 2025
https://github.com/msthamizh/airbnb_analysis
Developing a Streamlit application enabling users to explore and analyze Airbnb listing data. This application allows users to interactively visualize geospatial distributions of listings, analyze pricing trends, and explore availability patterns across different locations. Integrates MongoDB Atlas for data storage and PowerBi for advanced insights
data-analysis data-cleaning data-visualization json mongodb pandas-dataframe plotly powerbi python streamlit
Last synced: 11 Apr 2026
https://github.com/shridhar1504/foreign-exchange-rate-time-series-datascience-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis
Last synced: 17 May 2026
https://github.com/al-ghaly/power-bi-dashboard
A dashboard to analyze data specializations job market.
dashboard data-analysis powerbi
Last synced: 02 Feb 2026
https://github.com/khuyentran1401/sample_datapane_script
This repo shows how to use Datapane create a simple script to see the rank of the authors or publications with respect to publishing frequency
data-analysis data-science datapane python
Last synced: 21 May 2026
https://github.com/JovaniPink/excel-powerbi
The folder of my work with Excel, VBA, and PowerBI for Data Analysis & Visualization.
data-analysis data-visualization dax excel excel-vba power-pivot power-query powerbi vba-macros
Last synced: 20 Jul 2025
https://github.com/denko5/sales-analysis
A complete SQL-based sales analysis project covering Africa, showcasing data cleaning, exploratory analysis, insights, and lessons learned. The project highlights sales trends, regional performances, and marketing effectiveness across multiple platforms.
africa data data-analysis data-science exploratory-data-analysis insights kenya sales sql
Last synced: 24 Jan 2026
https://github.com/ghackenberg/kurs-datenanalyse
This repository contains material for my data analysis course. In this course we first introduce the concept of databases and SQL, before diving into OLAP and other data analysis tools.
data-analysis data-structures data-warehouse entity-relationship-diagram etl graph list olap relational-algebra relational-database sql tree
Last synced: 17 Feb 2026
https://github.com/neemiasbsilva/datascience-portfolio
Hello guys, welcome to my Data Science Portfolio. I include some knowledges I earn in my journey. I included some case study, papers, and code. Please check the readme.
case-study churn-prediction code-challenges data-analysis data-science deep-learning forecasting fundamental-of-statistics health-care image-recognition machine-learnin machine-learning math mathematics pattern-recognition portfolio programming-skills speech-emotion-detection statistics voice-activity-detection
Last synced: 14 May 2026
https://github.com/priyanshubiswas-tech/ev-data-analysis-dashboard
An interactive dashboard analyzing EV trends, including total vehicles, BEV vs. PHEV breakdown, model popularity, state-wise distribution, and CAFV eligibility. Visualizes key insights for data-driven decisions in the EV industry. 📊
dashboard data data-analysis data-science data-visualization tableau tableau-public
Last synced: 17 Feb 2026
https://github.com/morphclue/godot-trend
R-Code and data for game engines on itch.io
data-analysis game-engines trends
Last synced: 05 Apr 2025
https://github.com/olekscode/covidanalysis
A setup for COVID-19 data analysis in Pharo
coronavirus covid-19 data-analysis pharo
Last synced: 05 Apr 2025
https://github.com/yash-kavaiya/ai-analytics
This is a Streamlit app that uses Pandas and AI to perform data analytics on uploaded CSV files.
data-analysis generative-ai pandas streamlit
Last synced: 20 Jul 2025
https://github.com/defrecord/value-alignment-toolkit
A comprehensive toolkit for implementing, analyzing, and validating AI value alignment based on Anthropic's 'Values in the Wild' research.
ai anthropic data-analysis ethics privacy python simulation value-alignment
Last synced: 20 Jul 2025
https://github.com/nafisalawalidris/logistic-regression-model-for-breast-cancer-recurrence-prediction
Predicting Breast Cancer Recurrence - A logistic regression model using patient attributes to classify recurrence risk. Dataset analysis and model evaluation. Contributions welcome.
breast-cancer classification-model data-analysis data-science healthcare logistic-regression machine-learning python recurrence-prediction scikit-learn
Last synced: 17 May 2026
https://github.com/chengkangzai/malaysia-pandemic-dashboard
covid-19 data-analysis pandemic-dashboard
Last synced: 29 Mar 2025
https://github.com/nysportsfan/Gun-Violence-in-the-US
This repository contains all the relevant files for my first capstone project as part of the Springboard Data Science Career Track.
data-analysis data-science data-visualization machine-learning python3 statistics
Last synced: 10 May 2025
https://github.com/brain-image-library/py-brain-inventory
Python package that can get data from BIL inventories
brain-image-library data-analysis data-viz
Last synced: 03 Jul 2026
https://github.com/vkbo/osirisanalysis
Matlab toolbox for analysing simulation results from Osiris 3
data-analysis matlab matlab-gui physics-simulation
Last synced: 10 May 2025
https://github.com/ibensusan/wine-properties-assessment
Wine Properties Assessment using Microsoft Excel
data-analysis data-visualization excel
Last synced: 20 Mar 2026
https://github.com/ncasuk/decades-pp
Post processing library for the data from the FAAM aircraft
atmospheric-sciences data-analysis data-processing meteorology science
Last synced: 07 Mar 2026
https://github.com/sharathsphd/coffee_causality
Data-driven analysis of coffee shop sales using correlation, regression, and causal inference. A Jupyter Book project exploring foot traffic, weather patterns, and business analytics.
business-analytics causal-inference correlation data-analysis foot-traffic forecasting github-pages jupyter-notebook machine-learning open-source python regression retail-analytics statistics storytelling time-series visualization weather-analysis
Last synced: 18 May 2026
https://github.com/fbraza/paris_airbnb
Analysis of Paris AirBnB data using R and Shiny
analysis data data-analysis paris-airbnb r shiny
Last synced: 21 Mar 2025
https://github.com/mariam-badr-mb/gtc-land-type-classification
This project develops a machine learning model to classify land cover types in Egypt using Sentinel-2 satellite imagery. The system detects categories such as agriculture, water bodies, urban areas, deserts, roads, and tree cover.
data-analysis data-visualization deep-neural-networks eda machine-learning model-architecture streamlit
Last synced: 12 Jun 2026
https://github.com/devandrenicolas/analise-de-vendas
This project is a comprehensive data analysis tool designed to analyze sales performance data. It includes modules for generating fake sales data, cleaning and preprocessing the data, and performing exploratory data analysis (EDA) with advanced visualizations.
data-analysis data-visualization faker-generator matplotlib pandas python
Last synced: 07 May 2026
https://github.com/adityakumarsingh01/customer-purchase-behaviour-analysis
A data analysis project exploring online consumer behavior and FOMO effects using EDA on survey data.
consumer-behavior data-analysis eda fomo online-shopping python survey-data
Last synced: 25 Apr 2026
https://github.com/nishnash54/recomax---recommendation-platform
P&G Hack - Recommendation platform
data-analysis data-science data-visualization machine-learning prediction-model recommendation-engine
Last synced: 15 Jan 2026
https://github.com/islam-hady9/heart-attack-analysis-prediction-using-ann
Heart Attack Analysis & Prediction using Artificial Neural Network
artificial-neural-networks data-analysis deep-learning eda machine-learning matplotlib numpy pandas python tensorflow
Last synced: 04 Apr 2026
https://github.com/ehopperdietzel/billionaires-analysis
Análisis de la cantidad de billonarios por país. Inspirado en el artículo "Russian Billionaires"
bootstrap data-analysis poisson-distribution prediction
Last synced: 18 May 2026
https://github.com/agustinmusanti/sqlchallenge-4
Desafio de creación de una base de datos SQL para una plataforma de streaming. Incluye DDL, DML y consultas avanzadas.
data-analysis database mysql sql streaming
Last synced: 18 May 2026
https://github.com/anurag-kumar-molankala/data-professional-survey
This Power BI dashboard analyzes survey responses from data professionals, covering key aspects such as salary distribution, job satisfaction, and preferred programming languages. The insights help understand trends in the data industry and what matters most to professionals.
dashboard data-analysis data-visualization dax-measures dax-query demographics etl-process excel-import power-bi salary-analysis sql-server survey-analysis trend-analysis
Last synced: 02 Feb 2026
https://github.com/danhenriquex/final-project-ia
Artificial Intelligence Project - Analysis of sentiments of news that impact the value of shares.
data-analysis machine-learning supervised-learning
Last synced: 25 Jun 2025
https://github.com/hshadman/idp_global_local_conformational_landscapes
PyHeteroMap: A python package that maps and analyzes the conformational ensembles of Intrinsically Disordered Proteins (IDPs) from simulations.
biophysics coarse-grained-molecular-dynamics data-analysis data-science data-visualization intrinsically-disordered-proteins molecular-dynamics molecular-dynamics-simulation monte-carlo-simulation object-oriented-programming polymer-physics protein-design protein-structure proteins python3 sequence-structure
Last synced: 11 May 2026
https://github.com/apache/cloudberry-gpbackup-s3-plugin
S3 plugin for Apache Cloudberry (Incubating) backup utility
ai big-data cloudberry data-analysis data-warehouse database distributed-database gpbackup greenplum mpp olap postgres postgresql s3plugin
Last synced: 12 Sep 2025
https://github.com/aiswarya196/supply-chain-analytics-ai
End-to-End Supply Chain Analytics project using AI tools (n8n, Quadratic) to automate data ingestion, calculate KPIs, and generate business insights.
data-analysis n8n-automation postgres quadratics supabase supply-chain-analytics
Last synced: 18 May 2026
https://github.com/shubhamgoyal575/ecommerce-product-categorization
This project classifies e-commerce products into predefined categories using machine learning. It includes preprocessing steps like stopword removal, punctuation cleaning, and feature extraction. Models, including LSTM, are implemented, and evaluated for better accuracy.
accuracy-score artificial-neural-networks confusion-matrix data-analysis data-cleaning data-preprocessing data-science data-visualization deep-learning exploratory-data-analysis hyperparameter-tuning logistic-regression long-short-term-memory machine-learning machine-learning-algorithms naive-bayes-algorithm natural-language-processing precision-score random-forest-classifier
Last synced: 30 Aug 2025
https://github.com/shubhamgoyal575/spam_detective
This project uses machine learning to classify messages as spam or ham based on text analysis. It includes data preprocessing, feature extraction (TF-IDF), and classification models like Logistic Regression and Naive Bayes for accurate spam detection. Built with Python and Scikit-Learn. 🚀
count-vectorizer data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization data-wrangling exploratory-data-analysis logistic-regression machine-learning machine-learning-algorithms naive-bayes natural-language-processing spam-detection tfidf-vectorizer
Last synced: 02 Jul 2025
https://github.com/dr-saad-la/r-distilled
R Programming Language distilled
data data-analysis learning programming-language r rlanguage rprogramming statistical-analysis
Last synced: 18 May 2026
https://github.com/dcs-training/interactive-analysis-reports-with-r-markdown.github.io
This workshop will help you create your own reproducible, customisable, and interactive analysis reports through R Markdown. By building on the basics of R, we will show you how to instantly prepare your results into a ready-made document (No more copy and pasting your results! Less human error!). Go to the readme file
data-analysis data-visualisation data-wrangling r rmarkdown statistics
Last synced: 25 Jun 2025
https://github.com/haloapping/pisangijo
Kumpulan library dan framework untuk analisa data, data science, machine learning, deep learning dan masih banyak lagi berbasis bahasa pemrograman Python 🐍.
belajar data-analysis data-science deep-learning forecasting libraries machine-learning perkakas pustaka python3 recommender-system referensi tools
Last synced: 13 Jun 2026
https://github.com/vara-co/sql-challenge
EmployeeSQL "Data modeling, data engineering, and data analysis."
data-analysis data-engineering data-modeling databases employee-database erd erdiagram postgres postgresql schema sql tables
Last synced: 18 May 2026
https://github.com/richardwarepam16/rental_analysis_using_python_and_sql
Maximizing Rental Profits: Data-Driven Strategies for a Movie Rental Store
data-analysis data-analytics python3 rental-management sakila-db sqlite3
Last synced: 18 May 2026
https://github.com/drill-n-bass/data-analysis-projects
Projects related to my Data Analyst path.
analysis data-analysis data-visualization matplotlib matplotlib-pyplot mysql mysql-database numpy pandas pandas-dataframe pandas-library pandas-python python python3 seaborn seaborn-plots static-analysis statistics
Last synced: 07 Apr 2026
https://github.com/kiranmayi5/r-projects
This repository showcases R projects designed to tackle real-world problems through data-driven solutions.
data-analysis exploratory-data-analysis predictive-modeling r statistical-analysis
Last synced: 25 Jun 2025
https://github.com/vara-co/space-missions
Space Missions Over Time (1957-2022): Successes vs Failures, and Rocket Usage
data-analysis data-analysis-python history matplotlib pandas pandas-python space space-race spaceships team-project
Last synced: 18 May 2026
https://github.com/gurpreetkaurjethra/ai-data-visualization-agent
This Streamlit application creates an interactive Data Visualization Assistant that can understand Natural Language Queries and generate appropriate Visualizations using LLMs.
aiagents aichatbot aidevelopment artificial-intelligence data-analysis data-visualization generative-ai llms
Last synced: 25 Jun 2025
https://github.com/ajwad-shaikh/sristi-sanshodh-collect
SRISTI Sanshodh Collect is an Android app for filling out forms. It's been used to collect billions of data points in challenging environments. Contribute and make the world a better place! ✨📋✨ https://docs.opendatakit.org/collect-…
collect data-analysis data-collection javarosa odk opendatakit
Last synced: 04 Apr 2025