Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-07-02 00:07:33 UTC
- JSON Representation
https://github.com/amoghkori/working-with-apache-spark-mllib
Implemented Apache Spark MLLib to analyze a large car dataset, predict car selling prices, and gain insights into the car market.
amazon-web-services data-analysis data-visualization exploratory-data-analysis linear-regression machine-learning model-selection pyspark python random-forest sagemaker spark
Last synced: 13 Apr 2026
https://github.com/sun-lab-nbb/sl-shared-assets
A Python library that stores assets shared between multiple Sun (NeuroAI) lab data acquisition and processing repositories.
data-analysis data-collection data-processing experiment sunlab
Last synced: 10 Mar 2026
https://github.com/ireneflorez/nypd-mvc
Analysis of NYPD Motor Vehicle Collisions
basemap data-analysis folium jupyter-notebook matplot pandas python
Last synced: 08 May 2026
https://github.com/first-coding/smart_analysis
Smart Analysis is an AI-powered data analysis tool that leverages large language models (LLMs) to generate SQL queries from natural language prompts. Upload CSV files, explore the data schema, and retrieve insights with ease. The system ensures error correction in SQL queries, delivering detailed reports and visualizations in a streamlined workflow
data-analysis llm openai prompt-engineering python
Last synced: 08 Mar 2025
https://github.com/benami171/ml_knn_decision-trees
A ml implementation comparing Decision Trees and k-Nearest Neighbors (k-NN) algorithms for Iris flower classification. Features comprehensive analysis of different approaches including brute-force and entropy-based decision trees, along with k-NN using multiple distance metrics.
classification cross-validation data-analysis decision-trees iris-dataset k-nearest-neighbours machine-learning nearest-neighbors python
Last synced: 30 Jun 2025
https://github.com/deliprofesor/joblocationmapper
JobLocationMapper is a Python tool that visualizes job listings on an interactive map. It uses city and state data to place job markers accurately and color-codes them by occupation (Software, Marketing, Design). The map clusters markers for better organization, and users can click on them to view job details.
clustrered-markers data-analysis data-visualization folium geocoding geographical-visualization interactive-map job-listings map-visualization pandas python
Last synced: 14 May 2026
https://github.com/vedantshi/coffee-sales-dashboard
This project analyzes coffee sales data using Excel, featuring data cleaning, trend analysis, and an interactive dashboard. Key insights highlight top-performing products, regional sales trends, and seasonal patterns. Recommendations focus on marketing strategies and inventory optimization. Future plans include Power BI integration for visuals.
business-insights data-analysis data-visualization excel-dashboard pivot-tables sales-trends
Last synced: 05 Jan 2026
https://github.com/harshindcoder/salifort_motors_project
This people analytics project analyzes factors influencing employee turnover and predicts whether an employee is likely to leave. It aims to uncover patterns behind departures, helping Salifort improve retention, workplace culture, and professional growth strategies.
data-analysis data-science data-visualization hr-analytics machine-learning tree-models
Last synced: 02 May 2026
https://github.com/chanmeng666/advanced-neural-network-applications
Practical implementations of perceptron and linear neuron models for classification and regression, with mathematical analysis and visualizations in Jupyter notebooks.
classification data-analysis data-science educational gradient-descent jupyter-notebook linear-neuron machine-learning matplotlib neural-network neural-networks numpy perceptron python regression
Last synced: 03 May 2026
https://github.com/ljadhav25/data-engineering-poc
This repository contains a beginner-level Data Engineering Proof of Concept (POC) project designed for practice. The objective is to provide hands-on experience with data engineering concepts, including data extraction, transformation, loading (ETL), and basic data analysis. This project is ideal for those looking to build foundational skills in da
data-analysis etl matplotlib numpy pandas python
Last synced: 13 Apr 2026
https://github.com/mohit01chugh/edu_sql_analysis
SQL queries used to analyze student data.
data-analysis database education plpgsql postgresql sql
Last synced: 17 May 2026
https://github.com/DCS-training/IntroToStatistics
This is a repository which contains all the materials to be used in the introduction to statistics course. Go to the readme file
data-analysis r rmarkdown statistics
Last synced: 25 Apr 2025
https://github.com/jedrzej-wydra/data-analysis-pro
Professional Data Analyst Exam by DataCamp
Last synced: 23 Mar 2025
https://github.com/siddharthbadal/kpmgdataanalysisproject
Data Analytics Consulting Virtual Internship
data-analysis data-cleaning data-visualization googlestudio msexcel powerpoint
Last synced: 05 Jan 2026
https://github.com/damiieibikun/web-scrapping-and-python-data-visualization-on-top-500-movies-imdb
Web Scrapping and Python Data visualization on Top 500 movies IMDb
beautifulsoup4 data-analysis data-visualization matplotlib-pyplot numpy pandas plotly-express python requests seaborn web-scraping
Last synced: 13 Apr 2026
https://github.com/haonamnguyen/costumer-shopping-trends-analysis
This project analyzes a synthetic dataset of customer shopping behavior to see key trends and insights. Using SQL and Tableau, the analysis focuses on customer demographics, purchase patterns, and preferences, including age distribution, payment methods, shipping types, and top product categories.
data-analysis data-visualization sql tableau
Last synced: 05 Jan 2026
https://github.com/adithya17-star/ai-powered-fraud-detection
An AI-powered fraud detection system using machine learning algorithms to identify suspicious transactions and provide interactive visualizations for financial security.
dashboard-visualization data-analysis finance-technology fintech flask fraud-detection machine-learning python security transaction-monitoring
Last synced: 02 May 2026
https://github.com/omnipotence-eth/manufacturing-quality-analytics
SQL + Python pipeline for semiconductor NCR analysis — supplier performance, defect Pareto, yield trends
analytics data-analysis etl manufacturing matplotlib pandas postgresql python quality sql
Last synced: 11 Apr 2026
https://github.com/lucas54neves/financial-organizer
Financial organizer using Streamlit
data-analysis data-science financial-organizer plotly python streamlit
Last synced: 02 May 2026
https://github.com/dpbm/diabetes-analysis
simple diabete analysis with python
analysis data-analysis data-science data-science-projects data-set diabetes-detection diabetes-prediction machine-learning pandas python
Last synced: 11 Apr 2026
https://github.com/malucor/livros
Programa em Python para fazer uma análise de dados sobre livros, a partir de um arquivo Excel.
analise-de-dados book books bookshelf data-analysis ipynb jupyter-notebook livro livros python
Last synced: 16 May 2026
https://github.com/aksoni07/movie-recommendation
A hybrid movie recommendation system designed to deliver personalized and accurate suggestions by combining user preferences, item attributes, and collaborative patterns, ensuring a seamless and engaging experience.
clustering content-based-filtering data-analysis embeddings jupyter-notebook numpy ollaborative-filtering pandas personalization python recommendation-systems scikit-learn user-item-interactions
Last synced: 11 Apr 2026
https://github.com/27ahmad/ibm-data-science-capstone
The Capstone is the final course in the IBM Data Science Professional Certificate program. It's a project that combines all the skills and knowledge you've gained throughout the specialization.
data-analysis data-science folium-maps machine-learning plotly-dash python sql
Last synced: 26 May 2026
https://github.com/mr-chang95/udacity_movie_project
Movie Data Analysis and Visualization Project for Udacity's Data Analyst Program. Using Python in Jupyter Notebook.
data-analysis data-visualization jupyter-notebook movie python
Last synced: 13 Apr 2026
https://github.com/pinedah/sleep-data-analysis-exercise
Análisis de un dataset médico sobre el sueño, explorando duración, calidad y factores relacionados. Incluye limpieza de datos, EDA y visualizaciones con Python (pandas, numpy, matplotlib, seaborn, scipy).
data-analysis data-science escom numpy pandas python school-project scipy
Last synced: 13 Apr 2026
https://github.com/archanakokate/eda_amazon_products_and_discounts_2023
Exploratory Data Analysis (EDA) on Amazon's 2023 Products and Discounts data
data-analysis data-mining data-visualization exploratory-data-analysis
Last synced: 03 Jan 2026
https://github.com/delabrov/jwstoolkit
A python package for handling JWST observations
astronomy-astrophysics data-analysis data-cube data-visualization imagery jwst python3 spectroscopic-data
Last synced: 26 May 2026
https://github.com/felinjob/ibm-applied-data-science-capstone
Este projeto, parte da especialização IBM Data Science Professional Certificate, prevê o sucesso do pouso do Falcon 9 da SpaceX. Usando dados da API da SpaceX e Web Scraping, o projeto inclui análise de dados e Machine Learning para gerar insights sobre os lançamentos.
data-analysis data-science data-visualization ibm jupyter-notebook machine-learning numpy pandas python scikit-learn seaborn sql
Last synced: 11 Apr 2026
https://github.com/ascender1729/leetcode_scraper
Extract topic tags from LeetCode problems to streamline interview preparation.
beautifulsoup coding-interview data-analysis graphql leetcode python scraper web-scraping
Last synced: 20 Jun 2026
https://github.com/busradeveci/student-performance-prediction
A machine learning project to predict student exam performance based on academic, social, and personal features. Built with Python and scikit-learn.
data-analysis kaggle linear-regression machine-learning predictive-modeling python scikit-learn student-performance
Last synced: 25 Apr 2025
https://github.com/whoprashant7/querying-a-large-relational-database-using-ms-sql
Analysing data using Ms Sql Server
data-analysis ms-sql-server sql
Last synced: 05 Jul 2025
https://github.com/amishidesai04/emergency-calls-data-analysis-project
Welcome to the Emergency Calls Data Analysis project repository. This project is dedicated to extracting, processing, and visualizing data from the "Emergency – 911 Calls, Montgomery County" dataset, sourced from Kaggle. The main objective is to analyze trends in emergency calls in Montgomery County, Pennsylvania, spanning multiple years.
analysis data-analysis data-extraction data-processing data-science data-visualization numpy pandas python seaborn
Last synced: 02 May 2026
https://github.com/danpoynor/python-number-guessing-game-with-stats
A number guessing game written in Python 3 that presents median, mode, and mean statistics
console-game data-analysis number-guessing-game python3 statistics
Last synced: 26 May 2026
https://github.com/kittonn/data-analysis-freecodecamp
freecodecamp - data analysis projects.
Last synced: 05 Apr 2025
https://github.com/anthonytlei/graphsql
Lightweight SQL-to-GraphQL connector for querying GraphQL endpoints using SQL syntax.
connector data-analysis dbapi graphql graphsql python sql sqlalchemy superset
Last synced: 09 Apr 2026
https://github.com/neuro-mechatronics-interfaces/matlab_analyses
Tools for analysis, statistics, and/or simulation in Matlab.
data-analysis data-visualization matlab matlab-codes matlab-functions matlab-gui matlab-scripts neuroscience weber-lab
Last synced: 09 Jun 2026
https://github.com/m0saan/python-for-data-analysis
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney,
data-analysis data-science ipython-notebook machine-learning matplotlib numpy pandas python
Last synced: 02 May 2026
https://github.com/hemangsharma/streamingcontentanalyzer
This Streamlit application provides an interactive dashboard for analyzing streaming content data. It allows users to explore movie and TV show ratings, distributions, temporal trends, and genre breakdowns through various visualizations and filters.
dashboard data-analysis data-science data-visualization python streamlit-dashboard streamlit-webapp
Last synced: 02 Apr 2025
https://github.com/ved-coder-king/wheat_ai_project
This project, Smart Wheat Farming AI System, was developed as part of the coursework for the Artificial Intelligence program at Esprit School of Engineering.
agriculture data-analysis data-visualization deep-learning image-classification machine-learning object-detection python wheat
Last synced: 15 Apr 2025
https://github.com/mrdandelion6/error-propagator-v2
In development
data-analysis data-science error-propagation flask fullstack-development python react vite
Last synced: 25 Mar 2025
https://github.com/ttwag/p9_pandas
Problems that Introduce the DataFrame Object in Python's Pandas Library
data-analysis pandas-dataframe python
Last synced: 10 Jun 2025
https://github.com/mishaa931/amazon-sales-dashboard-power-bi
This project features a dynamic Power BI dashboard built on dummy Amazon sales data. It visualizes key business metrics such as revenue trends, top-selling categories, discount impact, and geographic performance. The dashboard is designed to help stakeholders make data-driven decisions through clear, interactive visuals.
data-analysis data-quality data-visualization microsoftpowerbi
Last synced: 05 Feb 2026
https://github.com/vatshayan/youtube-user-analysis
Analysis of Youtube Users about their choice and preferences
data data-analysis data-mining data-science data-visualization dataset machine-learning machine-learning-algorithms
Last synced: 05 Feb 2026
https://github.com/se7en69/rna-seq-data-processing-and-analysis-pipeline
This pipeline automates essential steps for RNA-Seq data analysis, including quality control, read trimming, alignment to a reference genome, and coverage quantification. It leverages tools like FastQC, fastp, STAR, and bedtools to ensure high-quality results, with MultiQC reports providing an overview at each stage.
bioinformaitcs-scripting bioinformatics bioinformatics-pipeline data-analysis linux scripts shell
Last synced: 02 May 2026
https://github.com/marianamartiyns/inep-educationperfomance
Data collection, processing, exploratory analysis, and predictive modeling of school performance rates using datasets from INEP.
data-analysis data-cleaning data-science inep predictive-modeling pyhton web-scraping
Last synced: 16 Mar 2025
https://github.com/tolumie/loan-approval-prediction
Loan Approval Prediction using Machine Learning | EDA + Decision Tree, Random Forest & Logistic Regression | Automating loan eligibility for Dream Housing Finance by analyzing customer data and predicting loan approvals.
classification credit-risk-analysis data-analysis decision-tree-classifier finance-analytics loan-approval logistic-regression-algorithm machine-learning predictive-modeling-techniques random-forest
Last synced: 30 Jun 2025
https://github.com/apsinghanalytics/hranalytics_myersbriggspersonalityinsights
A Excel analytics study exploring the correlation between personality traits and key HR-relevant parameters, including tenure and performance
data-analysis data-visualization excel pivot-tables
Last synced: 30 Jan 2026
https://github.com/kath92/my_data_projects
My data projects.
data-analysis data-vizualisation nlp-machine-learning poewrbi python tableau
Last synced: 23 Mar 2025
https://github.com/odinsride/monpl
PL/SQL Data Load Monitoring Tools
data-analysis database etl logging logging-framework monitoring oracle plsql
Last synced: 28 Jul 2025
https://github.com/luminati-io/Walmart-dataset-samples
A sample dataset of over 1000 Walmart products, extracted using the Bright Data API, ideal for consumer market insights and competitor analysis.
api data-analysis dataset walmart walmart-scraper web-scraping
Last synced: 09 Apr 2025
https://github.com/luminati-io/Target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 09 Apr 2025
https://github.com/robertpaulp/expenseadvisor
HackITall 2023- Hackathon
chatgpt-api data-analysis data-processing python scrapping-python
Last synced: 03 May 2026
https://github.com/bhavna-kale/cars-eda-project
Project analyzing used car market data to identify high-impact price drivers and depreciation curves, presented through an interactive web application.
data-analysis excel matplotlib numpy pandas python3 searborn streamlit
Last synced: 03 May 2026
https://github.com/leandrocollares/nyc-film-permits
NYC film permits: an exploratory data analysis
data-analysis data-visualization pandas plotly
Last synced: 05 Jul 2025
https://github.com/khushi-sabarad/8-week-sql-challenge
Case studies' solutions for the #8WeekSQLChallenge by Danny Ma
8weeksqlchallenge case-study data-analysis mysql sql
Last synced: 06 Sep 2025
https://github.com/aicorsair/python-case-study-imdb-movie-reviews-sentiment-analysis-with-nlp
This repository contains a comprehensive case study on sentiment analysis using the IMDb dataset of movie reviews.
ada-boost artificial-intelligence classification data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization feature-engineering feature-extraction hyperparameter-tuning logistic-regression machine-learning naive-bayes natural-language-processing nltk python random-forest shap
Last synced: 03 May 2026
https://github.com/shivam5509/power-bi-project
Expert in creating interactive dashboards and reports using Power BI, utilizing 10+ visual tools like cards, slicers, and charts. Skilled in cleaning and transforming large datasets with Power Query Editor. Proficient in advanced DAX functions (SUMX, FILTER, CALCULATE) to derive insights and drive data-driven decisions.
advanced-excel computer-science data-analysis data-mining data-visualization engineering mysql numpy pandas powerbi pyhton3 sql sql-server
Last synced: 11 Apr 2026
https://github.com/satvikpraveen/matplotlibmasterpro
📷 MatplotlibMasterPro is a complete, portfolio-ready project to master data visualization using matplotlib. Includes 16 notebooks, real datasets, exportable plots, custom themes, Streamlit dashboard, and Docker support. Ideal for learners and data professionals.
charts custom-plots dashboarding data-analysis data-science data-visualization educational-project interactive-visualizations jupyter-notebook matplotlib notebooks open-source plotting portfolio-project python python-utilities reproducible-research subplots time-series-analysis visualization-tools
Last synced: 14 May 2026
https://github.com/code-jl/dna-sequence-analyzer
A robust Python-based bioinformatics tool for comprehensive DNA sequence analysis and manipulation.
bio-tools bioinformatics biological-data computational-biology data-analysis dna-analysis dna-sequencing fasta gc-content gene-detection genetics genomics molecular-biology motif-finding nucleotide-analysis python python3 scientific-computing sequence-analysis sequence-manipulation
Last synced: 11 Mar 2025
https://github.com/tj2904/lfb-callout-analysis
An investigation into London Fire Brigade's callout data.
data-analysis decsion-tree kmeans lfb-incidents london-fire-brigade pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/shellynagar27/candy-market-share-analysis
Candy Market Share Analysis explores confectionery sales data using Power BI, Python, and Power Query. It uncovers key market trends, top-selling candies, manufacturer performance, and packaging preferences to support data-driven decision-making for industry researchers.
critical-thinking data-analysis data-visualization exploratory-data-analysis powerbi powerquery problem-solving sales-analysis
Last synced: 03 Feb 2026
https://github.com/antoniszks/music-category-identifier
A 'Data-Science & Machine Learning' project where we are training a neural network to identify what kind of music we give to it. Based on a university project.
ai artificial-intelligence data-analysis data-science jupyter-notebook machine-learning ml notebook python
Last synced: 25 Feb 2025
https://github.com/baggiponte/pyconpt-polars
@pola-rs talk @pyconpt
apache-arrow data-analysis data-science etl polars python
Last synced: 03 May 2026
https://github.com/lopes51789/salaryanalysis
This salary dataset is a good candidate for descriptive analysis, and we can identify which demographics experience reduced or increased salaries. For example, we could explore the salary variations by gender, age, industry, and even years of prior work.
data-analysis json mysql python3 sql tableau
Last synced: 13 Apr 2026
https://github.com/dogan-the-analyst/data_analysis_in_the_office
Data analysis with R in the Office.
data-analysis ggplot r theoffice tidyverse
Last synced: 14 Mar 2025
https://github.com/ssoehdata/sql_for_data_science_specialization_course
Materials and Certifications from the SQL for DataScience Course
data-analysis data-science database databricks postgresql sql sqlite
Last synced: 10 Apr 2026
https://github.com/cnoret/retail-data-analysis
Let's analyze historical sales data from a large retail chain and predict weekly sales using machine learning on a Streamlit web app
data-analysis data-analyst data-science data-vizualisation pandas python streamlit streamlit-webapp
Last synced: 10 Apr 2026
https://github.com/bibymaths/python_snippets
A collection of Python scripts for bioinformatics data analysis, including tools for transcription counts, nucleotide composition, and protein sequence evaluation.
amino-acid-scoring bioinformatics data-analysis fasta-generation mathematical-evaluation nucleotide-analysis protein-sequence-analysis transcription-counts
Last synced: 29 Jul 2025
https://github.com/ndiplacide7/r-project
Explore diverse data analysis techniques using R programming combined with advanced machine learning algorithms to uncover insights and create powerful predictive models.
data-analysis data-visualization machine-learning-algorithms r
Last synced: 25 Mar 2025
https://github.com/arturo2r/dashboard
Dashboard of New House Index Pricing
colombia dashboard data-analysis forecasting forecasting-models forecasting-time-series prices r
Last synced: 25 Mar 2025
https://github.com/ravi-prakash1907/covid-19-china
A data-science research work to understand the growth rate of the novel Coronavirus.
china coronavirus covid-19 data-analysis data-mining data-science mathematical-modelling project r research research-paper
Last synced: 06 Sep 2025
https://github.com/paul0vinicius/ad2
Repositório da disciplina de Análise de Dados 2 (Data Analysis II)
Last synced: 08 Jan 2026
https://github.com/mohnish88/e-commerce-data-analysis
I analyzed sales data to identify trends and patterns, which significantly enhanced decision-making processes. Additionally, I created interactive visualizations to present these insights clearly and effectively, facilitating better understanding and communication of the data's implications.
data-analysis data-cleaning jupyter-notebook pandas plotly python python-library sales sales-analysis visulaization
Last synced: 03 May 2026
https://github.com/jcm-ai/Standard-Bank-Data-Science-Virtual-Experience-Programme
This repository has all of the assignments I had to do for the Standard Bank Data Science Virtual Experience Program. 📉👨💻📊📈
automl business-analysis business-solutions client-communication data-analysis data-mining data-science data-visualization machine-learning machine-learning-algorithms matplotlib-pyplot model-evaluation model-interpretation power-point presentation-slides programming-language python3 seaborn sql statical-analysis
Last synced: 19 Aug 2025
https://github.com/gattupalli-saketh/sentiment-analysis-on-products-
Product reviews sentiment analysis.
data-analysis machine-learning nlp review-analysis sentiment-analysis sentiment-classification
Last synced: 18 Apr 2026
https://github.com/krzysikd/uber_fare_prediction
Predicting uber fares using advanced machine learning models and feature engineering techniques
data-analysis data-processing eda hyperparameter-tuning jupyter machine-learning regression-models
Last synced: 02 Apr 2025
https://github.com/dbriane208/python-for-data-science
Machine Learning and Data Science repository. Love crafting Machine Learning models.
data-analysis data-science data-visualization machine-learning numpy pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/giyanellow/time-series-analysis-on-philippine-debt-and-inflation
A Time Series Analysis on the Philippine Inflation Rate with some predictions using RandomForest.
data-analysis data-analysis-python machine-learning python random-forest
Last synced: 18 Mar 2026
https://github.com/diligencefrozen/dcinside-data
Analyzing the Dcinside Frozen Gallery Dataset. #디시
Last synced: 30 May 2026
https://github.com/ababic/dumpling
Fast, flexibile, powerful static data anonymisation for SQL dumps
anonymisation cli data-analysis data-science pii pii-redaction postgres privacy rust rust-lang scrubber scrubbing security tooling
Last synced: 03 May 2026
https://github.com/salma-mamdoh/project-writing-functions-for-product-analysis
My Project to learn the Basics of Analysis on DataCamp
data-analysis data-camp pandas python
Last synced: 03 May 2026
https://github.com/saro0307/exploratory-data-analysis-terrorism
Phase 1 of Data Science project (program) to perform Exploratory Data Analysis on Terrorism using Python On Google Colab for Coderscave Internship sept 2023
colaboratory data-analysis datascience machine-learning numpy pandas python seaborn skit-learn visualization
Last synced: 13 Apr 2026
https://github.com/okwilkins/retailanalysis
A comprehensive exploratory analysis and implementation of kmeans/hierarchical clustering on online retail data.
data-analysis data-science machine-learning statistics
Last synced: 18 Oct 2025
https://github.com/1401dev/customer-lifetime-value-prediction
A data science project leveraging Python and Scikit-Learn to build predictive models that estimate customer lifetime value (CLV). Includes data cleaning, feature engineering, and model selection to identify key drivers of CLV, supporting strategic decision-making in customer retention and marketing.
clv clv-analysis customer-retention data-analysis dataprocessing feature-engineering machine-learning marketing-analytics predictive-modeling python regression-analysis scikit-learn
Last synced: 06 May 2026
https://github.com/tillbiskup/trepr
A Python package based on the ASpecD framework for handling TREPR data.
data-analysis data-processing electron-paramagnetic-resonance reproducible-research reproducible-science spectroscopy time-resolved
Last synced: 06 Sep 2025
https://github.com/ronitjariwala/prodigy_ds_02
Prodigy InfoTech Data Science Internship Task-2
Last synced: 28 Apr 2026
https://github.com/ankitgmishra/machinelearning
Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.
artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn
Last synced: 03 May 2026
https://github.com/fer-aguirre/covid19-venezuela
Análisis de datos de muertes por covid-19 en Venezuela
covid-19 data-analysis dataviz line-chart
Last synced: 09 Apr 2025
https://github.com/fbarffmann/vba-challenge
Built an Excel VBA script to automate stock market analysis across multiple years. Programmatically calculated and visualized key financial metrics, reducing manual reporting time and improving data accuracy.
automation data-analysis excel excel-vba financial-analysis reporting stock-market vba
Last synced: 04 Feb 2026
https://github.com/vatshayan/pokemon-analysis
Visualization, Analysis & Predicting the accuracy of finding Pokemon power, attack & speed through Machine Learning
artificial-intelligence data data-analysis data-science data-visualization dataset machine-learning machine-learning-algorithms pokemon scikit-learn
Last synced: 30 May 2026
https://github.com/sadia-khan13/data-preprocessing
Welcome to the Data preprocessing Repository! This repository is dedicated to showcase the comprehensive resources and implementations related to Data Preprocessing using Python and Jupyter Notebook.
artificial-intelligence data-analysis data-mining data-preprocessing data-science jupyter-notebook matplotlib numpy pandas python seaborn-python sklearn
Last synced: 11 Apr 2026
https://github.com/jkaardal/csvnav
A memory-efficient python class for navigating large CSV/text files.
csv data-analysis data-science machine-learning memory-management
Last synced: 14 Jan 2026
https://github.com/phanchenh/youtube_analysis_rlanguage
Insights into YouTube Channel Performance - A Data-Driven Approach
business-analytics data-analysis data-driven data-visualization etl-pipeline preprocessing r-language r-programming-language
Last synced: 10 Mar 2026
https://github.com/fer-aguirre/cookiecutter-data-analysis-lite
A cookiecutter template for data journalism projects that offers a simplified and beginner-friendly structure.
cookiecutter data-analysis data-journalism project-template python
Last synced: 14 Jun 2025
https://github.com/fbarffmann/car_price_prediction
Predicted used car prices with a Random Forest model (R² = 0.96) using Python. Analyzed 2,000+ listings and visualized trends with Tableau.
car-price-prediction data-analysis machine-learning pandas python random-forest regression sklearn tableau
Last synced: 13 Apr 2026
https://github.com/shrikantnaidu/sql-for-data-analysis
SQL for Data Analysis
data-analysis parch-and-posey postgresql
Last synced: 27 Feb 2025