Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/giuleo129/dataanalysis
This folder contains two projects focused on data analysis and statistical learning using R, covering exploratory data analysis, modeling, and predictive techniques.
data data-analysis data-science statistical-learning
Last synced: 25 Jan 2026
https://github.com/ashwin331133/hospital_allpatients_waitinglist_data
This Power BI project analyzes patient waiting lists across various medical specialties and case types (Day Case, Inpatient, Outpatient). The goal is to gain insights to improve healthcare management and resource allocation.
data-analysis data-visualization powerbi
Last synced: 03 Jan 2026
https://github.com/v-mayya/quantitative-analysis-data-dashboard
Quantitative survey data analysis using R
data data-analysis data-visualization flourish r
Last synced: 01 Apr 2025
https://github.com/seif-elkateb/dataset-analysis-r
cu-boulder data data-analysis datamodeling datascience ms-ds msds434 r
Last synced: 01 Apr 2025
https://github.com/bhushan148/finance-domain-bank-loan-report-tableau
I analyzed 🏦 bank loan data to reveal trends, KPIs, and insights. Using Tableau 📈 for dashboards and SQL 🗃️ for data extraction, I visualized loan applications, borrower profiles, and repayment behaviors 💡.
bussiness-intelligence dashboard-design data-analysis data-visualization excel figma sql sqlqueries tableau
Last synced: 08 Apr 2025
https://github.com/tomijuarez/lemmatisation
Lemmatisation fully implemented in Java.
algorithms data-analysis data-science java-8 lemmatization oop
Last synced: 08 Apr 2025
https://github.com/farzeen-2001/financial-analysis-report-using-powerbi
comprehensive analysis of financial report
data-analysis data-visualization datacleaning dax powerbi
Last synced: 17 Feb 2026
https://github.com/rahulsm20/insurance-data
A data analytics project dealing with risk assessment and it's effects in health insurance.
data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 12 Apr 2026
https://github.com/shellynagar27/transportation-and-logistics-challenge
Analyzing logistics data to optimize shipment efficiency, reduce delays, and enhance supply chain visibility using Power BI. Insights include top routes, delays, supplier trends, and peak shipments.
cleaning-data critical-thinking data-analysis data-visualization exploratory-data-analysis feature-engineering powerbi preprocessing-data problem-solving python
Last synced: 16 May 2026
https://github.com/yash-3-bit/online-sales-analysis
Project-Merging the different months datasets and performing the data cleaning ,Analysis and Visualization
data-analysis data-visualization pandas-library
Last synced: 27 Mar 2025
https://github.com/quocduyenanhnguyen/roi-modeling-and-analysis-of-sports-dataset
In this project, you will find my ROI model for retirement savings and PowerPoint presentation of my ROI model, as well as my data analysis/visualization of Sports Ticket Sales dataset that I concluded with a PDF group written report
data-analysis data-visualization microsoft-excel rate-of-return-modeling sports-ticket-sales-dataset
Last synced: 08 Feb 2026
https://github.com/dulajkavinda/pandas-exploring-data-ml
🐼 Exploring data with pandas library.
data-analysis machine-learning pandas python
Last synced: 09 May 2026
https://github.com/theveryhim/massive-text-processing-1
cleaning, processing and analysis of papers' dataset in pyspark(rdd) framework
big-data data-analysis frequent-itemsets massive-datasets pyspark text-preprocessing
Last synced: 03 Jul 2025
https://github.com/jacksonrakena/lcg-toolkit
Simulator and plotter for linear congruential generator (LCG) functions in Python
borland congruence congruent congruential data-analysis data-science generation generator lcg linear linear-congruential-generator numerical-recipes randu randu-function randu-generator randu-rng rng
Last synced: 31 Aug 2025
https://github.com/quantumudit/groceries-basket-analysis
This project performs market basket analysis using Power BI and Python to reveal associations between grocery items. It involves transforming raw transaction data into a processed dataset, creating interactive Power BI reports, and generating key insights through Python, enabling data-driven decision-making.
data-analysis data-visualization pandas powerbi python
Last synced: 12 Apr 2026
https://github.com/bpkaur/exploring-the-evolution-of-linux
This project explores the evolution of the Linux kernel by finding top 10 contributors and visualization of commits over the years.
data-analysis data-science datacamp ipynb-jupyter-notebook python3
Last synced: 21 Feb 2026
https://github.com/charlescro/reddit-classification-nlp
Analyzing subreddit language via Reddit API and NLP techniques.
data-analysis data-science data-visualization nlp-machine-learning reddit-api scikit-learn
Last synced: 03 Apr 2025
https://github.com/cassandrajm/reddit-dashboard
INTERACTIVE DASHBOARD: Analyzing Political Discourse on Reddit: A Multi-Faceted NLP Approach to Toxicity, Bias, and Political Stance
capstone data data-analysis data-science politics python reddit
Last synced: 09 Apr 2025
https://github.com/saroshfarhan/dublin_pedestrian_data_analysis
Pedestrian's footfall data analysis for the city of Dublin
data-analysis data-visualization r-programming
Last synced: 07 Jan 2026
https://github.com/ankitmishralive/machinelearning
Continuously deep diving in understanding & advancing my expertise in Machine Learning through ongoing education and hands on experience with practical learning.
artificial-intelligence data-analysis data-cleaning data-gathering machine-learning machinel-learning-algorithms matplotlib numpy pandas python seaborn
Last synced: 22 Mar 2025
https://github.com/fbarffmann/citibike-covid-analysis
Analyzed NYC CitiBike usage during March 2020 to assess the impact of COVID-19 using Python and Tableau. Includes ridership breakdowns, user type trends, and interactive dashboard.
citibike covid19 data-analysis data-visualization exploratory-data-analysis pandas python tableau transportation
Last synced: 12 Apr 2026
https://github.com/faysalalmahmud/bd-med-professional-analysis
Analysis of healthcare professionals in Bangladesh through web scraping, data processing, and interactive visualization.
data-analysis data-visualization jupyter-notebook python scraper selenium selenium-webdriver tableau
Last synced: 04 Sep 2025
https://github.com/muhammed-fazal/student-success-and-early-intervention-analytics-system
To consolidate scattered student performance records into a unified Data Warehouse in SQL Server. Engineer an Interactive Power BI dashboards that visualize academic trends, identifying student performance and implement predictive analytics.
analysis analytics dashboard data data-analysis data-engineering data-science data-visualization database etl etl-pipeline power-bi powerbi python sql sql-server
Last synced: 29 May 2026
https://github.com/ymorsi7/caliwageanalysis
California employment and wage analysis on data from the past decade.
data-analysis data-science ipynb jupyter-notebook
Last synced: 21 Jan 2026
https://github.com/rb-thompson/smart-finance-analyzer
Module End Project for CY
data-analysis data-visualization python
Last synced: 12 Jun 2025
https://github.com/angchekar28/valorant-gameplay-analysis
This project analyzes Valorant gameplay data to understand key factors affecting match outcomes. It compares various machine learning models to predict player performance, rank classification, and match success.
data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook machine-learning python
Last synced: 12 Apr 2026
https://github.com/camara94/data_analyse_series_temporelles
Dans ce tutoriel, nous allons répondre aux questions suivantes: 1. Lire les données Microsoft à l'aide du package **Pandas Data reader** 2. Obtenez le **prix maximum** de l'action de **2017 à 2022** 3. Quelle est la **date du cours le plus élevé** de l'action ? 4. Quelle est la **date du cours le plus bas** de l'action ?
data-analysis data-analysis-python data-science data-structures-and-algorithms data-visualization serie series-forecasting
Last synced: 09 Apr 2025
https://github.com/dmdlgg/spotify-analysis
An interactive data analysis app built with Python, Pandas, Plotly, and Streamlit, showcasing insights about the top 1000 most played songs on Spotify. Dataset sourced from Kaggle. Users can explore the frequency, popularity, and most played songs by artist in a clean and intuitive interface.
data-analysis data-visualization pandas plotly python streamlit
Last synced: 11 May 2026
https://github.com/nsandoya/python_scrp_project
This is a tool specially made for Dipaso ecommerce website. You can extract data from there, analyze it and see keywords, brands, and categories frecuency, prices distribution and other market tendencies as well —all in a group of friendly stadistic tables and graphics (exported from a Jupyter notebook) :)
beautifulsoup4 data data-analysis jupyter-notebook pandas python3
Last synced: 28 Apr 2026
https://github.com/sumit0ubey/internship
This repository showcases the tasks and projects I completed during various internships. It includes work across diverse domains such as: Data Analysis: Exploratory data analysis, data visualization, and insights generation using Python and libraries like Pandas, Matplotlib, and Seaborn. Backend Development: Designing and implementing RESTful API
backend-development data-analysis python-developer
Last synced: 05 Sep 2025
https://github.com/fbarffmann/nosql-challenge
Analyzed 28,000+ UK restaurant records using MongoDB and PyMongo. Queried hygiene scores, location data, and customer ratings.
data-analysis data-cleaning database-analysis json mongodb nosql pymongo python restaurant-data
Last synced: 13 Apr 2026
https://github.com/junpenglao/jaefa
Just Another Eye-movement Filtering Algorithm
data-analysis eye-movement-data eye-tracking
Last synced: 12 Jan 2026
https://github.com/anniefib/otherprojects
Powering Data Dreams: From Orchestration to Analytics with Cloud Precision
airflow-etl-orchestration aws cloud-native-data-solutions data-analysis data-visualization database datamodelling datawarehousing eda end-to-end-data-pipelines machine-learning-models pgadmin4 spark-analytics sql
Last synced: 07 May 2026
https://github.com/nullthefirst/py-notebooks
Jupyter Notebooks holding Data Science projects
data-analysis data-science data-visualization datasets jupyter-notebooks python
Last synced: 26 Apr 2026
https://github.com/nafiealhilaly/first-dash-app
A simple dash plotly app to explore and analyze imagined students assessment dataset
data-analysis data-analytics data-visualization eda plotly-dash python
Last synced: 02 Apr 2025
https://github.com/anas436/predictive-modelling-urban-growth-ai
Predictive Modelling for Urban Growth using AI
artificial-intelligence dashboard data-analysis data-collection data-preprocessing data-science data-visualization deep-learning deployment jupyterlab machine-learning python3 remote-sensing streamlit webapplication webscraping
Last synced: 05 Sep 2025
https://github.com/zkan/python-for-data-scientists
Python for Data Scientists
data-analysis data-science data-scientists machine-learning pandas python
Last synced: 13 Apr 2026
https://github.com/lexiortiz/advanced-data-analytics
Structured learning notes, code snippets, and key takeaways from the Google Advanced Data Analytics Professional Certificate. Serves as a personal reference for reinforcing concepts and as a resource for others on a similar learning journey.
data data-analysis data-engineering google python-3 sql
Last synced: 29 May 2026
https://github.com/nurulashraf/polynomial-regression-manufacturing
A Python project implementing polynomial regression to analyse and predict manufacturing-related data. Features include data preprocessing, model training, and visualisation of results. Ideal for exploring machine learning applications in manufacturing process optimisation.
data-analysis data-visualization machine-learning manufacturing polynomial-regression predictive-modeling process-optimization python regression-models scikit-learn
Last synced: 16 Apr 2026
https://github.com/analysisbyvivek/Road-Accident
Analyzes road accident patterns, exploring factors like lighting, weather, speed limits, time of day, and road conditions to uncover trends in severity and frequency.
data-analysis data-visualization eda jupyter-notebook kaggle tableau-public
Last synced: 29 Jan 2026
https://github.com/ashleydavis/brisjs-data-analysis-talk
Code for my talk to BrisJS on data analysis in JavaScript
charting data-analysis data-visualization data-viz javascript node node-js nodejs visualization
Last synced: 25 Mar 2025
https://github.com/parthds02/e-commerce-data-analysis-with-python
This project focuses on analyzing an e-commerce dataset using Python. The goal is to derive meaningful insights through exploratory data analysis (EDA) and uncover trends and patterns that can drive business decisions.
data-analysis ecommerce exploratory-data-analysis jupyter-notebook pytho sales-analysis visualization
Last synced: 13 Jun 2025
https://github.com/amoghkori/working-with-apache-spark-mllib
Implemented Apache Spark MLLib to analyze a large car dataset, predict car selling prices, and gain insights into the car market.
amazon-web-services data-analysis data-visualization exploratory-data-analysis linear-regression machine-learning model-selection pyspark python random-forest sagemaker spark
Last synced: 13 Apr 2026
https://github.com/asergioscosta/portfolio-dados
Portfolio of dashboards and data analysis projects.
business-intelligence dashboard data-analysis data-visualization kpi looker-studio powerbi
Last synced: 22 Feb 2026
https://github.com/amoghkori/effect-of-box-office-on-unemployment
Data preparation and cleaning process for movie ratings and reviews dataset and US unemployment rate dataset, involving an 8-step data wrangling process to create an Analytic Base Table (ABT) structure, emphasizing data structuring techniques, cleaning for outliers and missing values, and the importance of accurate and reliable data for analysis.
data-analysis data-cleaning data-preprocessing data-validation data-wrangling model-selection
Last synced: 13 Jun 2025
https://github.com/hassanislam463/data-cleaning-and-modelling-top-5-categories-analysis-forage
This project involves cleaning, merging, and analyzing datasets to identify the top 5 performing categories based on aggregate popularity scores. It includes cleaned datasets, a final merged dataset, visualizations, and a presentation summarizing the tasks and results. Tools used: Microsoft Excel, Python, and PowerPoint.
data-analysis data-visualization microsoft-excel
Last synced: 07 Jan 2026
https://github.com/zen204/renewable-energy-usage-v-electricity-access
Interactive data visualization project created for COSI 116A: Introduction to Information Visualization at Brandeis University (Fall 2024). The project showcases data-driven insights using advanced visualization techniques and user interactivity. Hosted on GitHub Pages.
d3js data-analysis data-visualization electricity github-pages html-css-javascript information-visualization interactive python renewable-energy tableau web-development
Last synced: 08 Feb 2026
https://github.com/himanshubhosale25/ai-insightful-quiz-analytics
This project analyzes student quiz performance data, providing visualizations and AI-generated feedback. It uses FastAPI for the backend, React for the frontend, and OpenAI LLMs to deliver personalized insights and actionable recommendations for students.
data-analysis fastapi openai-api react student-performance
Last synced: 11 Mar 2025
https://github.com/ljadhav25/data-engineering-poc
This repository contains a beginner-level Data Engineering Proof of Concept (POC) project designed for practice. The objective is to provide hands-on experience with data engineering concepts, including data extraction, transformation, loading (ETL), and basic data analysis. This project is ideal for those looking to build foundational skills in da
data-analysis etl matplotlib numpy pandas python
Last synced: 13 Apr 2026
https://github.com/mrfoxak/movie-recommender-system-project
This is a Machine Learning Recommendation System Project
data-analysis machine-learning python recommender-system regression tokenization
Last synced: 13 Apr 2026
https://github.com/damiieibikun/web-scrapping-and-python-data-visualization-on-top-500-movies-imdb
Web Scrapping and Python Data visualization on Top 500 movies IMDb
beautifulsoup4 data-analysis data-visualization matplotlib-pyplot numpy pandas plotly-express python requests seaborn web-scraping
Last synced: 13 Apr 2026
https://github.com/namratha2301/python-dashboard-streamlit
Experimenting with Streamlit. Streamlit app provides an interactive visualization of the best-selling books, showcasing trends, top-selling books, top authors, genre distributions, and sales by decade.
css dashboard data-analysis pandas plotly python seaborn streamlit
Last synced: 05 May 2026
https://github.com/ikigai-systems/fundamento-standalone
Your data is your treasure—store it on solid foundations
application automation cloud collaboration data-analysis data-visualization docs documentation on-prem ruby-on-rails wiki wiki-engine
Last synced: 11 May 2026
https://github.com/spacebakery/variance-in-weather-project
Statistics for Data Analysis | Variance and Standard Deviation
data-analysis python standard-deviation statistics variance
Last synced: 05 Jul 2025
https://github.com/crazy-dot/instagram_user_analytics
Analysis of Popular Social Media Network - Instagram
data-analysis instagram-analytics project-repository trainity
Last synced: 07 Jan 2026
https://github.com/samruddhi3012/tata-data-visualization
Hi! This repo contains the dashboard I created using Tableau for TATA Data Visualization Training!
data-analysis data-visualization tableau tata
Last synced: 07 Jan 2026
https://github.com/fabriciocovalesci/webscraping
Web Scraping para obter os dados da página
automation beautiful-soup-4 data-analysis data-cleaning data-extraction data-mining data-scraping data-wrangling html-parsing python web-crawling web-data-scraping web-scraping webscraping
Last synced: 13 Jun 2025
https://github.com/abhisek-13/fake_news_classifier
The Fake News Classifier is a TensorFlow-based machine learning project that detects and classifies fake news with 97% accuracy. The repository includes a single Python file with complete code for building and training the model, which you can use to create and deploy your own model.
colab-notebook data-analysis data-engineering deep-learning eda kaggle keras machine-learning nlp pandas python tensorflow
Last synced: 13 Apr 2026
https://github.com/lijesh010/covid-19_global_analytics_power_bi_project
This repository is a data visualization project that offers an in-depth analysis of the Covid-19 pandemic using Microsoft Power BI. This interactive dashboard provides valuable insights into key metrics related to Covid-19 cases, deaths, recoveries, and more, helping users understand the global impact of the pandemic.
dashboard data-analysis data-visualization powerbi report
Last synced: 08 Jan 2026
https://github.com/singhrdeep/croppilot
CropPilot is a lightweight, Python-based command-line tool designed to help small-scale farmers, gardeners, and students manage crop data, track profits, and explore sustainable practices. Built for usability and extensibility.
agriculture data-analysis farm-management open-source python
Last synced: 25 Apr 2025
https://github.com/whoprashant7/querying-a-large-relational-database-using-ms-sql
Analysing data using Ms Sql Server
data-analysis ms-sql-server sql
Last synced: 05 Jul 2025
https://github.com/kittonn/data-analysis-freecodecamp
freecodecamp - data analysis projects.
Last synced: 05 Apr 2025
https://github.com/mxagar/data_science_udacity
My personal notes, code and projects of the Udacity Data Science Nanodegree.
dashboard data-analysis data-engineering data-science machine-learning-pipelines
Last synced: 09 Apr 2025
https://github.com/hemangsharma/streamingcontentanalyzer
This Streamlit application provides an interactive dashboard for analyzing streaming content data. It allows users to explore movie and TV show ratings, distributions, temporal trends, and genre breakdowns through various visualizations and filters.
dashboard data-analysis data-science data-visualization python streamlit-dashboard streamlit-webapp
Last synced: 02 Apr 2025
https://github.com/prady2309/car-price-prediction
Multiple Linear Regression Project
data-analysis data-science machine-learning python
Last synced: 20 May 2026
https://github.com/mrdandelion6/error-propagator-v2
In development
data-analysis data-science error-propagation flask fullstack-development python react vite
Last synced: 25 Mar 2025
https://github.com/aravindnathan02/bi-projects
Data Analysis and Visualization projects involving only BI tools (Power BI, Tableau, MS Excel).
data-analysis data-visualisation ms-excel powerbi tableau
Last synced: 08 Jan 2026
https://github.com/vatshayan/youtube-user-analysis
Analysis of Youtube Users about their choice and preferences
data data-analysis data-mining data-science data-visualization dataset machine-learning machine-learning-algorithms
Last synced: 05 Feb 2026
https://github.com/joaquinmoron/airbnb-eda-python
EDA de Airbnb — limpieza, exploración y visualización en Python (pandas, matplotlib, seaborn).
airbnb data-analysis eda matplotlib pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/marianamartiyns/rfm-cluster-analysis
Customer behavior and sales analysis, including data cleaning, RFM calculation, churn analysis and customer clustering.
cluster-analysis data-analysis data-cleaning data-visualization pyhton
Last synced: 16 Mar 2025
https://github.com/marina-gal/elderly-care-ranking
Data analysis and scoring model for elderly care homes, including data cleaning, transformation, 0–100 scoring, and ranking across multiple quality dimensions.
Last synced: 30 May 2026
https://github.com/luminati-io/Target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 09 Apr 2025
https://github.com/leandrocollares/nyc-film-permits
NYC film permits: an exploratory data analysis
data-analysis data-visualization pandas plotly
Last synced: 05 Jul 2025
https://github.com/saifalibaig/covid-19-infection-rate-analysis-using-python
Analysis of Covid-19 Infection rate and the world happiness report to identify if there is any relationship between infection rate and happiness
data-analysis data-visualization jupyter-notebook numpy pandas python3 sns
Last synced: 18 Apr 2026
https://github.com/hari7261/playwithdata-python
This is one of the repository where I have put lot of data science and machine learning related questions on their solutions I hope you will find something better than some other platforms. Thank you Happy exploring
data-analysis data-science data-science-learning machienlearning matplotlib matplotlib-python ml numpy numpy-arrays numpy-library pandas pandas-dataframe pandas-library python python-script sklearn
Last synced: 13 Apr 2026
https://github.com/tj2904/lfb-callout-analysis
An investigation into London Fire Brigade's callout data.
data-analysis decsion-tree kmeans lfb-incidents london-fire-brigade pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/shellynagar27/candy-market-share-analysis
Candy Market Share Analysis explores confectionery sales data using Power BI, Python, and Power Query. It uncovers key market trends, top-selling candies, manufacturer performance, and packaging preferences to support data-driven decision-making for industry researchers.
critical-thinking data-analysis data-visualization exploratory-data-analysis powerbi powerquery problem-solving sales-analysis
Last synced: 03 Feb 2026
https://github.com/mrprajapati18/100-days-of-code-data-science
100 Days of Code Challenge to learn Data Science from scratch! 📊🔍
anaconda-navigator data-analysis data-science data-visualization machine-learning-algorithms pyhton-library python-3
Last synced: 18 Apr 2026
https://github.com/lopes51789/salaryanalysis
This salary dataset is a good candidate for descriptive analysis, and we can identify which demographics experience reduced or increased salaries. For example, we could explore the salary variations by gender, age, industry, and even years of prior work.
data-analysis json mysql python3 sql tableau
Last synced: 13 Apr 2026
https://github.com/vinayakarannil/covid-19-india-analysis
EDA of covid-19 data in India
covid-19 covid-19-india data-analysis eda
Last synced: 14 Jun 2025
https://github.com/ndiplacide7/r-project
Explore diverse data analysis techniques using R programming combined with advanced machine learning algorithms to uncover insights and create powerful predictive models.
data-analysis data-visualization machine-learning-algorithms r
Last synced: 25 Mar 2025
https://github.com/rupashi03/fitbit-user-eda-case-study
Performed Exploratory Data Analysis (EDA) on Fitbit users' data to uncover trends in activity and health metrics.
business-analysis case-study consumer-insights data-analysis exploratory-data-analysis health-data r user-behavior-analytics
Last synced: 25 Mar 2025
https://github.com/arturo2r/dashboard
Dashboard of New House Index Pricing
colombia dashboard data-analysis forecasting forecasting-models forecasting-time-series prices r
Last synced: 25 Mar 2025
https://github.com/ravi-prakash1907/covid-19-china
A data-science research work to understand the growth rate of the novel Coronavirus.
china coronavirus covid-19 data-analysis data-mining data-science mathematical-modelling project r research research-paper
Last synced: 06 Sep 2025
https://github.com/allanotieno254/powerbi-dax-filter-context
This repository contains a Power BI project that explores **DAX Filter Context**, a crucial concept in DAX calculations. The project focuses on **Bank Loan Analysis**, demonstrating how different filter contexts affect DAX formulas.
business-intelligence data data-analysis dax dax-functions powerbi powerbi-visuals visualization
Last synced: 08 Jan 2026
https://github.com/chiragkumargohil/co2-emissions-data-analysis
A Python programme that analyses CO2 emission data from 1997 to 2010. This programme prints data, provides brief of a given year, displays and compares Year vs. Emission graphs for chosen countries, and generates a separate data file for chosen countries. It was a self-paced project that Guru 99 provided.
co2-emission data-analysis matplotlib python
Last synced: 28 Aug 2025
https://github.com/pawlo77/smarty
End-to-End Data Science tool
data-analysis data-processing pandas pipeline
Last synced: 08 May 2026
https://github.com/dbriane208/python-for-data-science
Machine Learning and Data Science repository. Love crafting Machine Learning models.
data-analysis data-science data-visualization machine-learning numpy pandas python seaborn
Last synced: 13 Apr 2026
https://github.com/rishitabansal9/adult-census-income-prediction
This is a project made for data analysis and income prediction using random forest classifier with 91% accuracy.
data data-analysis data-science feature-engineering random-forest-classifier
Last synced: 25 Mar 2025
https://github.com/meokullu/prefill
PreFill adds desired characters onto output values to increase their legibility.
alignment data data-analysis data-engineering data-science legibility
Last synced: 17 Jan 2026
https://github.com/tillbiskup/trepr
A Python package based on the ASpecD framework for handling TREPR data.
data-analysis data-processing electron-paramagnetic-resonance reproducible-research reproducible-science spectroscopy time-resolved
Last synced: 06 Sep 2025
https://github.com/tatilimongi/first_python_project
Este repositório contém um estudo de caso de automação de planilhas em Python para análise de vendas de carros por fabricante ao longo dos anos
data-analysis email-sending file-manipulation graphical-visualization spreadsheet-automation
Last synced: 26 Mar 2025
https://github.com/fer-aguirre/covid19-venezuela
Análisis de datos de muertes por covid-19 en Venezuela
covid-19 data-analysis dataviz line-chart
Last synced: 09 Apr 2025
https://github.com/weisswuerste/polars-eurovision-analytics
Analytics example using both the Pandas and Polars libraries
data-analysis data-analytics pandas polars python python-3 python3
Last synced: 08 May 2026
https://github.com/jkaardal/csvnav
A memory-efficient python class for navigating large CSV/text files.
csv data-analysis data-science machine-learning memory-management
Last synced: 14 Jan 2026
https://github.com/isaqueiros/newspapersales-predictions-linearregression_and_regularisation
This notebook is a study on the sales of newspapers of a local stand, with intention to predict the newspaper sales performance based on the different features available. For this, 4 sklearn models are applied: Linear Regression, Lasso Regression, Ridge Regression and Elastic Net Regression.
data-analysis data-science linear-regression machine-learning python regularization-methods sklearn-library sklearn-linear-regression
Last synced: 02 May 2026