Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-23 00:07:29 UTC
- JSON Representation
https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis
Last synced: 07 May 2026
https://github.com/md-emon-hasan/data-science
Data science tutorials, including data preprocessing, analysis, visualization, project deployment, machine learning and deep learning algorithms.
artificial-intelligence data-analysis data-engineering data-science deep-learning machine-learning-algorithms python
Last synced: 07 May 2026
https://github.com/dogan-the-analyst/web_scraping_job_vacancies
data-analysis python web-scraping
Last synced: 07 May 2026
https://github.com/robson-python/airplane-price-data-analysis
Airplane Price Data Analysis - Airplane Price Prediction
data-analysis data-science data-visualization jupyter-notebook linear-regression machine-learning matplotlib pandas python seaborn vscode
Last synced: 10 Jun 2026
https://github.com/shridhar1504/loan-clustering-datascience-project
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning
Last synced: 08 May 2026
https://github.com/guglielmo/datalab-notebooks
Data analysis at openpolis
data-analysis data-science jupyter-notebooks pandas python3
Last synced: 08 May 2026
https://github.com/iguptashubham/ott-churn-eda-ml
Understanding why customers discontinue their subscriptions will be crucial in optimizing the user experience, reducing churn, and maximizing customer lifetime value. By using Machine learning model to predict the Customer Churn.
data-analysis data-analysis-project data-science data-science-portfolio data-science-projects data-visualization machine-learning python
Last synced: 08 May 2026
https://github.com/jethronap/jstat-gui
Web-based GUI application for data analysis
data-analysis data-visualization java jstat mongodb
Last synced: 08 May 2026
https://github.com/miroslav-reiter/kurz_jazyk_sql_analytici_datovi_vedci
Materiály ku kurzu Jazyk SQL 1 pre Analytikov a Dátových Vedcov
analysis analytics data data-analysis data-science database mysql reiter sql
Last synced: 08 May 2026
https://github.com/allanotieno254/us-largest-companies-by-revenue-web-scraping
A Python project for web scraping and analyzing the largest companies in the United States by revenue from Wikipedia
automation beautifulsoup csv data-analysis data-cleaning data-execution data-extraction pandas python web-scraping
Last synced: 08 May 2026
https://github.com/md-emon-hasan/data_analytics_project
Data analytics tasks and solutions, featuring hands-on exercises for data cleaning, visualization, and analysis using Python libraries.
cars-dataset census-data covid19-data data-analysis london-house-price police-data weather-data
Last synced: 08 May 2026
https://github.com/sermonzagoto/data_manipulation_with_pandas
Data Manipulation with Pandas - Part 1
data-analysis data-science jupyter-notebook pandas-python python
Last synced: 09 May 2026
https://github.com/tanu272004/-income-mortgage-housing-insights-a-state-city-analysis-
To analyze state & city housing trends and affordability using data analytics.”
bigquery business-intelligence data-analysis data-visualization dax googlecloud kpi numpy powerbi predective-modeling python sql
Last synced: 09 May 2026
https://github.com/aminzibayi/atfc
Technology forecasting toolkit
data-analysis data-visualization graph technology-forecasting
Last synced: 09 May 2026
https://github.com/sedatdikbas/aefes-time-series-forecasting
Bu proje, Anadolu Efes Biracılık ve Malt Sanayii A.Ş. (AEFES) piyasa verilerini kullanarak kapanış fiyatlarının gelecekteki değerlerini tahmin etmek amacıyla derin öğrenme yöntemleri (LSTM, BiLSTM, CNN+LSTM) kullanmaktadır. Projede, veri ön işleme, model eğitimi ve değerlendirme adımları detaylandırılmıştır.
bilstm cnn-lstm data-analysis deep-learning financial-forecasting lstm machine-learning python stock-price-prediction tensorflow
Last synced: 09 May 2026
https://github.com/rubinlake/rl-academy-data-analytics
Educational data analysis project demonstrating BMW sales data analysis with AI-powered code assistance using Cursor IDE and Jupyter notebooks
cursor-ide data-analysis educational-project jupyter langchain matplotlib numpy pandas python scipy seaborn
Last synced: 09 May 2026
https://github.com/dogan-the-analyst/developer_survey_analysis
Analysis of the 2024 Stack Overflow developer survey. Tools used include Python, Pandas, Matplotlib, and IBM Cognos.
data-analysis data-visualization ibm-cognos-analytics matplotlib pandas python
Last synced: 09 May 2026
https://github.com/talha-1010/imdb-data-analysis
A data analysis project made with python using pandas
data-analysis data-visualization jupyter-notebook pandas pandas-dataframe
Last synced: 09 May 2026
https://github.com/blagojeblagojevic/lol_data_analysis
classification data-analysis data-science jupyter-notebook kaggle python3
Last synced: 09 May 2026
https://github.com/mariam-badr-mb/gtc-ml-project2-diabetes-prediction
This project is part of the GTC Machine Learning Program. It demonstrates the end-to-end ML workflow by building a predictive model for diabetes detection
classification-algorithm data-analysis data-visualization diabetes-prediction gridsearchcv hyperparameter-tuning machine-learning python
Last synced: 09 May 2026
https://github.com/sathyasris27/data-analysis-on-adult-smoking-patterns-in-the-uk
The aim of this analysis is to understand the smoking patterns among adults in the UK.
data data-analysis data-visualization python3
Last synced: 09 May 2026
https://github.com/chayandatta/got_script_manipulation
Game of Thrones Script - String & file manipulation
data-analysis data-science pandas python3
Last synced: 11 May 2026
https://github.com/easycris-software/easycris
Professional statistical analysis and RNA-seq for researchers — no coding required
anova bioinformatics data-analysis desktop-app genomics pharmacology research-tools rna-seq statistics tauri
Last synced: 11 May 2026
https://github.com/ahmednasef3/heart-attack-full-eda
Simple EDA for Heart Attack Dataset.
data-analysis data-science data-visualization eda exploratory-data-analysis heartattack matplotlib pandas seaborn
Last synced: 11 May 2026
https://github.com/is-leeroy-jenkins/sherpa
A budget execution & data analysis tool based on Winforms, .NET 6, and written in C# for EPA analysts
budget-management data-analysis data-science data-visualization federal-government
Last synced: 13 May 2026
https://github.com/zpreisler/modules
Python libraries and modules for processing simulation outputs
data-analysis python scripts tensorflow
Last synced: 13 May 2026
https://github.com/iguptashubham/pizzahut-analysis-sql
best dataset for data analysis. Pizzahut data analysis done by Shubham Gupta in MySql. This dataset is provided by friend of mine intern at pizzahut. In pizzahut, they used this dataset to train and ask question. This data does not reveal anything about the pizzahut. It is safe to share. data
data-analysis data-analytics database dataset datasets mysql mysql-database pizzahut
Last synced: 14 May 2026
https://github.com/pferreirafabricio/data-immersion
🏊🏻♂️ Activities and exercises from 'Imersão Dados' event
data data-analysis data-science dataset jupiter-notebook python
Last synced: 14 May 2026
https://github.com/abhi18av/innovation-competition
Submission for a programming challenge
clojure clojurescript data-analysis
Last synced: 13 Jun 2026
https://github.com/dogan-the-analyst/model_car_warehouse_analysis
This is a SQL project.
Last synced: 15 Jun 2026
https://github.com/ensinho/data-analysis
My repository for data analysis studys in Python.
csv data-analysis graphs python python-documentation
Last synced: 15 Jun 2026
https://github.com/chetanmalviya513/Firm-Financial-Transaction-Analysis
📊 Financial Analysis & Forecasting Processed large-scale financial data using Python for trend analysis and insights. Developed interactive Tableau dashboards to improve forecasting accuracy and reduce costs by 25%.
data-analysis financial-data forecasting insights msexcel pandas python reporting tableau-dashboards
Last synced: 15 Jun 2026
https://github.com/techshot25/baltimore-911-calls
Analysis of 911 calls provided by the city of Baltimore.
data-analysis data-science decision-tree-classifier logistic-regression machine-learning machine-learning-algorithms statistics
Last synced: 16 Jun 2026
https://github.com/duoan/machine-learning-notebook
A notebook repository for tracking learning machine learning notebook.
data-analysis decision-tree ensemble-model gbdt machine-learning numpy pandas xgboost
Last synced: 18 Jun 2026
https://github.com/shrawans007/google_cyclistic_2023
Google Data Analytics Capstone Case Study (SQL and Tableau)
big-query bigquery coursera-assignment cyclistic cyclistic-bike-share-analysis-case-study cyclistic-bikshare data-analysis data-analysis-project data-analytics data-cleaning data-combination data-exploration data-science google-data-analytics sql tableau tableau-dashboard tableau-public
Last synced: 19 Jun 2026
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 19 Jun 2026
https://github.com/alicankaya192/ai-jobs-market-2025-2026-salaries
🤖 Global AI & LLM jobs market analysis (2025–2026). Salary trends, remote work premiums, top paying skills, and LLM engineering vs traditional AI comparisons. 📈
ai-jobs data-analysis data-science data-visualization eda exploratory-data-analysis generative-ai jobs jupyter-notebook llm-learning market-analysis matplotlib pandas salary-analysis statistics
Last synced: 21 Jun 2026
https://github.com/paweljakubas/j-data-analysis
Data analysis using J
data-analysis j machine-learning statistics
Last synced: 23 Jun 2026
https://github.com/rogernet/desafio-profissional-produto-data-driven
Ajudar a formar Analistas de Produto, PMs e Gestores de Negócio capazes de tomar decisões estratégicas baseadas em dados.
data-analysis data-science data-visualization product
Last synced: 23 Jun 2026
https://github.com/rudra-g-23/find-my-joint
A utility to find potential join keys (matching columns) across multiple DataFrames.
data-analysis data-visualization join network-graph pandas pandas-dataframe
Last synced: 24 Jun 2026
https://github.com/olob0/badwords-pt-br
💬 Wordlist com palavrões em pt-BR para análise de dados, filtros, ou texto considerado "evitável"
badword-filter badwords brasil data-analysis filter filter-lists filterlist portugues portuguese text-analysis wordlist
Last synced: 06 Jan 2026
https://github.com/ryannapp12/quant_trading_engine
A modular, and scalable quantitative trading engine built in Python. This project demonstrates efficient data caching with SQLite, concurrent backtesting, and advanced risk analytics, showcasing best practices in clean code architecture and performance optimization.
algorithmic-trading backtesting dash data-analysis data-visualization fintech lstm machine-learning numpy pandas plotly python quantitative-finance real-time risk-management sqlite technical-analysis tensorflow time-series-analysis trading-strategies
Last synced: 11 Apr 2026
https://github.com/rapter1990/data-visualization-examples
Data Visualization Examples
data data-analysis data-visualization folium matplotlib plot plotly python seaborn visualization
Last synced: 13 Apr 2026
https://github.com/banyc/dfsql
SQL REPL/lib for Data Frames
cli csv data-analysis jsonl ndjson repl sql
Last synced: 31 Jul 2025
https://github.com/jeniljani-4444/end-to-end-world-cup-analysis-web-app
Our streamlined Streamlit web app fetches and processes ESPN CricInfo data delivering dynamic graphs for a quick and engaging cricket experience. Deployed on AWS EC2 with CI/CD pipelines.
aws-ec2 data-analysis plotly preprocessing streamlit-webapp
Last synced: 02 Apr 2025
https://github.com/celineboutinon/bottleneck
ENSAE-ENSAI Formation Continue (Cepe)/OpenClassrooms Data Analyst 2022-2023 - Projet 5
data-analysis data-analytics data-visualisation dataframes market-intelligence marketing-analytics matplotlib-pyplot missingno numpy pandas python seaborn
Last synced: 07 Sep 2025
https://github.com/rajnish93/jpandas
A lightweight JavaScript library for working with tabular data, inspired by Pandas in Python. Built with TypeScript, it provides an intuitive API for data manipulation and analysis.
data-analysis data-analytics data-manipulation data-science dataframe javascript pandas stream-processing table typescript
Last synced: 11 Jun 2025
https://github.com/ansh420/mcdonald_case-study
It is basically depend on the market Segment Analysis. It is a case study of mcDonald.
algorithms-implemented data-analysis python3 segmentation
Last synced: 12 Apr 2026
https://github.com/azmainadel/twitter-data-neo4j
Playing with graph database on a large dataset of twitter data.
data-analysis data-visualization neo4j-database snap
Last synced: 06 Apr 2025
https://github.com/nikita-data/eda_projects
Exploratory data analysis projects
cac data-analysis data-visualization eda folium-maps hypothesis-testing ltv math matplotlib numpy plotly python regular roi scipy seaborn segmentation statistics unit-economics
Last synced: 06 Jan 2026
https://github.com/victoriapm/analyze_a-b_test_results
Understand the results of an A/B test run by an e-commerce website.
ab-testing data-analysis ecommerce-website
Last synced: 06 Oct 2025
https://github.com/macdon112/layoff-analysis
SQL data cleaning & analysis of global layoffs
data-analysis data-cleaning data-exploration sql
Last synced: 21 Feb 2026
https://github.com/pzim-devdata/data-developer
All my DATA developer projects
correlation data-analysis data-mining data-science data-visualization database folium folium-maps mongodb mysql python spark sql
Last synced: 11 Apr 2026
https://github.com/gholamrezadar/most-profitable-actors
Finds the list of actors with the most boxoffice profit using TMDB API.
Last synced: 16 Jan 2026
https://github.com/ilchen/eu_economic_data_analysis
Jupyter notebooks for analysis of Eurozone GDP, yields on government bonds, inflation expectations, unemployment and participation rates, money supply, personal consumption and savings, stock market. Using APIs from Eurostat, ECB, OECD and Yahoo-Finance.
data-analysis disposable-income finance gdp hicp inflation interest-rates jupyter-notebook money-supply participation-rate risk-free-interest-rate savings stock-market unemployment-rate
Last synced: 10 Oct 2025
https://github.com/airscholar/data_analysis_with_ai
A repository showing how to use AI and ChatGPT for Data Analysis with Pandas and Python
chatgpt data-analysis gpt4 openai pandas pandasai python
Last synced: 10 Apr 2026
https://github.com/targetta/ankaflow
YAML-based data pipeline framework that runs both locally and fully in-browser designed for data engineers, ML teams, and SaaS developers who need flexible, SQL-powered pipelines.
bigquery clickhouse data-analysis dataops deltalake duckdb elt-pipeline etl etl-automation motherduck parquet python sql
Last synced: 09 Oct 2025
https://github.com/muneeb1030/webscrapper_mastodon
The Mastodon Social Platform Scraper is a Python-based web scraping tool designed to explore and extract valuable data from the Mastodon social platform.
data-analysis data-collection mastodon python3 scrapy scrapy-spider selenium-python webscraping
Last synced: 09 Oct 2025
https://github.com/vi/rendercsv
Tool to convert CSV table to a picture.
animation csv csv2pic csv2png data-analysis picture png table table-renderer visualization
Last synced: 01 Apr 2025
https://github.com/datasciencelovers/ai-financial-market-data-analysis
Analyse Financial Market Data of AI companies with Python
ai artificial-intelligence big-data-analytics chatgpt data-analysis data-analytics data-science data-visualization financial-analysis gemini google llama machine-learning market-data-analysis matplotlib-python meta openai pandas-python python
Last synced: 05 May 2026
https://github.com/lostvikx/fintech-pg
Files related to my FinTech course
analytics data-analysis data-science data-visualization finance fundamental-analysis numpy pandas python technical-analysis
Last synced: 14 Apr 2026
https://github.com/rayyan9477/multiple-disease-prediction-system
This repository contains a Multiple Disease Prediction System leveraging machine learning techniques for accurate predictions. It utilizes Python, Pandas, Scikit-learn, and Flask for data preprocessing, model building, and web deployment. Explore the project and connect on LinkedIn for collaborations.
data-analysis data-science machine-learning python streamlit
Last synced: 10 Apr 2026
https://github.com/smusab9152/pokemon_data_analysis
This repo that explores and analyzes a dataset of Pokémon attributes. The analysis includes data cleaning, exploratory data analysis (EDA), and visualizations .
analytics data-analysis data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas pokemon python seaborn statistical-analysis
Last synced: 02 May 2026
https://github.com/cosmoduende/r-twitter
Explore your Twitter activity with R: Sentiment Analysis and Data Visualization. How to analyze your Twitter account (or any account), discover your habits and sentiments with the "rtweet" package and NLP.
data-analysis data-visualization lemmatization nlp nlp-library nlp-resources nltk nltk-library r-package r-programming r-studio rtweet stemming twitter twitter-api twitter-data twitter-data-analysis twitter-data-extraction twitter-sentiment-analysis udpipe
Last synced: 10 Oct 2025
https://github.com/prangonghose/analysis_of_bangladesh_economic_complexity
In this project a brief analysis has been done by our team in the export economy of Bangldesh for the past three decades.
data-analysis data-science data-visualization inequalipy matplotlib pandas plotly
Last synced: 22 May 2026
https://github.com/selcuk05/forbes_top_100_celebrities_data_analysis
Forbes Top 100 Celebrities since 2005 Data Analysis and Visualization
Last synced: 11 Oct 2025
https://github.com/rupav/fifa17-detailed-analysis
⚽ FIFA 17 data analysis using various Machine Learning Algorithms. ⚽
data-analysis data-visualization fifa17 machine-learning-algorithms radar-chart
Last synced: 16 May 2026
https://github.com/vatshayan/hospital-discharge-analysis
Analysis of Hospitalization Discharge Rates in Lake County, Illinois of various attributes like Anxiety, Alcohol, mood, Diabetes, Asthma, etc
data-analysis data-visualization jupyter-notebook machine machine-learning machine-learning-algorithms scikit-learn
Last synced: 04 Mar 2025
https://github.com/lmuffato/analise-de-diarias-prefeituas-do-es
Esse código faz parte de um projeto de descoberta e combate a esquemas de corrupção, através do tratamento e cruzamento de dados abertos disponíveis em diversas prefeituras do Espirito Santo através do portal da transparência. Junção e análise de várias tabelas importadas em csv.
data-analysis personal-project r rstudio
Last synced: 12 Jun 2025
https://github.com/colburncodes/se_pudding_2023
This project is a React app designed to showcase research conducted by a team of data scientists and data analysts. The app is utilizing React and React-Chartjs-2
chartjs-2 data-analysis data-science data-visualization react-chartjs-2 reactjs
Last synced: 11 May 2026
https://github.com/lmuffato/dados-meteorologicos-inmet-tratamento
Tratamento e enálise de dados meteorológicos das estações locais fornecidos pelo INMET, utilizando a linguagem R
data-analysis personal-project r rstudio
Last synced: 12 Jun 2025
https://github.com/shrawans007/data_science_students
Customer Engagement Analysis in Excel for 365datascience.com
2021-2022 365datascience case-study case-study-analysis case-study-project customer-engagement-analysis data-analysis data-analytics data-science data-science-students free-plan indian-students microsoft-excel ms-excel ms-excel-addin ms-excel-data-analytics ms-excel-task paid-plan us-students
Last synced: 11 Mar 2026
https://github.com/raad07/sql_project-world_layoffs_dataset
This is a SQL project which comprises the Data Cleaning in the first part and Exploratory Data Analysis (EDA) in the second part.
data-analysis database mysql sql
Last synced: 27 Jan 2026
https://github.com/agustinmusanti/delitosencaba-proyectofinal-dataanalytics-coderhouse
En este repositorio muestro mi proyecto final en el curso "Data Analytics" de Coderhouse.
Last synced: 22 Jan 2026
https://github.com/rkolehov/retail-sales-analysis-project
End-to-end e-commerce analysis showcasing SQL and data visualization skills. Tracks sales, customer behavior, product performance, and delivery efficiency. Interactive dashboards provide actionable insights for business decision-making
analytics dashboard data-analysis ecommerce jupyter-notebook postgresql python sql tableau vscode
Last synced: 19 Apr 2026
https://github.com/ryanfranklin237/data-visualization-spreadsheets
Data visualization done with microsoft excel and google spreadsheets
data-analysis data-science data-visualization google-spreadsheets microsoft-excel
Last synced: 22 Feb 2026
https://github.com/mafda/seattle_airbnb_data_analysis
This repository contains a comprehensive analysis of the Seattle Airbnb dataset, conducted using the CRISP-DM (Cross Industry Standard Process for Data Mining) methodology.
crisp-dm data-analysis data-science jupyter-notebook pandas-python seattle-data
Last synced: 29 May 2026
https://github.com/nafisalawalidris/springforth-university-foodbank
Springforth University Food Bank: A collaborative initiative with UNESCO to address student food insecurity. Contains code and resources for the web application, data analysis, and insights into the prevalence and impact of food insecurity on academic performance.
academic-performance collaborative-initiative data-analysis data-visualization excel pivot-tables powerbi springforth-university-food-bank student-food-insecurity unesco
Last synced: 17 Feb 2026
https://github.com/vruddhi18/e-commerce_data_analysis_powerbi_dashboard
The E-Commerce Data Analysis project leverages Power BI to analyze sales and customer insights from Blinkit, Zepto, Myntra, and Flipkart, providing interactive dashboards to enhance e-commerce strategies.
Last synced: 27 Feb 2026
https://github.com/sunnybibyan/call_centre_power_bi_dashboard
Create a dashboard in Power BI to visualize relevant KPIs and metrics that will help the call center manager understand trends.
call-centre-analysis dashboard data-analysis data-visualization powerbi
Last synced: 19 Mar 2026
https://github.com/vipul2001/cousera-courses
This repo covers the solution to the assignments of various courses on algorithm,deep learning and data Analytics
coursera-courses data-analysis data-analytics-ibm deep-learning divide-and-conquer neural-network
Last synced: 29 May 2026
https://github.com/gattiharishkumar/blinkit-sales-analysis-dashboard
This project presents a comprehensive sales analysis dashboard for Blinkit, an Indian last-minute delivery app. The dashboard was created using Power BI and provides a detailed overview of the company's sales performance across various outlets and product categories.
dashboard data-analysis data-transformation data-visualization ms-excel-data-analytics power-query powerbi powerbi-visuals
Last synced: 19 Mar 2026
https://github.com/tqhungdev0605/crawl_200_jd_dataanalyst
Automate job data scraping for 200 Data Analyst postings on https://vn.indeed.com using Python
data-analysis jupyter-notebook python3 scraping selenium
Last synced: 11 Apr 2026
https://github.com/ankitml/underscore
collections data-analysis json python3 underscore
Last synced: 14 Apr 2026
https://github.com/randomshek/Working-With-Excel
Using Excel Power Query and PowerPivot, reorganise the data into a star schema and showcasing reports that can be created by data analysts using DAX formulae and PowerPivot
data-analysis excel power-pivot power-query
Last synced: 20 Jul 2025
https://github.com/allanotieno254/british-airways-review-dashboard-using-tableau
An interactive dashboard built using Tableau, visualizing British Airways customer review data. This project analyzes customer sentiment, satisfaction levels, and other key metrics to provide insights into passenger experiences and identify improvement areas for British Airways.
customer-satisfaction-analysis data-analysis data-visualisation tableau-dashboards tableau-public
Last synced: 19 Mar 2026
https://github.com/jabhij/fbi_nics-firearm-background-checks
This project is a try to showcase the use of guns across the US.
data-analysis data-analytics data-science data-visualization tableau
Last synced: 23 Feb 2026
https://github.com/carusel02/sequential-data-processing-and-analysis
Sequential data processing and analysis using linked-list in C
data-analysis data-processing linked-list
Last synced: 26 Oct 2025
https://github.com/zen204/airbnb-availability
A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.
binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning
Last synced: 21 Jan 2026
https://github.com/shadowk29/cusumtools
An eclectic collection of python scripts I have found to be useful in processing nanopore data
data-analysis data-visualization time-series-analysis
Last synced: 16 Mar 2026
https://github.com/yonatanadam/film-success-prediction
Analyzing Hollywood movie success based on genre, target audience, and runtime using machine learning
data-analysis ipynb machine-learning
Last synced: 25 Jan 2026
https://github.com/apache/cloudberry-devops-release
DevOps and Release for Apache Cloudberry (Incubating)
ai big-data cloudberry data-analysis data-warehouse database devops distributed-database greenplum mpp olap postgres postgresql
Last synced: 04 Sep 2025
https://github.com/mgimond/meteo_waterville
Waterville (Maine) meteorological data
data-analysis data-science exploratory-data-analysis meteorology r
Last synced: 24 Jan 2026
https://github.com/shadan100/stroke-prediction-analysis
A web based application to predict whether a patient is likely to get stroke based on the input parameters like gender, age, various diseases, and smoking status. Each row in the data provides relevant information about the patient.
artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python stroke-prediction web-application
Last synced: 08 Mar 2026
https://github.com/narenkhatwani/arkouda-projects
This repository contains the source codes of the projects done using Arkouda (a software package that allows a user to interactively issue massive parallel computations on distributed data using functions and syntax that mimic NumPy, the underlying computational library used in most Python data science workflows.)
arkouda data-analysis data-analytics data-science high-performance high-performance-computing highperformancecomputing numpy pandas parallel-computing parallel-processing parallelization python
Last synced: 17 Apr 2026
https://github.com/agungbudiwirawan/sales-analysis-using-excel-formulas
The objective of this project is to analyze supermarket sales data using formulas in Microsoft Excel.
data-analysis excel excel-formulas microsoft-excel spreadsheet
Last synced: 08 Jan 2026
https://github.com/jamiemagee/rhi
Collating the data on the Renewable Heat Incentive scheme, and presenting it in a more readable format.
data-analysis open-data open-government rhi
Last synced: 25 Feb 2026