An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/natanel567/university_machine_learning_project

Machine Learning final project Tel Aviv University

data-analysis jupyter-notebook machine-learning

Last synced: 11 May 2025

https://github.com/prakhar-code/british_airways_review_analysis

Analysis of the British Airways Reviews by Customers, filtered by several different factors such as food, entertainment, services, etc.

data-analysis data-cleaning excel tableau-dashboards tableau-public tableau-visualization

Last synced: 15 Jan 2026

https://github.com/satyacoder29/comparison-of-region-based-sales-tableau

The region-based sales comparison analyzes sales performance across different regions. It identifies trends, top-performing regions, and areas needing improvement by comparing metrics like revenue, growth rate, and product demand. This analysis helps optimize sales strategies and resource allocation for better performance.

data-analysis data-cleaning data-collection data-visualization powerquerym relationships tableau tableau-desktop unions

Last synced: 02 Feb 2026

https://github.com/ziaeemehr/neuro_toolbox

Single Header File C++ library for analysis of neurophysiological and simulated data.

data-analysis data-science signal-processing synchronization

Last synced: 21 Jul 2025

https://github.com/rafinha0rafinha/web-analyzer-backend

(Legacy) This is the backend for Mazaoro SARLU's lead magnet "Web Analyzer". This project analyzes websites using Google Lighthouse and returns a detailed report consumed by the frontend.

azure-app-service azure-devops chartjs cicd data-analysis data-science data-visualization express flask hacktoberfest lighthouse numpy sentiment-analysis vader-sentiment-analyzer

Last synced: 10 Apr 2026

https://github.com/mfakhriazhar/stock-price-prediction

Stock prices are highly volatile and influenced by various factors, making accurate prediction a major challenge in investment decisions.

data-analysis data-science deep-learning python recurrent-neural-networks

Last synced: 18 May 2026

https://github.com/dina-hosny/sparkify---data-modeling-with-cassandra

Sparkify - Data Modeling with Cassandra - Udacity Data Engineering Expert Track.

cassandra cql data-analysis data-engineering data-modeling data-warehousing etl python

Last synced: 11 Apr 2026

https://github.com/spring-0/netflix-media-data-analysis

Exploring and analyzing Netflix data to uncover trends through data visualization and statistical analysis.

data-analysis netflix

Last synced: 27 Mar 2025

https://github.com/jasonsu131/cps188-term-project

A data analysis program developed in C to extract information about diabetic patients across Canada from a governmental spreadsheet available online. The program showcases summaries and averages based on the extracted data.

c data-analysis data-statictics file-reading

Last synced: 28 Mar 2025

https://github.com/sciencesar-labs/py485-final-project

ROOT-based muon data analysis using Python & Jupyter – final project for PY485E @ CERN

cern computational-physics data-analysis jupyter-notebook muons python root uproot

Last synced: 15 May 2026

https://github.com/velut/thesis-sw

Software and datasets used in the "Cost-effective and Scalable Activity Matching using Crowdsourcing" thesis

bpmn cost crowdflower crowdsourcing data-analysis dataset performance-analysis plotting-algorithms r thesis

Last synced: 19 Jun 2025

https://github.com/mae776569/weratedogs-wrangling

Wrangling WeRateDogs Twitter data to create interesting and trustworthy analyses and visualizations

data-analysis data-science data-visualization tweets twitter-api

Last synced: 25 Jan 2026

https://github.com/jonek/pv-city-mastr

Extract and analyze data about photovoltaic systems in Germany

data-analysis germany jupyter-notebook pandas photovolatic-power photovoltaic

Last synced: 11 May 2026

https://github.com/mfakhriazhar/ecom-qtt-prediction

In e-commerce, understanding seasonal sales trends and best-selling products is critical to business strategy. However, companies often struggle with predicting sales, determining factors that influence sales (discounts, product categories, locations), and optimizing stock and marketing.

data-analysis data-science data-visualization e-commerce-project eda machine-learning python

Last synced: 19 May 2026

https://github.com/c17an/data-analysis-exercise

데이터 분석 수련장

data-analysis python3

Last synced: 05 Apr 2025

https://github.com/kenwuqianghao/scotiabank-datathon-2023

Code and data analysis done for 2023 Scotiabank Datathon

data-analysis fraud-detection jupyter-notebook python

Last synced: 18 May 2026

https://github.com/lord3008/instances-of-data-analysis

This repository of mine shows my work on data analysis of various projects that I made. I feel data analysis is the very key to investigate a solution. Further more it enlightens the direction towards model building.

data data-analysis

Last synced: 03 Mar 2025

https://github.com/yash22222/web-scraping-for-data-analysis-predictive-model-on-customer-data

Utilized web scraping for customer feedback at Air India, conducting robust data analysis, and applying machine learning for predictive modeling. Drove data-driven decisions, enhancing services, and elevating customer satisfaction. Expertise in web scraping, analysis, and predictive modeling for actionable insights.

data-analysis data-preprocessing data-science data-visualization exploratory-data-analysis machine-learning powerbi random-forest-classifier sentiment-analysis tableau web-scraping

Last synced: 30 May 2026

https://github.com/sabdikay/telco-customer-churn-analysis-ibm-dataset

This project explores customer churn trends for a company in California using an IBM dataset. Built in a Jupyter Notebook, it employs pandas, NumPy, matplotlib, seaborn, plotly, and scipy to clean, analyze, and visualize data. Through statistical tests and interactive maps, it uncovers key drivers behind customer cancellations

business-intelligence customer-churn data-analysis data-analysis-python data-visualization exploratory-data-analysis jupyter-noteboook matplotlib numpy pandas plotly predictive-modeling python scipy seaborn statistical-analysis

Last synced: 07 Apr 2026

https://github.com/annaanastasy/classification-project-student-grades

A machine learning project to predict students' academic performance using features like demographics, study habits, and parental involvement, achieving 74% accuracy with the CatBoost model.

catboost-classifier classification data-analysis data-visualization machine-learning-algorithms predictive-modeling

Last synced: 29 Mar 2025

https://github.com/vavarm/data-analysis-french-electric-automobile-infrastructure

Data analysis realized in R Shiny and Python about the French electric vehicle and charging station infrastructure

data-analysis data-science data-visualization factominer geojson ggplot2 plotly python r rshiny

Last synced: 15 May 2026

https://github.com/manuelgil/vscode-data-pack

This extension pack includes the essential extensions for data analysts.

data-analysis data-science data-structures data-visualization vscode-extension

Last synced: 07 Apr 2026

https://github.com/prarthana-singh/heart-attack-prediction-model

A Machine Learning model that predicts the risk of a heart attack based on health parameters like cholesterol levels, blood pressure, BMI, smoking habits, and age. Built using Classification models, Scikit-Learn, Pandas, and Python.

classification data-analysis data-science heart-attack-prediction logistic-regression machine-learning numpy pandas python scikit-learn

Last synced: 25 Jun 2025

https://github.com/sparkerdata/hockeyshotmap

Interactive Streamlit app for NHL shot maps & player analysis. Pulls live (or demo) play-by-play data, normalizes rink coordinates, and visualizes shots with context filters (strength, period, player).

data-analysis data-visualization duckdb hockey hockey-analytics ice-hockey nhl nhl-data python sports sports-analytics

Last synced: 18 May 2026

https://github.com/robinmillford/analyzing-e-commerce-transactions---data-cleaning-cohort-analysis-and-sql

In this project, I aimed to analyze the profitability of products in an e-commerce dataset. I performed various SQL queries to extract valuable insights about product profitability, including the identification of the top 5 products with the highest profit margin, and unique combinations of brands and product lines with the highest profitability.

cohort-analysis data-analysis data-visualization excel jupyter-notebook powerbi python3 sql

Last synced: 18 May 2026

https://github.com/ivanayala96/end-to-end-business-intelligence-solution-logistics-financial-performance-dashboard

Project Overview: This project features a comprehensive Power BI solution developed for Ayala's Consultancy. It transforms raw operational data (generated via Python) into a strategic decision-making tool, managing a dataset of $7.71M in total sales and over 2,500 transactions.

anlytics bussines-report bussiness-intelligence data-analysis dax power-bi powerbi python

Last synced: 22 Apr 2026

https://github.com/berkekaragoz/media-investments-data-analysis

Advertisement Investments Distribution of Turkey by Medium

data-analysis r

Last synced: 19 Aug 2025

https://github.com/dacosmicgiant/marketing-sms-analyser

Mini project for R language SEM - V

data-analysis r shiny

Last synced: 21 Mar 2025

https://github.com/kammarah/studentdata

I created & deployed a Streamlit app to store, manage & analyze student data. 📊🎓

connection data data-analysis data-visualization deploy deployments libraries python streamlit streamlit-webapp webapp

Last synced: 18 May 2026

https://github.com/stefagnone/unsupervised-analysis-project

This project investigates the impact of video content on social media engagement using advanced analytics techniques like PCA, k-means clustering, and logistic regression. It provides actionable insights for optimizing social media strategies for Thai fashion and cosmetics retailers.

data-analysis data-visualization engagement-metrics facebook-live-sellers k-means-clustering logistic-regression marketing-insights pca-analysis python social-media-analytics

Last synced: 05 Apr 2025

https://github.com/stefagnone/data_storyboarding_visualization

Data Storyboarding and Visualization Techniques for Effective Communication

data-analysis data-visualization ggplot2-analysis r tableau-dashboards

Last synced: 05 Apr 2025

https://github.com/stefagnone/-wedding-vendor-pricing-and-customer-satisfaction-analysis

Data-driven analysis of wedding vendor pricing and customer satisfaction, with database design, SQL optimization, and cost breakdown generation.

business-intelligence cost-optimization customer-satisfaction data-analysis database-design python sql vision-board-analysis wedding-planning wedding-vendor-pricing

Last synced: 03 May 2026

https://github.com/stefagnone/moneyball_project

Data-driven analysis inspired by the Moneyball approach, identifying affordable replacements for key Oakland A's players using R and sabermetrics to support cost-effective recruitment.

baseball-statistics data-analysis data-driven-decision-making player-replacement-strategy r-programming sabermetrics sports-analytics

Last synced: 05 Apr 2025

https://github.com/rorrell/rightwhaledata

A Jupyter Notebook where I wrangle some data on right whale sightings and create a visualization

data-analysis data-visualization jupyter-notebook python3

Last synced: 11 May 2026

https://github.com/nour-zayed/shopping-trends-analytics-sql-python-power-bi

"End-to-end Shopping Trends analytics project using SQL, Python, Excel & Power BI — data cleaning, EDA, KPI generation, and interactive dashboards with DAX for actionable business insights."

business-intelligence data-analysis data-visualization dax powerbi python sql

Last synced: 18 May 2026

https://github.com/tarasbln/big-quant

Official public repository of the Berlin Investment Group (BIG) Quant Team, featuring quantitative finance research, algorithmic trading strategies, market analyses, educational materials, and open-source projects.

data-analysis education finance investment investment-club python3 quantative-finance quantative-trading quantitative-research research

Last synced: 21 Mar 2025

https://github.com/jayita11/exploring-most-streamed-songs-for-last-four-decades-eda

Perform EDA to uncover trends in streaming patterns, likes, and artists over the last four decades.

data-analysis eda hypothesis-testing matplotlib most-streamed-songs pandas python seaborn

Last synced: 07 Apr 2026

https://github.com/ahadly/sql-data-analytics-project

This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.

analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql

Last synced: 18 May 2026

https://github.com/satyacoder29/crm-analytics

CRM Analytics Dashboard – An interactive dashboard using Tableau, SQL, and Salesforce CRM Analytics (CRMA) to analyze sales performance, customer segmentation, and churn prediction. Features automated ETL pipelines, predictive analytics, and real-time insights for data-driven decision-making. 🚀📊

advanced-excel data-analysis data-cleaning data-collection data-transformation data-visualization matplotlib numpy pandas powerbi python seaborn sql tableau

Last synced: 03 Mar 2025

https://github.com/enayar478/nomad_machine_learning_dash_app

An interactive Machine Learning app built with Dash and Plotly, developed as part of the Data Analytics Bootcamp at Le Wagon Bordeaux. It allows users to visualize data, make real-time predictions, and explore various model insights.

analytics cachetools dash dashboard-application data-analysis data-science deployment gunicorn interactive-visualization machine-learning pandas plotly plotly-dash prediction-model python python3 render scikit-learn web-application

Last synced: 02 Jan 2026

https://github.com/oshinrathor/Data-Science-Systems-and-Analytics-Projects

Dive into my Data Science Projects Repository, featuring a Spam SMS Classifier, NIA Dashboard, H1N1 Vaccine Prediction, and NYC Taxi Fare Prediction. Each project showcases my skills in data cleaning, exploratory analysis, modeling, and visualization, offering valuable insights and methodologies for data enthusiasts and practitioners.

dashboard data-analysis data-driven-decisions data-presentation data-science data-visualization dataexploration eda insights nia webanalytics

Last synced: 12 Sep 2025

https://github.com/lucalullo/italian-justice-workload

Multidimensional analysis of the Italian justice system workload (2003–2024). A study of civil and criminal proceedings using judicial pressure and litigation indicators.

data-analysis italy judicial-workload justice-system kaggle legal-analytics pandas python time-series

Last synced: 24 May 2026

https://github.com/oshinrathor/data-science-systems-and-analytics-projects

Dive into my Data Science Projects Repository, featuring a Spam SMS Classifier, NIA Dashboard, H1N1 Vaccine Prediction, and NYC Taxi Fare Prediction. Each project showcases my skills in data cleaning, exploratory analysis, modeling, and visualization, offering valuable insights and methodologies for data enthusiasts and practitioners.

dashboard data-analysis data-driven-decisions data-presentation data-science data-visualization dataexploration eda insights nia webanalytics

Last synced: 02 Mar 2025

https://github.com/akash1070/predicting-zomato-restaurant-ratings

Perform extensive Exploratory Data Analysis(EDA) on the Zomato Dataset. Building an appropriate Machine Learning Model that will help various Zomato Restaurants to predict their respective Ratings based on certain features deploy the Machine learning model via Flask

data-analysis extratreesregressor flask linear-regression machine-learning random-forest zomato-bangalore zomato-data-analysis

Last synced: 18 May 2026

https://github.com/huynhtanphatt/diagnosing-uk-railway-performances

This project analyzes UK railway ticket and operation data to show how revenue, passenger demand, and on-time performance are connected.

data-analysis data-visualization datastorytelling python railway sql ticketing transportation

Last synced: 24 Apr 2026

https://github.com/sbera01/credit-card-approval-predictor

End-to-end Machine Learning project to predict credit card approval decisions using real-world financial features. Includes EDA, model training, and deployment-ready architecture

credit-card-approval-prediction data-analysis machine-learning python scikit-learn streamlit

Last synced: 24 Dec 2025

https://github.com/sebastianurdaneguibisalaya/enfermedades-fissal

Análisis holístico de atenciones por enfermedades raras, huérfanas y transplantes coberturados por FISSAL en el Perú.

data-analysis data-visualization python

Last synced: 24 Feb 2025

https://github.com/dinamohsin/toman-bikeshare-data-analysis-sql-power-bi

This project involves data analysis using SQL, Power BI, and CSV datasets to extract insights and visualize key business metrics.

csv-files data-analysis data-visualization database powerbi sql sql-server

Last synced: 22 Apr 2026

https://github.com/jerinpious/house-price-prediction

This project is a machine learning-based application to predict house prices. A frontend interface has been developed using Streamlit to make the prediction process user-friendly for regular customers. The project is structured

data-analysis data-engineering data-science eda machine-learning pandas python random-forest scikit-learn streamlit

Last synced: 05 Apr 2026

https://github.com/sreejabethu/smart-report-analyzer

An AI-powered app to analyze and summarize Excel, CSV, and PDF reports using Hugging Face language models. Built with Streamlit.

data-analysis huggingface llm nlp pdf-analysis python question-answering streamlit summarization

Last synced: 18 May 2026

https://github.com/aalkiyumi/project-4-big-data-analysis-with-pyspark-on-weather-data

In this project, I analyzed weather data from the NCEI Global Surface Summary of Day dataset using PySpark in Jupyter Notebook. Tasks included data cleaning, statistical analysis, and forecasting for temperature, wind speed, precipitation, and extreme weather events. The project also predicts future weather patterns for Cincinnati and Florida.

big-data-analytics cs5165 data-analysis data-cleaning data-engineering data-science introduction-to-cloud-computing jupyter-notebook machine-learning precipitation-analysis predictive-modeling pyspark statistical-analysis temperature-forecasting time-series-forecasting uc uc2026 university-of-cincinnati wind-speed-data

Last synced: 17 Mar 2025

https://github.com/cowboymrzamo2380/json-to-excel-converter

This repository provides a tool to convert JSON data to Excel format (.xlsx). It allows you to easily transform structured JSON data into a well-organized spreadsheet for better analysis and visualization.

automation-script automation-tools data-analysis data-converter data-export data-formatting data-tools data-visualization excel excel-automation excel-converter excel-tools json json-exporter json-parser json-processing json-to-csv json-to-excel programming-tools spreadsheet-tools

Last synced: 05 Apr 2025

https://github.com/cyberoctane29/deutsche-bank-customer-churn-prediction-end-to-end-analysis-and-modeling

In this project, I aim to predict customer churn for Deutsche Bank using supervised machine learning. It involves data exploration, feature engineering, and building Naive Bayes, Decision Tree, Random Forest, and XGBoost models. Models are tuned, evaluated, and compared to identify the best approach for churn prediction.

bank-customer-churn churn-analysis churn-prediction customer-churn-analytics data-analysis data-analytics data-visualization decision-tree eda gaussian-naive-bayes machine-learning random-forest supervised-learning xgboost

Last synced: 11 Oct 2025

https://github.com/clarajacintho/ig4-ds

The final project for the Multidimensional Data Analysis and Data Mining courses, where we analyze data from motorcyclists to determine what causes accidents

data-analysis data-science shiny-apps

Last synced: 11 May 2025

https://github.com/saadhaniftaj/logistic--lasso-regression-data-analysis

Iris dataset analysis with logistic and Lasso regression, using coordinate descent for feature selection and binary classification. Includes preprocessing and data visualizations

data-analysis lasso-regression-model logistic-regression python statistics

Last synced: 18 May 2026

https://github.com/thoratstuti/power-bi-dashboards-for-finance-analysis

Power BI can group and gather information from multiple systems to present the whole picture of business data analytics in one “single view”. It made the staff of the financial institution work in a collective digital platform, where they can compute and share relevant data.

data-analysis data-visualizations excel graph pie-chart powerbi

Last synced: 07 Mar 2026

https://github.com/steviecurran/multi-dish

Scripts to reduce data from large radio telescopes (GMRT, VLA)

data-analysis interferometer pipeline radio-astronomy telescopes

Last synced: 09 May 2026

https://github.com/ljadhav25/django-data-analyzer

Django Data Analyzer is a web application built using the Django framework, designed to streamline data analysis tasks. Users can upload CSV files containing data for analysis. The application utilizes the powerful data manipulation capabilities of Python libraries like pandas and numpy to perform various analyses on the uploaded data.

data-analysis data-visualization django-application matplotlib numpy pandas python seaborn

Last synced: 01 Mar 2026

https://github.com/lucycatherine/healthinsuranceproject

This repository contains a machine learning project that analyzes the factors influencing health insurance charges, such as age, smoking status, and medical conditions.

data-analysis data-science data-visualization jupyter-notebook machine-learning python

Last synced: 18 May 2026

https://github.com/gui-sitton/y.music

In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.

data data-analysis data-analysis-python data-science data-visualization python

Last synced: 18 May 2026

https://github.com/niniola-creator/niniola-creator

This is a repository that I have created to show my skills, share my projects and track my progress in my data science/web development journey.

bootstrap5 css3 data-analysis data-science data-visualization database html5 javascipt javascript matplotlib pandas powerbi python spreadsheets sql

Last synced: 07 Apr 2026

https://github.com/cosmoduende/r-ggcats

StrangeR things: Adding… Cats? to your plots on R. How to analyze and visualize data with the help of funny cats with the ”ggcat” package.

data-analysis data-analytics data-science data-visualisation data-visualization data-viz dataviz ggcats r-language r-library r-package r-programming r-scripts r-studio rstats rstudio

Last synced: 22 Jul 2025

https://github.com/a19xys/dm-csgo_analysis

Analysis to address the most important aspects of the knowledge discovery process from data.

data-analysis data-mining data-science dataset jupyter-notebook python

Last synced: 18 May 2026

https://github.com/datalopes1/bankabc_churn

Neste projeto será realizado o processo de EDA (Exploratory Data Analysis) com foco na análise de Churn a partir do datas ser Bank Customer Churn Dataset, que pode ser encontrado no Kaggle e disponibilizado por Gaurav Topre.

churn-analysis data-analysis data-science eda python

Last synced: 18 May 2026

https://github.com/kfrural/dashboard_agro

Dashboard Agro is a technological platform that integrates several components to support Brazilian agribusiness through data analysis, visualization and forecasts. This innovative solution was developed to serve three main groups: farmers, researchers and public managers.

big-data data-analysis predictive-analytics python

Last synced: 15 May 2026

https://github.com/1adityakadam/carnegie-classifications-ancestry-grid

A concise, interactive tool for exploring the historical lineage of U.S. higher education institutions using Carnegie Classification data from 1973–2021.

dash data-analysis html javascript pandas python

Last synced: 25 Jun 2025

https://github.com/dylanbk/exploring-data

A collection of programs that explore data engineering and analysis.

data-analysis data-engineering matplotlib pandas python

Last synced: 02 Mar 2025

https://github.com/andersoncrs/analisis-de-texto-tweets

En este proyecto exploro el análisis de texto de tweets para descubrir tendencias, opiniones y temas relevantes en redes sociales. Usando herramientas de procesamiento de lenguaje natural, convierto grandes volúmenes de mensajes en información clara y visualmente atractiva.

data-analysis data-visualization eda text-mining

Last synced: 21 Jul 2025

https://github.com/artemzarubin/xml-document-processor

XML processing tool using the Strategy design pattern.

csharp data-analysis data-transformation design-patterns strategy xml

Last synced: 21 Jul 2025

https://github.com/BingyanStudio/github-analyzer

锐评一下你都在 GitHub 写了什么

data-analysis github llm reports selfhosted typescript

Last synced: 12 May 2025

https://github.com/1adityakadam/Carnegie_classifications_website

A comprehensive data analytics platform analyzing 50+ years of U.S. higher education trends through interactive visualizations and historical institution tracking.

css data-analysis html javascript python ui-design web-development

Last synced: 25 Jun 2025

https://github.com/nadamarei/data-analyzer

The Qualitative Data Analysis Tool is a powerful Streamlit application designed for researchers to analyze word frequencies in corporate documents. This tool processes PDF reports, identifies target words and their contextually relevant synonyms, and generates comprehensive reports with document statistics, summary analysis, and per-file breakdowns

data-analysis data-visualization python-3 streamlit

Last synced: 18 May 2026

https://github.com/rohansoni45/movie-recommendation-system

This project is a Content-Based Recommender System that suggests movies to users based on their preferences and watched history. The system leverages cosine similarity to find and recommend movies similar to a selected title. It is built using Python and libraries like Pandas, NumPy, and Scikit-learn.

content-based-filtering cosine-similarity data-analysis data-science machine-learning numpy pandas python recommender-system render scikit-learn

Last synced: 17 Apr 2026

https://github.com/ddihora1604/iitk_task

A comprehensive financial data analysis system that collects, processes, and analyzes data from approximately 500 tickers in the S&P Global Index. It provides detailed financial information, ESG metrics, and various financial statements for comprehensive market analysis.

beautifulsoup4 data-analysis data-visualization datamodelling dataset esg machine-learning python yahoo-finance

Last synced: 29 Oct 2025

https://github.com/pronzzz/diabetes-prediction

Diabetes prediction using a KNN model and Pima Indian Diabetes Dataset

data-analysis data-manipulation data-preprocessing data-visualization knn machine-learning outlier-detection seaborn

Last synced: 13 Apr 2025

https://github.com/jelhamm/model-ensembles-boosting-in-machine-learning

"This repository contains implementations of Boosting method, popular techniques in Model Ensembles, aimed at improving predictive performance by combining multiple models. by using titanic database."

boosting boosting-algorithms boosting-ensemble boosting-machine data-analysis database-analysis datamining datamining-algorithms jupyter-notebook machine-learning machine-learning-models machine-learning-projects matplotlib-python model-ensemble module numpy-library pandas-library python sklearn-library

Last synced: 16 May 2026

https://github.com/ireneflorez/exploration_r

Data exploration on the 'White Wine Quality' dataset using R

data-analysis data-visualization r

Last synced: 16 Jun 2026

https://github.com/jelhamm/singular-value-decomposition-data-mining

"This repository hosts an implementation of the Singular Value Decomposition (SVD) algorithm tailored for data mining tasks. SVD is utilized for efficient dimensionality reduction, aiding in the extraction of key patterns and features from large and complex datasets."

data-analysis dimension-reduction jyputer-notebook machine-learning matplotlib numpy-library pandas-library preprocessing python scipy-library singular-value-decomposition sklearn-library standardscaler svd svd-matrix-factorisation

Last synced: 18 May 2026

https://github.com/carmoreno/nobelprizes

Final project of Big Data Module.

data-analysis mongodb

Last synced: 29 Apr 2026

https://github.com/dineshdhamodharan24/singapore_flat_resale_

This project focuses on developing a machine learning model to predict the resale values of apartments in Singapore. The goal is to create a user-friendly online application that enables users to obtain accurate predictions for the resale values of specific properties.

data-analysis flat json numpy pandas pickle project python streamlit

Last synced: 07 Apr 2026