An open API service indexing awesome lists of open source software.

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/sharmas1ddharth/data-analysis-with-python

Freecodecamp's Data Analysis with Python Projects Code

data-analysis data-analysis-with-python freecodecamp-project

Last synced: 03 Jun 2026

https://github.com/nathaliacosim/migration-patrim

Automação para extração, conversão e migração de dados patrimoniais para o sistema patrimônio cloud da betha sistemas. O projeto garante um fluxo estruturado e seguro de transferência de informações, utilizando C# (.NET Framework), PostgreSQL e integração via API.

conversion-tool data-analysis data-conversion data-transformation dotnet dotnet-code dotnet-console-app migration-tool

Last synced: 17 Apr 2026

https://github.com/rishisolanke/pdf_query_langchain

PDF Query LangChain is a tool that extracts and queries information from PDF documents using advanced language processing. Leveraging LangChain, OpenAI, and Cassandra, this app enables efficient, interactive querying of PDF content. Ideal for data analysis, research, and automated reporting, it simplifies detailed document analysis with ease.

artificial-intelligence data-analysis document-query langchain natural-language-processing nlp openai pdf-analysis pdf-extraction python research-tool

Last synced: 17 Apr 2026

https://github.com/victoorv/criminalite_us

Une analyse de la criminalité en fonction de variables socio-économiques a été menée, incluant la sélection et la comparaison de modèles de régression multiple ainsi que des tests d'hypothèses sur les coefficients et la significativité des modèles.

data-analysis data-science r regression regression-analysis regression-models statistical-analysis statistical-tests statistics

Last synced: 04 Apr 2026

https://github.com/royungar/sql_chicago_data_analysis_project

SQL-based data analysis project using SQLite, pandas, and Jupyter SQL magic commands. Analyzes crime, school, and census data from Chicago to explore socioeconomic patterns using filtering, joins, aggregation, and subqueries.

aggregation census-data chicago crime-data data-analysis data-engineering education-data ibm jupyter-notebook pandas sql sqlite subqueries

Last synced: 04 Jun 2026

https://github.com/imrandil/sql_practice_with_analysis

SQL practice using postgres db and docker as a tool to setup postgres, loving the sql way

data-analysis docker markdown postgres sql

Last synced: 10 May 2026

https://github.com/virajbhutada/diamond-price-estimator

This project develops a predictive model to estimate diamond prices based on characteristics like carat, cut, color, and clarity. It covers data preprocessing, feature engineering, model selection, training, and evaluation. The final product is a web app where users can input diamond attributes to get accurate and instant price predictions.

cross-validation css data-analysis data-science-projects data-visualization eda feature-engineering html hyperparameter-tuning jupyter-notebooks machine-learning ml-algorithms model-deployment model-selection performance-optimization predictive-modeling python python-app user-interface

Last synced: 14 Apr 2026

https://github.com/vitornegromonte/eda_stroke

Exploratory data analysis in the stroke prediction dataset

data-analysis data-science exploratory-data-analysis kaggle-dataset visualization

Last synced: 17 Apr 2026

https://github.com/virajbhutada/hr-analytics-excel-sql-tableau-powerbi

Explore a comprehensive HR Analytics portfolio showcasing data analysis and visualization skills. Featuring dashboards in Power BI, Excel, and Tableau, along with SQL queries for deeper insights. A holistic view of expertise in HR analytics, data visualization, and database management. Let's dive into the game of data insights!

data-analysis data-management data-visualization excel hr-analytics interactive-dashboards portfolio-project postgresql powerbi powerbi-visuals sql sql-queries tableau tableau-public

Last synced: 02 Aug 2025

https://github.com/ahmad-ali-rafique/decision-tree-regressor-modeling

Comprehensive exploration of decision tree regressors, including data cleaning, model building, and performance evaluation on various datasets.

artificial-intelligence data data-analysis dataanalytics decision-trees decisiontreeregressor modeling models regression-models

Last synced: 17 Apr 2026

https://github.com/sevilaymuni/project-no.3-seaborn-plots

Pandas and Seaborn Mediated Comprehensive Analysis on Differentiated Thyroid Cancer

data-analysis data-structures data-visualization mathplotlib pandas python seaborn

Last synced: 18 Apr 2026

https://github.com/marknature/machine-learning-intern

Machine Learning tasks involving the Titanic Dataset and Breast Cancer Wisconsin (Diagnostic) dataset

data-analysis github jupiter-notebook machine-learning matplotlib numpy pandas python scikit-learn sklearn

Last synced: 10 Apr 2026

https://github.com/sanam2405/ahs

This contains the analysis of result of AHS Madhyamik Examination 2022

data-analysis data-visualization jupyter-notebook python

Last synced: 18 Apr 2026

https://github.com/evanwporter/sloth

Faster Pandas Dataframe

cython data-analysis dataframe pandas

Last synced: 14 Mar 2025

https://github.com/prangonghose/wikipedia-blocking-policies

This study investigates the relationship between editors’ disruptive behavior and regulation policies on English Wikipedia, focusing on the Blocking Policy page. The study collects and analyzes data from 2004 to 2022 using the Wikipedia API, page statistics, and keyword extraction.

data-analysis data-visualization matplotlib open-source pandas python3 seaborn

Last synced: 18 Apr 2026

https://github.com/zeraphim/streamlit-iris-classification-dashboard

A Streamlit web application that performs Exploratory Data Analysis (EDA), Data Preprocessing, and Supervised Machine Learning to classify Iris species from the Iris dataset (Setosa, Versicolor, and Virginica) using Decision Tree Classifier and Random Forest Regressor.

classification dashboard data-analysis data-science decision-tree-classifier eda machine-learning python3 random-forest-regressor streamlit supervised-learning

Last synced: 18 Apr 2026

https://github.com/vansh-py04/data-analysis-questions-pandas-numpy-sql

Solution to 450+ Data Science Tech Stack questions essential for Data Analysts and Scientists!

data-analysis data-science deepnote machine-learning numpy pandas python sql

Last synced: 18 Apr 2026

https://github.com/vvhacker007/technocolabs

This repo contains the projects that were assigned to me during the internship.

data-analysis data-science flask heroku-deployment internship machine-learning project streamlit website

Last synced: 18 Apr 2026

https://github.com/akhundmuzzammil/energyconsumptionprediction

This repository contains code and resources for training a linear regression model to predict energy consumption based on various building parameters.

data-analysis energy-consumption linear-regression machine-learning python scikit-learn streamlit visualization

Last synced: 18 Apr 2026

https://github.com/danpoynor/data-analysis-of-video-game-sales-2000-2015

This analysis reviews sales for the top 100 video games from the years 2000-2015 to gather insights. Within the notebook I use Python’s Pandas, Matplotlib, and Seaborn libraries to interact with the data and create graphs.

data-analysis jupyter-notebook matplotlib pandas-dataframe python3 seaborn-plots video-game-sales

Last synced: 18 Apr 2026

https://github.com/amoneva/cacc

An R Package to compute Conjunctive Analysis of Case Configurations (CACC), Situational Clustering Tests, and Main Effects

criminology data-analysis r social-science

Last synced: 15 May 2025

https://github.com/greenpau/esqrunner

Run Elasticsearh queries and create metrics based on the result of the queries in Elasticsearch database.

data-analysis elasticsearch query-builder querydsl

Last synced: 10 May 2026

https://github.com/manalisbhavsar/mall-customers-clustering

K-Means clustering to mall customer data, segmenting customers based on their annual income and spending score. To identify patterns and group customers for targeted marketing.

data-analysis data-visualization matplotlib numpy pandas python scikit-learn

Last synced: 18 Apr 2026

https://github.com/zimmi48/nixpkgs-issues

Analysis on nixpkgs issue lifetime.

data-analysis github-api nixpkgs

Last synced: 10 May 2026

https://github.com/mtimma001/clinical-trial-data-tool

Clinical Trial Data Analysis Tool is a Flask-based web app for healthcare professionals to manage and analyze clinical trial data. It features full CRUD functionality, interactive visualizations (Plotly/Matplotlib), a responsive Bootstrap UI, MySQL database integration, and Heroku deployment for accessible, scalable use.

bootstrap5 clinical-trials crud data-analysis data-visualization flask healthcare heroku mysql pandas plotly python

Last synced: 05 Apr 2026

https://github.com/al-ghaly/prosper-loans-analysis

A statistical Analysis Project, to analyze the data of a finance company’s loans Using Python packages (pandas – NumPy – seaborn – matplotlib)

data-analysis matplotlib numpy pandas python python-data-analysis seaborn statistical-analysis statistics

Last synced: 18 Apr 2026

https://github.com/jordanconallluthaiswright/purchase-behaviour-data-analysis

This project analyzes Black Friday purchase behavior for Company XYZ, uncovering trends by gender, age, and location. Using data cleaning, statistical analysis, and visualization, it evaluates spending patterns, confidence intervals, and category preferences to provide actionable insights for optimizing marketing strategies and targeting.

business-analytics data-analysis jupyter-notebook python

Last synced: 18 Apr 2026

https://github.com/mi7773/advanced_sql_data_analytics_project

A hands-on SQL project simulating data analysis using fact and dimension tables, covering trends over time, cumulative metrics, performance breakdowns, segmentation, and reporting via SQL.

analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics database query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql

Last synced: 18 Apr 2026

https://github.com/bolshovaelizaveta/covid19_spark_analysis

Учебный проект по дисциплине 'Базы данных для компьютерного зрения'. Разработка аналитической платформы для эпидемиологического мониторинга COVID-19 с использованием Apache Hadoop и Spark

apache-hadoop apache-spark covid-19 data-analysis jupyter-notebook machine-learning medical-imaging pyspark sql

Last synced: 18 Apr 2026

https://github.com/sarincr/training-on-artificial-intelligence

Entree Academy 10 Days free training on Artificial Intelligence. Course will be conducted in a Blended learning way with Daily one hour online training and 3 hour project based training

artificial-intelligence artificial-intelligence-algorithms data-analysis data-science data-visualization decision-trees deep-learning deeplearning logistic-regression machine-learning machine-learning-algorithms machinelearning num numpy pandas regression scikit-learn scipy sklearn

Last synced: 10 Apr 2026

https://github.com/stimulsoft/samples-dashboards.web-for-blazor-webassembly

Blazor WebAssembly (Wasm) samples for Reports.BLAZOR embedded components, Visual Studio C# projects, .NET 6, .NET 7, .NET 8 dashboards tool

blazor client-side converter dashboard data data-analysis data-sources database datagrid designer diagram dimension json net presentation print runtime viewer wasm webassembly

Last synced: 18 Apr 2026

https://github.com/robinmillford/sales-metrics-dashboard-streamlit

This Streamlit dashboard provides an interactive and comprehensive analysis of customer behavior, regional sales trends, and revenue insights. The dashboard enables businesses to identify key performance metrics, customer segments, and revenue drivers, supporting data-driven decision-making.

dashboard data-analysis data-visualization duckdb sales-analysis sales-dashboard streamlit-dashboard

Last synced: 19 Apr 2026

https://github.com/yuvrajsaraogi/unemployment-analysis-with-python

Unemployment is measured by the unemployment rate which is the number of people who are unemployed as a percentage of the total labour force. We have seen a sharp increase in the unemployment rate during Covid-19, so analyzing the unemployment rate can be a good data science project.

big-data big-data-analytics data-analysis data-science data-visualization engineering excel jupyter-notebook machine-learning mini-project natural-language-processing nlp project python3 sql

Last synced: 19 Apr 2026

https://github.com/robertochiosa/automatic-powerpoint-report-rmd

Automatically generate good looking powerpoint presentations from a csv dataset

data-analysis data-science medium medium-article python r

Last synced: 19 Apr 2026

https://github.com/pranjalya/hand-washing-data-visualisation

A small project of Data Visualization, where we analyze the effect of hand washing after introduced by Dr. Semmelweis to the nurses and midwives after giving birth.

data-analysis data-visualization jupyter-notebook pandas python3

Last synced: 06 May 2026

https://github.com/decepticon-ts/cap-ai-studio

Description: A modern, powerful web application for advanced image analysis and batch processing, featuring real-time AI-powered image captioning, comprehensive reporting, and an intuitive user interface. Built with Streamlit and Google's Gemini API.

artificial-intelligence batch-processing computer-vision data-analysis gemini-api image-processing image-processing-python python streamlit streamlit-webapp threading

Last synced: 19 Apr 2026

https://github.com/diegoglezsu/bulletin-fetcher

bulletin-py is python package to easily fetch bulletins and legal acts from a wide variety of sources of Eurpean Union.

data-analysis european-union legal-documents python sparql

Last synced: 19 Apr 2026

https://github.com/mlucifer27/bilateral-visualization

Streamlit app visualizes bilateral relationship scores between 100 countries from 1945 to 2024. It supports interactive heatmaps, network graphs, pairwise comparisons, and more.

d3blocks data-analysis data-visualization plotly-python python streamlit

Last synced: 04 Jun 2026

https://github.com/samwhaaa/superfoodsmax

A customer demographic & spending trend analysis on the fictional SuperFoodsMax grocery chain

data-analysis data-analytics data-visualization jupyter jupyter-notebook python

Last synced: 20 Apr 2026

https://github.com/montanaz0r/suicide-rate-analysis

Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector

analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate

Last synced: 20 Apr 2026

https://github.com/nikolaos-mavromatis/etf-data-analysis-dashboard

Insights into SPY ETF performance with an interactive Streamlit dashboard powered by Alpha Vantage data.

api data-analysis data-visualization financial-analysis pandas plotly python streamlit

Last synced: 20 Apr 2026

https://github.com/dthung1602/goodread-bestbook-prediction

Data analysis - trying to predict the result of Goodreads Choice Adward

data-analysis goodreads pca python r xgboost

Last synced: 20 Apr 2026

https://github.com/misaghmomenib/soccer-match-analysis

This Project Predicts Football Match Outcomes (Home Win, Away Win, or Draw) Using Historical Match Data. It Involves Data Preprocessing, Exploratory Analysis, and Training a Random Forest Model to Predict Results Based on Features Like Shots, Possession, and Passes.

data-analysis git open-source python

Last synced: 20 Apr 2026

https://github.com/jayqi/data-analysis-tools

Presentation on Data Analysis Tools

data-analysis presentation-slides

Last synced: 06 Jan 2026

https://github.com/jbalooshie/school_district_analysis

Analysis of standardized testing results using NumPy and Pandas, executed in Jupyter Notebook. Summaries of the testing results are provided based on school, test type, and grade level.

data-analysis data-science dataframes jupyter-notebook numpy pandas python

Last synced: 20 Apr 2026

https://github.com/shellynagar27/merchandise-sales-analysis

Merchandise Sales Analysis explores the sales trends of influencer Lee Chatmen’s merchandise using Power BI, and Power Query. The project uncovers key insights on revenue, product performance, location impact, shipping trends, and customer reviews.

critical-thinking data-analysis data-visualization figma powerbi powerquery problem-solving

Last synced: 07 Apr 2025

https://github.com/akashprak/socialnetworkads

Predicting customer purchase behavior from the Social Network Ads dataset.

data-analysis machine-learning mlflow pandas python scikit-learn seaborn xgboost

Last synced: 30 Mar 2025

https://github.com/wtbates99/pandas-monday

Python library that provides seamless integration between pandas DataFrames and Monday.com boards. Easily read Monday.com board data into pandas DataFrames with support for subitems, pagination, and column filtering. Built with the Monday.com GraphQL API.

api-wrapper data-analysis data-integration dataframe graphql monday pandas productivity-tools python

Last synced: 20 Apr 2026

https://github.com/abinashsahoo007/project-bankruptcy-prevention

The project is to create a classification model that predicts the chances of a business facing bankruptcy based on the key feature like Industrial Risk, Management Risk, Financial Flexibility, Credibility, Competitiveness, Operating Risk.

data-analysis data-mining data-visualization deployments eda machine-learning pickle python statistics streamlit

Last synced: 20 Apr 2026

https://github.com/ak-pydev/python_practice

Documenting my learning journey from python -> ML -> DL -> LLM/GenAI -> Agents exercises solved daily from Udemy/Kaggle/YouTube.

data-analysis data-science feature-engineering llms machine-learning mlflow mlops-workflow modeling python3 streamlit uvicorn

Last synced: 20 Apr 2026

https://github.com/robinmillford/hr-analytics-employee-performance-analysis

HR Analytics: Unveiling Employee Performance - A comprehensive exploration of employee data using SQL and Power BI, uncovering key insights for strategic HR decision-making.

data-analysis data-visualization jupyter-notebook powerbi python3 sql

Last synced: 20 Apr 2026

https://github.com/profasem/logistics-performance-analysis

Power BI dashboard analyzing logistics performance, delivery delays, carrier efficiency, and regional risk.

business-intelligence dashboard data-analysis logistics powerbi python supply-chain

Last synced: 21 Apr 2026

https://github.com/docuvesta/la-mer-skincare-chicago-duty-free-analysis

Comparing La Mer product selection, availability and pricing from 3 different purchase locations ✈️

analytics cremedelamer data-analysis data-analytics data-science data-visualization lamer luxury plotly python seaborn skincare

Last synced: 21 Apr 2026

https://github.com/danpoynor/pet-shelter-data-analysis-notebook

Demonstration of skills analyzing data from a pet shelter. The CSV data contains tables detailing the incoming and outgoing animals and I use my knowledge of Pandas to gather and present the requested information.

csv data-analysis data-cleaning data-science jupyter-notebook matplotlib numpy pandas pet-shelter tabular-data

Last synced: 21 Apr 2026

https://github.com/nxion/sql-data-warehouse-project

Building a modern data warehouse with MS SQL server, ETL processes, data modeling and analyitics.

data data-analysis data-analytics data-engineering data-lakehouse data-warehouse datalake datascience etl etl-job medallion-architecture ms mssql sql sql-query sql-server

Last synced: 05 Jun 2026

https://github.com/martinkalema/power-distribution-modelling

Power Distribution Modelling for cea and cel algorithms

data-analysis python synthetic-dataset

Last synced: 21 Apr 2026

https://github.com/rdrahul123/ecommerce-sales-dashboard

This project focuses on analyzing e-commerce sales data to uncover actionable insights and improve business decision-making. Using interactive dashboards and data analysis techniques, the project evaluates key performance metrics, customer behavior, sales trends, and payment modes across different categories and regions.

data-analysis data-science excel powerbi

Last synced: 22 Mar 2025

https://github.com/rahulpatel0615/sales-analysis-project

Sales Data Analysis Dashboard with Python, Pandas, and Matplotlib. Features 12+ visualizations and comprehensive insights.

data data-analysis data-visualization matplotlib pandas portfolio python

Last synced: 21 Apr 2026

https://github.com/meerantajalli/networksecuritydefense

This Network Security defense systems acts as an indicator against SMP Floods, UDP Floods, ICMP Floods. This model is trained using packets from wireshark and can easily differentiate between normal network traffic and traffic that has been targetted on the machine by an attacker using the rate of packets transfer and using the source IP.

anomaly-detection classification cyber-security data-analysis ddos-detection icmp-flood intrusion-detection machine-learning network-security packet-analysis python random-forest security smp-flood udp-flood wireshark

Last synced: 21 Apr 2026

https://github.com/nikhilfuke1/a-b-testing-and-regression-analysis-python

Python Statistical Project involves data analysis, visualization, A/B testing, and regression analysis to determine the best-performing platform.

ab-testing data-analysis hypothesis-testing libraries python regression-analysis statistics visualization

Last synced: 21 Apr 2026

https://github.com/iness000/online-retail-customer-segmentation

This project performs comprehensive customer segmentation analysis on an online retail dataset using machine learning clustering techniques and RFM (Recency, Frequency, Monetary) analysis. The goal is to identify distinct customer segments to drive better customer relationship management strategies and business insights.

customer-segmentation data-analysis k-means

Last synced: 31 Aug 2025

https://github.com/tmmvn/analytics-notebooks

A bunch of data analytics notebooks done testing out JetBrains DataLore

ai algorithms data-analysis datalore elements-of-ai helsinki-university-mooc python

Last synced: 22 Apr 2026

https://github.com/prgermux/yield-reporter

This Python application provides a graphical user interface (GUI) for analyzing and visualizing production data from various machines. It uses the PyQt5 framework for the GUI and Matplotlib for plotting data.

automation data-analysis python reporting

Last synced: 22 Apr 2026

https://github.com/scailfin/benchmark-templates

Workflow Templates are parameterized workflow specifications for the Reproducible Open Benchmarks for Data Analysis Platform (ROB)

benchmarks data-analysis reproducibility

Last synced: 16 Jan 2026

https://github.com/rorrell/lifeexpectancy

A Jupyter Notebook where I create a chart with two line plots on it to check out the life expectancy of men vs. women from 1900-2018

data-analysis data-visualization jupyter-notebook python3

Last synced: 22 Apr 2026

https://github.com/leabrodyheine/california-schools-data-visualization

This front-end project provides interactive visualizations of learning models adopted by California schools during the pandemic. Using D3.js and Mapbox, it dynamically presents data through bar charts, bubble charts, heatmaps, and geographic maps, allowing users to explore trends across school types, sizes, and districts.

d3-visualization d3js data-analysis data-visualization mapbox openai plotly

Last synced: 22 Apr 2026

https://github.com/devexpress-examples/web-forms-pivot-grid-export-additional-captions-header-or-footer

This example illustrates how to add a custom header to the document exported to PDF in Pivot Grid for Web Forms.

asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms

Last synced: 22 Apr 2026

https://github.com/ayushi-gajendra/buenos-aires-subway-statistics

A comprehensive data analysis of the Buenos Aires subway system ridership using Python and Pandas. This project identifies peak-hour congestion patterns, explores hourly passenger distributions, and utilizes the 95th percentile to isolate extreme traffic conditions for urban mobility insights.

95th-percentile buenos-aires data-analysis data-science-portfolio data-visualization matplotlib pandas python statistical-analysis subway-ridership transit-data urban-mobility

Last synced: 05 Jun 2026

https://github.com/floffah/my-listening

Various ways to analyse your Spotify extended streaming history data

convex data-analysis listening-history spotify

Last synced: 23 Apr 2026

https://github.com/al-ogr/sf_pr1_job_analysis_hh

SkillFactory DataScience PROJECT-1. Анализ резюме из HeadHunter

data-analysis data-science ipynb plotly python

Last synced: 23 Apr 2026

https://github.com/thc1006/nycu_timtable_crawler

🎓 NYCU Course Data Crawler & Timetable System | 國立陽明交通大學課程爬蟲與選課系統 - Python web scraper for course schedules, syllabi & educational data analysis. Crawls 18K+ courses with 98% success rate. Features: interactive timetable, JSON API, Google Colab support, batch processing, resume capability.

academic course course-selection crawler data-analysis education educational-data google-colab json-api nycu open-data python schedule student-tools syllabus taiwan timetable university web-automation web-scraping

Last synced: 24 Apr 2026

https://github.com/mozeel-v/spam-detection

ML-powered SMS Spam Classifier using NLP and Scikit-learn. Detects and filters spam messages with interactive Streamlit UI.

classification data-analysis mnb streamlit

Last synced: 10 May 2026

https://github.com/arunabhagit/inventory-misalignment-and-revenue-loss-in-multi-store-bike-retail

This project focuses on identifying the inventory and demand mismatch causing stagnant sales and lost revenue in a bike retail chain. By analyzing store-level performance and regional customer preferences, the project aims to detect underperforming products.

data-analysis data-visualization powerbi python

Last synced: 24 Apr 2026

https://github.com/datalopes1/bank_marketing

Este projeto será baseado no Dataset Bank Marketing encontrado na UC Irvine - Machine Learning Repository e disponibilizado por S. Moro, R. Laureano e P. Cortez

data-analysis data-science data-visualization eda python

Last synced: 24 Apr 2026

https://github.com/szuzick/us-immigration-presidential-analysis

Power BI dashboard analyzing 40 years of U.S. immigration data across presidential administrations (1981-2020)

dashboard data-analysis data-visualization government-data immigration powerbi powerbi-dashboards powerbi-visuals presidential-analysis

Last synced: 10 Jun 2026

https://github.com/voidnire/redditviralmysteryposts

Análise de posts de subreddits de mistério. O que define um post viral neste tipo de sub?

data-analysis data-visualization mysteries mystery nlms python-3 reddit

Last synced: 24 Apr 2026

https://github.com/muthukumar0908/youtube-data-harvesting-and-warehousing-using-sql-mongodb-and-streamlit

Create a simple and intuitive user interface using Streamlit, From the youtube getting and extracting the data by using API key. That data stored in database.

data-analysis mongodb-atlas python sqldatabase streamlit-webapp youtube-api

Last synced: 24 Apr 2026

https://github.com/als8446/tripleten-data-science-projects

Projects Overview Projects made in the Data Scientist course from TripleTen LatAm

data data-analysis hypothesis-tests machine matplotlib numpy pandas python scipy sklearn

Last synced: 10 Apr 2026

https://github.com/luca-02/credit-card-fraud-detection

This is a small master's degree project for New Generation Data Models and DBMSs course (academic year 2024/25).

data-analysis database nosql python

Last synced: 10 Jun 2026

https://github.com/edwinrlambert/emomap-sentiment-analysis

To analyze public sentiment related to specific locations in a city (e.g., parks, transit stations, restaurants, neighborhoods) using geo-tagged social media posts, reviews, and comments. The goal is to visualize how people feel across different areas and times.

data-analysis jupyter-notebook python sentiment-analysis

Last synced: 24 Apr 2026

https://github.com/amlanmohanty1/zepto-sql-data-analysis-project

Complete Data Analysis on Zepto Inventory data using SQL

data-analysis database inventory-management postgresql sql zepto

Last synced: 24 Apr 2026

https://github.com/pipe199x/end-to-end-prediction-california

End-to-end prediction project using various technologies to predict housing prices in California.

california-housing data-analysis machine-learning python

Last synced: 11 May 2026

https://github.com/gnodux/adb-link

An MCP server that connects to multiple databases. Supports access control and dynamic SQL query tool registration and invocation.

agent ai-tools data-analysis database-gateway go mcp mcp-server

Last synced: 06 Jun 2026

https://github.com/nmelgar/birthday_sports_dataviz

We will analyze how the Matthew Effect has influenced in professional sports players.

analysis csv data data-analysis data-science data-visualization datavisualization dataviz probability research tableau

Last synced: 08 Jan 2026