Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-02-11 00:07:23 UTC
- JSON Representation
https://github.com/vipul2001/cousera-courses
This repo covers the solution to the assignments of various courses on algorithm,deep learning and data Analytics
coursera-courses data-analysis data-analytics-ibm deep-learning divide-and-conquer neural-network
Last synced: 17 Jan 2025
https://github.com/phomint/udacity_dataanalysis
All projects and activities
data-analysis python udacity-nanodegree
Last synced: 15 Jan 2025
https://github.com/dhairyac/customer-churn-prediction
Analyze, visualize and predict customer churn using Machine Learning
data-analysis data-visualization ensemble-classifier machine-learning performance-metrics python-3 random-forest-classifier softmax-regression svm-classifier
Last synced: 22 Jan 2025
https://github.com/alejo1630/sport_stats
Data analysis of information from the summer and winter Olympic games over the years. UC Davis SQL Specialization Final Project
data-analysis jupyter-notebook olympics-dataset plotly python seaborn sql
Last synced: 31 Dec 2024
https://github.com/narenkhatwani/arkouda-projects
This repository contains the source codes of the projects done using Arkouda (a software package that allows a user to interactively issue massive parallel computations on distributed data using functions and syntax that mimic NumPy, the underlying computational library used in most Python data science workflows.)
arkouda data-analysis data-analytics data-science high-performance high-performance-computing highperformancecomputing numpy pandas parallel-computing parallel-processing parallelization python
Last synced: 06 Feb 2025
https://github.com/chen0040/pyspark-advanced-algorithms
Samples of Advanced Algorithms and Data Analysis implemented in pyspark
advanced-algorithms data-analysis map-reduce pyspark
Last synced: 09 Feb 2025
https://github.com/luminati-io/airbnb-dataset-samples
A sample dataset of over 1000 Airbnb listings, extracted using the Bright Data API, ideal for competitor tracking, brand reputation, and market analysis.
airbnb airbnb-listings api data-analysis datasets web-scraper web-scraper-api web-scraping
Last synced: 23 Jan 2025
https://github.com/prernarohra/mental-health-prediction
This project focuses on predicting mental health outcomes using machine learning algorithms. By analyzing various psychological, social, and lifestyle factors, the model aims to identify individuals at risk, enabling early intervention and support.
data-analysis data-science data-visualization machine-learning mental-health python
Last synced: 23 Jan 2025
https://github.com/antononcube/wl-quantileregression-paclet
Wolfram Language (aka Mathematica) paclet that provides various Quantile Regression functions.
data-analysis machine-learning quantile-regression time-series time-series-analysis
Last synced: 08 Feb 2025
https://github.com/prernarohra/quakeguard
QuakeGuard is an innovative project for reducing earthquake intensity and structural damage. It takes a proactive approach to seismic activity, by using complex algorithms and real-time data to improve safety and resilience for people in earthquake-prone areas.
artificial-intelligence backend data-analysis data-science earthquake-intensity final-year-project front-end geology machine-learning open-source python visualization
Last synced: 23 Jan 2025
https://github.com/akash1070/project---applied-statistics-
To dive deep into this data & find some valuable insights.
data-analysis data-science python statistics
Last synced: 29 Jan 2025
https://github.com/alejo1630/chicago_crimes
A Jupyter Notebook with the data analysis and data visualization of crimes in Chicago from 2017 to 2023 using libraries such as seaborn and folium
data-analysis data-visualization folium pandas python seaborn
Last synced: 31 Dec 2024
https://github.com/hariyebk/eplinsights
English Premier League 2018/2019 Data Analysis
class-composition data-analysis filesystem-library
Last synced: 25 Jan 2025
https://github.com/kmihajlo/dataprocessing_graduatesadmissionprediction
Statistical processing of a data set using R.
data-analysis data-processing r statistical-analysis
Last synced: 09 Feb 2025
https://github.com/mateibejan1/ai-masters
A repository for all the projects I have done during my AI MSc.
ai-masters bayesian-inference big-data computer-vision data-analysis data-mining data-visualization deep-learning machine-learning-algorithms natural-language-processing
Last synced: 09 Feb 2025
https://github.com/sayedgamal99/data-science
This is a repository for Data Science Projects.
data-analysis data-science deep-learning machine-learning python regression supervised-learning
Last synced: 31 Jan 2025
https://github.com/akash1070/freecodecamp-data-analysis-with-python-
contains study notes and assignments from freecodecamp of Data Analysis With Python
data-analysis demographic-analysis mean-variance-standard-calculator medical-data-visualisation numpy-library pandas-library python3 sea-level-predictor time-series-analysis
Last synced: 29 Jan 2025
https://github.com/shridhar1504/foreign-exchange-rate-time-series-datascience-project
This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.
data-analysis data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis
Last synced: 23 Dec 2024
https://github.com/bcko/ud-da-eda-whitewinequality
Udacity Data Analyst Nanodegree Project : Exploratory Data Analysis : White Wine Quality dataset
data-analysis exploratory-data-analysis rmarkdown rstudio udacity udacity-data-analyst-nanodegree
Last synced: 25 Jan 2025
https://github.com/bcko/ud-da-stroopeffect
Udacity Data Analyst Nanodegree Project : Test a Perceptual Phenomenon (Stroop Effect)
data-analysis data-analyst-nanodegree stroop-effect udacity udacity-data-analyst-nanodegree
Last synced: 25 Jan 2025
https://github.com/sarincr/basics-of-julia-programming-language
Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.
data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning
Last synced: 21 Jan 2025
https://github.com/neerajcodes888/whatsapp-chat-analyzer
A Python tool for effortless analysis of WhatsApp conversations. Gain insights with basic statistics, word cloud visualizations, and URL statistics. Powered by pandas, urlextract, wordcloud, seaborn, and Streamlit. 📊📱
analyzer chat data-analysis data-visualization pandas python3 seaborn urlextract whatsapp wordcloud
Last synced: 31 Jan 2025
https://github.com/thecoderpinar/worldpopulationanalysis2024
World Population Analysis 2024: An In-Depth Exploration of Urban and Rural Populations and Infrastructure Accessibility
data-analysis data-science economic-indicators machine-learning population-growth prophet-forecasting
Last synced: 09 Feb 2025
https://github.com/meetup-python-grenoble/datasette-workshop
Exploration de données avec Datasette
data-analysis data-science data-visualization datasette exploratory-data-analysis python sql workshop
Last synced: 06 Feb 2025
https://github.com/sufiyanahmed4566/sql-musicmaven
"This Music Store Database Project showcases SQL skills through comprehensive database design, query optimization, and data analysis. Includes ER diagram, database file, query questions (Easy, Medium, Hard), answered queries, and CSV table data. Ideal for recruiters seeking skilled SQL developers for music store management and data analysis.
data-analysis database insights mysql-database oracle-database relational-databases sql
Last synced: 24 Jan 2025
https://github.com/shridhar1504/loan-clustering-datascience-project
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering-algorithm data-analysis data-science data-visualization datanalysis eda kmeans-clustering machine-learning python sql sql-server unsupervised-learning
Last synced: 23 Dec 2024
https://github.com/akash1070/data-science-virtual-internship-by-anz
Exploratory data analysis and prediction of annual salary for customers from the dataset provided by ANZ.
data-analysis data-science predictive-analytics presentation-slides
Last synced: 29 Jan 2025
https://github.com/shridhar1504/power-bi-visualization-project
This repository contains Visualization Projects which is visualized through Power BI Software, by using the visualization we can gain multiple insights and strategies which helps to develop the business for gaining high profit margins and by the insights we can reduce the damages by accidents & calamities.
dashboard data-analysis data-cleaning data-science data-visualization exploratory-data-analysis microsoft-excel microsoft-power-bi microsoft-powerpoint powerbi powerbi-report powerbi-visuals powerpoint-slides
Last synced: 23 Dec 2024
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 05 Feb 2025
https://github.com/tim-hub/python-course
A new Python Course, a new trial to offer MOOC style learning resources and content for python learners
Last synced: 23 Jan 2025
https://github.com/magnaopus1/synthron-cfd-trader-pro
SYNTHRON CFD Trader PRO is a cutting-edge trading platform featuring raw, custom-designed machine learning models. From reinforcement learning for dynamic strategies to predictive analytics, sentiment analysis, and optimization techniques, it empowers trading across stocks, forex, indices, commodities, futures, and crypto with precision.
ai backtesting cfd commodities data-analysis data-science data-structures forex futures indices machine-learning trading
Last synced: 05 Feb 2025
https://github.com/shreeparab1890/flipkart-laptops-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Laptops listed on Flipkart.
data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly
Last synced: 01 Jan 2025
https://github.com/gher-uliege/stareso-data-processing
A set of tools to read, plot and process data from STARESO
coastal corsica data-analysis data-processing ocean-sciences oceanography
Last synced: 05 Feb 2025
https://github.com/jfjlaros/spreadscript
SpreadScript: Use a spreadsheet as a function.
automation command-line data-analysis evaluation function interface spreadsheet
Last synced: 12 Jan 2025
https://github.com/shelton-beep/predicting-gpa-using-lifestyle-factors
Predicting student GPA using lifestyle factors like study habits, sleep, and stress levels. A machine learning model built to help students and educators understand the impact of lifestyle choices on academic performance.
data-analysis data-preprocessing data-science feature-engineering gpa-prediction machine-learning model-interpretability predictive-modeling python regression-analysis student-performance xgboost
Last synced: 29 Jan 2025
https://github.com/noodleslove/house-of-representative-analysis-i
This project uses public data about the stock trades made by members of the US House of Representatives.
data-analysis data-science eda kaggle-dataset matplotlib-pyplot pandas python stocks-trading
Last synced: 28 Jan 2025
https://github.com/jasontanx/capstone-project-machine-learning
A final semester project from my MSc Data Science course
data-analysis datascience machinelearningprojects tourism-data
Last synced: 01 Feb 2025
https://github.com/vijayjoshi16/credit-card-fraud-detection-using-ml-in-python
Credit Card Fraud Detection Using ML in Python
data-analysis jupyter-notebook logistic-regression machine-learning matplotlib-pyplot numpy pandas python regression seaborn
Last synced: 23 Jan 2025
https://github.com/juliusmarkwei/iris-dataset-analysis
Data analysis, data visualization and model training using the popular Iris Dataset
data-analysis data-visualisation linear-regression machine-learning
Last synced: 01 Jan 2025
https://github.com/juliusmarkwei/titanic-data-analysis
Data analysis, data visualization, feature scaling, feature transformation, model selection and model optimization.
data-analysis data-science data-visualization linear-regression model-selection regression
Last synced: 01 Jan 2025
https://github.com/sivas-2/coffee-sales-visualization
This repository contains data visualization scripts and notebooks analyzing coffee sales data from a vending machine, sourced from Kaggle. The visualizations explore sales trends, customer preferences, and product popularity over time.
data data-analysis data-science data-visualization python visualization
Last synced: 07 Feb 2025
https://github.com/sivas-2/food-demand
This project aims to predict food demand based on historical data, leveraging various statistical methods to achieve accurate forecasts.
data-analysis data-science dataanalysis food-demand-forecasting statistics
Last synced: 07 Feb 2025
https://github.com/jen-uis/la-crime-data-analysis
This repository contains project materials for the Fall 2023 MGT 256 class. This project is completed with assists from Professor Adem Orsdemir.
business-analytics crime-data crime-data-analysis data-analysis knn la-crimes-from-2020 la-safe r r-markdown r-studio report-generation rmd united-states visualization
Last synced: 21 Jan 2025
https://github.com/jatin-mehra119/bike-rentals-dataset
This repository focuses on optimizing bike rental availability during peak hours and days using machine learning techniques. Leveraging publicly available data from the UCI Machine Learning Repository, it includes scripts for data preprocessing, model training, and visualization, along with detailed observations and results.
data-analysis data-science ensemble-model pandas scikitlearn-machine-learning
Last synced: 17 Jan 2025
https://github.com/ankan24/machine-learning-data-analysis
This repository contains a collection of Jupyter Notebooks that demonstrate various machine learning and data analysis techniques. The project does not provide a detailed description or specific use cases, but the notebooks cover a range of topics related to machine learning and data analysis.
data-analysis jupiter-notebook machine-learning
Last synced: 01 Feb 2025
https://github.com/mengyaohuang/data-manipulation-and-analysis
Data processing implementation with tools in Python
data-analysis nlp-machine-learning pandas-dataframe python
Last synced: 01 Feb 2025
https://github.com/rekha0suthar/e-commerce-shopper-s-behaviour-understanding
Understand the online shopper purchasing pattern through Machine learning
data-analysis data-preprocessing data-visualization logistic-regression machine-learning numpy pandas python3 scikit-learn seaborn-plots
Last synced: 01 Feb 2025
https://github.com/sermonzagoto/data_manipulation_with_pandas
Data Manipulation with Pandas - Part 1
data-analysis data-science jupyter-notebook pandas-python python
Last synced: 28 Jan 2025
https://github.com/thecoderpinar/telecommunication-customer-churn-analysis-and-prediction
📊 This project focuses on customer churn analysis and prediction in the telecommunications sector. Using data analysis, modeling, and predictive techniques, it aims to understand and mitigate customer loss by developing strategies.
churn churn-prediction classification customer data-analysis data-science deep-learning machine-learning neural-network telecom
Last synced: 09 Feb 2025
https://github.com/viper373/163-buff
爬取网易BUFF平台CS:GO武器皮肤交易数据
163 arima crawler-python csgo data-analysis prediction python
Last synced: 05 Feb 2025
https://github.com/supersjgk/data-analysis-dns-over-https
A Data Analytics + ML project to classify Benign and Malicious DNS-over-HTTPS traffic
classification-model data-analysis data-analysis-python data-analytics datamining decision-trees dns dns-over-https doh gradient-boosting knn machine-learning random-forest
Last synced: 25 Jan 2025
https://github.com/sermonzagoto/data_cleansing_in_telco
Data Cleansing in Python
data-analysis data-science machine-learning matplotlib-figures pandas-python seaborn-plots
Last synced: 28 Jan 2025
https://github.com/andryadsm/ibrd-statement-loans
🏦 Project IBRD Statement of Loans (Python, SQL, Excel, Power BI, Tableau)
bank bank-loans dashboard data-analysis data-transformation data-visualization database-management excel finance international-development loans mssqlserver mysql powerbi python sql tableau
Last synced: 07 Feb 2025
https://github.com/ibensusan/wine-properties-assessment
Wine Properties Assessment using Microsoft Excel
data-analysis data-visualization excel
Last synced: 07 Feb 2025
https://github.com/nomadsdev/financial-trend-analyzer
FinancialTrendAnalyzer helps analyze and visualize sales data to uncover financial trends. It uses Python to calculate total sales, track changes, and generate insightful charts for better decision-making.
business-intelligence data-analysis data-visualization financial-analysis matplotlib numpy pandas python revenue-trends sales-data seaborn time-series-analysis
Last synced: 19 Dec 2024
https://github.com/ifibla/adsdb-project
Algorithms, Data Structures and Databases Project
data-analysis data-engineering python
Last synced: 28 Jan 2025
https://github.com/sarathchandranpm/vehicle_theft_analysis
This project is a comprehensive data analysis of vehicle theft patterns, utilizing advanced SQL techniques to explore when, which, and where vehicles are most likely to be stolen. The analysis provides deep insights into vehicle theft characteristics through systematic, multi-dimensional exploration.
Last synced: 09 Feb 2025
https://github.com/cworld1/novel-analysis
A simple project for analyzing Chinese novels
Last synced: 23 Jan 2025
https://github.com/kinshuk-code-1729/data-visualisation-using-python
This Repository consists of several python snippets for creating Two-Dimensional (2D) Graphics
data-analysis data-science data-visualization matplotlib visualization
Last synced: 12 Jan 2025
https://github.com/priyanka7411/redbus-data-scraping-streamlit
A web scraping and filtering application for Redbus data using Selenium and Streamlit
automation bus-data data-analysis data-scraping data-visualization mysql open-source pandas plotly python selenium streamlit web-automation web-scraping webdriver-manager
Last synced: 01 Feb 2025
https://github.com/wiseaidev/corona-virus-data-analysis-modeling-and-visualization
Data analysis of covid-19 and SEIRD model implementation.
coronavirus coronavirus-tracking covid-19 data-analysis data-analysis-python data-visualization folium-maps modeling-dynamic-systems numpy ploty population python3 science science-research seird-model seird-simulator simulation
Last synced: 10 Dec 2024
https://github.com/evardnk/dataanalyticsportfolio
Собрание моих проектов по аналитике данных
api automation data-analysis etl-pipeline jupyter-notebook jupyterlab kpis numpy pandas pipeline postgresql powerbi python sql visualization
Last synced: 19 Dec 2024
https://github.com/m-faizan-mahmood/house-price-prediction-machine-learning-model
Implemented a Multiple Linear Regression model to predict house prices based on square footage, number of bedrooms, and age of the house.
artificial-intelligence data-analysis data-science data-visualization machine-learning machine-learning-algorithms matplotlib neural-network numpy pandas predictive-modeling python regression-models seaborn sklearn
Last synced: 18 Jan 2025
https://github.com/al-ghaly/power-bi-dashboard
A dashboard to analyze data specializations job market.
dashboard data-analysis powerbi
Last synced: 22 Jan 2025
https://github.com/rupav/fifa17-detailed-analysis
⚽ FIFA 17 data analysis using various Machine Learning Algorithms. ⚽
data-analysis data-visualization fifa17 machine-learning-algorithms radar-chart
Last synced: 09 Feb 2025
https://github.com/aekanshd/crazytics-suicidesindia
Basic interpretation of the Suicides in India data-set using R.
data-analysis data-science graph india r suicides
Last synced: 15 Jan 2025
https://github.com/nishumehta/coffee-sales-analysis
dashboard data-analysis data-visualization excel
Last synced: 07 Feb 2025
https://github.com/john-science/data_science_by_example
Examples of Data Science Tools & Libraries
data-analysis data-science ipython pandas
Last synced: 18 Nov 2024
https://github.com/akash1070/data-science-virtual-internship-by-accenture
data merging and data cleaning in python as well as data visulaisation with dashboard in Tableau.
data-analysis data-cleaning data-science python3 tableau visualization
Last synced: 29 Jan 2025
https://github.com/noeyislearning/netflix-movie-analysis
Explore movie duration trends on Netflix and assess the impact of non-feature film genres in this data-driven analysis.
data-analysis data-science data-visualization datacamp-projects jupyter-notebook netflix-analysis python3
Last synced: 01 Feb 2025
https://github.com/noeyislearning/e-commerce-sales-analysis
E-Commerce Sales Analysis, repository contains code and analysis for an e-commerce transaction dataset from Kaggle. The goal is to uncover insights from the data that could help drive business strategy and decisions.
data-analysis data-science jupyter-notebook nextjs python typescript
Last synced: 01 Feb 2025
https://github.com/noeyislearning/cancer-linear-regression-model
The correlation between socioeconomic status and lung cancer incidence and mortality rates among low-income populations in the United States.
cancer-research data-analysis data-science data-visualization jupyter-notebook linear-regression-models matplotlib numpy python seaborn statsmodels
Last synced: 01 Feb 2025
https://github.com/noeyislearning/customer-shopping-trends
An invaluable resource for businesses aiming to optimize strategies and enhance customer satisfaction. Analyze customer attributes, purchase history, and preferences to make data-driven decisions.
business-analytics data-analysis data-science data-visualization jupyter-notebook matplotlib pandas python3 seaborn
Last synced: 01 Feb 2025
https://github.com/solrikk/pictrace-web
PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image
automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api
Last synced: 09 Jan 2025
https://github.com/nafiealhilaly/analyzing-sa-schools-data
A simple python streamlit app to explore and analyze Saudi Arabia schools dataset from data.gov.sa
data-analysis data-visualization eda python streamlit
Last synced: 08 Feb 2025
https://github.com/birkkarlsen/beam_dynamics_tools
Repository filled with functions related to the analysis of longitudinal beam dynamics measurements and simulations
accelerator-physics beam-dynamics data-analysis
Last synced: 12 Jan 2025
https://github.com/anas436/extreme-weather-forecasts-ai
Improving Extreme Weather Forecasts Using AI
artificial-intelligence data-analysis data-science data-visualization keras-tensorflow machine-learning neural-network time-series-forecasting
Last synced: 01 Feb 2025
https://github.com/tjpalanca/ph-elections-2016-analysis
Analysis of Philippines Election Results 2016
analysis data-analysis data-science philippines-election voter-turnout
Last synced: 29 Jan 2025
https://github.com/johnsesana/eda-video-game-sales
Exploratory Data Analysis on Public Datasets
data-analysis data-visualization excel
Last synced: 17 Jan 2025
https://github.com/olekscode/covidanalysis
A setup for COVID-19 data analysis in Pharo
coronavirus covid-19 data-analysis pharo
Last synced: 18 Dec 2024
https://github.com/eesunmoon/on-device_multimodal_er
[Research] Multimodal Emotion Recognition for On-device AI
artificial-intelligence data-analysis deep-learning embedded-systems emotion-recognition heart-rate-analysis multimodal-fusion npu on-device python speech-processing speech-recognition tensorflow wearable-devices
Last synced: 22 Dec 2024
https://github.com/shriram-vibhute/digit_classification
This project demonstrates various machine learning techniques for classifying handwritten digits from the MNIST dataset. It covers data preprocessing, model training, evaluation, and advanced classification strategies.
classification data-analysis data-visualization machine-learning matplotlib numpy pandas sk-learn
Last synced: 15 Jan 2025
https://github.com/shriram-vibhute/data-analysis
This repository offers a comprehensive collection of data analysis techniques using NumPy Pandas, Matplotlib and Seaborn.
data-aggregation data-analysis data-visualization data-wrangling matplotlib numpy pandas seaborn
Last synced: 15 Jan 2025
https://github.com/rayyan9477/diamond-price-forecasting
This is a comprehensive machine learning project focused on predicting diamond prices. Using a dataset of diamond attributes, the project implements various machine learning models to forecast prices. Key features include data preprocessing, exploratory data analysis (EDA), and model training with algorithms such as Linear Regression, Decision Tree
data-analysis data-science decision-trees eda linear-regression machine-learning
Last synced: 10 Jan 2025
https://github.com/rayyan9477/multiple-disease-prediction-system
This repository contains a Multiple Disease Prediction System leveraging machine learning techniques for accurate predictions. It utilizes Python, Pandas, Scikit-learn, and Flask for data preprocessing, model building, and web deployment. Explore the project and connect on LinkedIn for collaborations.
data-analysis data-science machine-learning python streamlit
Last synced: 10 Jan 2025
https://github.com/rayyan9477/coin-detection-project
This Coin Detection Project leverages machine learning techniques to identify coins using a dataset from Kaggle. Key libraries utilized include OpenCV for image processing, TensorFlow for model training, and Pandas for data manipulation. The project also employs NumPy for numerical operations and Matplotlib for visualization.
computer-vision data-analysis data-science data-visualization machine-learning notebook python
Last synced: 10 Jan 2025
https://github.com/rayyan9477/youtube-spam-detection-with-flask-and-machine-learning
This is a web application built using Flask that detects spam comments on YouTube using a Naive Bayes classifier. It leverages techniques such as CountVectorizer for feature extraction and scikit-learn for machine learning. The application reads data from a CSV file and predicts whether a comment is spam or not.
data-analysis data-science machine-learning nlp-machine-learning spam-detection
Last synced: 10 Jan 2025
https://github.com/rayyan9477/household-transactions-analysis-and-clustering
This project involves analyzing household transaction data to gain insights into spending patterns and behaviors. The analysis includes data cleaning, exploratory data analysis (EDA), clustering using K-Means, and visualization of customer segments.
customer-segmentation data-analysis data-cleaning data-science exploratory-data-analysis kmeans-clustering machine-learning
Last synced: 10 Jan 2025
https://github.com/seabbs/explorebcgonoutcomes
Analysis to explore the association of BCG vaccination and TB outcomes.
bcg data-analysis regression rstats tuberculosis
Last synced: 01 Jan 2025
https://github.com/roberto-butti/fit_explorer
FIT File Explorer, in GO Lang
data-analysis fitness geospatial golang
Last synced: 24 Dec 2024
https://github.com/akash1070/data-science-advanced-analytics-virtual-experience-program
The BCG Open-Access Data Science & Advanced Analytics Virtual Experience Program
data-analysis data-science machine-learning-algorithms
Last synced: 29 Jan 2025
https://github.com/emredurukn/data-analysis
Example notebooks for analyzing data
data-analysis data-visualization python
Last synced: 10 Jan 2025
https://github.com/discdiver/new-belgium-ratings
Find the most popular New Belgium beers of all time!
beautifulsoup data-analysis pandas python seaborn webscraping
Last synced: 10 Jan 2025