Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-01-06 00:07:22 UTC
- JSON Representation
https://github.com/thennen/py-ivtools
This is a package for measurement and analysis of current-voltage characteristics of electronic devices.
current-voltage data-analysis data-visualization electrical-engineering emerging-technology instrumentation measurements
Last synced: 24 Nov 2024
https://github.com/patriloto/reinventartec_2021
Material para el taller de Primeros pasos en R para el análisis de datos
Last synced: 29 Nov 2024
https://github.com/yard1/linearordering
An R package. Provides various methods of linear ordering of data. Supports weights and positive/negative impacts.
data-analysis data-analysis-in-r data-analysis-r data-science r
Last synced: 18 Nov 2024
https://github.com/alejo1630/sport_stats
Data analysis of information from the summer and winter Olympic games over the years. UC Davis SQL Specialization Final Project
data-analysis jupyter-notebook olympics-dataset plotly python seaborn sql
Last synced: 31 Dec 2024
https://github.com/bgr8/bokeh-ile-veri-gorsellestirme
Data visualization with Bokeh Library
bokeh color data-analysis data-visualization hbar html python vbar
Last synced: 15 Dec 2024
https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo
This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.
crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web
Last synced: 08 Jan 2025
https://github.com/nicholaskross/yt-pscore-analysis
Analysis of the Oct 2019 p-score dataset
analytics data-analysis data-cleaning social-media-analysis youtube youtube-channel
Last synced: 15 Nov 2024
https://github.com/zeinhasan/eksploration-and-data-visualization-course-material
Exploratory Data Analysis (EDA) Laboratory Assistant Teaching Materials
data-analysis data-visualization statistics
Last synced: 02 Jan 2025
https://github.com/okwilkins/retailanalysis
A comprehensive exploratory analysis and implementation of kmeans/hierarchical clustering on online retail data.
data-analysis data-science machine-learning statistics
Last synced: 20 Nov 2024
https://github.com/dogan-the-analyst/exploring_api_with_python
Data analysis with Python.
api data-analysis jupyter-notebook python
Last synced: 08 Jan 2025
https://github.com/pheithar/socialdata_madridcentral
Social data and visualization course at DTU - 2022. Effectiveness of Madrid Central
data-analysis data-visualization jupyer-notebook madrid python
Last synced: 30 Nov 2024
https://github.com/alejo1630/chicago_crimes
A Jupyter Notebook with the data analysis and data visualization of crimes in Chicago from 2017 to 2023 using libraries such as seaborn and folium
data-analysis data-visualization folium pandas python seaborn
Last synced: 31 Dec 2024
https://github.com/shadan100/stroke-prediction-analysis
A web based application to predict whether a patient is likely to get stroke based on the input parameters like gender, age, various diseases, and smoking status. Each row in the data provides relevant information about the patient.
artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python stroke-prediction web-application
Last synced: 11 Oct 2024
https://github.com/sevdanurgenc/python-for-data-science-lecture-notes
In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.
data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial
Last synced: 30 Nov 2024
https://github.com/pranavarora1895/proteintypeprediction
Data Analysis on Protein Type Prediction
bioinformatics data-analysis supervised-learning
Last synced: 18 Nov 2024
https://github.com/sevdanurgenc/data-modeling-techniques-lecture-notes
In this repo, I have the course contents of Data Modelling Techniques training, which will be given to Innova Technology by the cooperation of Academy Peak Information Technologies Training and Consultancy between 25 - 26 January 2022.
data-analysis data-mining data-modeling data-science data-structure data-visualization
Last synced: 30 Nov 2024
https://github.com/ajwad-shaikh/sristi-sanshodh-collect
SRISTI Sanshodh Collect is an Android app for filling out forms. It's been used to collect billions of data points in challenging environments. Contribute and make the world a better place! ✨📋✨ https://docs.opendatakit.org/collect-…
collect data-analysis data-collection javarosa odk opendatakit
Last synced: 17 Dec 2024
https://github.com/ansh420/mcdonald_case-study
It is basically depend on the market Segment Analysis. It is a case study of mcDonald.
algorithms-implemented data-analysis python3 segmentation
Last synced: 18 Nov 2024
https://github.com/christianrcanlas/christianrcanlas.github.io
e-Portfolio showcasing my personal projects.
arima classification-algorithims crostons-method data-analysis data-visualization data-warehousing etl-pipelines hierarchical-forecasting holt-winters long-short-term-memory machine-learrning ms-sql-server predictive-analytics python r-markdown support-vector-regression t-sql tableau time-series-decomposition time-series-forecasting
Last synced: 17 Nov 2024
https://github.com/17bit0216/machine-learning
All of my data analysis and Machine learning Projects.
analysis data-analysis linearr logistic logisticregression machine-learning python3 random-forest
Last synced: 17 Nov 2024
https://github.com/phillbertnevinemmanuel/automotivesalesdataanalysis
This marks my inaugural venture into personal data analysis, employing SQL and Python for Correlation Analysis. I've sourced the dataset from Kaggle, specifically focusing on automotive sales. You can find the dataset linked on my website below. I'm excited to share that I've independently managed the majority of tasks involved in this project.
data-analysis dataset microsoft-sql-server python python-lambda sql ssms tsql
Last synced: 17 Nov 2024
https://github.com/greed2411/ndl
Numbers Don't Lie, attempt on Data Analysis using pandas and matplotlib.
cities data-analysis data-science data-visualization india kaggle
Last synced: 17 Nov 2024
https://github.com/mynenik/xyplot-win32
XYPLOT Plotting and Data Analysis Program for 32-bit Windows
cpp data-analysis data-manipulation data-visualization forth mfc windows-app
Last synced: 24 Nov 2024
https://github.com/harshmule1/school-data-analysis-
School Data Analysis Using SQL
Last synced: 17 Nov 2024
https://github.com/aisurjyasamantaray/sales-perfomance-analysis-dashboard
A comprehensive sales performance analysis dashboard built using Python, and visualization tools. This project includes data cleaning, descriptive statistics, correlation analysis, and insights into sales trends, profitability, and the impact of discounts. Key features include interactive visualizations using Seaborn, and Matplot
analytics annova data data-analysis data-visualization-project dataproject eda hypothesis-testing pandas-dataframe python sales-performance-analysis statistics
Last synced: 18 Nov 2024
https://github.com/prernarohra/heart-disease-prediction
This project develops a machine learning model to predict heart disease risk based on symptoms and medical history. The model achieved the best accuracy with Logistic Regression, as it works well for binary classification problems.
artificial-intelligence data-analysis data-science dataset heartdisease-prediction machine-learning models
Last synced: 27 Dec 2024
https://github.com/supertetelman/kaggle-public
A collection of Python and Matlab projects aimed at utilizing various machine learning techniques to solve big data problems.
cnn data-analysis deep-learning machine-learning matlab python
Last synced: 30 Nov 2024
https://github.com/sermonzagoto/data_cleansing_in_telco
Data Cleansing in Python
data-analysis data-science machine-learning matplotlib-figures pandas-python seaborn-plots
Last synced: 30 Nov 2024
https://github.com/sermonzagoto/data_manipulation_with_pandas
Data Manipulation with Pandas - Part 1
data-analysis data-science jupyter-notebook pandas-python python
Last synced: 30 Nov 2024
https://github.com/shibam120302/heart-disease-data-analysis-by-shibam
You can read more on the heart disease statistics and causes for self-understanding. This project covers manual exploratory data analysis
analysis data-analysis scraper
Last synced: 20 Nov 2024
https://github.com/burhanahmed1/data-analysis-with-python
Data-Acquisition and Basic Insights, Data Wrangling, Exploratory Data Analysis (EDA), and Training Prediction Models(Machine Learning) on two datasets.
data-analysis data-aquisition data-insights data-science data-wrangling dataanalytics datascience-machinelearning eda exploratory-data-analysis machine-learning-models matlpotlib numpy pandas practice-programming prediction-model python scikit-learn scikitlearn-machine-learning seaborn
Last synced: 06 Jan 2025
https://github.com/hevalhazalkurt/exploring_the_data_of_lego_history
A data exploration project on LEGO history in Python with pandas, matplotlib etc. (WIP)
data data-analysis data-science data-visualization datascience datasets lego lego-history matplotlib pandas python python3
Last synced: 20 Nov 2024
https://github.com/manikantasanjay/time_series_data_analysis_on_stocks
Time Series Data Analysis project on Daily Stock Prices of the following companies(Apple, Microsoft, Google, Amazon) for a span of 5 years.
data-analysis pandas stock time-series time-series-analysis
Last synced: 22 Dec 2024
https://github.com/misszeferino/us-traffic-accidents-analysis
Exploratory Data Analysis using Python
data-analysis matplotlib numpy pandas python seaborn
Last synced: 15 Nov 2024
https://github.com/ryanfranklin237/data-visualization-spreadsheets
Data visualization done with microsoft excel and google spreadsheets
data-analysis data-science data-visualization google-spreadsheets microsoft-excel
Last synced: 12 Nov 2024
https://github.com/abhi-lab2/ipl-data-analysis
IPL data analysis for future predictions
data-analysis data-science python
Last synced: 29 Dec 2024
https://github.com/shipyardapp/amazonathena-blueprints
Simplified blueprints for building data pipelines with Amazon Athena.
amazon-athena athena cli data-analysis data-engineering data-science elt etl
Last synced: 04 Dec 2024
https://github.com/andr3w03/employee-attrition-problem
Employee Attrition Problem Analysis and Prediction
data-analysis data-science data-visualization dicoding gradient-boosting-classifier machine-learning problem-solving python sklearn streamlit
Last synced: 20 Nov 2024
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 21 Nov 2024
https://github.com/misszeferino/bellabeat-data-analysis
Bellabeat Data Analysis using R
analytics data-analysis ggplot2 lubridate r r-programming tidyverse
Last synced: 15 Nov 2024
https://github.com/ryanfranklin237/data-cleansing
A group of python scripts that clean large data sets by removing duplicate data, putting data in correct formats, and removing redundant cells
data-analysis data-cleaning data-science extract-transform-load pandas-dataframe python
Last synced: 12 Nov 2024
https://github.com/ryanfranklin237/data-visualization-python
A tool that allows you to visualize data from a csv or excel file in a graph or charts form
data-analysis data-science data-visualization matplotlib pandas-dataframe python
Last synced: 12 Nov 2024
https://github.com/leocornus/leocornus-visualdata
JavaScript libraries to make data visualization simpler and easier.
data-analysis data-mining data-visualization data-visualization-simpler javascript-library
Last synced: 08 Jan 2025
https://github.com/misszeferino/nashville-housing-data-cleaning
Data cleaning using SQL
data-analysis data-cleaning sql
Last synced: 15 Nov 2024
https://github.com/hayatiyrtgl/data_analysis_project
Financial data analysis: preprocess, visualize, calculate technical indicators.
data-analysis data-analysis-python data-science dataframe numpy pandas python python3 stock-price-prediction talib trade-analysis
Last synced: 22 Dec 2024
https://github.com/shrawans007/google_cyclistic_2023
Google Data Analytics Capstone Case Study (SQL and Tableau)
big-query bigquery coursera-assignment cyclistic cyclistic-bike-share-analysis-case-study cyclistic-bikshare data-analysis data-analysis-project data-analytics data-cleaning data-combination data-exploration data-science google-data-analytics sql tableau tableau-dashboard tableau-public
Last synced: 08 Jan 2025
https://github.com/lightbridge-ks/zoominterface
A data analysis Shiny app of program Zoom report files.
data-analysis r shiny-apps zoom-class zoom-meetings
Last synced: 15 Nov 2024
https://github.com/misszeferino/erp-data-analysis
Data Analysis - ERP Data (merge and outliers)
data-analysis data-visualization matplotlib merge numpy outlier-detection python scipy
Last synced: 15 Nov 2024
https://github.com/noeyislearning/intro-to-data-analysis
The repository teaches skills for cleaning, exploring, analyzing, and visualizing data in Python to gain insights and make data-driven decisions.
data-analysis jupyter-notebook lecture-notes python
Last synced: 06 Dec 2024
https://github.com/misszeferino/data-analysis-using-mysql
Data Analysis using SQL
Last synced: 15 Nov 2024
https://github.com/fx2y/datanarrate
[WIP] LLM-powered agent for adaptive data analysis across multiple sources. Uses natural language for complex queries, visualizations, and insights. Features autonomous planning, SQL/Elasticsearch generation, and AI storytelling. Built with LangChain, GPT-4, FastAPI, and React.
ai data-analysis data-visualization elasticsearch fastapi gpt-4 langchain machine-learning nlp react sql
Last synced: 15 Nov 2024
https://github.com/famarks/grafarg
Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License
analytics charts data data-analysis data-science data-visualization grafana grafarg graph
Last synced: 18 Dec 2024
https://github.com/moindalvs/learn_eda_house_price_dataset
Data Set: House Prices: Advanced Regression Techniques Exploratory Data Analysis on more than 80 features
cardinality data-analysis data-science data-structures data-visualization missing-values
Last synced: 17 Nov 2024
https://github.com/archtaqi/data-science-and-machine-learning
My Courses and Practice material for Data science and Machine Learning
data-analysis data-science data-visualization machine-learning machine-learning-algorithms python3
Last synced: 15 Nov 2024
https://github.com/misszeferino/netflix-exploratory-analysis
Netflix exploratory analysis using python
data-analysis data-visualization pandas plotly python
Last synced: 15 Nov 2024
https://github.com/mirokeimioniemi/optimizing-insulin-injection-timing
Data processing and analysis for "Determining the optimal timing for insulin injection to minimize glucose level variability after a meal in ideal conditions" - a research project for the IB Standard Level Mathematics Analysis and Approaches course inspired by my type 1 diabetes.
cgm data-analysis data-science dexcom dexcom-g6 diabetes exploration ib insulin insulin-timing international-baccalaureate mathematics optimization python type-1-diabetes
Last synced: 12 Nov 2024
https://github.com/rayyan9477/coin-detection-project
This Coin Detection Project leverages machine learning techniques to identify coins using a dataset from Kaggle. Key libraries utilized include OpenCV for image processing, TensorFlow for model training, and Pandas for data manipulation. The project also employs NumPy for numerical operations and Matplotlib for visualization.
computer-vision data-analysis data-science data-visualization machine-learning notebook python
Last synced: 11 Nov 2024
https://github.com/sarincr/data-analytics-with-knime
Data Analytics with KNIME (Konstanz Information Miner), a free and open-source data analytics, reporting and integration platform. KNIME integrates various components for machine learning and data mining through its modular data pipelining concept. A graphical user interface and use of JDBC allows assembly of nodes blending different data sources, including preprocessing (ETL: Extraction, Transformation, Loading), for modeling, data analysis and visualization without, or with only minimal, programming.
ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks data-analysis data-mining data-science data-structures data-visualization database datascience deep-learning machine-intelligence machine-learning machine-learning-algorithms machinelearning mining mining-software
Last synced: 20 Nov 2024
https://github.com/kirkalyn13/opensignal_autogenerate_report
Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,
data-analysis data-science data-visualization matplotlib numpy pandas python
Last synced: 15 Nov 2024
https://github.com/sarincr/basics-of-julia-programming-language
Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.
data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning
Last synced: 20 Nov 2024
https://github.com/rayyan9477/diamond-price-forecasting
This is a comprehensive machine learning project focused on predicting diamond prices. Using a dataset of diamond attributes, the project implements various machine learning models to forecast prices. Key features include data preprocessing, exploratory data analysis (EDA), and model training with algorithms such as Linear Regression, Decision Tree
data-analysis data-science decision-trees eda linear-regression machine-learning
Last synced: 11 Nov 2024
https://github.com/garciparedes/castile-and-leon-crops
Data Analysis of Castile and Leon Crops Area over the last years
castile-and-leon crops data-analysis data-science jupyter jupyter-notebook notebook spain
Last synced: 15 Nov 2024
https://github.com/jen-uis/la-crime-data-analysis
This repository contains project materials for the Fall 2023 MGT 256 class. This project is completed with assists from Professor Adem Orsdemir.
business-analytics crime-data crime-data-analysis data-analysis knn la-crimes-from-2020 la-safe r r-markdown r-studio report-generation rmd united-states visualization
Last synced: 20 Nov 2024
https://github.com/haloapping/pisangijo
Kumpulan library dan framework untuk analisa data, data science, machine learning, deep learning dan masih banyak lagi berbasis bahasa pemrograman Python 🐍.
belajar data-analysis data-science deep-learning forecasting libraries machine-learning perkakas pustaka python3 recommender-system referensi tools
Last synced: 06 Jan 2025
https://github.com/karatechop/noaa-storm-database-data-analysis
Analysis of population health and economic consequences of events documented in the U.S. National Oceanic and Atmospheric Administration’s (NOAA) storm database.
data-analysis knitr r rmarkdown
Last synced: 20 Nov 2024
https://github.com/misszeferino/cyclistic-bike-share-analysis
Data Analysis using R
data-analysis ggplot2 lubridate r r-programming tidyverse
Last synced: 15 Nov 2024
https://github.com/mohamedhany99/human-voice-identifier-counter
the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)
android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python
Last synced: 17 Nov 2024
https://github.com/aryansharma5/data-visualization-and-thorough-analysis
comprehensive guide for data analysis and visualization
data-analysis data-visualization
Last synced: 24 Nov 2024
https://github.com/shahaf-f-s/feature-space
A modular framework for combining pandas series features
data-analysis data-science feature-engineering
Last synced: 08 Jan 2025
https://github.com/shriram-vibhute/data-analysis
This repository offers a comprehensive collection of tools and scripts for data science, encompassing essential tasks such as data cleaning, wrangling, and aggregation. It includes practical examples and utilities for numerical computations with NumPy, data manipulation with Pandas, and effective data visualization techniques.
data-aggregation data-analysis data-visualization data-wrangling matplotlib numpy pandas
Last synced: 15 Nov 2024
https://github.com/asifdotexe/quickvu
Quick VU: No-code, data cleaning analysis and visualization tool built on Streamlit. Quickly clean, visualize, explore, and understand data relationships and correlations with ease. Perfect for analysts, business users, and anyone looking to gain data insights—without writing a single line of code.
automation data-analysis data-cleaning data-visualization python3 streamlit-application toolkit
Last synced: 15 Nov 2024
https://github.com/mindgamesnl/yanderestats
https://mindgamesnl.github.io/YandereStats/
data-analysis reporting-pipeline yandere yandere-sim
Last synced: 01 Jan 2025
https://github.com/asifdotexe/timeseriesanalysis
This repository serves as a central hub for all of my projects related to time series analysis. Here, you'll find a collection of projects, code samples, and resources that explore various aspects of time series data and its analysis.
data-analysis feature-engineering jupyter-notebook pandas python time-series-analysis visualization
Last synced: 15 Nov 2024
https://github.com/verbasik/yandex.practicum.datascience
Портфолио проектов Data Science, выполненных в рамках профессиональной переподготовки в Яндекс.Практикум. Включает исследования в области финансов, недвижимости, кинопроката и других, с использованием статистики, машинного обучения и анализа данных.
data-analysis data-science machine-learning yandex-praktikum
Last synced: 11 Nov 2024
https://github.com/jamiemagee/rhi
Collating the data on the Renewable Heat Incentive scheme, and presenting it in a more readable format.
data-analysis open-data open-government rhi
Last synced: 08 Jan 2025
https://github.com/jpcadena/car-sales-etl
ETL process for a Car Sales project.
asyncpg car-sales data-analysis data-engineering data-visualization database etl etl-pipeline postgresql python sqlalchemy
Last synced: 15 Nov 2024
https://github.com/jpcadena/solid-principles-machine-learning
S.O.L.I.D. Principles for Machine Learning project.
clean-code data-analysis data-engineering data-science deep-learning dependency-inversion-principle design-patterns design-principles interface-segregation-principle liskov-substitution-principle machine-learning machine-learning-models mlops models open-closed-principle pylint python single-responsibility-principle software-engineering solid-principles
Last synced: 15 Nov 2024
https://github.com/emso-exe/reclamacoes_de_consumidores_com_empresa_de_telecomunicacoes
Projeto de análise de reclamações de consumidores com empresa de telecomunicações no 1º semestre de 2021 com base nos dados do site consumidor.gov.br.
analise-de-dados ciencia-de-dados data-analysis data-science datascience python python-3 python3
Last synced: 15 Nov 2024
https://github.com/mdaffailhami/data_science_speedrun_journey
This repository contains notebooks and projects related to my data science speedrun journey.
algebra artificial-intelligence data-analysis data-analyst data-science data-scientist jupyter-notebook machine-learning math mathematics numpy pandas postgresql probability python statistics
Last synced: 27 Dec 2024
https://github.com/akash1070/data-science-virtual-internship-by-anz
Exploratory data analysis and prediction of annual salary for customers from the dataset provided by ANZ.
data-analysis data-science predictive-analytics presentation-slides
Last synced: 01 Dec 2024
https://github.com/jpcadena/classification-tweets-national-security-ecuador
Classification of Tweets about national security at Ecuador 2022
classification classification-model data-analysis data-science ecuador insecurity machine-learning natural-language-processing nlp nltk numpy pandas python pytorch scikit-learn snscrape supervised-learning tensorflow tweet twitter
Last synced: 15 Nov 2024
https://github.com/netcodez/data-science-projects
Data Science Projects completed on DataCamp Data Scientist with Python Career Track
data data-analysis data-visualization datacleaning feature-engineering feature-extraction machine-learning predictive-analytics predictive-modeling python scikit-learn-python scikitlearn-machine-learning statistical-analysis statistical-models
Last synced: 15 Nov 2024
https://github.com/rayyan9477/multiple-disease-prediction-system
This repository contains a Multiple Disease Prediction System leveraging machine learning techniques for accurate predictions. It utilizes Python, Pandas, Scikit-learn, and Flask for data preprocessing, model building, and web deployment. Explore the project and connect on LinkedIn for collaborations.
data-analysis data-science machine-learning python streamlit
Last synced: 11 Nov 2024
https://github.com/mokeddembillel/student-performance-prediction
Using Machine learning to predict a student final grade
data-analysis data-exploration feature-extraction feature-importance feature-selection linear-regression machine-learning power-bi principal-component-analysis regression spyder student-performance-prediction svm-regressor
Last synced: 20 Nov 2024
https://github.com/ysayaovong/car-sales
An analysis of car sales data to uncover market trends and insights through data cleaning, analysis, and visualization.
automotive business-analysis data-analysis data-cleaning data-visualization market-trends matplotlib pandas python sales-data scikit-learn seaborn
Last synced: 23 Nov 2024
https://github.com/jpcadena/onemetric-plus
OneMetric+ project for analytical tool on demand forecast and outlier detection
black-formatter data-analysis data-analytics data-science data-visualization demand-forecasting isort machine-learning matplotlib mypy numpy outlier-detection pandas pre-commit-hook pydantic python ruff scikit-learn seaborn solid-principles
Last synced: 15 Nov 2024
https://github.com/busraozdemir0/datascienceproject
Youtube Trend Video İstatistiklerinin Analizi
classification-algorithm data-analysis data-analysis-python data-science jupyter-notebook linear-regression-algorithm lineer-regresyon machine-learning machine-learning-algorithms matplotlib nonlinear-regression numpy pandas python seaborn unsupervised-learning
Last synced: 07 Dec 2024
https://github.com/hafeez-urrehman/mental-health-analyzer
Mental-Health-Analyzer is an AI-Based project for predicting mental health disorders such as stress, anxiety, depression, and loneliness. By applying machine learning techniques, this project analyzes user inputs and behavioral data to provide accurate predictions, aiming to support mental well-being and early intervention.
data-analysis data-science early-diagnonosis machine-learning mental-health mental-wellbeing predictive-modeling python
Last synced: 08 Jan 2025
https://github.com/semasuka/income-classification
Predicting if an individual make more than 50K using different features
aws-s3 binary-classification data-analysis data-science data-visualization eda finance-analytics machine-learning precision python random-forest-classifier scikit-learn streamlit
Last synced: 11 Nov 2024
https://github.com/rayyan9477/youtube-spam-detection-with-flask-and-machine-learning
This is a web application built using Flask that detects spam comments on YouTube using a Naive Bayes classifier. It leverages techniques such as CountVectorizer for feature extraction and scikit-learn for machine learning. The application reads data from a CSV file and predicts whether a comment is spam or not.
data-analysis data-science machine-learning nlp-machine-learning spam-detection
Last synced: 11 Nov 2024
https://github.com/riju18/advanced-data-analysis-and-visualization
Advanced level of data preparation, level of detail calculation, animation, table calculation etc for data analysis & visualization.
data-analysis data-science data-visualization tableau
Last synced: 30 Nov 2024
https://github.com/dannyben/datamix
DSL for manipulating tabular data
csv data data-analysis data-engineering gem ruby tabular-data
Last synced: 07 Dec 2024
https://github.com/mathieu2301/pbsc-tracker
Expérience de tracking des vélos en libre service fonctionnants avec PBSC
ai data-analysis data-mining data-science data-visualization libelo machine-learning pbsc valence velib-tracker
Last synced: 15 Nov 2024
https://github.com/sandk21/detection_faux_billets
Algorithme de détection de faux billets selon leurs dimensions géométriques et application web pour générer les prédictions
data-analysis data-science data-visualization machine-learning pandas python scipy sklearn streamlit
Last synced: 07 Dec 2024
https://github.com/johnsesana/eda-liquor-sales
Exploratory Data Analysis on Public Datasets
data-analysis data-visualization sql tableau-dashboards
Last synced: 16 Nov 2024
https://github.com/aekanshd/crazytics-suicidesindia
Basic interpretation of the Suicides in India data-set using R.
data-analysis data-science graph india r suicides
Last synced: 15 Nov 2024
https://github.com/rayyan9477/household-transactions-analysis-and-clustering
This project involves analyzing household transaction data to gain insights into spending patterns and behaviors. The analysis includes data cleaning, exploratory data analysis (EDA), clustering using K-Means, and visualization of customer segments.
customer-segmentation data-analysis data-cleaning data-science exploratory-data-analysis kmeans-clustering machine-learning
Last synced: 11 Nov 2024
https://github.com/faezeh-gholamrezaie/visual-google-scholar-search
A Python script that searches Google Scholar for specific keywords and visually presents the results in various chart formats, enabling researchers to analyze trends and insights in academic literature.
academic academic-research academic-trends ai ai-research bibliometrics data-analysis data-visualization google-scholar publication-analysis python research-trends scholarly scholarly-data word-cloud
Last synced: 28 Dec 2024
https://github.com/phomint/udacity_dataanalysis
All projects and activities
data-analysis python udacity-nanodegree
Last synced: 15 Nov 2024
https://github.com/johnsesana/eda-video-game-sales
Exploratory Data Analysis on Public Datasets
data-analysis data-visualization excel
Last synced: 16 Nov 2024