Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-01-13 00:07:19 UTC
- JSON Representation
https://github.com/manikantasanjay/time_series_data_analysis_on_stocks
Time Series Data Analysis project on Daily Stock Prices of the following companies(Apple, Microsoft, Google, Amazon) for a span of 5 years.
data-analysis pandas stock time-series time-series-analysis
Last synced: 22 Dec 2024
https://github.com/ayobami6/tweet-data-analysis
WeRateDogs Tweets Scrape using twitter Api
data-analysis data-science twitter webscraping
Last synced: 13 Jan 2025
https://github.com/hevalhazalkurt/exploring_the_data_of_lego_history
A data exploration project on LEGO history in Python with pandas, matplotlib etc. (WIP)
data data-analysis data-science data-visualization datascience datasets lego lego-history matplotlib pandas python python3
Last synced: 20 Nov 2024
https://github.com/andr3w03/employee-attrition-problem
Employee Attrition Problem Analysis and Prediction
data-analysis data-science data-visualization dicoding gradient-boosting-classifier machine-learning problem-solving python sklearn streamlit
Last synced: 20 Nov 2024
https://github.com/nafisalawalidris/northwind-traders-sales-analysis
Northwind Traders Sales Analysis project, which analyses sales data for a fictitious company. It utilises the Northwind Database and includes SQL queries to provide insights on employees, products, suppliers and revenue. The project aims to help the company gain valuable information for business decision-making.
business-insights data-analysis database northwind-traders sales sql
Last synced: 22 Nov 2024
https://github.com/hayatiyrtgl/data_analysis_project
Financial data analysis: preprocess, visualize, calculate technical indicators.
data-analysis data-analysis-python data-science dataframe numpy pandas python python3 stock-price-prediction talib trade-analysis
Last synced: 22 Dec 2024
https://github.com/supertetelman/kaggle-public
A collection of Python and Matlab projects aimed at utilizing various machine learning techniques to solve big data problems.
cnn data-analysis deep-learning machine-learning matlab python
Last synced: 30 Nov 2024
https://github.com/dsrodrigovieira/houserocketsales
Este repositório contém um projeto desenvolvido para praticar habilidades de análise de dados utilizando Python
data-analysis data-visualization heroku kaggle-dataset python
Last synced: 30 Dec 2024
https://github.com/misszeferino/bellabeat-data-analysis
Bellabeat Data Analysis using R
analytics data-analysis ggplot2 lubridate r r-programming tidyverse
Last synced: 15 Nov 2024
https://github.com/virajbhutada/global-universities-success-analysis-powerbi-sql-excel
This capstone project conducts in-depth analysis using Power BI, SQL, and Excel to explore complex dynamics shaping global university success. Integrating data from diverse ranking systems and criteria, our aim is to unravel the factors influencing universities worldwide.
capstone capstoneproject data-analysis data-analytics data-insights data-science data-science-projects data-visualization excel exploratory-data-analysis mece mysql powerbi powerpoint sql
Last synced: 10 Jan 2025
https://github.com/vishal-038/real_estate_price_prediction
The Real Estate Price Prediction project aims to develop a machine learning model to predict house prices based on various features
data-analysis data-science data-visualization machine-learning python
Last synced: 22 Nov 2024
https://github.com/misszeferino/nashville-housing-data-cleaning
Data cleaning using SQL
data-analysis data-cleaning sql
Last synced: 15 Nov 2024
https://github.com/misszeferino/data-analysis-using-mysql
Data Analysis using SQL
Last synced: 15 Nov 2024
https://github.com/misszeferino/erp-data-analysis
Data Analysis - ERP Data (merge and outliers)
data-analysis data-visualization matplotlib merge numpy outlier-detection python scipy
Last synced: 15 Nov 2024
https://github.com/misszeferino/netflix-exploratory-analysis
Netflix exploratory analysis using python
data-analysis data-visualization pandas plotly python
Last synced: 15 Nov 2024
https://github.com/misszeferino/cyclistic-bike-share-analysis
Data Analysis using R
data-analysis ggplot2 lubridate r r-programming tidyverse
Last synced: 15 Nov 2024
https://github.com/misszeferino/us-traffic-accidents-analysis
Exploratory Data Analysis using Python
data-analysis matplotlib numpy pandas python seaborn
Last synced: 15 Nov 2024
https://github.com/2003harsh/house-price-prediction-using-machine-learning
This project features a web app that predicts house prices using a linear regression model. Users can input details like location, square footage, bathrooms, and bedrooms through an HTML form. I've added a CI/CD pipeline with GitHub Actions, unit testing with pytest, and automated Docker containerization to improve deployment and robustness.
ci-cd data-analysis docker-image flask linear-regression machine-learning matplotlib mlops-workflow requests scikit-learn
Last synced: 10 Oct 2024
https://github.com/archtaqi/data-science-and-machine-learning
My Courses and Practice material for Data science and Machine Learning
data-analysis data-science data-visualization machine-learning machine-learning-algorithms python3
Last synced: 15 Nov 2024
https://github.com/aadityatamrakar/futures_spread_chart
Cash Market & Futures Daily Spread Chart - NSE Stocks
data data-analysis data-mining expressjs nodejs requests
Last synced: 28 Nov 2024
https://github.com/lightbridge-ks/zoominterface
A data analysis Shiny app of program Zoom report files.
data-analysis r shiny-apps zoom-class zoom-meetings
Last synced: 15 Nov 2024
https://github.com/fx2y/datanarrate
[WIP] LLM-powered agent for adaptive data analysis across multiple sources. Uses natural language for complex queries, visualizations, and insights. Features autonomous planning, SQL/Elasticsearch generation, and AI storytelling. Built with LangChain, GPT-4, FastAPI, and React.
ai data-analysis data-visualization elasticsearch fastapi gpt-4 langchain machine-learning nlp react sql
Last synced: 15 Nov 2024
https://github.com/mg380/ibm-applied-data-science-capstone
This Capstone is the 10th (final) course in IBM Data Science Professional Certificate specialization, and it actually summarises in the form of project all materials that have been learned during this specialization
capstone data data-analysis data-science datascience ibm machine-learning plotly python scikit-learn sql
Last synced: 10 Oct 2024
https://github.com/nirmalvatsyayan/data-analyst-nanodegree
Udacity data analyst nanodegree project submissions and learning
data-analysis numpy pandas python statistics udacity-data-analyst-nanodegree
Last synced: 12 Jan 2025
https://github.com/akshat0427/python_youtube_history
a bunch of data science operations performed on youtube history data
data-analysis data-science extracting-features
Last synced: 11 Jan 2025
https://github.com/sarincr/basics-of-julia-programming-language
Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.
data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning
Last synced: 20 Nov 2024
https://github.com/kirkalyn13/opensignal_autogenerate_report
Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,
data-analysis data-science data-visualization matplotlib numpy pandas python
Last synced: 15 Nov 2024
https://github.com/abhinavsharma07/fraud_analytics-credit_card_fraud_detection
The aim of this project is to predict fraudulent credit card transactions with the help of different machine learning models.
banking data-analysis decision-trees hyperparameter-optimization machine-learning-algorithms pipelines random-forest-classifier svm-classifier xgboost-classifier
Last synced: 10 Jan 2025
https://github.com/sarincr/data-analytics-with-knime
Data Analytics with KNIME (Konstanz Information Miner), a free and open-source data analytics, reporting and integration platform. KNIME integrates various components for machine learning and data mining through its modular data pipelining concept. A graphical user interface and use of JDBC allows assembly of nodes blending different data sources, including preprocessing (ETL: Extraction, Transformation, Loading), for modeling, data analysis and visualization without, or with only minimal, programming.
ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks data-analysis data-mining data-science data-structures data-visualization database datascience deep-learning machine-intelligence machine-learning machine-learning-algorithms machinelearning mining mining-software
Last synced: 20 Nov 2024
https://github.com/karatechop/noaa-storm-database-data-analysis
Analysis of population health and economic consequences of events documented in the U.S. National Oceanic and Atmospheric Administration’s (NOAA) storm database.
data-analysis knitr r rmarkdown
Last synced: 20 Nov 2024
https://github.com/garciparedes/castile-and-leon-crops
Data Analysis of Castile and Leon Crops Area over the last years
castile-and-leon crops data-analysis data-science jupyter jupyter-notebook notebook spain
Last synced: 15 Nov 2024
https://github.com/jen-uis/la-crime-data-analysis
This repository contains project materials for the Fall 2023 MGT 256 class. This project is completed with assists from Professor Adem Orsdemir.
business-analytics crime-data crime-data-analysis data-analysis knn la-crimes-from-2020 la-safe r r-markdown r-studio report-generation rmd united-states visualization
Last synced: 20 Nov 2024
https://github.com/asifdotexe/timeseriesanalysis
This repository serves as a central hub for all of my projects related to time series analysis. Here, you'll find a collection of projects, code samples, and resources that explore various aspects of time series data and its analysis.
data-analysis feature-engineering jupyter-notebook pandas python time-series-analysis visualization
Last synced: 15 Nov 2024
https://github.com/asifdotexe/quickvu
Quick VU: No-code, data cleaning analysis and visualization tool built on Streamlit. Quickly clean, visualize, explore, and understand data relationships and correlations with ease. Perfect for analysts, business users, and anyone looking to gain data insights—without writing a single line of code.
automation data-analysis data-cleaning data-visualization python3 streamlit-application toolkit
Last synced: 15 Nov 2024
https://github.com/yash-kavaiya/ai-analytics
This is a Streamlit app that uses Pandas and AI to perform data analytics on uploaded CSV files.
data-analysis generative-ai pandas streamlit
Last synced: 24 Dec 2024
https://github.com/gholamrezadar/favourite-youtube-channels
this program goes through your youtube watch history and sorts channels based how many of their videos you have watched!
data-analysis data-visualization python
Last synced: 30 Dec 2024
https://github.com/gholamrezadar/most-profitable-actors
Finds the list of actors with the most boxoffice profit using TMDB API.
Last synced: 30 Dec 2024
https://github.com/mafesan/2021-tfm-code
Revelio: Machine-Learning classifier to identify Bots integrable with GrimoireLab
bot-accounts data-analysis data-analytics data-science grimoirelab machine-learning metrics open-source open-source-community project-health python scikit-learn
Last synced: 22 Dec 2024
https://github.com/busraozdemir0/datascienceproject
Youtube Trend Video İstatistiklerinin Analizi
classification-algorithm data-analysis data-analysis-python data-science jupyter-notebook linear-regression-algorithm lineer-regresyon machine-learning machine-learning-algorithms matplotlib nonlinear-regression numpy pandas python seaborn unsupervised-learning
Last synced: 07 Dec 2024
https://github.com/mattdelaune/retail_rfm_analysis
Power BI multi-page report leveraging advanced data visualization for RFM analysis. Delivers deep analytical insights into customer behavior, engagement, and spending patterns, driving strategic business decisions.
data-analysis dax powerbi report rfm-analysis sales-data visualization
Last synced: 30 Dec 2024
https://github.com/aryansharma5/data-visualization-and-thorough-analysis
comprehensive guide for data analysis and visualization
data-analysis data-visualization
Last synced: 24 Nov 2024
https://github.com/shriram-vibhute/data-analysis
This repository offers a comprehensive collection of tools and scripts for data science, encompassing essential tasks such as data cleaning, wrangling, and aggregation. It includes practical examples and utilities for numerical computations with NumPy, data manipulation with Pandas, and effective data visualization techniques.
data-aggregation data-analysis data-visualization data-wrangling matplotlib numpy pandas
Last synced: 15 Nov 2024
https://github.com/solrikk/pictrace-web
PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image
automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api
Last synced: 09 Jan 2025
https://github.com/moindalvs/learn_eda_house_price_dataset
Data Set: House Prices: Advanced Regression Techniques Exploratory Data Analysis on more than 80 features
cardinality data-analysis data-science data-structures data-visualization missing-values
Last synced: 17 Nov 2024
https://github.com/jpcadena/solid-principles-machine-learning
S.O.L.I.D. Principles for Machine Learning project.
clean-code data-analysis data-engineering data-science deep-learning dependency-inversion-principle design-patterns design-principles interface-segregation-principle liskov-substitution-principle machine-learning machine-learning-models mlops models open-closed-principle pylint python single-responsibility-principle software-engineering solid-principles
Last synced: 15 Nov 2024
https://github.com/jpcadena/classification-tweets-national-security-ecuador
Classification of Tweets about national security at Ecuador 2022
classification classification-model data-analysis data-science ecuador insecurity machine-learning natural-language-processing nlp nltk numpy pandas python pytorch scikit-learn snscrape supervised-learning tensorflow tweet twitter
Last synced: 15 Nov 2024
https://github.com/wiseaidev/truth-guard
Analyzing a 79k Dataset of Misinformation and Fake News
data-analysis fastapi lstm machine-learning python supervised-learning
Last synced: 20 Dec 2024
https://github.com/jpcadena/onemetric-plus
OneMetric+ project for analytical tool on demand forecast and outlier detection
black-formatter data-analysis data-analytics data-science data-visualization demand-forecasting isort machine-learning matplotlib mypy numpy outlier-detection pandas pre-commit-hook pydantic python ruff scikit-learn seaborn solid-principles
Last synced: 15 Nov 2024
https://github.com/jpcadena/car-sales-etl
ETL process for a Car Sales project.
asyncpg car-sales data-analysis data-engineering data-visualization database etl etl-pipeline postgresql python sqlalchemy
Last synced: 15 Nov 2024
https://github.com/blagojeblagojevic/lol_data_analysis
classification data-analysis data-science jupyter-notebook kaggle python3
Last synced: 21 Dec 2024
https://github.com/cworld1/novel-analysis
A simple project for analyzing Chinese novels
Last synced: 23 Nov 2024
https://github.com/cworld1/da-learning
Some notes and code about CWorld learning Database Analysis
data-analysis data-science jupyter-book jupyter-notebook python r
Last synced: 23 Nov 2024
https://github.com/mokeddembillel/student-performance-prediction
Using Machine learning to predict a student final grade
data-analysis data-exploration feature-extraction feature-importance feature-selection linear-regression machine-learning power-bi principal-component-analysis regression spyder student-performance-prediction svm-regressor
Last synced: 20 Nov 2024
https://github.com/netcodez/data-science-projects
Data Science Projects completed on DataCamp Data Scientist with Python Career Track
data data-analysis data-visualization datacleaning feature-engineering feature-extraction machine-learning predictive-analytics predictive-modeling python scikit-learn-python scikitlearn-machine-learning statistical-analysis statistical-models
Last synced: 15 Nov 2024
https://github.com/wildanmujjahid29/books-sales-analytics-python
Books Sales Analytics With Pyhton
data data-analysis data-science data-visualization
Last synced: 07 Jan 2025
https://github.com/revan-alqahmi/summarize-talabat-company-reviews
Natural Language Processing Project, which is a program that analyzes Arabic comments at Talabat Company and classifies them into positive, negative, and neutral using machine learning algorithms and natural language processing techniques.
artificial-intelligence data-analysis machine-learning-algorithms natural-language-processing python
Last synced: 29 Dec 2024
https://github.com/mathieu2301/pbsc-tracker
Expérience de tracking des vélos en libre service fonctionnants avec PBSC
ai data-analysis data-mining data-science data-visualization libelo machine-learning pbsc valence velib-tracker
Last synced: 15 Nov 2024
https://github.com/johnsesana/eda-video-game-sales
Exploratory Data Analysis on Public Datasets
data-analysis data-visualization excel
Last synced: 16 Nov 2024
https://github.com/allanccwang/electronic_projects
implement the circuit with microcontroller
arduino circuit-analysis circuit-simulations circuits-and-electronics cpp data-analysis microcontroller physics python wemos
Last synced: 17 Dec 2024
https://github.com/johnsesana/eda-liquor-sales
Exploratory Data Analysis on Public Datasets
data-analysis data-visualization sql tableau-dashboards
Last synced: 16 Nov 2024
https://github.com/sandk21/detection_faux_billets
Algorithme de détection de faux billets selon leurs dimensions géométriques et application web pour générer les prédictions
data-analysis data-science data-visualization machine-learning pandas python scipy sklearn streamlit
Last synced: 07 Dec 2024
https://github.com/aekanshd/crazytics-suicidesindia
Basic interpretation of the Suicides in India data-set using R.
data-analysis data-science graph india r suicides
Last synced: 15 Nov 2024
https://github.com/phomint/udacity_dataanalysis
All projects and activities
data-analysis python udacity-nanodegree
Last synced: 15 Nov 2024
https://github.com/yash22222/data-analysis-with-python
This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.
binning data data-acquisition data-analysis data-binning data-cleaning data-formatting data-integration data-normalization data-preprocessing data-science data-transformation data-wrangling dataframe description numpy pandas pandas-dataframe python python3
Last synced: 05 Jan 2025
https://github.com/vatshayan/list-of-animals-data-classification-
Classification & Visualization of List of Animals Data set using Machine Learning Algorithm
animal-behavior animal-data animals artificial-intelligence classification data data-analysis data-mining data-science data-visualization dataset jupyter-notebook machine-learning python supervised-learning
Last synced: 15 Nov 2024
https://github.com/yash22222/tata-data-visualisation-virtual-internship
Data Visualisation: Empowering Business with Effective Insights Gain insights into leveraging data visualisations as a tool for making informed business decisions.
basics ceo charts cmo data-analysis data-interpretation data-science data-visualization graphs machine-learning mcq microsoft-excel microsoft-power-bi microsoft-word powerpoint-presentations python tableau tata tata-data-visualisation
Last synced: 05 Jan 2025
https://github.com/ahmednasef3/heart-attack-full-eda
Simple EDA for Heart Attack Dataset.
data-analysis data-science data-visualization eda exploratory-data-analysis heartattack matplotlib pandas seaborn
Last synced: 15 Nov 2024
https://github.com/mengyaohuang/data-manipulation-and-analysis
Data processing implementation with tools in Python
data-analysis nlp-machine-learning pandas-dataframe python
Last synced: 05 Dec 2024
https://github.com/bilal-belli/personalacademicdocuments
This repository contains some personal academic assignments, maybe it will help someone!
compilation computer-architecture data-analysis data-structures-and-algorithms database front-end hpc networking operating-systems signal-processing
Last synced: 16 Nov 2024
https://github.com/vatshayan/hospital-discharge-analysis
Analysis of Hospitalization Discharge Rates in Lake County, Illinois of various attributes like Anxiety, Alcohol, mood, Diabetes, Asthma, etc
data-analysis data-visualization jupyter-notebook machine machine-learning machine-learning-algorithms scikit-learn
Last synced: 15 Nov 2024
https://github.com/hayesall/babybear
🐼 It's like pandas, but tiny.
data-analysis data-analysis-python data-science dataframe python teaching teaching-tool
Last synced: 15 Nov 2024
https://github.com/seabbs/explorebcgonoutcomes
Analysis to explore the association of BCG vaccination and TB outcomes.
bcg data-analysis regression rstats tuberculosis
Last synced: 01 Jan 2025
https://github.com/turquetti/projeto5-vamoai
Projeto final da Resilia + iFood <3
Last synced: 15 Nov 2024
https://github.com/ahmednasef3/titanic-full-eda
Simple EDA for Titanic Dataset.
data-analysis data-visualization eda exploratory-data-analysis matplotlib pandas seaborn titanic titanic-data-analytics
Last synced: 15 Nov 2024
https://github.com/virajbhutada/uk-road-traffic-analytics-excel-sql-powerbi-tableau
This portfolio project presents comprehensive analysis of road accidents data using Excel, SQL queries, Power BI visualizations, and Tableau dashboards. This repository showcases the integration of multiple analytical tools, offering actionable insights to enhance road safety and mitigate accidents.
analytics data-analysis data-science data-visualization excel microsoft-sql-server powerbi powerbi-visuals road-safety sql tableau tableau-public
Last synced: 17 Nov 2024
https://github.com/ahmednasef3/udemy-courses-full-eda
Exploratory Data Analysis on the factors that can affect the promotions and earnings in Udemy Courses and the perfect way to make a good saled course in Udemy.
data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib pandas seaborn udemy-course-project
Last synced: 15 Nov 2024
https://github.com/daniel1kp/openrtb-dashboard
This is a demo project designed to illustrate using Rill to analyze programmatic bid logs using the canonical open RTB framework.
data-analysis openrtb real-time-bidding rill
Last synced: 15 Nov 2024
https://github.com/ahmednasef3/store-sales-full-eda
Simple EDA for Store Sales.
data-analysis data-visualization eda exploratory-data-analysis matplotlib pandas plotly seaborn store
Last synced: 15 Nov 2024
https://github.com/virajbhutada/google-stock-price-forecasting-lstm
Analyzing and predicting Google's stock prices through detailed data exploration and advanced LSTM models. This project involves data preprocessing, creating time-series sequences, constructing and training LSTM networks, and evaluating their performance to forecast future stock prices utilizing Python and Machine Learning libraries.
data-analysis data-science data-visualization future-prediction google-dataset google-stock-price-prediction google-stocks lstm-model lstm-neural-network machine-learning machine-learning-models matplotlib model-building model-training numpy python stock-forecasting
Last synced: 10 Jan 2025
https://github.com/dannyben/datamix
DSL for manipulating tabular data
csv data data-analysis data-engineering gem ruby tabular-data
Last synced: 07 Dec 2024
https://github.com/virajbhutada/movie-rental-store-analytics-sql-powerbi-excel
Dive into the DVD rental industry with my Capstone project, Movie Rental Analytics. Analyzing the Sakila DVD Rental Store Database, I extract insights through exploratory data analysis (EDA) and Power BI visualizations. Findings inform strategies for optimizing film inventory, enhancing business operations, and customer experiences.
business-intelligence capstone-project customer-behavior-analysis data-analysis data-science excel exploratory-data-analysis film-ratings mece movie-database movie-rental mysql powerbi powerbi-visuals revenue-analysis sql sql-database
Last synced: 10 Jan 2025
https://github.com/victor-lis/regression-ai-model
ai data-analysis python regression-model
Last synced: 14 Dec 2024
https://github.com/prithivsakthiur/data-board
Data Boards - Visualization of various plots ( Analysis )
data-analysis gradio huggingface keras mathplotlib pandas plots pyplot scikit-learn seaborn spaces
Last synced: 21 Dec 2024
https://github.com/seekinginfiniteloop/fedcal
A feature-rich Python calendar that enables time series analyses of changes in federal workforce schedules and shifts in executive department funding status.
data-analysis data-science econometrics economic-data economics federal federal-government hr pandas pandas-library pandas-python pydata python
Last synced: 14 Oct 2024
https://github.com/zen204/airbnb_availability
A machine learning model that predicts Airbnb listing availability, utilizing feature engineering and supervised learning techniques to improve guest experience and optimize host management.
binary-classification data-analysis data-preprocessing data-visualization feature-engineering machine-learning matplotlib model-evaluation nlp pandas predictive-modeling python scikit-learn seaborn supervised-learning
Last synced: 03 Nov 2024
https://github.com/virajbhutada/walmart-retail-analyzer
Gain valuable insights into retail sales with the "Walmart Retail Performance Dashboard" in MS Excel. This user-friendly tool facilitates an in-depth analysis of key sales metrics, providing a comprehensive view of Walmart's performance. Make data-driven decisions for informed and strategic business outcomes.
analytics data-analysis data-science data-visualization excel insights interactive-visualizations performance-analysis retail-sales walmart
Last synced: 10 Jan 2025
https://github.com/saltiola7/data-analysis-portfolio
Data engineering & analysis portfolio, which showcases my use of Python & SQL
airflow airtable-block anaconda automation back4app chatgpt csv-parser data-analysis data-engineering docker-compose gcp graphql-api jupyter-notebook nosql prefect python rest-api sql streamlit web-scraping
Last synced: 21 Dec 2024
https://github.com/jubinjacob03/heartdiseaseclassify-ml
Heart Disease Dataset Analysis & Classification using ML models such as linear, support vector machine, k-means, k-nearest neighbors and logistic regression.
data-analysis data-science data-visualization ipython-notebook kaggle-dataset kmeans knn linear-regression logistic-regression machine-learning matplotlib python seaborn support-vector-machine
Last synced: 11 Oct 2024
https://github.com/vkbo/osirisanalysis
Matlab toolbox for analysing simulation results from Osiris 3
data-analysis matlab matlab-gui physics-simulation
Last synced: 16 Nov 2024
https://github.com/jxareas/de-zoomcamp-2024
Solutions for @datatalksclub's Data Engineering Zoomcamp 2024.
data-analysis data-engineering data-science database datascience de-zoomcamp docker docker-compose etl etl-pipeline mage-ai orchestration python workflow
Last synced: 19 Nov 2024
https://github.com/airdac/sim-telco_customer_churn
Prediction of customer churn with logistic regression in R. Team project from UPC's Master's Degree in Data Science
classification data-analysis data-science logistic-regression r statistical-models upc
Last synced: 14 Nov 2024
https://github.com/talha-1010/imdb-data-analysis
A data analysis project made with python using pandas
data-analysis data-visualization jupyter-notebook pandas pandas-dataframe
Last synced: 14 Nov 2024
https://github.com/lykmapipo/scala-spark-product-sales-analysis
Scala application to process, and analyze product sales using Spark
anomaly-detection apache-spark apache-spark-sql customer-segmentation data-analysis data-processing lykmapipo market-basket-analysis product-sales product-sales-analysis rolling-average running-total sbt scala summary-statistics time-series-analysis
Last synced: 21 Dec 2024
https://github.com/anilkumarteegala/aspiration.ai-ml-internship
This repo contains the internship project by Career Launcher.
data-analysis data-science financial internship machine-learning python3 stock-analysis stock-market visualization
Last synced: 13 Nov 2024
https://github.com/mardavsj/weather-prediction
Weather prediction model which mainly focuses on visualization.
data-analysis data-visualization matplotlib numpy pandas pandas-dataframe
Last synced: 21 Dec 2024
https://github.com/dual-points/dplearn
A Python package for data analysis.
data-analysis data-science python python-package
Last synced: 13 Nov 2024
https://github.com/shadan100/sales-prediction-analysis
The aim is to build a predictive model and find out the sales of each product at a particular store. Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.
artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python sales-prediction
Last synced: 11 Oct 2024
https://github.com/aravind-selvam/bikeshare-company-analysis
Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company
analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server
Last synced: 14 Nov 2024
https://github.com/mohammadreza-mohammadi94/data-analysis-and-machine-learning-projects
A comprehensive collection of data analysis and machine learning projects, showcasing techniques and models for various data challenges. Dive in to explore code examples, analyses, and machine learning workflows.
data-analysis data-science dataframes exploratory-data-analysis pandas python scikit-learn visualization
Last synced: 07 Nov 2024