Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2024-11-14 00:06:25 UTC
- JSON Representation
https://github.com/geo-y20/loan-approval-automation-using-mongodb-and-pymongo
This project demonstrates the implementation of a loan approval system that utilizes MongoDB for distributed data storage and management, and PyMongo for database operations. The project aims to automate the assessment of loan eligibility using customer details from online applications.
crud-application data data-analysis data-science data-visualization deployment jupyter-notebook loan-default-prediction loan-prediction-analysis machine-learning machine-learning-algorithms matplotlib mongodb pymongo streamlit web
Last synced: 10 Nov 2024
https://github.com/zpreisler/modules
Python libraries and modules for processing simulation outputs
data-analysis python scripts tensorflow
Last synced: 12 Nov 2024
https://github.com/rayyan9477/coin-detection-project
This Coin Detection Project leverages machine learning techniques to identify coins using a dataset from Kaggle. Key libraries utilized include OpenCV for image processing, TensorFlow for model training, and Pandas for data manipulation. The project also employs NumPy for numerical operations and Matplotlib for visualization.
computer-vision data-analysis data-science data-visualization machine-learning notebook python
Last synced: 11 Nov 2024
https://github.com/markmusic27/data-statistics-calculator
💣 This method (made in JavaScript / Python) can find the mean, median, mode, range, and standard deviation.
data-analysis standard-deviation statistics statistics-calculator
Last synced: 09 Nov 2024
https://github.com/carmoreno/analisisaccidentalidadbogota
Data Analysis about traffic accidents at Bogotá, Colombia.
data-analysis data-science jupyer-notebook matplotlib numpy pandas scikit-learn
Last synced: 09 Nov 2024
https://github.com/leocornus/leocornus-visualdata
JavaScript libraries to make data visualization simpler and easier.
data-analysis data-mining data-visualization data-visualization-simpler javascript-library
Last synced: 10 Nov 2024
https://github.com/vkbo/osirisanalysis
Matlab toolbox for analysing simulation results from Osiris 3
data-analysis matlab matlab-gui physics-simulation
Last synced: 12 Oct 2024
https://github.com/luiscib3r/streamlit-examples
Streamlit examples.
data-analysis data-science machine-learning python streamlit
Last synced: 05 Nov 2024
https://github.com/shahaf-f-s/feature-space
A modular framework for combining pandas series features
data-analysis data-science feature-engineering
Last synced: 10 Nov 2024
https://github.com/victoriapm/analyze_a-b_test_results
Understand the results of an A/B test run by an e-commerce website.
ab-testing data-analysis ecommerce-website
Last synced: 12 Oct 2024
https://github.com/sunnybibyan/random_data_generation
A project that generates a dataset using various statistical distributions (Normal, Uniform, Exponential, Random Integers, and Binomial) and performs data analysis. Includes visualizations and an option to export the data as a CSV file.
data-analysis data-visualization python random-data-generation statistics streamlit-webapp
Last synced: 09 Nov 2024
https://github.com/abhi18av/innovation-competition
Submission for a programming challenge
clojure clojurescript data-analysis
Last synced: 09 Nov 2024
https://github.com/discdiver/new-belgium-ratings
Find the most popular New Belgium beers of all time!
beautifulsoup data-analysis pandas python seaborn webscraping
Last synced: 11 Nov 2024
https://github.com/souravsuvarna/whatsapp-chat-analyzer-api
The WhatsApp Chat Analyzer API is a public api specifically designed for frontend enthusiasts who are interested in building a WhatsApp Chat Data Visualizer project. Built on FastAPI, this API offers a seamless and efficient method to process chat data and returns the processed result data in JSON format.
api data-analysis data-science fastapi publicapi python
Last synced: 09 Nov 2024
https://github.com/fx2y/datanarrate
[WIP] LLM-powered agent for adaptive data analysis across multiple sources. Uses natural language for complex queries, visualizations, and insights. Features autonomous planning, SQL/Elasticsearch generation, and AI storytelling. Built with LangChain, GPT-4, FastAPI, and React.
ai data-analysis data-visualization elasticsearch fastapi gpt-4 langchain machine-learning nlp react sql
Last synced: 11 Oct 2024
https://github.com/lightbridge-ks/zoominterface
A data analysis Shiny app of program Zoom report files.
data-analysis r shiny-apps zoom-class zoom-meetings
Last synced: 11 Oct 2024
https://github.com/wiseaidev/corona-virus-data-analysis-modeling-and-visualization
Data analysis of covid-19 and SEIRD model implementation.
coronavirus coronavirus-tracking covid-19 data-analysis data-analysis-python data-visualization folium-maps modeling-dynamic-systems numpy ploty population python3 science science-research seird-model seird-simulator simulation
Last synced: 22 Oct 2024
https://github.com/jamiemagee/rhi
Collating the data on the Renewable Heat Incentive scheme, and presenting it in a more readable format.
data-analysis open-data open-government rhi
Last synced: 10 Nov 2024
https://github.com/akshat0427/python_youtube_history
a bunch of data science operations performed on youtube history data
data-analysis data-science extracting-features
Last synced: 12 Nov 2024
https://github.com/arhcoder/base-hackathon-2022
💸 Sistema que analiza las facturas de compra-venta de una empresa de importaciones y exportaciones, y crea una base de conocimiento con la que crea sugerencias de abastecimiento para las empresas clientes de Banco BASE, con el fin de ahorrarles dinero.
algorithms bank companies data-analysis decision-making exportation hackaton importation javascript mysql python suggestions
Last synced: 11 Nov 2024
https://github.com/emaasit/pydata-book
Learning data analysis with python
data-analysis jupyter pandas python
Last synced: 13 Oct 2024
https://github.com/semasuka/income-classification
Predicting if an individual make more than 50K using different features
aws-s3 binary-classification data-analysis data-science data-visualization eda finance-analytics machine-learning precision python random-forest-classifier scikit-learn streamlit
Last synced: 11 Nov 2024
https://github.com/wiseaidev/truth-guard
Analyzing a 79k Dataset of Misinformation and Fake News
data-analysis fastapi lstm machine-learning python supervised-learning
Last synced: 22 Oct 2024
https://github.com/emso-c/stream-analyser
A tool that analyses YouTube live streams.
cli data-analysis guessing highlights python youtube-video
Last synced: 11 Oct 2024
https://github.com/nicholaskross/yt-pscore-analysis
Analysis of the Oct 2019 p-score dataset
analytics data-analysis data-cleaning social-media-analysis youtube youtube-channel
Last synced: 15 Nov 2024
https://github.com/shadan100/stroke-prediction-analysis
A web based application to predict whether a patient is likely to get stroke based on the input parameters like gender, age, various diseases, and smoking status. Each row in the data provides relevant information about the patient.
artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python stroke-prediction web-application
Last synced: 11 Oct 2024
https://github.com/burhanahmed1/data-analysis-with-python
Data-Acquisition and Basic Insights, Data Wrangling, Exploratory Data Analysis (EDA), and Training Prediction Models(Machine Learning) on two datasets.
data-analysis data-aquisition data-insights data-science data-wrangling dataanalytics datascience-machinelearning eda exploratory-data-analysis machine-learning-models matlpotlib numpy pandas practice-programming prediction-model python scikit-learn scikitlearn-machine-learning seaborn
Last synced: 10 Nov 2024
https://github.com/metalwarrior665/actor-results-checker
apify data-analysis json-schema-checker
Last synced: 09 Nov 2024
https://github.com/foxriver76/iobroker.intelliflow
Stream data analysis adapter for ioBroker.
data-analysis iobroker machine-learning streaming-data
Last synced: 11 Oct 2024
https://github.com/mohnoor94/datasciencefundementalsusingpython
My journey to learn Data Science with Python
data data-analysis data-science data-visualization learning learning-by-doing python python3
Last synced: 12 Oct 2024
https://github.com/rayyan9477/diamond-price-forecasting
This is a comprehensive machine learning project focused on predicting diamond prices. Using a dataset of diamond attributes, the project implements various machine learning models to forecast prices. Key features include data preprocessing, exploratory data analysis (EDA), and model training with algorithms such as Linear Regression, Decision Tree
data-analysis data-science decision-trees eda linear-regression machine-learning
Last synced: 11 Nov 2024
https://github.com/dcs-training/introcausalinference
This is a repository for the Introduction to Causal Inference course provided by Chris Oldnall for the CDCS. Go to the readme file
data-analysis python r statistics
Last synced: 10 Nov 2024
https://github.com/drill-n-bass/data-analysis-projects
Projects related to my Data Analyst path.
analysis data-analysis data-visualization matplotlib matplotlib-pyplot mysql mysql-database numpy pandas pandas-dataframe pandas-library pandas-python python python3 seaborn seaborn-plots static-analysis statistics
Last synced: 07 Nov 2024
https://github.com/gf712/abpytools-qt
Qt interface of AbPyTools
antibody-numbering antibody-sequences cpp11 data-analysis python3 qt5
Last synced: 14 Oct 2024
https://github.com/famarks/grafarg
Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License
analytics charts data data-analysis data-science data-visualization grafana grafarg graph
Last synced: 12 Oct 2024
https://github.com/grypesc/graduateadmissions
Visualization, analysis and predictive modeling of a Kaggle graduate admissions dataset.
data-analysis data-mining data-science data-visualization dataset
Last synced: 28 Oct 2024
https://github.com/rayyan9477/multiple-disease-prediction-system
This repository contains a Multiple Disease Prediction System leveraging machine learning techniques for accurate predictions. It utilizes Python, Pandas, Scikit-learn, and Flask for data preprocessing, model building, and web deployment. Explore the project and connect on LinkedIn for collaborations.
data-analysis data-science machine-learning python streamlit
Last synced: 11 Nov 2024
https://github.com/dr-saad-la/r-distilled
R Programming Language distilled
data data-analysis learning programming-language r rlanguage rprogramming statistical-analysis
Last synced: 09 Nov 2024
https://github.com/raad07/sql_project-world_layoffs_dataset
This is a SQL project which comprises the Data Cleaning in the first part and Exploratory Data Analysis (EDA) in the second part.
data-analysis database mysql sql
Last synced: 13 Oct 2024
https://github.com/gappeah/nike_web_crawler
This project involves web scraping Nike's product pages to extract product names, prices, and links. The project showcases three different implementations of the web crawler using Selenium and BeautifulSoup. It also includes visualisation of the scraped data using Matplotlib and Seaborn.
beautifulsoup data-analysis data-visualization python selenium web-crawler web-scraper webcrawler webscraper webscraping webscraping-beautifulsoup
Last synced: 10 Nov 2024
https://github.com/csoren66/diabetics_prediction
Predicting that whether the patient has diabetes or not on the basis of the features we will provide to our machine learning model.
data-analysis machine-learning python svm
Last synced: 14 Nov 2024
https://github.com/mindgamesnl/yanderestats
https://mindgamesnl.github.io/YandereStats/
data-analysis reporting-pipeline yandere yandere-sim
Last synced: 08 Nov 2024
https://github.com/sumidcyber/dataviz-master
This Python application provides a user-friendly interface to load and visualize the contents of a CSV file. Users can choose from various types of graphs and perform analyses on the dataset.
data-analysis data-analysis-project data-analysis-python database databases python python3
Last synced: 13 Oct 2024
https://github.com/verbasik/yandex.practicum.datascience
Портфолио проектов Data Science, выполненных в рамках профессиональной переподготовки в Яндекс.Практикум. Включает исследования в области финансов, недвижимости, кинопроката и других, с использованием статистики, машинного обучения и анализа данных.
data-analysis data-science machine-learning yandex-praktikum
Last synced: 11 Nov 2024
https://github.com/alexandregazagnes/rica-analysis
This repository contains the code to download, analyse, and modelize the RICA dataset from the french ministry of agriculture.
analysis argiculture business data data-analysis data-analytics food python
Last synced: 09 Nov 2024
https://github.com/bishtrishu/pizza_sales_data_analysis_sql
This project is a comprehensive data analysis of pizza sales, aimed at uncovering key insights and trends to inform business decisions. Using a combination of SQL, Python, and data visualization tools, the project analyzes sales data to understand customer preferences, peak sales periods, and the most popular pizza types.
cloud data data-analysis data-science data-visualization dataanalytics database mysql oracle-database
Last synced: 09 Nov 2024
https://github.com/walkerdustin/vergleich-von-messmethoden-fuer-punktwolken
Bei der Vermessung eines physischen Raumes ist das Ergebnis eine Punktwolke. Diese Punktwolke beschreibt dann ausgewählte Punkte im Raum, zum Beispiel auf den Wänden und der Decke. Wenn diese Punkte in zwei seperaten Messungen gemessen werden, vielleicht sogar von unterschiedlichen Geräten, soll hinterher herausgefunden werden wie genau diese Punktwolken übereinstimmen. Dafür gibt es zwei grundsätzlich verschiedene Methoden. Diese sollen hier verglichen werden.
3d-models accuracy-metrics data-analysis data-visualization kaggle measure-distance numpy point-cloud pointcloudprocessing punkte python science-research simulation statistics
Last synced: 08 Nov 2024
https://github.com/dannyben/datamix
DSL for manipulating tabular data
csv data data-analysis data-engineering gem ruby tabular-data
Last synced: 17 Oct 2024
https://github.com/rohithsaji97/open_gate_dip
An automatic gate opening system with an additional parking system (using Raspberry PI).
automated data-analysis digital-image-processing opencv python3 raspberry-pi-3 trained-models
Last synced: 07 Nov 2024
https://github.com/sandk21/detection_faux_billets
Algorithme de détection de faux billets selon leurs dimensions géométriques et application web pour générer les prédictions
data-analysis data-science data-visualization machine-learning pandas python scipy sklearn streamlit
Last synced: 17 Oct 2024
https://github.com/rayyan9477/youtube-spam-detection-with-flask-and-machine-learning
This is a web application built using Flask that detects spam comments on YouTube using a Naive Bayes classifier. It leverages techniques such as CountVectorizer for feature extraction and scikit-learn for machine learning. The application reads data from a CSV file and predicts whether a comment is spam or not.
data-analysis data-science machine-learning nlp-machine-learning spam-detection
Last synced: 11 Nov 2024
https://github.com/gesiscss/wikipedia-language-olga-master
Measuring Gender Inequalities of German Professions on Wikipedia
bias crowdflower data-analysis data-science gender images python statistics wikipedia
Last synced: 09 Nov 2024
https://github.com/techshot25/baltimore-911-calls
Analysis of 911 calls provided by the city of Baltimore.
data-analysis data-science decision-tree-classifier logistic-regression machine-learning machine-learning-algorithms statistics
Last synced: 10 Nov 2024
https://github.com/serhatderya/medical_examination_research
This repository contains a research about medical examinations (such as body measurements, results from various blood tests, and lifestyle choices).
catplot data-analysis data-analytics data-cleaning data-preparation data-preprocessing data-science data-visualization eda exploratory-data-analysis exploratory-data-visualizations heatmap jupyter-notebook medical preprocessing python research seaborn
Last synced: 09 Nov 2024
https://github.com/misszeferino/sql-projects
bigquery data-analysis mysql queries sql sqlite3
Last synced: 12 Oct 2024
https://github.com/rayyan9477/household-transactions-analysis-and-clustering
This project involves analyzing household transaction data to gain insights into spending patterns and behaviors. The analysis includes data cleaning, exploratory data analysis (EDA), clustering using K-Means, and visualization of customer segments.
customer-segmentation data-analysis data-cleaning data-science exploratory-data-analysis kmeans-clustering machine-learning
Last synced: 11 Nov 2024
https://github.com/samuelsoaress/python-study-datascience-ia
My data science and AI studies
data-analysis data-crawler data-mining data-science deep-learning machine-learning-algorithms
Last synced: 10 Nov 2024
https://github.com/matthewgrosman/messenger-analytics
Project that ingests Facebook Messenger conversations and generates analytics.
analytics data-analysis excel facebook facebook-messenger java mongodb
Last synced: 08 Nov 2024
https://github.com/cano1998/eda-survival-of-the-titanic
This project focuses on Exploratory Data Analysis (EDA) to identify the key determinants that influenced survival during the infamous Titanic accident.
data-analysis data-cleaning data-preprocessing data-visualization exploratory-data-analysis jupyter-notebook titanic-survival-exploration
Last synced: 09 Nov 2024
https://github.com/1ayanabil1/healthcare-machine-learning
Explore our open-source repository focused on healthcare machine learning. We've developed predictive models for cardiovascular disease, diabetes, breast cancer, and more. Our projects employ diverse machine learning algorithms and data science techniques, enhancing early detection, diagnosis, and patient outcomes.
data-analysis data-science deep-learning disease disease-detection disease-modeling disease-prediction eda healthcare-application heathcare jupyter-notebook machine-learning machine-learning-algorithms machinelearning-python python
Last synced: 10 Nov 2024
https://github.com/sumitgirwal/procoder-public
"ProCoder", which is a web-based application providing massive open online courses for both professionals and students. It aims to offer a platform for learning coding skills online, accessible to anyone who is interested in learning programming or enhancing their coding knowledge. ProCoder provides courses on various programming languages, tools.
blog-platform bootstrap-4 chat-application css3 data-analysis django-crud django-project html5 javascript numpy-library pandas-library python3
Last synced: 10 Nov 2024
https://github.com/BigBangData/TimesheetAnalysis
R shiny app to help analyze a bookkeeper's business - or anyone with a timesheet and some time.
bookkeeping data-analysis data-viz r-programming shiny-apps shiny-r timesheet-management
Last synced: 13 Aug 2024
https://github.com/alexandrelamarre/fission
Data analytics & Structured streaming optimized for the Edge
data-analysis data-engineering rust structured-data unstructured-data
Last synced: 12 Nov 2024
https://github.com/saltiola7/data-analysis-portfolio
Data engineering & analysis portfolio, which showcases my use of Python & SQL
airflow airtable-block anaconda automation back4app chatgpt csv-parser data-analysis data-engineering docker-compose gcp graphql-api jupyter-notebook nosql prefect python rest-api sql streamlit web-scraping
Last synced: 04 Nov 2024
https://github.com/shipyardapp/amazonathena-blueprints
Simplified blueprints for building data pipelines with Amazon Athena.
amazon-athena athena cli data-analysis data-engineering data-science elt etl
Last synced: 13 Aug 2024
https://github.com/shadan100/sales-prediction-analysis
The aim is to build a predictive model and find out the sales of each product at a particular store. Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.
artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python sales-prediction
Last synced: 11 Oct 2024
https://github.com/2003harsh/house-price-prediction-using-machine-learning
This project features a web app that predicts house prices using a linear regression model. Users can input details like location, square footage, bathrooms, and bedrooms through an HTML form. I've added a CI/CD pipeline with GitHub Actions, unit testing with pytest, and automated Docker containerization to improve deployment and robustness.
ci-cd data-analysis docker-image flask linear-regression machine-learning matplotlib mlops-workflow requests scikit-learn
Last synced: 10 Oct 2024
https://github.com/shipyardapp/postgresql-blueprints
Simplified blueprints for building data pipelines with PostgreSQL.
cli data-analysis data-engineering data-pipeline data-science database elt etl postgres postgresql
Last synced: 13 Aug 2024
https://github.com/kaz-yos/distributed
Comparison of Privacy-Protecting Analytic and Data-sharing Methods: a Simulation Study (Pharmacoepidemiol Drug Saf 2018)
data-analysis epidemiology statistics
Last synced: 12 Nov 2024
https://github.com/mg380/ibm-applied-data-science-capstone
This Capstone is the 10th (final) course in IBM Data Science Professional Certificate specialization, and it actually summarises in the form of project all materials that have been learned during this specialization
capstone data data-analysis data-science datascience ibm machine-learning plotly python scikit-learn sql
Last synced: 10 Oct 2024
https://github.com/alejo1630/chicago_crimes
A Jupyter Notebook with the data analysis and data visualization of crimes in Chicago from 2017 to 2023 using libraries such as seaborn and folium
data-analysis data-visualization folium pandas python seaborn
Last synced: 08 Nov 2024
https://github.com/alejo1630/sport_stats
Data analysis of information from the summer and winter Olympic games over the years. UC Davis SQL Specialization Final Project
data-analysis jupyter-notebook olympics-dataset plotly python seaborn sql
Last synced: 08 Nov 2024
https://github.com/mohammadreza-mohammadi94/data-analysis-and-machine-learning-projects
A comprehensive collection of data analysis and machine learning projects, showcasing techniques and models for various data challenges. Dive in to explore code examples, analyses, and machine learning workflows.
data-analysis data-science dataframes exploratory-data-analysis pandas python scikit-learn visualization
Last synced: 07 Nov 2024
https://github.com/gauravcodepro/numpy-builder
A numpy shell builder to extract and how to use the numpy across the arrays.I am putting the entire manual for those who like to search immediately rather than looking here and there.
bash-prompt bash-script bash-scripting data-analysis data-mining data-science numpy numpy-arrays shell-prompt shell-script
Last synced: 09 Nov 2024
https://github.com/cr-mao/machine-learning
机器学习笔记
data-analysis data-handling machine-learning math numpy pandas
Last synced: 09 Nov 2024
https://github.com/jen-uis/loan-status-prediction
This repository contains project materials for the Winter STAT 206 class, University of California, Riverside, A. Gary Anderson School of Management.
data data-analysis data-analytics data-cleaning data-visualization descriptive-analytics julia julia-language jupyter-notebook predictive-analytics predictive-modeling team-collaboration
Last synced: 12 Oct 2024
https://github.com/odeyiany2/flit-apprenticeship-data-science-projects
This repo contains all my projects for my FLiT Apprenticeship
data-analysis data-science data-visualization machine-learning sql
Last synced: 09 Nov 2024
https://github.com/christianrcanlas/christianrcanlas.github.io
e-Portfolio showcasing my personal projects.
arima classification-algorithims crostons-method data-analysis data-visualization data-warehousing etl-pipelines hierarchical-forecasting holt-winters long-short-term-memory machine-learrning ms-sql-server predictive-analytics python r-markdown support-vector-regression t-sql tableau time-series-decomposition time-series-forecasting
Last synced: 12 Oct 2024
https://github.com/giordano-lucas/tesco-extension
Products clustering and interactive visualization
clustering data-analysis data-visualization tesco
Last synced: 09 Nov 2024
https://github.com/nelsonkariuki/dataanalysis
This project involves data analysis of vido game sales from https://www.kaggle.com/gregorut/videogamesales/download
data-analysis data-visualization python
Last synced: 12 Nov 2024
https://github.com/ehopperdietzel/billionaires-analysis
Análisis de la cantidad de billonarios por país. Inspirado en el artículo "Russian Billionaires"
bootstrap data-analysis poisson-distribution prediction
Last synced: 30 Oct 2024
https://github.com/x1ao4/doc-merger
通过 python 脚本将两个相对不完整的文档合并为一个完整的文档 / merge two relatively incomplete documents into one complete document via python script
data-analysis data-merging document-analysis document-comparison document-processing documents filtering filtering-data merge merge-documents
Last synced: 08 Nov 2024
https://github.com/jubinjacob03/heartdiseaseclassify-ml
Heart Disease Dataset Analysis & Classification using ML models such as linear, support vector machine, k-means, k-nearest neighbors and logistic regression.
data-analysis data-science data-visualization ipython-notebook kaggle-dataset kmeans knn linear-regression logistic-regression machine-learning matplotlib python seaborn support-vector-machine
Last synced: 11 Oct 2024
https://github.com/virajbhutada/uk-road-traffic-analytics-excel-sql-powerbi-tableau
This portfolio project presents comprehensive analysis of road accidents data using Excel, SQL queries, Power BI visualizations, and Tableau dashboards. This repository showcases the integration of multiple analytical tools, offering actionable insights to enhance road safety and mitigate accidents.
analytics data-analysis data-science data-visualization excel microsoft-sql-server powerbi powerbi-visuals road-safety sql tableau tableau-public
Last synced: 12 Oct 2024
https://github.com/chayandatta/got_script_manipulation
Game of Thrones Script - String & file manipulation
data-analysis data-science pandas python3
Last synced: 19 Oct 2024
https://github.com/mdaffailhami/customer-data-analysis
This repository contains code and analysis for exploring customer data, focusing on profiling and contact preferences. The project includes various stages of data processing, from raw data preparation to final cleaned datasets, and employs Python and popular data analysis libraries to uncover insights and trends.
data-analysis data-cleaning data-science data-visualization jupyter jupyter-notebook pandas plotly python
Last synced: 13 Nov 2024
https://github.com/zeinhasan/eksploration-and-data-visualization-course-material
Exploratory Data Analysis (EDA) Laboratory Assistant Teaching Materials
data-analysis data-visualization statistics
Last synced: 08 Nov 2024
https://github.com/mattdelaune/retail_rfm_analysis
Power BI multi-page report leveraging advanced data visualization for RFM analysis. Delivers deep analytical insights into customer behavior, engagement, and spending patterns, driving strategic business decisions.
data-analysis dax powerbi report rfm-analysis sales-data visualization
Last synced: 08 Nov 2024
https://github.com/hordiales/redpanal-db-analysis
Analysis of the RedPanal.org music database
creative-commons data-analysis dataset etl machine-learning music music-information-retrieval statistical-analysis
Last synced: 23 Oct 2024
https://github.com/victor-lis/regression-ai-model
ai data-analysis python regression-model
Last synced: 26 Oct 2024
https://github.com/sahaavi/uber-vs-lyft
Advance Predictive Modeling in R
data-analysis data-science eda machine-learning predictive-modeling r
Last synced: 07 Nov 2024
https://github.com/ryanfranklin237/data-cleansing
A group of python scripts that clean large data sets by removing duplicate data, putting data in correct formats, and removing redundant cells
data-analysis data-cleaning data-science extract-transform-load pandas-dataframe python
Last synced: 12 Nov 2024
https://github.com/abhi-lab2/ipl-data-analysis
IPL data analysis for future predictions
data-analysis data-science python
Last synced: 07 Nov 2024
https://github.com/ryanfranklin237/data-visualization-python
A tool that allows you to visualize data from a csv or excel file in a graph or charts form
data-analysis data-science data-visualization matplotlib pandas-dataframe python
Last synced: 12 Nov 2024
https://github.com/ryanfranklin237/data-visualization-spreadsheets
Data visualization done with microsoft excel and google spreadsheets
data-analysis data-science data-visualization google-spreadsheets microsoft-excel
Last synced: 12 Nov 2024
https://github.com/lykmapipo/scala-spark-product-sales-analysis
Scala application to process, and analyze product sales using Spark
anomaly-detection apache-spark apache-spark-sql customer-segmentation data-analysis data-processing lykmapipo market-basket-analysis product-sales product-sales-analysis rolling-average running-total sbt scala summary-statistics time-series-analysis
Last synced: 04 Nov 2024
https://github.com/quantumudit/alteryx-weekly-challenges
This repository contains Alteryx solutions to the weekly challenges published in Alteryx Community
alteryx alteryx-workflow data-analysis data-science data-transformation data-visualization etl
Last synced: 06 Nov 2024