Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-20 00:07:30 UTC
- JSON Representation
https://github.com/luabagg/worldwide-trends
Worldwide Google Trends visualization and classification
data-analysis data-visualization google-trends trends
Last synced: 03 Feb 2026
https://github.com/karthikmprakash/911-call-dataanalysis
Data Analysis of Emergency (911) Calls: Fire, Traffic, EMS for Montgomery County, PA
911-call-analysis data-analysis data-visualization python3 united-states-data
Last synced: 10 May 2026
https://github.com/tqhungdev0605/crawl_200_jd_dataanalyst
Automate job data scraping for 200 Data Analyst postings on https://vn.indeed.com using Python
data-analysis jupyter-notebook python3 scraping selenium
Last synced: 11 Apr 2026
https://github.com/renatomaynard/statistical-modeling-and-regression-analysis-life-expectancy
Statistical Modeling and Regression Analysis for Life Expentancy
data-analysis healthcare linear-regression machine-learning predictive-modeling r regression-analysis statistical-models statitics
Last synced: 23 Mar 2025
https://github.com/zeinhasan/eksploration-and-data-visualization-course-material
Exploratory Data Analysis (EDA) Laboratory Assistant Teaching Materials
data-analysis data-visualization statistics
Last synced: 11 May 2026
https://github.com/shadan100/stroke-prediction-analysis
A web based application to predict whether a patient is likely to get stroke based on the input parameters like gender, age, various diseases, and smoking status. Each row in the data provides relevant information about the patient.
artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python stroke-prediction web-application
Last synced: 08 Mar 2026
https://github.com/narenkhatwani/arkouda-projects
This repository contains the source codes of the projects done using Arkouda (a software package that allows a user to interactively issue massive parallel computations on distributed data using functions and syntax that mimic NumPy, the underlying computational library used in most Python data science workflows.)
arkouda data-analysis data-analytics data-science high-performance high-performance-computing highperformancecomputing numpy pandas parallel-computing parallel-processing parallelization python
Last synced: 17 Apr 2026
https://github.com/agungbudiwirawan/sales-analysis-using-excel-formulas
The objective of this project is to analyze supermarket sales data using formulas in Microsoft Excel.
data-analysis excel excel-formulas microsoft-excel spreadsheet
Last synced: 08 Jan 2026
https://github.com/equicirco/cirquant
Code and data delivering for quantifying circularity through open data and digital innovation.
circular-economy data-analysis database julialang official-statistics
Last synced: 13 Jan 2026
https://github.com/misszeferino/sql-projects
bigquery data-analysis mysql queries sql sqlite3
Last synced: 29 Jan 2026
https://github.com/gracysapra/r-in-data-science
This repository contains essential guides for data analysis using R, covering topics like data preparation, data reshaping, and data visualization. Each file focuses on fundamental techniques to manipulate, clean, and visualize data effectively using R programming.
data-analysis data-preparation data-reshaping data-science data-visualization data-visualizations ggplot r r-for-data-science
Last synced: 19 Apr 2026
https://github.com/harshals499/ecosecure-visualization
Data visualization project using Qlik to analyze sales performance for EcoSecure Systems.
business-intelligence data-analysis data-visualization qlik-sense sales-analysis
Last synced: 12 Jun 2026
https://github.com/aymane-maghouti/sentiment-analysis-for-jumia-reviews-and-smartphone-price-prediction-system
The project focuses on customer sentiment analysis for Jumia, aiding informed online decisions. It collects and analyzes product comments to determine sentiments and implements a decision-making algorithm. Additionally, it includes product price prediction system using regression techniques.
beutifulsoup data-analysis data-cleaning data-collection data-preprocessing data-scraping data-visualization eda falsk machine-learning python web-application
Last synced: 18 Apr 2026
https://github.com/bilal-belli/personalacademicdocuments
This repository contains some personal academic assignments, maybe it will help someone!
compilation computer-architecture data-analysis data-structures-and-algorithms database front-end hpc networking operating-systems signal-processing
Last synced: 20 Apr 2026
https://github.com/rcv911/lyapunov-indicators
Calculating Lyapunov indicators with multiprocessing in Python
data-analysis lyapunov lyapunov-indicators multiprocessing
Last synced: 18 Jan 2026
https://github.com/babak2/synthea-data-analysis
Synthea Data Analysis
data-analysis data-visualization jupyter-notebook jupytext matplotlib numpy pandas python3 seaborn synthea
Last synced: 11 Apr 2026
https://github.com/dcs-training/good-data-visualisation-with-r
Our guide on how we create data visualisations through R. Go to the readme file
data-analysis data-visualisation r rmarkdown
Last synced: 16 Jun 2026
https://github.com/avijit-jana/redbus-data_scraping_and_filtering_with_streamlit_app
A Streamlit-based application leveraging Selenium to automate data scraping from Redbus, enabling efficient collection, analysis, and visualization of bus travel data for improved operational efficiency and strategic planning in the transportation industry.
automation dashboard data-analysis data-visualization datadrivendecisions python3 redbus selenium-python streamlit-application webscrapping
Last synced: 15 Mar 2025
https://github.com/sermonzagoto/data_cleansing_in_telco
Data Cleansing in Python
data-analysis data-science machine-learning matplotlib-figures pandas-python seaborn-plots
Last synced: 23 Mar 2025
https://github.com/andr3w03/employee-attrition-problem
Employee Attrition Problem Analysis and Prediction
data-analysis data-science data-visualization dicoding gradient-boosting-classifier machine-learning problem-solving python sklearn streamlit
Last synced: 11 Apr 2026
https://github.com/antonioscardace/mri-brainage
Showing Accelerated Brain Ageing in Alzheimer's Patients.
alzheimers-disease brain-age classification data-analysis medical-analysis predictive-modeling regression
Last synced: 18 Jan 2026
https://github.com/samuelsoaress/python-study-datascience-ia
My data science and AI studies
data-analysis data-crawler data-mining data-science deep-learning machine-learning-algorithms
Last synced: 13 May 2026
https://github.com/mohamedhany99/collecting-sound-features-frequency-from-an-audio-file-and-save-it-in-excel
This script takes a single audio file and collect all the sound features in it including (Frequency - mean - variance - minimum value - maximum value - Median - Kurtosis - Skewness) and save it in an array and import this array in a row in the datasheet.
automation data-analysis data-science dataset-generation excel-import signal-processing
Last synced: 18 Apr 2026
https://github.com/vidhi1290/zomato-data-analysis
Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!
data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis
Last synced: 11 Apr 2026
https://github.com/arnikz/piqmie
Proteomics Identifications & Quantitations Data Management & Integration Service
data-analysis data-management data-visualisation mass-spectrometry peptide-identification protein-inference protein-quantification proteomics silac web-application
Last synced: 03 Feb 2026
https://github.com/mathpfreitas/top-hits-spotify-2000-to-2019-
# 🎧 Top Hits Spotify (2000 - 2019)Explore music trends from 2000 to 2019 with this dataset of songs, artists, and genres. Use the insights to understand what makes a hit in today's music landscape. 🐙💻
analysis analytics chart data-analysis data-visualization exploratory-data-analysis hits interactivedashboards jupyter-notebook matplotlib music musical numpy pandas plotly python timeseries track-hits
Last synced: 01 Jul 2025
https://github.com/carterlasalle/sportsarbfinder
Sports Betting Arbitrage Finder: Python tool for identifying profitable arbitrage opportunities across bookmakers. Features multi-region support, customizable profit margins, interactive calculator, and web interface. Uses real-time odds data from The Odds API. Ideal for betting enthusiasts, analysts, and educational purposes.
arbitrage-betting betting-strategy data-analysis finance gambling odds-api python sports-analytics sports-betting
Last synced: 31 Mar 2025
https://github.com/boardgameanalytics/bga-notebooks
Exploratory notebooks using the BGG dataset
data-analysis data-exploration data-visualization ipython-notebook python
Last synced: 28 Jan 2026
https://github.com/chelseammatta/nopd-cad-data-analysis
Analysis of 911 call data from New Orleans' 3rd & 4th police districts (2019-2022) using BigQuery
911-calls 911-data bigquery cad-data crime-analysis data-analysis emergency-response new-orleans public-safety sql
Last synced: 01 Jul 2025
https://github.com/virajbhutada/google-stock-price-forecasting-lstm
Analyzing and predicting Google's stock prices through detailed data exploration and advanced LSTM models. This project involves data preprocessing, creating time-series sequences, constructing and training LSTM networks, and evaluating their performance to forecast future stock prices utilizing Python and Machine Learning libraries.
data-analysis data-science data-visualization future-prediction google-dataset google-stock-price-prediction google-stocks lstm-model lstm-neural-network machine-learning machine-learning-models matplotlib model-building model-training numpy python stock-forecasting
Last synced: 27 Feb 2025
https://github.com/savinrazvan/heredity
An AI that assesses the likelihood of genetic traits in individuals using a Bayesian Network to analyze family genetic data, modeling genetic inheritance and mutations to infer probabilities of gene presence and trait expression.
ai bayesian-network biological-data-analysis data-analysis educational-project family-genetics genetic-inheritance genetic-traits heredity mutation-modeling probability-calculation python
Last synced: 27 Feb 2025
https://github.com/easonlai/eda_for_prudential_life_insurance_sample_data
Notebook sample of Exploratory Data Analysis (EDA) for Prudential Life Insurance Sample Data
azure-databricks azuredatabricks data-analysis data-analysis-python data-analytics databricks databricks-notebooks eda exploratory-data-analysis insurance insurance-sample-data jupyter-notebook python python3
Last synced: 14 May 2026
https://github.com/amstuta/cpp-neural-network
Simple implementation of a feedforward neural network in c++
data-analysis deep-learning machine-learning neural-network
Last synced: 08 Apr 2025
https://github.com/priboy313/pandasflow
A set of custom python modules for friendly workflow on pandas
catboost data-analysis data-science pandas phik python scikit-learn shap
Last synced: 20 Jan 2026
https://github.com/vre-hub/science-projects
VRE example science projects
dark-matter data-analysis docker extreme-universe jupyter-notebook
Last synced: 18 Jan 2026
https://github.com/muhammadhilmyputrarisma/ab-test
Python code for A/B testing on Cookie Cats game data. This project analyzes the impact of moving the first gate from level 30 to level 40 on player retention and game rounds, helping to evaluate if delaying the gate improves player engagement and gameplay experience.
ab-testing cookie-cats data-analysis data-visualization game-analytics python statistics
Last synced: 18 May 2026
https://github.com/silvano315/med-physics
This would be a repository about medical physics. It will based on 4 paths: medical data to analyse, SOTA programs for medical purposes, computer vision and eXplainability.
computer-vision data-analysis data-science explainable-ai medical-imaging medical-physics medical-tool
Last synced: 24 Mar 2025
https://github.com/bretsw/beds
Bookdown project for an open education resource (OER) book: Becoming Educational Data Scientists
analytics data-analysis data-analytics data-science
Last synced: 31 Mar 2025
https://github.com/robcyberlab/linear-regression-application
🔢Linear Regression Application💻
artificial-intelligence data-analysis data-science data-visualization linear-regression machine-learning python python-programming regression-analysis statistics
Last synced: 31 Mar 2025
https://github.com/robcyberlab/machine-learning-classifier
🤖Machine Learning Classifier⚙️
ai artificial-intelligence classifiers data-analysis data-science deep-learning digit-recognition machine-learning pca-algorithm python svm-classifier
Last synced: 31 Mar 2025
https://github.com/m-faizan-mahmood/detailed-exploratory-data-analysis-eda-marketing-recomendations.
This project focuses on cleaning, preprocessing, and analyzing data using Pandas and NumPy. Key steps include handling missing values, removing outliers, feature engineering, and exploratory data analysis (EDA). Visualizations with Matplotlib and Seaborn highlight trends in customer spending, campaign performance, and product sales.
big-data data-analysis data-processing data-science eda exploratory-data-analysis numpy pandas python
Last synced: 11 Apr 2026
https://github.com/oguzgn/fully-automated-performance-marketing-dashboard
This project integrates data from multiple ad platforms with Google Analytics to track marketing campaigns. It uses a structured naming system and UTM tags. Data is visualized in Looker Studio dashboards to analyze campaign performance and ad spend.
bigquery data-analysis data-engineering data-modeling marketing-analytics marketing-automation marketing-data-science marketingdata sql
Last synced: 24 Mar 2025
https://github.com/akash1070/freecodecamp-data-analysis-with-python-
contains study notes and assignments from freecodecamp of Data Analysis With Python
data-analysis demographic-analysis mean-variance-standard-calculator medical-data-visualisation numpy-library pandas-library python3 sea-level-predictor time-series-analysis
Last synced: 01 May 2026
https://github.com/akash1070/data-science-virtual-internship-by-anz
Exploratory data analysis and prediction of annual salary for customers from the dataset provided by ANZ.
data-analysis data-science predictive-analytics presentation-slides
Last synced: 24 Mar 2025
https://github.com/emredurukn/data-analysis
Example notebooks for analyzing data
data-analysis data-visualization python
Last synced: 12 May 2026
https://github.com/eshaagarwa/sales_insight_project
Sales insights project using Powerbi and SQL
data-analysis data-visualization databse datacleaning datamodeling microsoft-power-bi mysql-database powerbi sales-insights sql
Last synced: 08 Aug 2025
https://github.com/beallio/wherewolf
Wherewolf is a production-grade, local SQL workbench designed for data engineers and analysts to query local files (CSV, Parquet, JSON) with ease. Built with Streamlit, it provides a unified interface to execute SQL against either DuckDB or PySpark engines without requiring complex setup.
big-data data-analysis data-engineering etl parquet performance pyspark python spark-sql sql uv
Last synced: 28 Apr 2026
https://github.com/mathieu2301/pbsc-tracker
Expérience de tracking des vélos en libre service fonctionnants avec PBSC
ai data-analysis data-mining data-science data-visualization libelo machine-learning pbsc valence velib-tracker
Last synced: 10 Jun 2025
https://github.com/rayyan9477/household-transactions-analysis-and-clustering
This project involves analyzing household transaction data to gain insights into spending patterns and behaviors. The analysis includes data cleaning, exploratory data analysis (EDA), clustering using K-Means, and visualization of customer segments.
customer-segmentation data-analysis data-cleaning data-science exploratory-data-analysis kmeans-clustering machine-learning
Last synced: 27 Feb 2025
https://github.com/tjpalanca/ph-elections-2016-analysis
Analysis of Philippines Election Results 2016
analysis data-analysis data-science philippines-election voter-turnout
Last synced: 11 Jun 2025
https://github.com/timbeechey/opa
Ordinal pattern analysis R package
cran data-analysis hypothesis-testing longitudinal ordinal r r-package rcpp repeated-measures rstats statistics
Last synced: 21 Feb 2026
https://github.com/solrikk/pictrace-web
PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image
automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api
Last synced: 12 Apr 2026
https://github.com/saeun-park/lg-aimers-4th
MQL 데이터 기반 B2B 영업기회 창출 예측 모델 개발
b2b data-analysis data-science machine-learning mql
Last synced: 08 Apr 2025
https://github.com/aaryan-agr/canadian-energy
This project analyzes Canada's energy trade, focusing on imports, exports, and market trends in the energy sector.
data-analysis data-cleaning data-manipulation data-processing data-science data-vizualisation energy-sector time-series-analysis
Last synced: 10 Jun 2025
https://github.com/fatma-moanes/machine-learning-labs
My implementation for the labs of the Machine Learning course that I studied in my university, Zewail City.
bootstrap data-analysis data-science deep-learning keras knn-classification linear-regression logistic-regression machine-learning machine-learning-algorithms matplotlib ml neural-networks numpy pandas pca preprocessing python seaborn svm-classifier
Last synced: 12 Apr 2026
https://github.com/jinkogule/multi-analyst
O Multi Analyst é uma ferramenta de análise de dados com uma usabilidade simples, que utiliza inteligência artificial para interpretar os resultados das análises realizadas, retornando insights úteis aos usuários.
apriori-algorithm bootstrap css data-analysis django html numpy open-ai pandas python web-application
Last synced: 12 Apr 2026
https://github.com/guruakaashjn/te_project_microsoft_ai
AI based statistical analysis of land-use plastic pollution in India using AI/ML techniques.
artificial-intelligence data-analysis data-analytics data-science data-visualization machine-learning powerbi
Last synced: 27 Feb 2025
https://github.com/hamada-khairi/pfda-hamada
A comprehensive R-based data analysis project that examines housing rental patterns across multiple cities, utilizing statistical methods and visualization techniques to analyze 4,746 properties' data points including rent prices, locations, and amenities. The project employs various R libraries to clean, process, and visualize rental market trends
apu data-analysis data-analysis-in-r data-cleaning-and-preprocessing data-processing-and-analysis data-science data-visualization-project ggplot2 house-rent-prediction r-programming-projects r-statistics r-studio real-estate-analytics
Last synced: 16 Mar 2025
https://github.com/nirmalvatsyayan/data-analyst-nanodegree
Udacity data analyst nanodegree project submissions and learning
data-analysis numpy pandas python statistics udacity-data-analyst-nanodegree
Last synced: 12 Apr 2026
https://github.com/noturlee/imdb-dataanalysis
A data model that predicts the IMDb rating of a movie based on features like genre, director, and actors. Using regression techniques to tackle this problem.
data-analysis data-cleaning data-modeling data-science data-visualization
Last synced: 08 Apr 2025
https://github.com/hordiales/redpanal-db-analysis
Analysis of the RedPanal.org music database
creative-commons data-analysis dataset etl machine-learning music music-information-retrieval statistical-analysis
Last synced: 10 Mar 2025
https://github.com/abdelhakim-gh/machine-learning_data-analysis_project
recognizing handwritten numbers & comparing the Life Expectancy vs Fertility in 1960 & 2013 of regions
data-analysis jupyter-notebook machine-learning python r r-studio
Last synced: 12 Apr 2026
https://github.com/willie-conway/datavista
DataVista is a comprehensive, production-grade data analysis and machine learning platform that combines real-time data ingestion from live APIs, interactive visualizations, statistical analysis, hypothesis testing, and machine learning model training — all in a unified, professional-grade interface. Built with React and Recharts.
analytics-platform api-integration classification coingecko-api csv-import data-analysis data-cleaning-and-preprocessing data-pipeline data-science data-visualizations etl hypothesis-testing json-export machine-learning-models open-meteo react recharts regression statistics world-bank
Last synced: 30 May 2026
https://github.com/sevdanurgenc/python-for-data-science-lecture-notes
In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.
data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial
Last synced: 23 Mar 2025
https://github.com/ultrasage-danz/weather-data-analysis
Weather Data Analysis notebook project. Created using Google collab
collaboration data-analysis data-science dataset google google-colab-notebook project
Last synced: 24 Mar 2025
https://github.com/leosimoes/uerj-tcc-analisador-dados-texto
Texto do trabalho de conclusão de curso (TCC) em engenharia de computação. Aplicativo Web para análise de dados.
data-analysis data-science data-visualization python streamlit
Last synced: 24 Mar 2025
https://github.com/leosimoes/udacity-starbucks
Project 3 of the Udacity Machine Learning Engineer Nanodegree Program. Data analysis and machine learning application to Starbukcs data.
aws-iam aws-s3 aws-sagemaker data-analysis data-science machine-learning python
Last synced: 24 Mar 2025
https://github.com/olob0/badwords-pt-br
💬 Wordlist com palavrões em pt-BR para análise de dados, filtros, ou texto considerado "evitável"
badword-filter badwords brasil data-analysis filter filter-lists filterlist portugues portuguese text-analysis wordlist
Last synced: 06 Jan 2026
https://github.com/mftnakrsu/crm-rfm-analysis
CRM-RFM-Analysis
ai crm data data-analysis data-science deep-learning machine-learning python rfm rfm-analysis
Last synced: 16 Mar 2025
https://github.com/john-science/data_science_by_example
Examples of Data Science Tools & Libraries
data-analysis data-science ipython pandas
Last synced: 12 May 2025
https://github.com/ifibla/adsdb-project
Algorithms, Data Structures and Databases Project
data-analysis data-engineering python
Last synced: 12 Apr 2026
https://github.com/dwidevelopes/database-input-pelanggran-mahasiswa
Menginput data Mahasiswa Yang Melakukan Pelanggran yang siap di data dan di hukum Dan juga siap Terkena Sanksi
aplikasi aplikasi-sekolah data data-analysis database input-method mahasiswa sekolah siswa siswi website
Last synced: 02 May 2026
https://github.com/priyanshubiswas-tech/aws-mwaa-elt-airflow-sql-dbt-superset-project
This project was created as part of an assessment for DigitalXC AI. It demonstrates a cloud-based ELT pipeline using AWS MWAA, Airflow, dbt, PostgreSQL, and Superset. The pipeline automates data ingestion from S3, transformation with dbt, and visualization through Superset, following modern data engineering practices on a scalable AWS architecture.
apache-airflow apache-superset aws-s3 dag data-analysis data-engineering-pipeline data-visualization dbt elt-pipeline python rds-postgres
Last synced: 03 Jul 2025
https://github.com/sing-group/bew
Public repository for Biofilmfs Experiment Workbench (BEW).
aibench data-analysis data-management java jfreechart workbench
Last synced: 03 Jul 2025
https://github.com/gher-uliege/bluecloud-plankton
Spatial interpolation of plankton data using a neural network
data data-analysis data-visualization neural-network oceanography
Last synced: 30 Mar 2025
https://github.com/suhas-005/power-bi-dashboard
Power BI Dashboard Projects
data-analysis data-visualization dataset power-bi-project powerbi
Last synced: 01 Apr 2025
https://github.com/rekha0suthar/e-commerce-shopper-s-behaviour-understanding
Understand the online shopper purchasing pattern through Machine learning
data-analysis data-preprocessing data-visualization logistic-regression machine-learning numpy pandas python3 scikit-learn seaborn-plots
Last synced: 12 Apr 2026
https://github.com/mayankagg9722/movie-recommendation
Collaborative Filtering is performed over Movie Lens Dataset.
collaborative-filtering data-analysis jupyter-notebook movie-recommendation python-script website
Last synced: 29 Jul 2025
https://github.com/m-faizan-mahmood/house-price-prediction-machine-learning-model
Implemented a Multiple Linear Regression model to predict house prices based on square footage, number of bedrooms, and age of the house.
artificial-intelligence data-analysis data-science data-visualization machine-learning machine-learning-algorithms matplotlib neural-network numpy pandas predictive-modeling python regression-models seaborn sklearn
Last synced: 12 Apr 2026
https://github.com/thomascenni/anfavea-data-analysis
Data analysis with Pandas and Datapane.
Last synced: 06 May 2026
https://github.com/rani-sikdar/python-data-structures
A repository for sharing Python implementations of common data structures and algorithms, including arrays, linked lists, stacks, queues, trees, graphs, and sorting/searching techniques. Perfect for learning, revisiting fundamentals, and collaborating on computer science concepts. Contributions welcome! 🚀
data-analysis data-structure data-visualization numpy-library pandas-dataframe python python-3
Last synced: 30 Mar 2025
https://github.com/neerajcodes888/whatsapp-chat-analyzer
A Python tool for effortless analysis of WhatsApp conversations. Gain insights with basic statistics, word cloud visualizations, and URL statistics. Powered by pandas, urlextract, wordcloud, seaborn, and Streamlit. 📊📱
analyzer chat data-analysis data-visualization pandas python3 seaborn urlextract whatsapp wordcloud
Last synced: 12 Apr 2026
https://github.com/rajnish93/jpandas
A lightweight JavaScript library for working with tabular data, inspired by Pandas in Python. Built with TypeScript, it provides an intuitive API for data manipulation and analysis.
data-analysis data-analytics data-manipulation data-science dataframe javascript pandas stream-processing table typescript
Last synced: 11 Jun 2025
https://github.com/supersjgk/data-analysis-dns-over-https
A Data Analytics + ML project to classify Benign and Malicious DNS-over-HTTPS traffic
classification-model data-analysis data-analysis-python data-analytics datamining decision-trees dns dns-over-https doh gradient-boosting knn machine-learning random-forest
Last synced: 19 Mar 2025
https://github.com/ansh420/mcdonald_case-study
It is basically depend on the market Segment Analysis. It is a case study of mcDonald.
algorithms-implemented data-analysis python3 segmentation
Last synced: 12 Apr 2026
https://github.com/nikita-data/unit_economics_projects
unit economics & cohort analysis projects
cac churn-rate conversion create-function data-analysis data-visualization eda hypothesis-testing ltv math matplotlib numpy python retention-rate roi scipy seaborn segmentation statistics unit-economics
Last synced: 06 Jan 2026
https://github.com/azmainadel/twitter-data-neo4j
Playing with graph database on a large dataset of twitter data.
data-analysis data-visualization neo4j-database snap
Last synced: 06 Apr 2025
https://github.com/nikita-data/eda_projects
Exploratory data analysis projects
cac data-analysis data-visualization eda folium-maps hypothesis-testing ltv math matplotlib numpy plotly python regular roi scipy seaborn segmentation statistics unit-economics
Last synced: 06 Jan 2026
https://github.com/victor-lis/regression-ai-model
ai data-analysis python regression-model
Last synced: 01 Apr 2025
https://github.com/macdon112/layoff-analysis
SQL data cleaning & analysis of global layoffs
data-analysis data-cleaning data-exploration sql
Last synced: 21 Feb 2026
https://github.com/gholamrezadar/most-profitable-actors
Finds the list of actors with the most boxoffice profit using TMDB API.
Last synced: 16 Jan 2026
https://github.com/vi/rendercsv
Tool to convert CSV table to a picture.
animation csv csv2pic csv2png data-analysis picture png table table-renderer visualization
Last synced: 01 Apr 2025
https://github.com/lobooooooo14/badwords-pt-br
💬 Wordlist com palavrões em pt-BR para análise de dados, filtros, ou texto considerado "evitável"
badword-filter badwords brasil data-analysis filter filter-lists filterlist portugues portuguese text-analysis wordlist
Last synced: 25 Mar 2025
https://github.com/prangonghose/analysis_of_bangladesh_economic_complexity
In this project a brief analysis has been done by our team in the export economy of Bangldesh for the past three decades.
data-analysis data-science data-visualization inequalipy matplotlib pandas plotly
Last synced: 22 May 2026
https://github.com/lmuffato/analise-de-diarias-prefeituas-do-es
Esse código faz parte de um projeto de descoberta e combate a esquemas de corrupção, através do tratamento e cruzamento de dados abertos disponíveis em diversas prefeituras do Espirito Santo através do portal da transparência. Junção e análise de várias tabelas importadas em csv.
data-analysis personal-project r rstudio
Last synced: 12 Jun 2025
https://github.com/colburncodes/se_pudding_2023
This project is a React app designed to showcase research conducted by a team of data scientists and data analysts. The app is utilizing React and React-Chartjs-2
chartjs-2 data-analysis data-science data-visualization react-chartjs-2 reactjs
Last synced: 11 May 2026