Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-20 00:07:30 UTC
- JSON Representation
https://github.com/pythondeveloper6/supermarket-eda-seaborn-for-beginners
learn Seaborn basics using a simple EDA
data-analysis eda numpy pandas seaborn visualization
Last synced: 11 Apr 2025
https://github.com/kellyjadams/spotify-data-analyze
A serverless data pipeline that logs my Spotify listening history to BigQuery using Cloud Run, then visualizes trends with Looker Studio. Built with Python, Flask, Docker, and GCP..
data-analysis data-engineering
Last synced: 07 May 2025
https://github.com/rikard-helgegren/leverage_analysis_tool
Analyst tool for portfolio construction. How can levereged certificates be used to increase returns in a portfolio while keeping the risk as low as possible. Use the tool and find out.
cpp data-analysis investment kivy-framework python3
Last synced: 12 Apr 2025
https://github.com/super-lou/exstat
🌾 R package to provide an efficient and simple solution to aggregate and analyze the stationarity of time series
climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics time-series
Last synced: 13 Apr 2025
https://github.com/asifdotexe/sentimentscoringmodel
This project focuses on performing sentiment analysis on Amazon reviews using natural language processing (NLP) techniques. It includes various steps, from data exploration and preprocessing to building and evaluating sentiment models.
data-analysis data-visualization natural-language-processing sentiment-analysis
Last synced: 28 Oct 2025
https://github.com/c0deta1ker/matbasex
MatBaseX is an all-in-one database and analytical tool for photoelectron spectroscopy (PES) analysis, focused on materials and their X-ray interactions. It offers features like a Materials Properties Database, IMFP & XPS Sensitivity Factor Calculator, and PES N-Layer Simulations & Curve Fitting utilities. Explore its powerful capabilities today!
cross-sections crystal-structure crystallography data-analysis data-fitting database electron imfp imfp-calculator-matlab material material-database matlab matlab-application matlab-gui matlab-toolbox pes-modelling photoelectron-spectroscopy photoionization simulation xps
Last synced: 01 Jul 2025
https://github.com/irfanchahyadi/ml-notes
Complete personal notes for performing Data Analysis, Preprocessing, and Training ML model.
data-analysis machine-learning plotting python
Last synced: 11 Jul 2025
https://github.com/coderjolly/player-market-value-prediction
There is an intense transfer speculation that surrounds all major player transfers today. An important part of negotiations is predicting the fair market price for a player. Therefore, we are predicting this Market Value of a player using the data provided in csv format.
data-analysis data-visualization decision-tree-regression machine-learning xgboost-regression
Last synced: 22 Jun 2025
https://github.com/alexeyev/hse-spb-bigdata-python-fall2016
Материалы к курсу по программированию и инструментам анализа данных, прочитанному в петербургском филиале НИУ ВШЭ осенью 2016 года
course-materials data-analysis numpy pandas python scikit-learn sklearn
Last synced: 07 Apr 2026
https://github.com/pepe-god/dataprophet
Extracts the identity information citizens from MySQL, creates a family network based on TC ID No. and exports it to CSV
101m 109m adres data-analysis data-extraction database-connector family-tree genealogy gsm hsys identity mysql-database python-script pyton
Last synced: 13 Jul 2025
https://github.com/i10mm/gpt-arxiv-fetcher
Revolutionize your research with our GitHub repository, where GPT meets arXiv API for seamless access and analysis of the latest academic papers!
artificial data-analysis intelligence llm machine-learning
Last synced: 14 Jul 2025
https://github.com/relvaner/nodes4j-core
Framework for parallel processing based on Actor4j. Useful for data analysis.
actor actor-model actor4j actors batch-processing data-analysis graph-processing java java-17 message-passing parallelization reactive-system stream-processing
Last synced: 14 Jul 2025
https://github.com/grburgess/ronswanson
Ron Swanson builds tables for 3ML
3ml astromodels data-analysis interpolation python ron-swanson spectral-fitting
Last synced: 15 Jul 2025
https://github.com/quantumudit/basketball-players-analysis
The project focuses on analyzing salaries and various other in-game metrics of top NBA basketball players from 2005-14 by performing exploratory data analysis with Python and Jupyter Notebook and by visualizing the data in an insightful dashboard made with Power BI
data-analysis jupyter-notebook power-bi python
Last synced: 17 May 2026
https://github.com/woctezuma/regression
Gaussian Process Regression vs. Relevance Vector Machine.
data-analysis data-science gaussian-process-regression machine-learning python regression relevance-vector-machine statistics
Last synced: 17 Jul 2025
https://github.com/srinivasrm/mutual-funds-analysis-and-prediction
In this project I have performed analysis and prediction on 1,3,and 5 year returns on 1064 mutual funds in India. I have scraped data from a website which is the most visited website for mutual fund investments.I have tested regression models linear model,SGD Regressor , Random Forest Regressor,Decision Tree Regressor,Ridge,MLP Regressor and linear model (Lasso).After which I have selected the best perorming model and performed Hyper parameter tuning and then deployed an interactive application which can generate the visualization and send an email with the visualization to the users email address.
beautifulsoup data-analysis data-base data-cleaning data-science deployment etl finanace frontend funds machine-learning mutual mutual-funds pgsql python scikit-learn sql streamlit web webapplication
Last synced: 27 Oct 2025
https://github.com/tushar2704/everyday-sql
Welcome to Everyday SQL Sheets – your go-to resource for everyday SQL cheat sheets, pro tips, interview questions, and more. Whether you're a beginner looking to learn SQL or an experienced developer seeking quick reference materials, this application has got you covered.
artificial-intelligence cheatsheet data-analysis data-science database mysql postgresql query-language sql sqlalchemy streamlit streamlit-tushar2704 tushar2704
Last synced: 05 Apr 2026
https://github.com/accurat/react-dataviz
⚛📊🚀 React components to build powerful interactive data visualizations
d3 data-analysis data-visualization react react-components
Last synced: 19 Jun 2025
https://github.com/negativenagesh/spam-ham_email_detection_machine_learning
This project focuses on classifying spam/ham emails, using machine learning algorithms like LGR, NB, RF, DT etc.. and based on the accuracy score and precision score I chose logistic regression for the classification. And I have used streamlit for frontend.
app data-analysis data-cleaning data-engineering data-science data-visualization data-visualizations jupyter-notebook logistic-regression machine-learning modeling naive-bayes-classifier nlp python
Last synced: 12 Apr 2025
https://github.com/apoorvalal/lalrutils
Misc utility functions in R for personal use.
Last synced: 07 Mar 2026
https://github.com/mukhopadhyay/data-visualization
...
data-analysis data-science data-visualization visualization
Last synced: 21 Mar 2025
https://github.com/cosmoduende/r-ufo-sightings
Are we alone in the universe? - Data Analysis and Data Visualization of UFO sightings with R. How to analyze and visualize data of UFO sightings of the last century in the USA and the rest of the world with R language.
data-analysis data-analytics data-science data-visualisation data-visualization data-visualizations dataviz ovni ovni-dataset r-code r-language r-programming r-stats ufo ufo-analysis ufo-dataset ufo-sighting ufo-sightings
Last synced: 13 May 2025
https://github.com/quantumudit/movie-ratings-analysis
This project focuses on analyzing and finding correlations between the audience and critic ratings for some of the popular movies released between 2009-2011 using Python & Power BI
data-analysis data-visualization jupyter-notebook power-bi python
Last synced: 19 Apr 2026
https://github.com/richiejp/jdp
Automatically collect and normalise data, then run algorithms on it.
automation-framework data-analysis suse-qa
Last synced: 02 Jan 2026
https://github.com/yash22222/tata-data-visualisation-virtual-internship
Data Visualisation: Empowering Business with Effective Insights Gain insights into leveraging data visualisations as a tool for making informed business decisions.
basics ceo charts cmo data-analysis data-interpretation data-science data-visualization graphs machine-learning mcq microsoft-excel microsoft-power-bi microsoft-word powerpoint-presentations python tableau tata tata-data-visualisation
Last synced: 22 Jul 2025
https://github.com/csparpa/last.fm-stats
Exercise on Last.fm data aggregation
data-analysis exercise lastfm lastfm-api python
Last synced: 21 May 2026
https://github.com/scottgriv/river-charts
🌊 📉 A Python, Django, Plotly, and Pandas web application that visualizes river data in real time, pulled using an API from the United States Geological Survey (USGS).
api charts data data-analysis data-visualization dataset django pandas plotly python usgs usgs-api visualization webapp
Last synced: 12 Aug 2025
https://github.com/nicucalcea/raise
An R library that uses ChatGPT / GPT to generate data
chatgpt chatgpt-api chatgpt-app data-analysis gpt gpt-35-turbo openai openai-chatgpt parsing r
Last synced: 05 Mar 2025
https://github.com/lisa-ho/three-investigators
Respository for scraping and analysing fan data on a German audio drama called 'Die Drei Fragezeichen' (the three investigators).
data-analysis data-viz datawrapper python webscraping
Last synced: 25 Oct 2025
https://github.com/slgobinath/wisdom
An adaptive and self-boosting stream processor
cep complex-event-processing data-analysis distributed self-tuning stream-processing wisdom
Last synced: 14 Apr 2025
https://github.com/sushant1827/traffic-forecasting-using-iot-sensor-data
Demonstrates how to utilize XGBoost for traffic forecasting using data gathered from IoT sensors, highlighting its efficiency in processing complex datasets and delivering accurate predictions.
data-analysis data-visualization exploratory-data-analysis feature-engineering feature-importance feature-selection gridsearchcv hyperparameter-optimization hyperparameter-tuning iot random-search xgboost-regression
Last synced: 08 Mar 2026
https://github.com/rdpeng/analyticdesigntheory
Web site for Analytic Design Theory
analytic-design-theory data-analysis exploratory-data-analysis
Last synced: 29 Jul 2025
https://github.com/jimbrig/lossrunAnalyzer
R Package and Shiny App to Analyze Insurance Lossruns
actuarial data-analysis data-mining data-science insurance r record-linkage risk-management shiny
Last synced: 30 Jul 2025
https://github.com/mch-fauzy/data-science
Repository containing portfolio of data science and machine learning projects. Presented in the form of iPython Notebooks
data-analysis data-science data-visualization ipython-notebooks machine-learning natural-language-processing portfolio
Last synced: 24 Sep 2025
https://github.com/jimmymugendi/email-sms-spam_classifier
Email SMS Spam Classifier is a cutting-edge machine learning solution designed to combat spam messages in email and SMS communications. Leveraging advanced Natural Language Processing (NLP) techniques, the system preprocesses text data by tokenizing, removing stopwords, and stemming, ensuring the most accurate classification results.
data-analysis data-visualization machine-learning-algorithms pandas seaborn-plots sklearn-library
Last synced: 25 Sep 2025
https://github.com/louis-heraut/card
🎴 Card of Analyse and Diagnostic in R for a user-friendly experience of data aggregation with parametrisation file.
aggregation climate-change climate-data climate-science data-analysis data-science diagnostic environment environment-variables hydrology hydrology-statistical inrae r statistics tools user-friendly
Last synced: 09 Mar 2026
https://github.com/thecoderpinar/earthquake_prediction_analysis_project
🌍 Welcome to the Earthquake Prediction Analysis Project! 🚀 This project aims to predict earthquake magnitudes using LSTM neural networks and analyze seismic data. Explore, analyze, and forecast earthquakes with ease! 📈🔮
analysis data-analysis data-science earthquake-prediction geocoding geology lstm lstm-neural-networks machine-learning matlab matlab-deep-learning open-source time-series visualization
Last synced: 16 Aug 2025
https://github.com/mirdan08/crafty
Data analysis project i've developed for the web scraping course.
blockchain data-analysis webscraping
Last synced: 30 Jul 2025
https://github.com/benjamindpb/wikidata-preprocessing
Wikidata dump preprocessing & analysis of georreferencial entities
data-analysis preprocessing wikidata wikidata-dump
Last synced: 15 Jul 2025
https://github.com/priyanka7411/dataspark-electronics-retail-analytics
DataSpark is a data analysis project using Python, SQL, and Power BI to analyze global electronics retail sales, focusing on customer behavior, sales performance, product profitability, and store performance to optimize sales strategies.
analytics-providers business-intelligence customer-segmentation data data-analysis electronics-industry global-sales pandas powerbi powerbi-visuals product-profitability python retail-analytics sales-performance sql store-analysis visualization
Last synced: 10 Jul 2025
https://github.com/aivanf/lemuras
A small Python library to deal with big tables
bigdata data-analysis html ipython-notebook join-tables json jupyter-notebook pandas pivot-tables python sql table
Last synced: 28 Oct 2025
https://github.com/tanaylab/naryn
Native Access medical record Retriever for high Yield aNalytics
Last synced: 20 Jul 2025
https://github.com/farfarfun/fundata
数据处理工具包 - 提供数据清洗、转换和分析功能
data-analysis data-processing farfarfun numpy pandas python
Last synced: 17 Feb 2026
https://github.com/kylejgillett/stevepy
A Space Weather data analysis tool for Python.
astronomy aurora data-analysis physics python space-weather space-weather-research
Last synced: 22 Mar 2025
https://github.com/nizarassad/digits-recognition
This project is a machine learning classification task on MNIST using SVM and CNN algorithms
classification colab-notebook convolutional-neural-networks data-analysis data-science deep-learning jupyter-notebook machine-learning neural-network python support-vector-machines
Last synced: 07 May 2025
https://github.com/rafaelbroseghini/data-analysis-visualizations-ml
:bar_chart: NLP, Regression Models, Story telling, Time Series Analysis ... in Python
data-analysis data-science machine-learning machine-learning-algorithms time-series visualization
Last synced: 03 Oct 2025
https://github.com/sherrisherry/cleandata
R Package "cleandata"
cran data-analysis data-mining machine-learning r r-package wrangling
Last synced: 18 Feb 2026
https://github.com/bdslab-upv/dashi
A flexible and powerful Python toolkit for dataset shift analysis and characterization, providing supervised and unsupervised evaluation of temporal and multi-source data shifts, visualization tools, and statistical insights for data integrity and model performance monitoring
data-analysis data-science dataset-shift python temporal-analysis
Last synced: 13 Dec 2025
https://github.com/elkronos/anovatoolbox
This GitHub repository contains a collection of functions for performing various statistical analyses and generating visualizations. The functions are designed to work with different types of data and provide comprehensive outputs for data analysis.
anova anova-model data-analysis r statistics
Last synced: 17 Mar 2025
https://github.com/thecoderpinar/credit-card-fraud-detection-project
This project focuses on the detection of credit card fraud using various data science and machine learning techniques. The dataset includes a record of credit card transactions over a specific period, with the goal of accurately identifying fraudulent activities. 🚀✨
anamoly-detection classification-algorithms credit-card-transactions data-analysis data-preprocessing data-science data-visualization fraud-detection machine-learning python
Last synced: 30 Apr 2025
https://github.com/tixfeniks/batch-iterator
Usefull python implementation of batch iterator.
batch data-analysis data-analytics data-science iterate machine-learning mini-batch minibatch python
Last synced: 23 Oct 2025
https://github.com/gabriel-dp/mineirando_github
Project to mine and analyze public GitHub data. Practical work of the Social Network Mining and Analysis subject at UFSJ
data-analysis data-mining github ufsj
Last synced: 07 Mar 2026
https://github.com/juangesino/behaviouraleconomics
All the files and data for the experiment performed during the course Behavioural Economics @ University of Amsterdam
behavioral-economics behavioural-economics data-analysis economics game-theory statistics
Last synced: 28 Oct 2025
https://github.com/michaelnabil230/laravel-analytics
A Laravel package to retrieve pageviews and other data from Database
data-analysis data-structures database laravel php
Last synced: 25 Jan 2026
https://github.com/omarelgabry/insights.py
A Python package for reading, storing, & analyzing data from Public Data APIs
Last synced: 14 Jul 2025
https://github.com/jaybird1291/anki-llm-review-stats-exporter
Export your Anki review history (revlog) as JSONL so you can analyze it with an LLM (ChatGPT, Claude, local models, etc.) without using any API.
anki anki-addon chatgpt data-analysis data-export jsonl llm review-stats revlog statistics
Last synced: 24 Dec 2025
https://github.com/maximtrp/scikit-na
Missing Data Analysis in Python
analysis data-analysis data-science data-visualization missing-data missing-values pandas python statistics visualization
Last synced: 19 Jan 2026
https://github.com/AivanF/Lemuras
A small Python library to deal with big tables
bigdata data-analysis html ipython-notebook join-tables json jupyter-notebook pandas pivot-tables python sql table
Last synced: 11 Apr 2025
https://github.com/gher-uliege/gher-uliege.github.io
GeoHydrodynamics and Environment Research
data-analysis data-assimilation datavisualization interpolation ocean-modelling ocean-sciences oceanography remote-shell
Last synced: 14 Apr 2025
https://github.com/gher-uliege/seadatacloud
Tools and interfaces to work with DIVA interpolation software tool.
data-analysis data-visualization interpolation nco netcdf ocean-sciences oceanography
Last synced: 30 Mar 2025
https://github.com/abdelmajidlh/eportfolio
ePortfolio Abdelmajid EL HOU
bioinformatics data data-analysis data-science data-visualization database datascience genetics
Last synced: 22 Mar 2025
https://github.com/varunbanka/data-insights
Data Insights is a user-friendly tool for analyzing large CSV files. Its advanced analytics helps uncover hidden patterns and trends, making it perfect for data scientists and analysts.
artificial-intelligence automation data-analysis data-science dataanalysis datahive numpy pandas python
Last synced: 22 Jun 2025
https://github.com/josmarcristello/geokmlanalyzer
The Geo Kml Analyzer is a Python-based tool designed to process and analyze elevation data from KML files. Uses google maps to obtain elevation data (interpolates if necessary) and spherical trigonometry equations to calculate the distance.
data-analysis geolocation geospatial gis google-maps-api gps kml python
Last synced: 30 Jul 2025
https://github.com/antrubtor/socialstats
Analyze your personal data from Discord, Instagram, Snapchat, and WhatsApp. View your call durations, response times, voice messages, and messaging activity in clean Excel charts.
call-duration chat-export data-analysis data-visualization discord excel instagram message-frequency message-history snapchat social-networks-statistics whatsapp
Last synced: 28 Jul 2025
https://github.com/louis-heraut/AEAG_toolbox
🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)
climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics
Last synced: 30 Oct 2025
https://github.com/artdgn/pages
Auto-updating dashboards about COVID-19 https://artdgn.github.io/pages
covid-19 data-analysis modeling
Last synced: 17 Jan 2026
https://github.com/al-ghaly/airline-company-data-warehouse
Data Warehouse modeling, design, implementation, and analysis for an Airline Company.
data-analysis data-warehousing database-modeling sql-server
Last synced: 14 Apr 2025
https://github.com/tideland/go-cells
Light-weight event-processing based on the idea of meshed cells with different pluggable behaviors
cep data-analysis data-stream event-processing events golang
Last synced: 05 Apr 2025
https://github.com/quantumudit/analyzing-suez-services
This project focuses on scraping all the service locations across Australia & New Zealand and their associated attributes from "Suez" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 18 May 2026
https://github.com/mo-karbalaee/python-for-data-analysis-book
All the practice and code that I am doing while I read the book called, Python for data analysis
data-analysis data-science python
Last synced: 02 Aug 2025
https://github.com/mynenik/xyplot-32
Extensible Plotting and Data Analysis Program for 32-bit x86 GNU/Linux
cpp data-analysis data-manipulation data-visualization forth linux-app motif xwindows
Last synced: 02 Aug 2025
https://github.com/jpvt/data_science
Portfolio with my Data Science Projects.
data-analysis data-science deep-learning machine-learning portfolio xgboost
Last synced: 19 Jun 2025
https://github.com/magnaopus1/synthron-cfd-trader-pro
SYNTHRON CFD Trader PRO is a cutting-edge trading platform featuring raw, custom-designed machine learning models. From reinforcement learning for dynamic strategies to predictive analytics, sentiment analysis, and optimization techniques, it empowers trading across stocks, forex, indices, commodities, futures, and crypto with precision.
ai backtesting cfd commodities data-analysis data-science data-structures forex futures indices machine-learning trading
Last synced: 30 Apr 2025
https://github.com/danielpuentee/outdpik
The fundamental toolkit for outliers search and visualization. It aims to be the fundamental high-level package for this purpose.
data-analysis matplotlib numpy python
Last synced: 16 Aug 2025
https://github.com/k1rsn7/kaggle
:basecamp: A collection of Kaggle solutions.
computer-vision cv data-analysis data-science deeplearning deeplearning-ai english english-language jupyter jupyter-notebook jupyter-notebooks kaggle kaggle-challenge russian russian-language
Last synced: 02 Sep 2025
https://github.com/ac-gomes/data-engineering-with-databricks
A simple boilerplate for data engineering and data analysis training in Databricks.
data-analysis data-engineering databricks databricks-notebooks pyspark python unit-testing
Last synced: 30 Apr 2025
https://github.com/zackakil/hot-shot-basketball-tracker
Mini web app for displaying basketball practice metrics.
basketball chartjs data-analysis html sport visualization
Last synced: 28 Jan 2026
https://github.com/valeriopagliarino/tcf-2021-unito-public
Exam project of the course "Computing Tecniques for Physics" - Università degli Studi di Torino - Physics department - 2021
cern-root data-analysis geant4-simulation monte-carlo-simulation object-oriented-programming physics
Last synced: 27 Mar 2025
https://github.com/AnonCatalyst/WebHound
WebHound is your Python-powered command-line assistant for sharp and efficient web searches! It sniffs out data from major search engines and detects social platforms, helping you uncover valuable insights and stay ahead of the game. 🌐🔍✨
awareness data-analysis data-visualization information-retrieval osint osint-python osint-reconnaissance osint-resources osint-tool osint-tools osinttools web-osint webscraping
Last synced: 29 Jul 2025
https://github.com/yashika-malhotra/strategic-analysis-of-retail-brand-in-south-america-using-sql
Leveraged Big Query and MySQL to analyze 100K records for sales optimization, trend identification, and enhancing customer satisfaction for a retail brand in South America and to provide insights and recommendations to improve their userbase and improve their services
bigquery data-analysis data-science database database-schema google-bigquery mysql-server sql
Last synced: 18 Apr 2026
https://github.com/dwhitena/daal-go
Use Intel's DAAL from Go
artificial-intelligence big-data data-analysis data-science machine-learning
Last synced: 13 Sep 2025
https://github.com/malkiii/data-analysis-agent
PandasAI + Gradio app
ai-agents data-analysis gradio llm pandas pandasai python
Last synced: 12 Aug 2025
https://github.com/aydinnyunus/dictionary
Dictionary
data data-analysis data-science data-structures data-visualization database dataset dictionaries dictionary dictionary-learning python python-2 python-3 python-3-6 python-library python-script python2 python27 python3 python36
Last synced: 09 May 2025
https://github.com/forhadulislam/sna-project
A project for Social Network Analysis. Analyzed yahoo query logs
data-analysis data-mining social-network
Last synced: 13 May 2026
https://github.com/mohammadkarbalaee/python-for-data-analysis-book
All the practice and code that I am doing while I read the book called, Python for data analysis
data-analysis data-science python
Last synced: 27 Mar 2025
https://github.com/dicook/tutorial_make_better_data_plots
Materials for a workshop in June 2025
data data-analysis data-science data-visualization r statistical-graphics statistics
Last synced: 25 Jun 2025
https://github.com/gbikram/isoon-leak-exploration
Data Analysis of iSoon's Leaked Data Dump
cyberthreatintelligence data-analysis jupyter-notebook nlp python
Last synced: 19 May 2026
https://github.com/lafayettegabe/nlp-resume-extraction
📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.
big-data data data-analysis data-science eda ner nlp resume-extractor
Last synced: 03 Apr 2025
https://github.com/karan-malik/uberdataanalysis
Uber Data Analysis and Visualization using Python
data-analysis data-analysis-python data-analytics data-science data-visualization dataanalysis matplotlib-pyplot numpy pandas pandas-dataframe python python3 seaborn uber uber-data
Last synced: 04 Mar 2025
https://github.com/DataHerb/dataherb-python
Python Package for DataHerb: create, search, and load datasets.
data data-analysis data-mining database dataset python
Last synced: 08 May 2025
https://github.com/nragland37/event-optimization-tool
R-based Shiny application that maps availability and identifies optimal engagement times to enhance participation within an organization
data-analysis data-cleaning data-preparation heatmap r shiny shiny-app tidyverse
Last synced: 02 Feb 2026
https://github.com/lucs1590/strava-analysis
🏃📊 Using strava to do personal analyses and to practice data scientist skills.
data-analysis data-science github jupyter-notebook python3 strava strava-api
Last synced: 08 Mar 2026
https://github.com/romac/adaproject
🔬 Project proposal for the Applied Data Analysis course at EPFL
Last synced: 31 Dec 2025
https://github.com/super-lou/card
🎴 Card of Analyse and Diagnostic in R for a user-friendly experience of data aggregation with EXstat
aggregation climate-change climate-data climate-science data-analysis data-science diagnostic environment environment-variables hydrology hydrology-statistical inrae r statistics tools user-friendly
Last synced: 13 Apr 2025
https://github.com/nikolas-virionis/polynomial-regression
Python package that analyses the given datasets and comes up with the best regression representation with either the smallest polynomial degree possible, to be the most reliable without overfitting or other models such as exponentials and logarithms
data-analysis exponential-regression flexibility logarithmic-regression logistic-regression polynomial-regression python sinusoisdal-regression statistics
Last synced: 06 Apr 2026
https://github.com/quantumudit/analyzing-cleanaway-services
This project focuses on scraping all the service locations across Australia and their associated attributes from "Cleanaway" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 25 Jan 2026
https://github.com/louis-heraut/aeag_toolbox
🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)
climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics
Last synced: 08 Apr 2026
https://github.com/cjunwon/youtube-data-analysis
End-to-end Youtube data analysis project using Youtube Data API, MySQL, AWS, Flask
aws-rds data-analysis datapipeline flask nlp pandas python shell sql vader-sentiment-analysis youtube youtube-api
Last synced: 17 Feb 2026
https://github.com/burhanahmed1/recipe-recommendor-using-pyspark
A smart recipe recommendation system that suggests recipes based on ingredient similarities. This project is done in PySpark
data-analysis data-science datawrangling education learning-python machine-learning machine-learning-algorithms nltk-python numpy pandas pyspark python python-project reccomendersystem recommendation-system
Last synced: 13 Mar 2025