Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-16 00:07:27 UTC
- JSON Representation
https://github.com/pythondeveloper6/matplotlib-for-beginners
how to visualize your data using matplotlib
data-analysis matplotlib numpy pandas python visualization
Last synced: 14 Mar 2026
https://github.com/cgivre/drill-geoip-functions
GeoIP Functions for Apache Drill
apache-drill city country data data-analysis data-science drill geoip-functions ip-address ipv4
Last synced: 12 Apr 2025
https://github.com/kmario23/pytorch-padawan
Exercises, Descriptions, and Visualizations to build intuitions and confidence in working with PyTorch for accelerated Scientific Computing
data-analysis data-science data-visualization deep-learning deep-neural-networks distributed-computing gpu-computing jupyter machine-learning ndarray python python3 pytorch scientific-computing scientific-visualization tensor tensor-processing vectorization vectorized-computation visual-computing
Last synced: 07 Apr 2025
https://github.com/mahshaaban/intro_data_r
A gentle introduction to data analysis in R
data-analysis image-analysis qpcr-analysis r
Last synced: 28 Apr 2025
https://github.com/theportus/asc-analysis
DH Scraping and Analyzing the ASC database with Jupyter Notebooks
anglo-saxon asc-database beautifulsoup beautifulsoup4 binder data-analysis data-visualization digital-history digital-humanities history jupyter-notebook medieval network-analysis paleography prosopography python python3 webscraping
Last synced: 22 Jan 2026
https://github.com/rizwanmunawar/data-analysis-on-csv-datasets-machine-learning-
Data Analysis and model building on CSV datasets.
classification classification-model data-analysis data-science machine-learning regression
Last synced: 14 Aug 2025
https://github.com/arsalanjabbari/imdb-top-250-movies-analysis
Academic research on IMDb's Top 250 Movies entails scraping and cleaning data, followed by analyzing genres, directors, release years, IMDb ratings, and actors' influence. This analysis offers insights into evolving cinematic preferences and demonstrates the value of data-driven research in understanding cultural phenomena.
Last synced: 31 Aug 2025
https://github.com/photosynq/photosynq-r
R package to conveniently access project data from the PhotosynQ website
data-analysis photosynq rstudio
Last synced: 22 Oct 2025
https://github.com/zcebeci/adana
A Complete Toolbox for Adaptive and Hybrid Genetic Algorithms in R
adaptive-genetic-algorithms biologically-inspired-algorithm data-analysis data-science evolutionary-algorithms genetic-algorithms global-optimization-algorithms hybrid-genetic-algorithm multi-objective-optimization nature-inspired-algorithms single-objective-optimization
Last synced: 08 Oct 2025
https://github.com/robinmillford/cortex-ai-multi-model-insights-hub
Cortex AI: Multi-Model Insights Hub is an advanced platform that leverages cutting-edge AI to empower your research, analysis, and data exploration. By integrating multiple Large Language Models (LLMs) with a sophisticated Retrieve-and-Generate (RAG) system
article-extractor chatbot data-analysis data-visualization deepseek-chat deepseek-r1 llama3 llm pdf-document-processor rag streamlit-webapp summarizer vector-database
Last synced: 28 Oct 2025
https://github.com/maximo3k/linkedin-sales-navigator-scraper
LinkedIn Sales Navigator saved Search into CSV
data-analysis linkedin linkedin-api linkedin-sales-navigator linkedin-scraper sales salesnavigator scraper
Last synced: 24 Oct 2025
https://github.com/neutrinoceros/gpgi
A lightweight Python library for efficient in-RAM particle deposition on rectilinear, unrefined grids.
data-analysis grid particles performance
Last synced: 22 Apr 2025
https://github.com/vvipjain/bank-loan-report
Bank Loan Reports
data data-analysis data-visualization powerbi sql
Last synced: 19 Jan 2026
https://github.com/carpentries-incubator/data-harvesting-for-agriculture
Data Harvesting for Agriculture
agriculture beta carpentries-incubator data-analysis data-cleaning data-handling english lesson qgis r
Last synced: 25 Oct 2025
https://github.com/marcosalvalaggio/edamame
Exploratory data analysis tools
classification data-analysis data-science explainable-ai exploratory-data-analysis exploratory-data-visualizations feature-engineering machine-learning pandas pypi-package regression sklearn-library
Last synced: 12 Jun 2025
https://github.com/globeandmail/startr-cli
A command-line scaffolder for the startr R project template
data-analysis data-journalism data-visualization journalism r
Last synced: 23 Apr 2025
https://github.com/qin-yu/r-global-financial-crisis
2018 [R] Data analysis: What happened during the 2007-09 financial crisis?
data-analysis data-visualization finance financial-analysis industry r risk-management statistical-analysis statistical-methods
Last synced: 11 Jul 2025
https://github.com/asepscareer/visualization-using-plotly-and-covid-19-data
Covid-19 Analysis using plotly : Choropleth Maps
choropleth-map covid-19 data-analysis data-visualization plotly python
Last synced: 12 Apr 2025
https://github.com/shervinnd/bazar_app_store_eda
Bazar App Data analysis code to find the most downloaded category and most popular installed apps
data data-analysis data-science dataanalysis eda python
Last synced: 15 Apr 2025
https://github.com/maksimekin/umd_data_challange_2020
Ocean Clean up data analysis project for the UMD Data Challenge 2020. Data Exploration for a Sustainable Planet.
cleanup competition data-analysis data-science folium geolocation machine-learning ocean planet pollution sklearn sustainability time-series trash umd
Last synced: 05 Jul 2025
https://github.com/waveform80/structa
A small utility for analyzing data structures (e.g. JSON files)
csv data-analysis data-visualization datajournalism datawrangling json yaml
Last synced: 06 Sep 2025
https://github.com/paezha/edashop
An open educational resource to teach a workshop on Exploratory Data Analysis in R
data-analysis exploratory-data-analysis open-educational-resources package r rstats workshop-materials
Last synced: 18 Mar 2025
https://github.com/theengineeringworld/numpy-data-science
NumPy Data Science Essential Traing COurse. Part of Youtube Course Offered by TheEngineeringWorld.
data-analysis data-science numpy numpy-exercises numpy-library numpy-tutorial python python-3-6 python3 scipy2018
Last synced: 09 Oct 2025
https://github.com/multiwoven/docs
Multiwoven Documentation
data-activation data-analysis data-engineering data-warehouses mlops open-source reverse-etl
Last synced: 14 Apr 2025
https://github.com/darsan-in/rumour-monger-spotter
Rumour Monger Spotter is a prototype developed during a national-level cyber hackathon to identify false information on Twitter. Using the Google Fact Check API and a Multinomial Naive Bayes classifier, the tool analyzes tweet content to assess the likelihood of misinformation. Despite a development window of less than 24 hours, the project won a t
ai data-analysis fact-checking hackathon india naive-bayes national-competition natural-language-processing prototype real-time-analysis social-media text-classification tweet-content twitter
Last synced: 12 Oct 2025
https://github.com/hackersandslackers/plotly-chartstudio-tutorial
📈 📊 Create Cloud-hosted Charts with Plotly Chart Studio.
data-analysis data-science data-visualization pandas pandas-python plotly plotly-chart-studio plotly-python tutorial
Last synced: 28 Apr 2025
https://github.com/ethan-wickstrom/rrrs
Welcome to RRRS, a rapid, hyper-optimized CSV random sampling tool designed with performance and efficiency at its core. Crafted meticulously in Rust, RRRS offers an unparalleled solution for extracting random data samples from CSV files swiftly and effortlessly.
analytics cli command-line command-line-tool data data-analysis data-science dataset rust rust-lang sample samples
Last synced: 16 May 2025
https://github.com/dcs-training/datavisualisationwithr
Data Visualisation with R Workshop (delivered by the Centre in December 2020). This workshop is focusing on visualising your data. Go to the readme file
data-analysis data-visualisation data-wrangling r
Last synced: 25 Apr 2025
https://github.com/solrikk/datadigger
DataDigger is a powerful and intuitive web application designed to extract and analyze data from web pages.
business-intelligence content-extraction data-analysis data-collection data-extraction data-mining go golang-api html-parser marketing-tools metadata-extraction research-tools seo-tools web-application web-crawling web-scraping web-tools
Last synced: 15 Apr 2025
https://github.com/fatihilhan42/data-science-projects
In this repo, there are (beginner-upper) level projects in the field of data science. I will host these projects that I have done in this field every day in this repo. With the hope that it will be useful to those who are interested in the field of data science like me and will just start...
data-analysis data-engineering data-mining data-science data-structures data-visualization database datascience fatihilhan fortytwo fortytwofficial jupyter-notebook python
Last synced: 11 Oct 2025
https://github.com/afsalashyana/whatsapp-chat-analyzer
Analyze WhatsApp chats with beautiful graphs. Written in JavaFX
data-analysis data-visualization javafx javafx-14 javafx-application whatsapp
Last synced: 04 Sep 2025
https://github.com/sondosaabed/data-visualization-with-matplotlib-and-seaborn
Learning to apply sound design and data visualization principles to the data analysis process. Also learning how to use analysis and visualizations to tell a story with data.
data-analysis data-analyst-nanodegree data-visualization matplotlib python seaborn seaborn-plots
Last synced: 09 Apr 2025
https://github.com/mikebild/introduction-python
An introduction to Python, Flask, Numpy, MatPlotLib and Pandas
data-analysis flask introduction iterables json jupyter matplotlib microservice numpy pandas python python3 sqlalchemy tutorial
Last synced: 11 Apr 2026
https://github.com/sondosaabed/introduction-to-data-analysis-with-pandas-and-numpy
Learning the data analysis process of questioning, wrangling, exploring, analyzing, and communicating data. Working with data in Python using libraries like NumPy and pandas.
data-analysis data-analyst-nanodegree data-wrangling numpy pandas python
Last synced: 09 Apr 2025
https://github.com/lucasbotang/real_estate_management_data_analysis
Data analysis for real estate management
data-analysis excel mysql tableau
Last synced: 06 Oct 2025
https://github.com/ttiagojm/ground-truth-vs-prediction
Machine and Deep Learning notebooks
data-analysis data-science deep-learning kaggle machine-learning opencv tensorflow
Last synced: 22 Apr 2025
https://github.com/mindful-ai-assistants/sp2024-election-analysis
📊 An analysis of voting patterns in São Paulo's 2024 elections, focusing on voter behavior, absenteeism, and geographic trends.
beautifulsoup dashboards data-analysis data-cleaning-and-preprocessing data-science dataset-creation datavisualization election-sp-brazil-2024 geolocalization geolocation geolocator maps oneness-consciousness power-bi python web-scraping-python
Last synced: 14 Apr 2025
https://github.com/bjpop/gurita
A convenient and expressive tool for data analytics and plotting on the command line
command-line data-analysis data-science pandas plotting python
Last synced: 04 Feb 2026
https://github.com/1uc1f3r616/dark-net-websites-dataset
Dataset of Onion Websites
crawler darknet data-analysis dataset onion search-engine website
Last synced: 27 Feb 2025
https://github.com/arv-anshul/campusx-project-notebooks
Capstone project by Campusx in DSMP course.
campusx campusx-dsmp data-analysis data-science eda jupyter-notebook machine-learning ml-project nlp project python3 recommender-system regression streamlit
Last synced: 22 Aug 2025
https://github.com/cheminfo/compass
Strategy for improved characterisation of human metabolic phenotypes using a COmbined Multiblock Principal components Analysis with Statistical Spectroscopy (COMPASS)
data-analysis metabolomics metabonomics multiblock nmr-spectroscopy pca population-analysis population-model
Last synced: 23 Mar 2025
https://github.com/cparmet/pandas-checks
🐼🩺 Pandas Checks: Non-invasive health checks for Pandas method chains
data-analysis data-engineering data-science method-chaining pandas
Last synced: 27 May 2026
https://github.com/sushantdhumak/traffic-forecasting-using-iot-sensor-data
Demonstrates how to utilize XGBoost for traffic forecasting using data gathered from IoT sensors, highlighting its efficiency in processing complex datasets and delivering accurate predictions.
data-analysis data-visualization exploratory-data-analysis feature-engineering feature-importance feature-selection gridsearchcv hyperparameter-optimization hyperparameter-tuning iot random-search xgboost-regression
Last synced: 26 Mar 2025
https://github.com/jackfiszr/pl2xl
Nodejs-polars wrapper with `readExcel` and `writeExcel` methods.
data-analysis data-science deno excel excel-reader excel-writer nodejs polars
Last synced: 21 Jan 2026
https://github.com/yuukidach/twitchanal
Using AI for eGaming analytics to discover community interactions and behaviors of Twitch.
data-analysis data-analytics twitch
Last synced: 05 Jan 2026
https://github.com/abhash-rai/traffic-image-classifier
A web-based solution utilizing a robust tensorflow model for precise traffic condition classification made in ReactJs and FastAPI for backend.
cnn cnn-classification cnn-keras cnn-model data-analysis data-science data-visualization fastapi keras keras-tensorflow python python-3 python3 react reactjs tensorflow traffic traffic-classification transfer-learning
Last synced: 23 Feb 2026
https://github.com/itzmeanjan/indian-railway
Exploring Indian Railways time table dataset, with :heart:
data-analysis data-visualization indian-railways matplotlib python python3 railway
Last synced: 17 Oct 2025
https://github.com/efharkin/ez-ephys
Easy IO, inspection, and manipulation of electrophysiological data.
data-analysis electrophysiology neurophysiology neuroscience patch-clamp python
Last synced: 14 Jan 2026
https://github.com/tirendazacademy/hands-on-data-science-with-gcp
Google BigQuery Tutorial
big-data big-data-analytics bigdata bigquery bigquery-ml bigqueryml cloud-computing data-analysis data-analytics data-engineering data-science dataanalysis dataengineering google-bigquery google-cloud-platform machienlearning machine-learning
Last synced: 06 Oct 2025
https://github.com/elfgk/diabetes-data-analysis
diabetes data analysis
analysis data-analysis diabetes-data-analysis eda jupiter-notebook
Last synced: 31 Aug 2025
https://github.com/jonzeolla/lab-securitydataanalysis
An introductory lab to Security Data Analysis (using Apache Metron (incubating)).
apache-metron data-analysis lab metron security
Last synced: 03 Jul 2025
https://github.com/trainingbypackt/splunk-7-essentials-elearning
Build an elaborate Splunk enterprise environment that will extract powerful insights from your machine-generated big data
data-analysis eventgen indexing machine-learning splunk sub-search visualization
Last synced: 01 Mar 2026
https://github.com/lit26/trump_tweet_analysis
Analysis of Trump's original tweets.
data-analysis lda-model topic-modeling
Last synced: 12 Apr 2025
https://github.com/1ayanabil1/healthcare-machine-learning
Explore our open-source repository focused on healthcare machine learning. We've developed predictive models for cardiovascular disease, diabetes, breast cancer, and more. Our projects employ diverse machine learning algorithms and data science techniques, enhancing early detection, diagnosis, and patient outcomes.
data-analysis data-science deep-learning disease disease-detection disease-modeling disease-prediction eda healthcare-application heathcare jupyter-notebook machine-learning machine-learning-algorithms machinelearning-python python
Last synced: 28 Apr 2025
https://github.com/nafisalawalidris/predicting-credit-card-approvals
Explore credit card approval prediction through data analysis and machine learning. Preprocess data, train logistic regression models, and optimize hyperparameters. Learn data preprocessing, feature engineering, model training, and evaluation. Dive into the world of machine learning with Python and popular libraries.
approval-prediction credit-card data-analysis data-preprocessing feature-engineering hyperparameter-optimization libraries logistic-regression machine-learning model-evaluation model-training python python3
Last synced: 19 Apr 2025
https://github.com/nirantak/programming-exercises
Programming exercises / Coding problems
data-analysis image-processing intelligent-systems matlab programming python python3
Last synced: 01 May 2025
https://github.com/vikas-ukani/data-analysis-with-python---zero-to-pandas
Attend Complete Data Analysis Course from freecodecamp.com
data-analysis data-science data-visualization machine-learning numpy numpy-arrays pandas
Last synced: 17 Oct 2025
https://github.com/sonigarima/donation-management-system
A donation management system for NGOs and Donors. The project is designed for Cognizance IITR 2021 - Salesforce Codathon.
data-analysis donation-management reactjs
Last synced: 07 Sep 2025
https://github.com/dsnchz/solid-g6
A SolidJS component library for graph visualization, powered by @antv/g6
analysis data-analysis data-visualization graph graph-visualization node-ui solidjs visualization
Last synced: 13 Oct 2025
https://github.com/martinthoma/bad-stats
Examples of how not to do statistics / visualizations
data-analysis statistics visualizations
Last synced: 07 Jan 2026
https://github.com/mscbuild/mscbuild
🏆 Сreating digital experiences that not only meet user expectations, but also drive engagement, loyalty and, ultimately, business success. Passionate developer from Latvia .
analysis best-practices coding config data-analysis data-science design developer freelance fullstack github-config latvia mscbuild profile readme seo site software-engineering web webapp
Last synced: 31 Jan 2026
https://github.com/prabhupavitra/data-visualization-with-python
This repository houses data visualization with Python.
barplot data-analysis data-visualization datavisualization dotplot grouped-bar-chart heatmap matplotlib matplotlib-pyplot pandas python3 seaborn stacked-bar-chart
Last synced: 09 May 2026
https://github.com/zmyzheng/signature-authentication-pen
Signature Authentication Pen, a cloud based IoT project which realizes identity authentication by exploiting the signature biometric features of the users. Details:
android aws data-analysis identity-authentication iot neural-network signature-authentication-pen
Last synced: 03 May 2026
https://github.com/banisterious/obsidian-oneirometrics
OneiroMetrics (Turning Dreams Into Data). A plugin for Obsidian to track and analyze dream journal metrics.
data-analysis dream-analysis dream-diary dream-journal dreams journaling metrics obsidian obsidian-plugin self-improvement tracking
Last synced: 22 Apr 2026
https://github.com/quantumudit/consumer-goods-sales-analysis
This project focuses on analyzing and visualizing the consumer goods sales in the United States between 2015-2016 using Python & Power BI.
data-analysis data-visualization database jupyter-notebook python sqlite
Last synced: 29 Apr 2026
https://github.com/chaitanyac22/house-price-prediction-project-for-a-us-based-housing-company
The goal of this project is to garner data insights using data analytics to purchase houses at a price below their actual value and flip them on at a higher price. This project aims at building an effective regression model using regularization (i.e. advanced linear regression: Ridge and Lasso regression) in order to predict the actual values of prospective housing properties and decide whether to invest in them or not.
advanced-linear-regression business-analytics data-analysis data-cleaning data-manipulation data-visualization exploratory-data-analysis feature-engineering lasso-regression linear-regression machine-learning model-building model-evaluation prediction-model python3 regularization rfe ridge-regression statistics
Last synced: 30 Apr 2026
https://github.com/depressioncenter/mden
Mobile technologies code from the University of Michigan's Mobile Data Experts Network (MDEN), featuring data cleaning automations, REDCap project templates, and links to useful external modules. [DOI: 10.6084/m9.figshare.25438714]
automation data-analysis data-cleaning fitness-tracker heart-rate-data mobile-data mobile-development mquery powerautomate powerbi powerquery python r sleep-data smartwatch-data tableau
Last synced: 25 Feb 2026
https://github.com/tirendazacademy/pandasai-tutorials
Tutorials for PandasAI
ai data-analysis data-science data-visualization llms openai pandas pandasai python
Last synced: 27 Mar 2026
https://github.com/jggautier/dataverse-curation-assistant
A small software application that provides a UI for automating things in repositories that use the Dataverse software
data-analysis dataverse hacktoberfest python
Last synced: 01 Mar 2026
https://github.com/edaaydinea/dataquest-projects
This repository is included data analyst, and data science-guided projects through Dataquest.
Last synced: 07 Feb 2026
https://github.com/rudra496/science
🔬 Interactive science experiments and research simulations — physics, chemistry, biology with 3D visualizations and real-time data analysis
data-analysis education experiments hacktoberfest javascript python research science simulation threejs
Last synced: 09 Jun 2026
https://github.com/quantumudit/regional-sales-analysis
This project focuses on analyzing and visualizing the United States regional sales for a fictitious company in between 2018-2020 using Python & Power BI.
data-analysis data-visualization databases jupyter-notebook power-bi python sqlite
Last synced: 02 May 2026
https://github.com/abrahamkoloboe27/dashboard-streamlit-atut
Lien de l'application
dashboard data-analysis data-visualization pandas plotly python streamlit visualization
Last synced: 05 Mar 2026
https://github.com/cyyeh/duckdb-data-agent
An AI-powered data analysis agent with a built-in SQL playground. Upload data files (CSV, JSON, Parquet, Excel) and ask questions in plain English — the agent delegates to a specialized subagent for SQL queries and renders charts inline — or switch to the SQL editor for direct queries.
agent claude-code csv data-analysis duckdb excel json langfuse llm parquet python react sql typescript
Last synced: 04 Jun 2026
https://github.com/happybono/avocadosmoothie
VB.NET project for running-median filtering. Users set kernel radius, border count, and pick MiddleMedian or AllMedian. Processing runs in parallel with a progress bar and smooth UI.
algorithms calibration correction data-analysis median outliers quicksort running-median runningmedian smoothing smoothing-methods statistics visual-basic
Last synced: 10 Feb 2026
https://github.com/niamoto/niamoto
Niamoto is a command-line application and library focused on processing and publishing botanical data
botany cli-application data-analysis data-processing data-publication python-library
Last synced: 23 Apr 2026
https://github.com/quantumudit/analyzing-whiskyexchange-whisky
This project focuses on scraping data related to Japanese Whiskey from the Whiskey Exchange website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 02 May 2026
https://github.com/karlyndiary/restaurant-ratings-analysis
Restaurant Ratings Analysis using Microsoft Power BI
dashboard data-analysis data-analysis-powerbi power-bi power-bi-dashboard report restaurant restaurant-ratings-analysis restaurant-ratings-dashboard restaurant-ratings-data-analysis restaurant-ratings-power-bi-dashboard
Last synced: 26 Feb 2026
https://github.com/mertcandav/julenum
A high-performance library for numerical methods and scientific computing in Jule
data-analysis jule julelang math matrix scientific-computing statistics
Last synced: 09 Feb 2026
https://github.com/dcs-training/digital-method-of-the-month
In this repository you are going to find the documents we produced to support the discussion in our Digital Methods of the Month. These documents will help you orienting yourself if you want to pickup the method in your research. Go to the readme file
3d-data data-analysis data-visualisation data-wrangling geographical-data gis good-practices-digital-research machine-learning network-analysis open-research preregistration statistics text-analysis
Last synced: 25 Feb 2026
https://github.com/avinashkranjan/basic-data-analysis-and-visualization-in-python
📊 Some of the most important python tools in data science for Data Analysis and Data Visualization.
data-analysis data-science matplotlib matplotlib-pyplot numpy pandas plotly seabourne
Last synced: 30 Oct 2025
https://github.com/natlee/myanimelist-comment-crawler
Crawl all reviews and infomation of Anime works on MyAnimeList. ;)
anime crawler data-analysis data-mining data-science kaggle kaggle-dataset myanimelist python requests scrapy-crawler sqlite
Last synced: 14 Apr 2025
https://github.com/accurat/react-dataviz
⚛📊🚀 React components to build powerful interactive data visualizations
d3 data-analysis data-visualization react react-components
Last synced: 19 Jun 2025
https://github.com/pythondeveloper6/store-sales-eda
simple EDA with some insights on Store Sales
data-analysis eda matplotlib numpy pandas seaborn
Last synced: 11 Apr 2025
https://github.com/mch-fauzy/data-science
Repository containing portfolio of data science and machine learning projects. Presented in the form of iPython Notebooks
data-analysis data-science data-visualization ipython-notebooks machine-learning natural-language-processing portfolio
Last synced: 24 Sep 2025
https://github.com/relvaner/nodes4j-core
Framework for parallel processing based on Actor4j. Useful for data analysis.
actor actor-model actor4j actors batch-processing data-analysis graph-processing java java-17 message-passing parallelization reactive-system stream-processing
Last synced: 14 Jul 2025
https://github.com/sushant1827/traffic-forecasting-using-iot-sensor-data
Demonstrates how to utilize XGBoost for traffic forecasting using data gathered from IoT sensors, highlighting its efficiency in processing complex datasets and delivering accurate predictions.
data-analysis data-visualization exploratory-data-analysis feature-engineering feature-importance feature-selection gridsearchcv hyperparameter-optimization hyperparameter-tuning iot random-search xgboost-regression
Last synced: 08 Mar 2026
https://github.com/ziaeemehr/cpp_workshop
Scientific programming toolbox with C++
cpp data-analysis data-science learning-by-doing programming scientific-computing telegram-channel youtube
Last synced: 15 May 2026
https://github.com/astrodynamic/retailanalitycs-in-postgresql
Develop a SQL script to create a database with tables, views, roles, and functions. Form personalized offers to increase average check, frequency of visits, and cross-selling.
bd csv data-analysis data-export data-input data-manipulation data-validation database-management functions git margin offers postgresql retail role-permission-management selling sql transaction tsv views
Last synced: 06 Apr 2026
https://github.com/jimbrig/lossrunAnalyzer
R Package and Shiny App to Analyze Insurance Lossruns
actuarial data-analysis data-mining data-science insurance r record-linkage risk-management shiny
Last synced: 30 Jul 2025
https://github.com/pythondeveloper6/supermarket-eda-seaborn-for-beginners
learn Seaborn basics using a simple EDA
data-analysis eda numpy pandas seaborn visualization
Last synced: 11 Apr 2025
https://github.com/sowinskibraeden/schedulegeneratorapp
The Desktop Application for my schedule-generator algorithm, allowing users to easily interact with the algorithm and its variables to generate schedules as documents for students individually as well as the master timetable
algorithm csv data-analysis dataclasses python-docx python-typing python311 xlsxwriter
Last synced: 09 Jul 2025
https://github.com/searchformyusername/dark-net-websites-dataset
Dataset of Onion Websites
crawler darknet data-analysis dataset onion search-engine website
Last synced: 16 Jun 2025
https://github.com/irfanchahyadi/ml-notes
Complete personal notes for performing Data Analysis, Preprocessing, and Training ML model.
data-analysis machine-learning plotting python
Last synced: 11 Jul 2025
https://github.com/csparpa/last.fm-stats
Exercise on Last.fm data aggregation
data-analysis exercise lastfm lastfm-api python
Last synced: 21 May 2026
https://github.com/louis-heraut/card
🎴 Card of Analyse and Diagnostic in R for a user-friendly experience of data aggregation with parametrisation file.
aggregation climate-change climate-data climate-science data-analysis data-science diagnostic environment environment-variables hydrology hydrology-statistical inrae r statistics tools user-friendly
Last synced: 09 Mar 2026
https://github.com/quantumudit/basketball-players-analysis
The project focuses on analyzing salaries and various other in-game metrics of top NBA basketball players from 2005-14 by performing exploratory data analysis with Python and Jupyter Notebook and by visualizing the data in an insightful dashboard made with Power BI
data-analysis jupyter-notebook power-bi python
Last synced: 17 May 2026
https://github.com/davidzajac1/reptoro
A Data Visualization and Analytics Platform for the Reptile Industry
analytics data-analysis data-visualization plotly-dash python
Last synced: 15 May 2026
https://github.com/saksham-joshi/sentiment_analyzer
Analyze the sentiment of a text stored in a string or file and understand the reason why your blogs and posts are not ranking up.
data-analysis data-analytics python sentiment-analyser sentiment-analysis sentiment-analysis-without-nltk
Last synced: 22 Aug 2025