Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-02-07 00:07:52 UTC
- JSON Representation
https://github.com/mrigankpawagi/exop
Quest for a Habitable Exoplanet
css data-analysis font-awesome googlefonts html5 javascript jquery materializecss planets pwa space webapp
Last synced: 27 Sep 2025
https://github.com/martinboller/cc-build
Builds latest version of CyberChef and install it with NGINX on another system. CyberChef is a simple, intuitive web app for analyzing and decoding data without having to deal with complex tools or programming languages.
analysis blueteam compression cyberchef data-analysis data-manipulation decode encode encryption hashing parsing virtual-machine
Last synced: 03 Aug 2025
https://github.com/AnonCatalyst/WebHound
WebHound is your Python-powered command-line assistant for sharp and efficient web searches! It sniffs out data from major search engines and detects social platforms, helping you uncover valuable insights and stay ahead of the game. 🌐🔍✨
awareness data-analysis data-visualization information-retrieval osint osint-python osint-reconnaissance osint-resources osint-tool osint-tools osinttools web-osint webscraping
Last synced: 29 Jul 2025
https://github.com/lebrancconvas/personal-saved-datasets
Saved Datasets for doing stuff in the area of Statistics & Data Science.
data-analysis data-science database datasets excel kaggle kaggle-dataset microsoft-excel sample-dataset
Last synced: 26 Feb 2025
https://github.com/julianolaurentino/sql_sample
Estudos voltados para criação de consultas em SQL. Este repositório está sendo alimentando semanalmente.
data-analysis sql sql-query sql-server
Last synced: 05 Mar 2025
https://github.com/neelshah18/neelshah18.github.io
Neel Shah's Website
article blog data-analysis data-science data-visualization deep-learning machine-learning personal-website
Last synced: 10 Aug 2025
https://github.com/josmarcristello/geokmlanalyzer
The Geo Kml Analyzer is a Python-based tool designed to process and analyze elevation data from KML files. Uses google maps to obtain elevation data (interpolates if necessary) and spherical trigonometry equations to calculate the distance.
data-analysis geolocation geospatial gis google-maps-api gps kml python
Last synced: 30 Jul 2025
https://github.com/rafaelbroseghini/data-analysis-visualizations-ml
:bar_chart: NLP, Regression Models, Story telling, Time Series Analysis ... in Python
data-analysis data-science machine-learning machine-learning-algorithms time-series visualization
Last synced: 03 Oct 2025
https://github.com/zmyzheng/signature-authentication-pen
Signature Authentication Pen, a cloud based IoT project which realizes identity authentication by exploiting the signature biometric features of the users. Details:
android aws data-analysis identity-authentication iot neural-network signature-authentication-pen
Last synced: 30 Aug 2025
https://github.com/andreip/twitter-authorities
Find authorities for Twitter topics. [Licenta][Undergraduate thesis]
data-analysis mongodb python twitter-topics
Last synced: 22 Aug 2025
https://github.com/monteirooscar98/salario-minimo-brasil
Extração de dados através de API do Banco Central, WebScraping no site do Dieese e Análise dos dados.
api brasil data-analysis opendata python webscraping
Last synced: 20 Aug 2025
https://github.com/malkiii/data-analysis-agent
PandasAI + Gradio app
ai-agents data-analysis gradio llm pandas pandasai python
Last synced: 12 Aug 2025
https://github.com/danielpuentee/outdpik
The fundamental toolkit for outliers search and visualization. It aims to be the fundamental high-level package for this purpose.
data-analysis matplotlib numpy python
Last synced: 16 Aug 2025
https://github.com/mynenik/xyplot-32
Extensible Plotting and Data Analysis Program for 32-bit x86 GNU/Linux
cpp data-analysis data-manipulation data-visualization forth linux-app motif xwindows
Last synced: 02 Aug 2025
https://github.com/cosmoduende/r-holy-books-sentiment-data-analysis
What's the most positive or negative religion? . Sentiment and Data Analysis of Holy Books with R. Analysis of religious dogmas by exploring their Holy Books (The Bible, The Quran, The Dhammapada, and The Book of Mormon) with R
bible book-of-mormon data-analysis data-analytics data-visualisation data-visualization dataviz dhammapada holy-scriptures quran religions-studies religious religious-studies sentiment-analysis sentiment-polarity sentimental-analysis text-analysis text-analytics text-mining text-mining-analysis
Last synced: 11 Apr 2025
https://github.com/tushar2704/stats-mosaic-streamlit
Stats-Mosaic-Streamlit is a comprehensive GitHub repository that aims to provide a growing collection of curated content and projects centered around statistics and its intersection with data science, machine learning, and artificial intelligence.
artificial-intelligence bivariate-analysis data-analysis data-science hypothesis-testing machine-learning statistical-learning statistics streamlit streamlit-tushar2704 univariate-analysis
Last synced: 04 Nov 2025
https://github.com/bocaletto-luca/world-bank-explorer
World Bank Explorer is an interactive and responsive web application that retrieves, visualizes, and compares global development indicators sourced from the World Bank Open Data API. The application allows users to explore data on multiple scales ... By Bocaletto Luca
api bocaletto-luca chartjs css3 data-analysis data-visualization economic-indicatos economic-trends free-data global-development html5 interactive-dashboard javascript open-data open-source publicdata responsive world-bank
Last synced: 15 Jul 2025
https://github.com/rgalyeon/machine_learning_and_data_analysis
Machine Learning and Data Analysis specialization by Yandex and MIPT
coursera data-analysis data-science machine-learning mipt python yandex
Last synced: 03 Mar 2025
https://github.com/sarincr/data-analytics-with-knime
Data Analytics with KNIME (Konstanz Information Miner), a free and open-source data analytics, reporting and integration platform. KNIME integrates various components for machine learning and data mining through its modular data pipelining concept. A graphical user interface and use of JDBC allows assembly of nodes blending different data sources, including preprocessing (ETL: Extraction, Transformation, Loading), for modeling, data analysis and visualization without, or with only minimal, programming.
ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks data-analysis data-mining data-science data-structures data-visualization database datascience deep-learning machine-intelligence machine-learning machine-learning-algorithms machinelearning mining mining-software
Last synced: 14 Mar 2025
https://github.com/kaustubhgupta/data-analysis-hub
This is where all my Data Analysis notebooks are present. All the notebooks are either fully explored and have an explanatory readme or a medium article has been published which is linked in the README.
data-analysis data-science google-play-store kaggle matplotlib pandas seaborn
Last synced: 22 Mar 2025
https://github.com/mindful-ai-assistants/movierevenueanalysis
🎬💰 Analyze movie companies' revenue, release strategies, and financial performance using statistical techniques for actionable insights. This project explores data on total revenue, number of releases, and lifetime gross to uncover patterns that can drive strategic decisions in the film industry.
correlation-analysis data-analysis data-science heatmap jupyter-notebook oneness-consciousness open-source python statistical-analysis statistical-analysis-and-hypothesis-testing statistics ttest
Last synced: 14 Apr 2025
https://github.com/karlyndiary/global-electronics-retailer-sales-and-customer-insights
Developed an analysis using Python, SQL, and Excel to examine sales and customer demographics for a Global Electronics Retailer. The findings aim to enhance business strategies and improve overall performance.
dashboard data-analysis data-cleaning-and-preprocessing data-pipeline data-visualization etl microsoft-excel microsoft-sql-server python sql
Last synced: 28 Jun 2025
https://github.com/emptymalei/mini-lab
Some code snippets used to explain stuff to myself in my personal data science wiki
data-analysis data-mining data-science data-visualization datascience
Last synced: 07 Apr 2025
https://github.com/rawsashimi1604/jobextract
Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.
data data-analysis data-analytics data-cleaning linkedin mvc python webscraper
Last synced: 06 Nov 2025
https://github.com/ocramz/record-encode
Generic encoding of record types
categorical-data categorical-features data-analysis data-mining data-science generic-programming machine-learning one-hot-encode preprocessing
Last synced: 14 Apr 2025
https://github.com/sivkri/imagecoloranalysis
ImageColorAnalysis is a repository with a Python script for color analysis in images using ImageMagick. It generates bash scripts for individual JPG images to analyze specific colors. It provides a flexible solution for extracting color information from images, applicable in various domains such as image classification and data analysis.
bash-scripts color-analysis computer-vision data-analysis image-classification image-processing imagemagick pavement pavement-images python-scripting stomata stomatal-index
Last synced: 22 Mar 2025
https://github.com/cjunwon/youtube-data-analysis
End-to-end Youtube data analysis project using Youtube Data API, MySQL, AWS, Flask
aws-rds data-analysis datapipeline flask nlp pandas python shell sql vader-sentiment-analysis youtube youtube-api
Last synced: 25 Jul 2025
https://github.com/arjo129/image-sorter
Sort through folders of videos and images. Root out blurred and overexposed images.
computational-photography data-analysis photo-browser photo-gallery photography uwp uwp-apps
Last synced: 25 Jul 2025
https://github.com/cego669/datathonengopevi
Equipe: Embrapeiros. Solução proposta para o Datathon do VI ENGOPE (Encontro Goiano de Probabilidade e Estatística). Obs: FOMOS CAMPEÕES!!!!!!!!
data-analysis data-science datathon python r streamlit xgboost-classifier
Last synced: 26 Jul 2025
https://github.com/nelson-gon/nelson-gon.github.io
Biologically Plausible Programming
bioinformatics blog blogdown computational-biology data-analysis data-exploration ghost ghostwriter-theme github github-pages hugo-site hugo-theme programming python3 r side-project
Last synced: 26 Jul 2025
https://github.com/felixcharotte/ibm_datascience_capstone
In this project, we predicted if the SpaceX Falcon 9 first stage will land successfully by following the data science methodology. We also summarized the results for the business stakeholders.
analysis data-analysis data-science data-visualization databases folium jupyter-notebook machine-learning machine-learning-alrgorithms matplotlib pandas plotly plotly-dash python scikit-learn scipy seaborn sql
Last synced: 26 Jul 2025
https://github.com/mohamedomar2020/random-forest
Creating a Random Forest model to predict the progression of bladder cancer
bladder-cancer cancer-genomics cancer-research data-analysis data-science genomics machine-learning machine-learning-algorithms random-forest
Last synced: 18 Sep 2025
https://github.com/billy-enrizky/kimia-farma-sales-management-database-replica-project
SQL Database Management, Then Visualizing it on Tableau!
analytics data-analysis data-visualization sql
Last synced: 27 Jul 2025
https://github.com/michaelcurrin/water-crisis-scraper
Scrape and explore data related to Cape Town's water crisis (Python3 application)
cape-town cron csv dam-levels data-analysis html open-data python3 schedule scraping south-africa water-crisis water-level webscraping
Last synced: 28 Jul 2025
https://github.com/thisisashukla/survival-analysis
Hands-On Survival Analysis in Python
data-analysis data-science survival-analysis
Last synced: 28 Jul 2025
https://github.com/skylord0001/python-daily
Python - Basic, Apache - Conf, Black Stack Hub, Data analysis, Data Structure, Google Cloud, SQL system
apache-configuration data-analysis data-structure python-scripts python-sql
Last synced: 06 Sep 2025
https://github.com/nafiealhilaly/analyze-coderhub-sa
A simple web app to analyze/explore coderhub.sa API data, this project was my first real react app.
backend data-analysis eda frontend python react reactjs
Last synced: 29 Jul 2025
https://github.com/archived-blueprints/postgresql-blueprints
Simplified blueprints for building data pipelines with PostgreSQL.
cli data-analysis data-engineering data-pipeline data-science database elt etl postgres postgresql
Last synced: 29 Jul 2025
https://github.com/BigBangData/TimesheetAnalysis
R shiny app to help analyze a bookkeeper's business - or anyone with a timesheet and some time.
bookkeeping data-analysis data-viz r-programming shiny-apps shiny-r timesheet-management
Last synced: 29 Jul 2025
https://github.com/mertcandav/julenum
A high-performance library for numerical methods and scientific computing in Jule
data-analysis jule julelang math matrix scientific-computing statistics
Last synced: 29 Jul 2025
https://github.com/casualcomputer/sql.mechanic
Functions that generate SQL queries that summarize high-dimensional tables stored in various databases (e.g. Microsoft SQL Servers, Netezza, DB2, Postgres, Oracle, MySQL, etc.).
data-analysis data-quality-checks data-science database mysql netezza oracle postgres quality-control r sql sql-server
Last synced: 30 Jul 2025
https://github.com/javadtorabikh/businessintelligencesystem
This API provides advanced business analytics capabilities for company data, offering 10 core analytical functions to transform raw business data into actionable insights. The system is built with Python and Flask, designed for reliability, scalability, and performance.
analysis business-analytics business-intelligence data-analysis mashine-learning
Last synced: 22 Sep 2025
https://github.com/frikishaan/browsing-history-analysis
This is a data analysis of my browsing history for the last 7 months.
browsing-history data-analysis jupyter-notebook python
Last synced: 22 Sep 2025
https://github.com/techytushar/india-odi-analysis
Analysis of ODI cricket matches of Indian Team
cricket data-analysis data-science pandas plotting python3
Last synced: 31 Jul 2025
https://github.com/zachlagden/spotify-listening-analyzer
A comprehensive Python tool for analyzing your Spotify listening history data.
analytics data-analysis pandas python spotify-web-api spotipy
Last synced: 31 Jul 2025
https://github.com/simoneas02/data-science
🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻
data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql
Last synced: 31 Jul 2025
https://github.com/jpcadena/classification-tweets-national-security-ecuador
Classification of Tweets about national security at Ecuador 2022
classification classification-model data-analysis data-science ecuador insecurity machine-learning natural-language-processing nlp nltk numpy pandas python pytorch scikit-learn snscrape supervised-learning tensorflow tweet twitter
Last synced: 02 Aug 2025
https://github.com/jamesquinlan/intro-stats-mat150
Introduction to Statistics
data-analysis statistics university-course
Last synced: 03 Aug 2025
https://github.com/billy-enrizky/dgf-analysis
DGF AI Analysis, from Exploratory Data Analysis, Handling Missing Data, to Predicting DGF with various Machine Learning Model such as Logistic Regression, Support Vector Machine, Gradient Boosting, and Random Forest
data-analysis data-science exploratory-data-analysis machine-learning support-vector-machine
Last synced: 04 Aug 2025
https://github.com/zrkhadija/data-analysis-for-financial-time-series
In this notebook, we performed data analysis on financial time series data from Yahoo Finance for the US market. We examined seasonality, trends, stationarity, and other aspects such as outliers and correlations.
autocorrelation correlation-analysis data-analysis financial-analysis time-series-analysis timeseries-forecasting visualization
Last synced: 05 Aug 2025
https://github.com/prakhar-ff13/yellow-taxi-demand-prediction
Predicting Taxi Demand in various regions of New York City
data-analysis data-analytics data-science data-visualization machine-learning python3 time-series
Last synced: 09 Nov 2025
https://github.com/tesfamichael12/solar-farm-analysis
This repository contains code and analysis for exploring solar farm data from Benin, Sierra Leone, and Togo. It includes EDA, strategic recommendations for optimal solar farm locations, and an interactive Streamlit dashboard.
data-analysis eda ml solar-farm-analysis
Last synced: 07 Aug 2025
https://github.com/simranjeet97/ipl-dataanalysis
Data Analysis performed on IPL Dataset with Data Profiling, Data Pre-Processing, Data Manipulation, and Data Visualization.
artificial-intelligence data-analysis data-manipulation data-mining data-preprocessing data-science data-visualization indian-premier-league-2008-2018 ipl ipl-dataset iplayer python
Last synced: 03 Oct 2025
https://github.com/ezzz-lui/rsm-evaluationproject
Este repositorio es donde esta documentado nuestro proyecto para RSM por parte de actividad final para el bootcamp Data Analyst
Last synced: 04 Oct 2025
https://github.com/rajshrestha86/police-brutality-data-analysis
In this project, we analyze the events after George Floyd’s death. The protests and riots across the United States and sentiments of news articles of three different news sources that have different political leaning. We will see how these media reacted after Floyd’s death and see the effect of media bias on the sentiments of news for #BlackLivesMatter and #AllLivesMatter movement. We will also see if there is a correlation between the police budget and the number of protests. This analysis will help us to see if there is really a need for defunding police to reduce police brutality and casualties. We will also see the correlation of partisan segregation and number of deaths to see if political preference has an effect on the number of deaths by police.
data-analysis matplotlib pandas python sentiment-analysis web-scraping
Last synced: 04 Oct 2025
https://github.com/jethronap/asylumdataku_website
Mini website for reporting analysis of Asylum Data @ DIKU
Last synced: 04 Oct 2025
https://github.com/umbrellaleaf5/drugdesign_data_analysis
Module of the DrugDesign project responsible for loading and pre-processing data from ChEMBL and PubChem, necessary for further modeling and analysis in drug development
chembl chemistry dafe data-analysis doxygen-documentation mipt pubchem python requests
Last synced: 15 Aug 2025
https://github.com/poga/dat-ipynb-demo
use ipython notebook to analyze data in dat archive
dat data-analysis distributed jupyter-notebook
Last synced: 17 Aug 2025
https://github.com/henrylin03/video-games
Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.
analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games
Last synced: 17 Aug 2025
https://github.com/dcs-training/effectivedatavisualisation
This repository hosts the material connected to a training course developed by Dave Elsmore (Edina) for CDCS on good data visualisation. Go to the readme file
data-analysis data-visualisation data-wrangling python
Last synced: 19 Aug 2025
https://github.com/iamfoysal/data-analysis
This repository contains various examples and exercises to help learn data science using Python.
data-analysis data-science database jupyter-notebook python3
Last synced: 20 Aug 2025
https://github.com/olow304/goboard
Python Data Analysis Dashboard using Public Dataset, Django
dashboard dashboard-templates data-analysis data-science django jupyter-notebook machine-learning python sklearn
Last synced: 21 Aug 2025
https://github.com/luizassimoes/fitness-report
Create a personalized Fitness Wrapped report using your Apple Health data with this Streamlit application. Generate comprehensive, detailed summaries of your annual fitness activities, providing valuable insights into your year-long progress and achievements.
data-analysis data-visualization python streamlit
Last synced: 22 Aug 2025
https://github.com/njoyedevs/chatgpt3_riskanalyzer
In this project, ChatGPT3 was fine tuned on 9 data series spanning 40 years. This helped train ChatGPT3 to provide a market risk score. To view, visit: https://www.aimarketrisk.com
chatgpt3 data-analysis flask fred-api full-stack-web-development pandas python
Last synced: 22 Aug 2025
https://github.com/hatamiarash7/matlab_advantech_examples
Matlab Examples To Use Advantech DAQ Cards
advantech daq data-acquisition data-analysis data-science datascience matlab
Last synced: 23 Aug 2025
https://github.com/jimbrig/eda
Exploratory Data Analysis R Package and Shiny App
data-analysis data-visualization eda r shiny
Last synced: 03 Jan 2026
https://github.com/quantumudit/thereyougo-store-analysis
This project focuses on scraping all the products and their related info from the "There You Go" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Aug 2025
https://github.com/martial2023/bank-performance-analysis
Analyse de données bancaires du Berka Dataset (1993-1998) pour calculer et visualiser des KPI clés
dashboard data-analysis data-visualization nextjs pandas plotly-express pymongo python recharts-js sqlalchemy
Last synced: 26 Aug 2025
https://github.com/realkarthiknair/data-science-notes
Data science notes and programs
data-analysis data-science data-visualization
Last synced: 27 Aug 2025
https://github.com/eerkela/bertrand
flexible type extensions for pandas
conversions data-analysis data-engineering data-science multiple-dispatch numpy pandas type-checking type-inference types
Last synced: 27 Aug 2025
https://github.com/ziaeemehr/itng_nest
Nest Simulator quick guides and examples, adding new model using NESTML
computational-neuroscience data-analysis nest-simulator neuroscience
Last synced: 30 Aug 2025
https://github.com/ronylpatil/whatsapplib
WhatsApp Group Chat Analysis Python Package.
data-analysis open-source pypi-package python-library python-package
Last synced: 02 Jan 2026
https://github.com/quantumudit/analyzing-goodreads-famous-quotes
This project focuses on scraping famous quotes and their related data from the GoodReads website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 31 Aug 2025
https://github.com/viper373/baidutieba
爬取百度贴吧(指定吧名、起始页数/重点页数、日志输出)
baidutieba-crawler bert data-analysis deep-learning python spider
Last synced: 30 Mar 2025
https://github.com/cosmoduende/r-arduino
Interoperability Data-IoT: How to send and receive data and take control of your Arduino, from R. How to establish interoperability between R and Arduino (Data and IoT) using a data flow between the two
arduino arduino-data arduino-dataflow arduino-serial arduino-serial-data arduino-serial-led arduino-uno data-analysis data-arduino data-cleaning data-iot data-visualization interoperability iot-rstudio r-analytics r-data-visualization r-iot rstudio-arduino serial-read serialport
Last synced: 06 Nov 2025
https://github.com/grburgess/gbm_kitty
Database, reduce, and analyze GBM data without having to know anything. Curiosity killed the catalog.
3ml catalogue data-analysis fermi-science grbs pipelines
Last synced: 29 Jun 2025
https://github.com/zelosleone/finncorr
A .NET Core financial analysis tool/API for calculating correlations between time series data with interactive visualizations powered by ML.NET and Plotly.js.
aspnet-core correlation-analysis csv-parser data-analysis dotnet financial-analysis machine-learning ml-net plotly rest-api statistical-analysis swagger time-series visualization
Last synced: 14 Apr 2025
https://github.com/jonperk318/machine-learning-analysis-of-hyperspectral-data
Using Non-negative Matrix Factorization (NMF) and Variational Autoencoder (VAE) machine learning architectures to analyze spatial and spectral features of hyperspectral cathodoluminescence (CL) spectroscopy images taken from hybrid inorganic-organic perovskite material
data-analysis data-science deep-neural-networks explained-variance hybrid-perovskite hyperspectral-image-classification machine-learning matplotlib nmf non-negative-matrix-factorization python pytorch scikit-learn semi-supervised-learning signal-processing solar-energy spectroscopy unsupervised-learning vae variational-autoencoder
Last synced: 28 Jan 2026
https://github.com/lastancientone/amd-vs-nvda
Analyzing 2 technology stocks using Master Analyst Program (MAP).
data data-analysis data-structures data-visualization excel forecasting time-series-analysis
Last synced: 15 May 2025
https://github.com/grburgess/polarpy
Tools for polar
3ml data-analysis grb polarization polarization-data
Last synced: 15 Jul 2025
https://github.com/fernandezfran/exma
A Python library with C extensions to analyze and manipulate molecular dynamics trajectories and electrochemical data
computational-physics data-analysis molecular-dynamics oop python science
Last synced: 16 Jan 2026
https://github.com/louislefevre/sstubs-miner
Data mining and analysis for the ManySStuBs4J dataset.
data-analysis data-mining manysstubs4j-dataset msr
Last synced: 30 Mar 2025
https://github.com/gattiharishkumar/employee-attendance-leaves-analytics-dashboard
This project showcases a Power BI dashboard created to analyze employee attendance and leaves over a three-month period. The data was sourced from Excel datasets available on the Codebasics website.
dashboards data-analysis data-cleaning data-transformation data-visualization power-query-editor powerbi
Last synced: 26 Jan 2026
https://github.com/walidbosso/r_data_mining
Extract knowledge from a data using different techniques, including Association Rules Hierarchical Agglomerative Clustering (HAC) K-means Clustering Decision Trees
association-rule-mining association-rules clustering data-analysis data-mining data-science data-visualization decision-tree-classifier decision-trees exportation extract-data hac hierarchical-clustering k-means k-means-clustering k-means-r r-programming r-studio
Last synced: 23 Mar 2025
https://github.com/gxelab/tutorials
Tutorials of frequently used software packages and libraries in the lab
bioinformatics data-analysis evolution genetics genomics julia python3 r-language statistics visualization
Last synced: 18 Jan 2026
https://github.com/willie-conway/datavista-command-line-application
A robust 🐍Python application for data analysis that provides a wide range of tools for 🔃loading, 🧹cleaning, and 🔃preprocessing data. It includes features for 📈statistical analysis, 👨🏿🔬hypothesis testing, 🦾machine learning, clustering, ⏳time series forecasting, and 📊data visualization, all designed to enhance your analytical workflow.
analytics big-data command-line data-analysis data-cleaning data-driven data-mining data-pipeline data-preprocessing data-science data-scientist data-visualization data-wrangling exploratory-data-analysis machine-learning pandas predictive-analytics python statistics visualization-tools
Last synced: 30 Jun 2025
https://github.com/jshinm/web-scrapper
Web Scrapper used to extract NeuroData github repo stats
Last synced: 04 Apr 2025
https://github.com/quantumudit/analyzing-quotes
This project focuses on scraping all the quotes and their related data from the "Quotes To Scrape" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 15 May 2025
https://github.com/quantumudit/analyzing-yell-cafes
This project focuses on scraping data related to cafes and coffee shops in London, England from the Yellow Pages (Yell.com) website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 15 May 2025
https://github.com/quantumudit/analyzing-gamerevolution-games
This project focuses on scraping data related to video games from the GameRevolution website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 15 May 2025
https://github.com/a-r-j/npview
CLI tools for quickly inspecting CSV/TSV & NumPy (.npy) array files
cli csv data-analysis inspector npy numpy python tsv
Last synced: 18 Jan 2026
https://github.com/elfgk/ogretmenanalizantalya
OgretmenAnalizAntalya
analysis data-analysis data-science data-visualization ogretmenanaliz
Last synced: 15 Apr 2025
https://github.com/hayesall/babybear
🐼 It's like pandas, but tiny.
data-analysis data-analysis-python data-science dataframe python teaching teaching-tool
Last synced: 04 Mar 2025
https://github.com/shibam120302/black-friday-sales-data-analysis
This repository contain Data Analysis on Black Friday Sales Data using various Regression ML algorithms
data-analysis eda machine-learning python random-forest regression
Last synced: 15 Mar 2025
https://github.com/kalebers/data_streams_parametric_t-sne
Research for Parametric T-SNE in high to low dimensional data stream, published in 2021 by Kalebe Rodrigues Szlachta and Andre de Macedo Wlodkovski, oriented by Jean Paul Barddal, Computer Science graduation from Pontifical Catholic University of Parana (PUCPR)
classifier data-analysis data-science data-visualization machinelearning parametric parametric-tsne python tsne-algorithm tsne-visualization
Last synced: 08 May 2025
https://github.com/fabienarcellier/qjoin
qjoin is a data manipulation library that provides simple and efficient joining and collection processing functionality
composable data-analysis developer-tools functools python
Last synced: 17 Nov 2025
https://github.com/kylekirkby/cardatasnatch
CarDataSnatch allows you to quickly find information about a car in the uk using a valid number plate. Grab an image of the car in question along with a multitude of other data. Compare two cars' data for fast and easy analysis.
beautifulsoup cars command-line-tool data data-analysis data-mining ethical-hacking python python3 requests scraper social-engineering
Last synced: 15 Apr 2025