Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-02-07 00:07:24 UTC
- JSON Representation
https://github.com/xxdavid/commit-message-conventions
🔎 Commit message conventions analysis
commit-conventions commit-message data-analysis git statistics
Last synced: 11 Jan 2025
https://github.com/cworld1/da-learning
Some notes and code about CWorld learning Data Analysis
data-analysis data-science jupyter-book jupyter-notebook python r
Last synced: 23 Jan 2025
https://github.com/banyc/dfsql
SQL REPL/lib for Data Frames
cli csv data-analysis jsonl ndjson repl sql
Last synced: 19 Nov 2024
https://github.com/grburgess/polarpy
Tools for polar
3ml data-analysis grb polarization polarization-data
Last synced: 23 Nov 2024
https://github.com/rush-db/rushdb
RushDB is an open-source alternative to Firebase, built on top of Neo4j
app-backend data-analysis data-engineering data-science database docker firebase graphs instant instant-apps javascript neo4j nestjs self-hosted typescript web-development
Last synced: 08 Jan 2025
https://github.com/victor-lis/regression-ai-model-practice
ai data-analysis python regression-model
Last synced: 14 Dec 2024
https://github.com/arjo129/image-sorter
Sort through folders of videos and images. Root out blurred and overexposed images.
computational-photography data-analysis photo-browser photo-gallery photography uwp uwp-apps
Last synced: 06 Jan 2025
https://github.com/burhanahmed1/recipe-recommendor-using-pyspark
A smart recipe recommendation system that suggests recipes based on ingredient similarities. This project is done in PySpark
data-analysis data-science datawrangling education learning-python machine-learning machine-learning-algorithms nltk-python numpy pandas pyspark python python-project reccomendersystem recommendation-system
Last synced: 06 Jan 2025
https://github.com/henrylin03/video-games
Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.
analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games
Last synced: 14 Jan 2025
https://github.com/mirokeimioniemi/optimizing-insulin-injection-timing
Data processing and analysis for "Determining the optimal timing for insulin injection to minimize glucose level variability after a meal in ideal conditions" - a research project for the IB Standard Level Mathematics Analysis and Approaches course inspired by my type 1 diabetes.
cgm data-analysis data-science dexcom dexcom-g6 diabetes exploration ib insulin insulin-timing international-baccalaureate mathematics optimization python type-1-diabetes
Last synced: 10 Jan 2025
https://github.com/quantumudit/thereyougo-store-analysis
This project focuses on scraping all the products and their related info from the "There You Go" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/shlokashah/student-depression-and-suicide-rate-prediction
https://shlokashah.github.io/Student-Depression-And-Suicide-Rate-Prediction/
data-analysis data-visualization machine-learning student suicide-rate-prediction
Last synced: 28 Dec 2024
https://github.com/mindful-ai-assistants/movierevenueanalysis
🎬💰 Analyze movie companies' revenue, release strategies, and financial performance using statistical techniques for actionable insights. This project explores data on total revenue, number of releases, and lifetime gross to uncover patterns that can drive strategic decisions in the film industry.
correlation-analysis data-analysis data-science heatmap jupyter-notebook open-source python statistical-analysis statistical-analysis-and-hypothesis-testing statistics ttest
Last synced: 22 Jan 2025
https://github.com/simranjeet97/ipl-dataanalysis
Data Analysis performed on IPL Dataset with Data Profiling, Data Pre-Processing, Data Manipulation, and Data Visualization.
artificial-intelligence data-analysis data-manipulation data-mining data-preprocessing data-science data-visualization indian-premier-league-2008-2018 ipl ipl-dataset iplayer python
Last synced: 14 Jan 2025
https://github.com/justsecret123/twitter-sentiment-analysis
A sentiment analysis model trained with Kaggle GPU on 1.6M examples, used to make inferences on 220k tweets about Messi and draw insights from their results.
classification data-analysis data-science deep-learning deep-neural-networks docker glove-embeddings kaggle lstm lstm-neural-networks machine-learning natural-language-processing nlp python rnn scikit-learn sentiment-analysis sentiment-classification tensorflow word-embeddings
Last synced: 05 Nov 2024
https://github.com/quantumudit/analyzing-yell-cafes
This project focuses on scraping data related to cafes and coffee shops in London, England from the Yellow Pages (Yell.com) website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/quantumudit/python-projects
Consists of various projects that are primarily powered by Python
data-analysis data-science data-visualization jupyter-notebook projects python pythonapplication pythonprojects
Last synced: 06 Nov 2024
https://github.com/llnl/hdtopology
High-dimensional topological data analysis library for NDDAV
analysis cpp data-analysis data-viz high-dimensional-data topological-data-analysis visualization
Last synced: 11 Nov 2024
https://github.com/shibam120302/black-friday-sales-data-analysis
This repository contain Data Analysis on Black Friday Sales Data using various Regression ML algorithms
data-analysis eda machine-learning python random-forest regression
Last synced: 21 Jan 2025
https://github.com/shlokashah/ipl-data-analysis
Data Analysis and Visualizations done on IPL dataset
data-analysis data-visualization pandas powerbi
Last synced: 28 Dec 2024
https://github.com/ikanurfitriani/project-data-analysis-python
This repository contains the results of data analysis learning using the Python.
data-analysis data-analysis-project data-analysis-python python
Last synced: 26 Jan 2025
https://github.com/chaganti-reddy/evmarket-india
Electric Vehicle Market Segmentation Analysis in India
data-analysis data-science machine-learning market-segmentation pandas python
Last synced: 20 Jan 2025
https://github.com/kaustubhgupta/data-analysis-hub
This is where all my Data Analysis notebooks are present. All the notebooks are either fully explored and have an explanatory readme or a medium article has been published which is linked in the README.
data-analysis data-science google-play-store kaggle matplotlib pandas seaborn
Last synced: 27 Jan 2025
https://github.com/kalebers/data_streams_parametric_t-sne
Research for Parametric T-SNE in high to low dimensional data stream, published in 2021 by Kalebe Rodrigues Szlachta and Andre de Macedo Wlodkowski, oriented by Jean Paul Barddal, Computer Science graduation from Pontifical Catholic University of Parana (PUCPR)
classifier data-analysis data-science data-visualization machinelearning parametric parametric-tsne python tsne-algorithm tsne-visualization
Last synced: 21 Jan 2025
https://github.com/aravind-selvam/covid_dashboard
With Covid death and vaccine data. I have created a dashboard.
covid-19 data-analysis data-science data-visualization tableau tableau-public visualization
Last synced: 14 Jan 2025
https://github.com/idaraabasiudoh/vehicle-co2emission_model
Predicts CO2 emissions from vehicle fuel consumption using a multiple linear regression model trained on sklearn, based on a dataset of engine sizes and corresponding CO2 emissions in Canada.
data-analysis jupyter-notebook machine-learning python3 scikit-learn
Last synced: 14 Jan 2025
https://github.com/iantomasinicola/portfoliodataanalyst
Progetto di Data analysis con Python, Microsoft Sql Server e Excel
data-analysis excel python sql
Last synced: 02 Jan 2025
https://github.com/hatamiarash7/matlab_advantech_examples
Matlab Examples To Use Advantech DAQ Cards
advantech daq data-acquisition data-analysis data-science datascience matlab
Last synced: 20 Dec 2024
https://github.com/saranshbansal/spam-detection-analytics-tool
This is a nice tool to read chunks of sms data from a csv and understand how different algorithms (pre-implemented) perform in identifying spam messages.
analytics data-analysis data-science data-visualization mysql spring-boot
Last synced: 05 Jan 2025
https://github.com/techytushar/india-odi-analysis
Analysis of ODI cricket matches of Indian Team
cricket data-analysis data-science pandas plotting python3
Last synced: 07 Jan 2025
https://github.com/vishnu-t-r/data-analytics-portfolio-projects
This repository contain data analyst portfolio projects developed using various data analytics tools including SQL, Python, Tableau, Looker etc.
data data-analysis data-cleaning data-modeling data-visualization looker looker-studio python sql ssms tableau
Last synced: 10 Nov 2024
https://github.com/ac-gomes/data-engineering-with-databricks
A simple boilerplate for data engineering and data analysis training in Databricks.
data-analysis data-engineering databricks databricks-notebooks pyspark python unit-testing
Last synced: 09 Nov 2024
https://github.com/andreaschandra/who-suicides-statistics
Exploratory Data Analysis for Suicides using Python
data-analysis data-science eda python
Last synced: 19 Dec 2024
https://github.com/trybnetic/tu7-acceleration-sleep-wake-classification
Supporting material for the paper ''Discrimination of sleep and wake periods from a hip-worn raw acceleration sensor using recurrent neural networks''
accelerometer accelerometry actigraphy data-analysis sensors sleep
Last synced: 15 Jan 2025
https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation
we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.
algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning
Last synced: 06 Dec 2024
https://github.com/1ayanabil1/healthcare-machine-learning
Explore our open-source repository focused on healthcare machine learning. We've developed predictive models for cardiovascular disease, diabetes, breast cancer, and more. Our projects employ diverse machine learning algorithms and data science techniques, enhancing early detection, diagnosis, and patient outcomes.
data-analysis data-science deep-learning disease disease-detection disease-modeling disease-prediction eda healthcare-application heathcare jupyter-notebook machine-learning machine-learning-algorithms machinelearning-python python
Last synced: 07 Jan 2025
https://github.com/c0deta1ker/matbase
MatBase provides access to an extensive database of material parameters, inelastic mean free paths (IMFP), photoionization binding energies, cross sections, and asymmetry parameters. Additionally, MatBase includes a suite of functions for users to load, process, model and fit their own data, making it an indispensable tool in the field.
cross-sections crystal-structure crystallography data-analysis data-fitting database electron imfp imfp-calculator-matlab material material-database matlab matlab-application matlab-gui matlab-toolbox pes-modelling photoelectron-spectroscopy photoionization simulation xps
Last synced: 30 Nov 2024
https://github.com/i4ds/ecallisto_ng
Ecallisto NG is a Python package tailored for interacting with Ecallisto data.
data-analysis data-visualization e-callisto ecallisto-international-network numpy pandas python spectrometer
Last synced: 09 Nov 2024
https://github.com/realkarthiknair/data-science-notes
Data science notes and programs
data-analysis data-science data-visualization
Last synced: 21 Dec 2024
https://github.com/emaasit/pydata-book
Learning data analysis with python
data-analysis jupyter pandas python
Last synced: 21 Nov 2024
https://github.com/ahmednasef3/store-sales-full-eda
Simple EDA for Store Sales.
data-analysis data-visualization eda exploratory-data-analysis matplotlib pandas plotly seaborn store
Last synced: 15 Jan 2025
https://github.com/ahmednasef3/titanic-full-eda
Simple EDA for Titanic Dataset.
data-analysis data-visualization eda exploratory-data-analysis matplotlib pandas seaborn titanic titanic-data-analytics
Last synced: 15 Jan 2025
https://github.com/kylejgillett/stevepy
A Space Weather data analysis tool for Python.
astronomy aurora data-analysis physics python space-weather space-weather-research
Last synced: 27 Jan 2025
https://github.com/invictusaman/socioeconomic-indicators-in-chicago-sql-python
This project displays how to create a database connection in notebook, update database using python and how to run Python program and SQL queries together. It uses SQLite and Chicago dataset for analysis.
data-analysis jupyter-notebook python sql sql-queries sqlite
Last synced: 12 Oct 2024
https://github.com/geraldohomero/dh-youtube-research
Digital Humanities Youtube Research Database (Research Proposal)
data-analysis data-mining digital-history digital-humanities python research-data-management research-tool youtube-api-v3
Last synced: 15 Jan 2025
https://github.com/ivanildobarauna-dev/data-pipeline-sync-ingest
ETL Process for Currency Quotes Data" project is a complete solution dedicated to extracting, transforming and loading (ETL) currency quote data. This project uses several advanced techniques and architectures to ensure the efficiency and robustness of the ETL process.
business-intelligence data-analysis data-analytics data-engineering data-pipeline data-visualization etl-pipeline python
Last synced: 26 Jan 2025
https://github.com/nikoshet/exploratory-data-analysis-using-r
Exploratory Data Analysis using R Course Project for M.Sc. 'Data Science and Machine Learning' in NTUA
data data-analysis data-science eda exploratory-data-analysis ggplot2 r
Last synced: 03 Jan 2025
https://github.com/openpmd/openpmd-ccd
A Python Module & LabView Bindings for Storing CCD Images with openPMD
ccd data-analysis database hdf5 open-data open-science openpmd
Last synced: 04 Jan 2025
https://github.com/dcs-training/from-spss-to-r-how-to-make-your-statistical-analysis-reproducible
Comfortable/aware of how to run your stats in SPSS? Curious to learn how to run them in R? You've come to the right place. Go to the readme file
data-analysis data-visualisation data-wrangling good-practices-digital-research r rmarkdown spss statistics
Last synced: 10 Nov 2024
https://github.com/dcs-training/effectivedatavisualisation
This repository hosts the material connected to a training course developed by Dave Elsmore (Edina) for CDCS on good data visualisation. Go to the readme file
data-analysis data-visualisation data-wrangling python
Last synced: 10 Nov 2024
https://github.com/rubydamodar/the-ultimate-pandas-bootcamp
Welcome to the Pandas for Data Science repository! This course is designed to take you from beginner to proficient in using Pandas, the powerful data manipulation library in Python. Whether you're just starting your data science journey or looking to sharpen your skills, this repository contains all the resources
beginner-friendly csv-data data-analysis data-cleaning data-manipulation data-science data-visualization dataframe exploratory-data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python python-pandas series statistical-analysis time-series titanic-dataset
Last synced: 18 Oct 2024
https://github.com/dcs-training/bayesian-statistics
Materials for the CDCS Introduction to Bayesian Statistics course. Go to the readme file
bayesian-statistics data-analysis r statistics
Last synced: 10 Nov 2024
https://github.com/jhrcook/100daysofpython
100 days, at least 1 hour a day, of learning the Python programming language.
100-days-of-code 100daysofcode continued-learning data-analysis data-science decision-trees deep-learning keras keras-tensorflow machine-learning neural-network neural-networks plots python python3 scikit-learn tensorflow
Last synced: 13 Nov 2024
https://github.com/dcs-training/r-qgisintegratingspatialanalysis
This was an intermediate course of three sessions with a focus on developing skills in data visualisation, analysis and integration using both R studio and QGIS. Go to the readme file
data-analysis data-visualisation data-wrangling gis qgis r spatial-analysis
Last synced: 10 Nov 2024
https://github.com/revogati/ecommerce_consumer_behaviour
This is a Full Data Analytics project From data cleaning, preparation, exploration, Interpretation of insights up to Presentation of findings and recommendations..
data-analysis data-exploration ecommerce jupyter-notebook python sql tableau-public visualization
Last synced: 07 Jan 2025
https://github.com/ultralytics/sandd
data-analysis data-science neutrino particle-physics
Last synced: 10 Nov 2024
https://github.com/arnabsaha7/customer-churn_prediction---analysis
Predict customer churn using machine learning. This project employs a RandomForestClassifier to analyze customer data and determine the likelihood of churn. Explore the Jupyter Notebook for insights into the data and model, and contribute to the project's development.
customer-churn-prediction data-analysis machine-learning
Last synced: 12 Jan 2025
https://github.com/jpcadena/classification-tweets-national-security-ecuador
Classification of Tweets about national security at Ecuador 2022
classification classification-model data-analysis data-science ecuador insecurity machine-learning natural-language-processing nlp nltk numpy pandas python pytorch scikit-learn snscrape supervised-learning tensorflow tweet twitter
Last synced: 15 Jan 2025
https://github.com/josechirif/reviews-and-satisfaction-analysis-of-airbnb-brazil-and-mexico-from-june-2010-to-february-2021
This project analyzes the reviews and satisfaction of customers who used AirBnB services. It also studies if there is a relationship between another variables.
data data-analysis data-visualization powerbi sql-server
Last synced: 23 Oct 2024
https://github.com/cosmoduende/r-arduino
Interoperability Data-IoT: How to send and receive data and take control of your Arduino, from R. How to establish interoperability between R and Arduino (Data and IoT) using a data flow between the two
arduino arduino-data arduino-dataflow arduino-serial arduino-serial-data arduino-serial-led arduino-uno data-analysis data-arduino data-cleaning data-iot data-visualization interoperability iot-rstudio r-analytics r-data-visualization r-iot rstudio-arduino serial-read serialport
Last synced: 27 Dec 2024
https://github.com/pitmonticone/covid-italy
References for COVID-19 situation in Italy.
coronavirus covid-19 covid-19-italy data data-analysis documentation testing
Last synced: 22 Jan 2025
https://github.com/marios-mamalis/mca-visualisation
A script for automatic visualisation of Multiple Correspondence Analysis (MCA) results from FactoMineR in 3 dimensions using Plotly (exported as html)
3d-scatterplots correspondence-analysis data-analysis factominer html mca multiple-correspondence-analysis plotly visualisation
Last synced: 13 Jan 2025
https://github.com/simoneas02/data-science
🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻
data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql
Last synced: 01 Feb 2025
https://github.com/muzammil-13/mimlrepo
Data Analysis using Python Machine Learning Libraries
data-analysis data-science machine-learning numpy pandas python python-library
Last synced: 16 Jan 2025
https://github.com/busraozdemir0/datascienceproject
Youtube Trend Video İstatistiklerinin Analizi
classification-algorithm data-analysis data-analysis-python data-science jupyter-notebook linear-regression-algorithm lineer-regresyon machine-learning machine-learning-algorithms matplotlib nonlinear-regression numpy pandas python seaborn unsupervised-learning
Last synced: 02 Feb 2025
https://github.com/akshat0427/spotify_history
code to find out some insights in spotify streaming data (work in progress)
data-analysis data-visualization
Last synced: 31 Jan 2025
https://github.com/fbecerra/fbecerra.github.io
Source code for my website www.fernandobecerra.com
data-analysis data-science data-visualization dataviz interactive-visualizations
Last synced: 27 Oct 2024
https://github.com/rawsashimi1604/jobextract
Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.
data data-analysis data-analytics data-cleaning linkedin mvc python webscraper
Last synced: 27 Dec 2024
https://github.com/omarsar/energy_stats
Analyzing energy production with Kibana Lens
data-analysis data-science data-visualization elasticsearch kibana
Last synced: 23 Jan 2025
https://github.com/cbg-ethz/scdna-pipe
Python data analysis pipeline for single cell copy number event history reconstruction
bioinformatics bioinformatics-pipeline data-analysis genomics python snakemake snakemake-workflows workflow
Last synced: 28 Jan 2025
https://github.com/parisaroozgarian/ibm-data-analyst-professional-certificate
The IBM Data Analyst Professional Certificate, consisting of 9 courses, equips with essential skills in Excel, SQL, Python, data visualization, and analysis techniques
big-data business-analysis business-communication communication data-analysis data-management data-structures data-visualization databases general-statistics human-resources planning python-programming spreedsheet sql
Last synced: 19 Dec 2024
https://github.com/ivanildobarauna-dev/data-consumer-api
ETL Process for Currency Quotes Data" project is a complete solution dedicated to extracting, transforming and loading (ETL) currency quote data. This project uses several advanced techniques and architectures to ensure the efficiency and robustness of the ETL process.
business-intelligence data-analysis data-analytics data-engineering data-pipeline data-visualization etl-pipeline python
Last synced: 19 Dec 2024
https://github.com/eerkela/bertrand
flexible type extensions for pandas
conversions data-analysis data-engineering data-science multiple-dispatch numpy pandas type-checking type-inference types
Last synced: 30 Oct 2024
https://github.com/leandronasx/agro-data
Projeto final da formação de analista de dados e dashboard da SoulCode Academy.
bigquery data-analysis gcp looker pandas powerbi python
Last synced: 12 Oct 2024
https://github.com/saadarazzaq/excel-merger
Merge multiple Excel and CSV files into a single dataset with the Excel Merger Streamlit app. 📊🔄🚀
data-analysis excel pandas python streamlit-webapp
Last synced: 23 Jan 2025
https://github.com/michaelcurrin/water-crisis-scraper
Scrape and explore data related to Cape Town's water crisis (Python3 application)
cape-town cron csv dam-levels data-analysis html open-data python3 schedule scraping south-africa water-crisis water-level webscraping
Last synced: 28 Oct 2024
https://github.com/ronylpatil/whatsapplib
WhatsApp Group Chat Analysis Python Package.
data-analysis open-source pypi-package python-library python-package
Last synced: 21 Jan 2025
https://github.com/tushar2704/stats-mosaic-streamlit
Stats-Mosaic-Streamlit is a comprehensive GitHub repository that aims to provide a growing collection of curated content and projects centered around statistics and its intersection with data science, machine learning, and artificial intelligence.
artificial-intelligence bivariate-analysis data-analysis data-science hypothesis-testing machine-learning statistical-learning statistics streamlit streamlit-tushar2704 univariate-analysis
Last synced: 27 Dec 2024
https://github.com/coumbacoulibaly/adventureworkscycles
Repository for Adventure Works Sample Database Analysis
adventureworks data-analysis data-analytics mssql-database mssqlserver sql ssms
Last synced: 17 Nov 2024
https://github.com/asifdotexe/covidporfolioproject
This is a SQL + Tableau Project on real world Covid 19 Dataset from the start of recorded case to 2nd March 2022 i.e My birthday XD
dashboard data-analysis data-exploration data-visualization sql sql-server tableau
Last synced: 15 Jan 2025
https://github.com/asifdotexe/timeseriesanalysis
This repository serves as a central hub for all of my projects related to time series analysis. Here, you'll find a collection of projects, code samples, and resources that explore various aspects of time series data and its analysis.
data-analysis feature-engineering jupyter-notebook pandas python time-series-analysis visualization
Last synced: 15 Jan 2025
https://github.com/tathithienthanh/dataanalysis_diagnosis-of-diabetes-based-on-data-set-of-blood-test-result
Implement all learned knowledge about data analysis and data mining to make a complete project about Diagnosis of diabetes based on data set of blood test result
blood-test classification clustering data-analysis data-processing decision-tree diabetes-prediction diagnosis exercise google-colab health hierarchical ipynb kmeans knn knns py python smote-sampling visualization
Last synced: 04 Feb 2025
https://github.com/jesussantana/data-science-with-python-it-academy
Learn how to extract value from data by ingesting, transforming, storing, analyzing, and visualizing data
classification-model clustering-methods dash data-analysis data-mining data-science database machine-learning matplotlib mongodb numpy pandas plotly python3 regression-models seaborn sklearn sql web-scraping
Last synced: 12 Nov 2024
https://github.com/skylord0001/python-daily
Python - Basic, Apache - Conf, Black Stack Hub, Data analysis, Data Structure, Google Cloud, SQL system
apache-configuration data-analysis data-structure python-scripts python-sql
Last synced: 23 Nov 2024
https://github.com/a-r-j/npview
CLI tools for quickly inspecting CSV/TSV & NumPy (.npy) array files
cli csv data-analysis inspector npy numpy python tsv
Last synced: 17 Dec 2024
https://github.com/elfgk/diabetes-data-analysis
diabetes data analysis
analysis data-analysis diabetes-data-analysis eda jupiter-notebook
Last synced: 04 Jan 2025
https://github.com/praju-1/pandas
The library is widely used in data science and machine learning for data cleaning, preparation, and analysis.
Last synced: 08 Feb 2025
https://github.com/tushar2704/store-demand-forecasting
This project predicts the sales demand for various items in different stores based on historical sales data. The objective is to develop a machine learning model that can provide accurate forecasts for future sales of each store-item combination.
artifi data-analysis data-science python sales-analysis sales-forecasting tushar2704
Last synced: 27 Dec 2024
https://github.com/0mppula/element-compare
A single page Next.js 14 app that allows the user to inspect and compare elements from the periodic table.
compare dark-mode data-analysis elements inspect lucide-react nextjs periodic-table reactjs server-side-rendering shadcn-ui single-page-app typescript vercel zustand
Last synced: 11 Nov 2024
https://github.com/bradleyboehmke/uc-bana-6043
Additional resources for the UC BANA 6043 Statistical Computing course
data-analysis data-science data-visualization python
Last synced: 14 Oct 2024
https://github.com/1994nikunj/nlp-toolkit-desktop-app
The code is a collection of NLP analyses, including text cleaning, most common words, n-grams generation, co-occurrence matrix generation, wordcloud generation, topic modeling (using Latent Dirichlet Allocation), and general text statistics.
data-analysis n-grams network-visualization nlp python text-cleaning topic-modeling wordcloud-generator
Last synced: 25 Nov 2024
https://github.com/ashwinpn/visualization
Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.
analysis data data-analysis data-science data-visualization graphs plots python python3 visualization
Last synced: 16 Jan 2025
https://github.com/walidbosso/r_data_mining
Extract knowledge from a data using different techniques, including Association Rules Hierarchical Agglomerative Clustering (HAC) K-means Clustering Decision Trees
association-rule-mining association-rules clustering data-analysis data-mining data-science data-visualization decision-tree-classifier decision-trees exportation extract-data hac hierarchical-clustering k-means k-means-clustering k-means-r r-programming r-studio
Last synced: 28 Jan 2025
https://github.com/fabienarcellier/qjoin
qjoin is a data manipulation library that provides simple and efficient joining and collection processing functionality
composable data-analysis developer-tools functools python
Last synced: 28 Nov 2024
https://github.com/w-edward/youtube-keyword-popularity-analyzer
An effort to discover the top trending keywords on Youtube.
data-analysis node-js numpy python webscraping youtube-api
Last synced: 16 Jan 2025
https://github.com/prakhar-ff13/yellow-taxi-demand-prediction
Predicting Taxi Demand in various regions of New York City
data-analysis data-analytics data-science data-visualization machine-learning python3 time-series
Last synced: 28 Jan 2025
https://github.com/aad99bxp/whatsapp-chat-analyzer
A project intended for Business Owners / Managers to analyze Whatsapp chats between their customer care executives and their customers.
data-analysis heroku-deployment python3
Last synced: 22 Jan 2025
https://github.com/alieymsxxn/sql_project_data_job_analysis
This project explores top-paying jobs, in-demand skills, and where high demand meets high salary in data analytics.
data-analysis postgresql sql sqlite
Last synced: 28 Nov 2024
https://github.com/BigBangData/TimesheetAnalysis
R shiny app to help analyze a bookkeeper's business - or anyone with a timesheet and some time.
bookkeeping data-analysis data-viz r-programming shiny-apps shiny-r timesheet-management
Last synced: 04 Dec 2024