Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/jamesnw/wtb-data

Explore beer addition and style info from WhatToBrew.com

data-analysis homebrewing jupyter-notebook python3

Last synced: 27 Jan 2025

https://github.com/iguptashubham/pizzahut-analysis-sql

best dataset for data analysis. Pizzahut data analysis done by Shubham Gupta in MySql. This dataset is provided by friend of mine intern at pizzahut. In pizzahut, they used this dataset to train and ask question. This data does not reveal anything about the pizzahut. It is safe to share. data

data-analysis data-analytics database dataset datasets mysql mysql-database pizzahut

Last synced: 14 Jan 2025

https://github.com/mindgamesnl/yanderestats

https://mindgamesnl.github.io/YandereStats/

data-analysis reporting-pipeline yandere yandere-sim

Last synced: 01 Jan 2025

https://github.com/gowthamsundaresan/eigenscan

blockexplorer for eigenlayer

crypto data-analysis eigenlayer nextjs web3

Last synced: 08 Jan 2025

https://github.com/shipyardapp/amazonathena-blueprints

Simplified blueprints for building data pipelines with Amazon Athena.

amazon-athena athena cli data-analysis data-engineering data-science elt etl

Last synced: 04 Dec 2024

https://github.com/garciparedes/castile-and-leon-crops

Data Analysis of Castile and Leon Crops Area over the last years

castile-and-leon crops data-analysis data-science jupyter jupyter-notebook notebook spain

Last synced: 16 Jan 2025

https://github.com/shadan100/sales-prediction-analysis

The aim is to build a predictive model and find out the sales of each product at a particular store. Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.

artificial-intelligence data-analysis data-science django django-framework jupyter-notebook machine-learning matplotlib pandas predictive-modeling python sales-prediction

Last synced: 11 Oct 2024

https://github.com/odeyiany2/flit-apprenticeship-data-science-projects

This repo contains all my projects for my FLiT Apprenticeship

data-analysis data-science data-visualization machine-learning sql

Last synced: 02 Jan 2025

https://github.com/shibam120302/heart-disease-data-analysis-by-shibam

You can read more on the heart disease statistics and causes for self-understanding. This project covers manual exploratory data analysis

analysis data-analysis scraper

Last synced: 21 Jan 2025

https://github.com/steciuk/ium-recommendation-system

Evaluation and comparison of 3 different recommendations models for web shopping service simulation.

data-analysis model-evaluation recomendation-system

Last synced: 06 Jan 2025

https://github.com/husna-poyraz/titanic-machine-learning

Use machine learning to create a model that predicts which passengers survived the Titanic shipwreck.

data data-analysis data-science data-visualization deep-learning machine-learning missing-data outlier-detection python titanic

Last synced: 09 Jan 2025

https://github.com/richardwarepam16/rental_analysis_using_python_and_sql

Maximizing Rental Profits: Data-Driven Strategies for a Movie Rental Store

data-analysis data-analytics python3 rental-management sakila-db sqlite3

Last synced: 22 Jan 2025

https://github.com/mr-vozhyk/karpov.courses-study

Часть заданий и проектов от karpov.courses

airflow data-analysis git python sql statistics

Last synced: 20 Dec 2024

https://github.com/ondrejhruby/countries-of-the-world

Explore global data with this repository, featuring insights, visualizations, and Python code examples on countries worldwide—perfect for enhancing your data analysis and visualization skills.

data-analysis data-science data-visualization geography jupyter-notebook machine-learning matplotlib pandas python statistics

Last synced: 21 Jan 2025

https://github.com/abeltavares/finstockdash

A streamlit web app for retrieving and analyzing financial data for a stock ticker.

dashboard data-analysis finance financial-analysis financial-reporting financial-statements investments python stocks streamlit web-app

Last synced: 05 Jan 2025

https://github.com/priboy313/pandasflow

A set of custom python modules for friendly workflow on pandas

catboost data-analysis data-science pandas phik python scikit-learn shap

Last synced: 21 Dec 2024

https://github.com/x1ao4/doc-merger

通过 python 脚本将两个相对不完整的文档合并为一个完整的文档 / merge two relatively incomplete documents into one complete document via python script

data-analysis data-merging document-analysis document-comparison document-processing documents filtering filtering-data merge merge-documents

Last synced: 30 Dec 2024

https://github.com/matthewgrosman/messenger-analytics

Project that ingests Facebook Messenger conversations and generates analytics.

analytics data-analysis excel facebook facebook-messenger java mongodb

Last synced: 08 Nov 2024

https://github.com/unrndm/dataanalysis

artifacts and sollutions of homework for course "Data Analysis" in Magistrate of HSE during 2023-2024

2023-2024 data-analysis hse

Last synced: 01 Feb 2025

https://github.com/leocornus/leocornus-visualdata

JavaScript libraries to make data visualization simpler and easier.

data-analysis data-mining data-visualization data-visualization-simpler javascript-library

Last synced: 08 Jan 2025

https://github.com/akarshankapoor7/tensorflow_tutorial

This is an easy and fast tutorial for tensorflow. In data science, TensorFlow is an open-source machine learning framework by Google. It's used for building and training machine learning and deep learning models.

data-analysis data-science deep-learning machine-learning tensorflow

Last synced: 27 Dec 2024

https://github.com/ganeshkumartk/ncov-2019

[EDA] Statistical modelling of Novel Coronavirus breakout nCoV-2019

corona data-analysis ncov ncov-2019 statistics wuhan wuhan-coronavirus wuhan-virus

Last synced: 17 Jan 2025

https://github.com/iguptashubham/ott-churn-eda-ml

Understanding why customers discontinue their subscriptions will be crucial in optimizing the user experience, reducing churn, and maximizing customer lifetime value. By using Machine learning model to predict the Customer Churn.

data-analysis data-analysis-project data-science data-science-portfolio data-science-projects data-visualization machine-learning python

Last synced: 14 Jan 2025

https://github.com/rohithsaji97/open_gate_dip

An automatic gate opening system with an additional parking system (using Raspberry PI).

automated data-analysis digital-image-processing opencv python3 raspberry-pi-3 trained-models

Last synced: 29 Dec 2024

https://github.com/ehopperdietzel/billionaires-analysis

Análisis de la cantidad de billonarios por país. Inspirado en el artículo "Russian Billionaires"

bootstrap data-analysis poisson-distribution prediction

Last synced: 30 Oct 2024

https://github.com/jabhij/tableau_dashboards

Consists brief info about all of my tableau dashboards, insights that I got out of them, & the outcomes that I got after analyzing those visualizations.

data-analysis data-analytics data-science data-visualization tableau visualisation

Last synced: 17 Jan 2025

https://github.com/jabhij/eda_experiments

In this repo I'll use different types of datasets to explore and implement various Exploratory Data Analysis (EDA) approaches.

ames-housing analysis battery-life blackfriday-analysis data-analysis data-science data-visualization eda matplotlib-pyplot numpy pandas python seaborn visualization zomato-data-analysis

Last synced: 17 Jan 2025

https://github.com/jabhij/fbi_nics-firearm-background-checks

This project is a try to showcase the use of guns across the US.

data-analysis data-analytics data-science data-visualization tableau

Last synced: 17 Jan 2025

https://github.com/whis99/userfunnelanalysis

An ecommerce user funnel conversion data analysis with matplotlib & python.

data-analysis data-analysis-python data-analyst data-visualization google-colab jupyter-notebook matplotlib python

Last synced: 13 Jan 2025

https://github.com/giordano-lucas/tesco-extension

Products clustering and interactive visualization

clustering data-analysis data-visualization tesco

Last synced: 02 Jan 2025

https://github.com/muneeb1030/dataannotation

This streamlines the process of annotating data for machine learning tasks, making it easier and more efficient for teams to create labeled datasets by leveraging Label Studio and Bulk

bulk data-analysis data-annotation label-studio python

Last synced: 11 Jan 2025

https://github.com/harmanveer-2546/supply-chain

Supply chain analytics is a valuable part of data-driven decision-making in various industries such as manufacturing, retail, healthcare, and logistics. It is the process of collecting, analyzing and interpreting data related to the movement of products and services from suppliers to customers.

customer-segmentation-analysis data data-analysis data-cleaning data-insights ggplot2 numpy pandas performance-evaluation predictive-analytics-for-business python risk-assessment sales-analysis statistical-analysis supply-chain tidyverse trend-analysis

Last synced: 11 Jan 2025

https://github.com/flyingfathead/neurograph-framework

A versatile tool for visualizing entropy loss in TensorFlow-based neural network training, providing insightful scatter plots with annotations.

data-analysis data-analysis-python data-visualization entropy graph graphs neural-network neural-networks neural-networks-visualization nn python python3 tensorflow tensorflow2 training visualization visualization-tools

Last synced: 11 Jan 2025

https://github.com/bala-1409/foreign-exchange-rate-time-series-data-science-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-science data-visualization datapreprocessing eda exploratory-data-analysis forecasting machine-learning-algorithms model modelfitting predictive-modeling python3 scikit-learn statsmodels time-series time-series-analysis

Last synced: 27 Jan 2025

https://github.com/kirkalyn13/opensignal_autogenerate_report

Script used to generate results/summary, including the trends of flagged provinces, from the raw excel data file,

data-analysis data-science data-visualization matplotlib numpy pandas python

Last synced: 16 Jan 2025

https://github.com/numbersprotocol/dyda

Dynamic data pipeline framework

ai artificial-neural-networks data-analysis data-science

Last synced: 27 Dec 2024

https://github.com/sunnybibyan/exploratory-data-analysis-eda

Welcome to the Titanic Dataset - Exploratory Data Analysis (EDA) project repository! This project aims to uncover insights from the Titanic dataset using Python and Jupyter Notebook. By analyzing key variables such as age, gender, and class, we aim to visualize relationships between passenger characteristics and survival rates.

data-analysis data-visualization jupyter-notebook python titanic-dataset

Last synced: 19 Dec 2024

https://github.com/sarincr/data-analytics-with-knime

Data Analytics with KNIME (Konstanz Information Miner), a free and open-source data analytics, reporting and integration platform. KNIME integrates various components for machine learning and data mining through its modular data pipelining concept. A graphical user interface and use of JDBC allows assembly of nodes blending different data sources, including preprocessing (ETL: Extraction, Transformation, Loading), for modeling, data analysis and visualization without, or with only minimal, programming.

ai artificial-intelligence artificial-intelligence-algorithms artificial-neural-networks data-analysis data-mining data-science data-structures data-visualization database datascience deep-learning machine-intelligence machine-learning machine-learning-algorithms machinelearning mining mining-software

Last synced: 21 Jan 2025

https://github.com/sumitgirwal/procoder-public

"ProCoder", which is a web-based application providing massive open online courses for both professionals and students. It aims to offer a platform for learning coding skills online, accessible to anyone who is interested in learning programming or enhancing their coding knowledge. ProCoder provides courses on various programming languages, tools.

blog-platform bootstrap-4 chat-application css3 data-analysis django-crud django-project html5 javascript numpy-library pandas-library python3

Last synced: 06 Jan 2025

https://github.com/adolbyb/data-science-python

An Introduction to Data Science and Data Visualization with the FAU Data Science and Machine Learning Club

data-analysis data-science data-visualization jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 20 Jan 2025

https://github.com/dcs-training/intromachinelearning

This course is aimed at providing an introduction to machine learning for those with some beginner level python skills. Go to the readme file

data-analysis data-wrangling machine-learning python statistics

Last synced: 07 Jan 2025

https://github.com/khuyentran1401/sample_datapane_script

This repo shows how to use Datapane create a simple script to see the rank of the authors or publications with respect to publishing frequency

data-analysis data-science datapane python

Last synced: 26 Jan 2025

https://github.com/sufiyanahmed4566/sql-musicmaven

"This Music Store Database Project showcases SQL skills through comprehensive database design, query optimization, and data analysis. Includes ER diagram, database file, query questions (Easy, Medium, Hard), answered queries, and CSV table data. Ideal for recruiters seeking skilled SQL developers for music store management and data analysis.

data-analysis database insights mysql-database oracle-database relational-databases sql

Last synced: 24 Jan 2025

https://github.com/haloapping/pisangijo

Kumpulan library dan framework untuk analisa data, data science, machine learning, deep learning dan masih banyak lagi berbasis bahasa pemrograman Python 🐍.

belajar data-analysis data-science deep-learning forecasting libraries machine-learning perkakas pustaka python3 recommender-system referensi tools

Last synced: 06 Jan 2025

https://github.com/karatechop/noaa-storm-database-data-analysis

Analysis of population health and economic consequences of events documented in the U.S. National Oceanic and Atmospheric Administration’s (NOAA) storm database.

data-analysis knitr r rmarkdown

Last synced: 21 Jan 2025

https://github.com/mardavsj/weather-prediction

Weather prediction model which mainly focuses on visualization.

data-analysis data-visualization matplotlib numpy pandas pandas-dataframe

Last synced: 21 Dec 2024

https://github.com/pinedah/escom_development-of-applications-for-data-analysis

This repository is a personal collection of programs, exercises, and notes from the Development of Applications for Data Analysis course at Instituto Politécnico Nacional (IPN). As part of the Bachelor's in Data Science, the course focuses on developing practical skills in Python for data analysis.

data-analysis data-science data-visualization jupyter-notebook python python-data-analysis

Last synced: 19 Dec 2024

https://github.com/valeriopagliarino/electronics-2021-unito-public

Data analysis and simulations for the course "Electronics laboratory" held at Physics Dep. - University of Turin, 2021

data-analysis electronics physics

Last synced: 01 Feb 2025

https://github.com/zpreisler/modules

Python libraries and modules for processing simulation outputs

data-analysis python scripts tensorflow

Last synced: 11 Jan 2025

https://github.com/md-emon-hasan/data-analytics-project

Data analytics tasks and solutions, featuring hands-on exercises for data cleaning, visualization, and analysis using Python libraries.

cars-dataset census-data covid19-data data-analysis london-house-price police-data weather-data

Last synced: 13 Jan 2025

https://github.com/ahmad-ali-rafique/handwritten-digit-recognition-mnist

This project demonstrates a complete pipeline for recognizing handwritten digits using the MNIST dataset. The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation of a Fully Connected Neural Network (FCNN).

ai artificial-intelligence data data-analysis datascience deep-learning deep-neural-networks fcnn fully-connected-network machine-learning machine-learning-algorithms ml modeling

Last synced: 16 Jan 2025

https://github.com/jubinjacob03/heartdiseaseclassify-ml

Heart Disease Dataset Analysis & Classification using ML models such as linear, support vector machine, k-means, k-nearest neighbors and logistic regression.

data-analysis data-science data-visualization ipython-notebook kaggle-dataset kmeans knn linear-regression logistic-regression machine-learning matplotlib python seaborn support-vector-machine

Last synced: 11 Oct 2024

https://github.com/ahmad-ali-rafique/weather-prediction-fcnn

This project demonstrates a complete pipeline for weather prediction using a Fully Connected Neural Network (FCNN). The project is implemented in Python using Jupyter Notebook, and it covers data loading, preprocessing, model training, and performance evaluation.

ai artificial-intelligence data-analysis data-science deep-learning deep-neural-networks fully-connected-network machine-learning machine-learning-algorithms weather-information

Last synced: 16 Jan 2025

https://github.com/raccoon-hero/gender-equality-tracker

A web application visualizing gender equality metrics with a focus on Ukraine. Built with Flask, it's powered by live data from global open sources, with dynamic research insights and analysis.

chartjs css dashboard data-analysis data-visualization flask frontend gender-equality global-metrics html linked-data openalex opendata python representation semantic-web ukraine webapp wikidata world-bank-api

Last synced: 27 Dec 2024

https://github.com/talha-1010/imdb-data-analysis

A data analysis project made with python using pandas

data-analysis data-visualization jupyter-notebook pandas pandas-dataframe

Last synced: 14 Jan 2025

https://github.com/famarks/grafarg

Grafarg is an interactive data analytics and graphical data visualization application. Grafarg being a progressive fork of Grafana 7.5.17 continues to be available under open source Apache 2.0 License

analytics charts data data-analysis data-science data-visualization grafana grafarg graph

Last synced: 18 Dec 2024

https://github.com/rani-sikdar/python-data-structures

A repository for sharing Python implementations of common data structures and algorithms, including arrays, linked lists, stacks, queues, trees, graphs, and sorting/searching techniques. Perfect for learning, revisiting fundamentals, and collaborating on computer science concepts. Contributions welcome! 🚀

data-analysis data-structure data-visualization numpy-library pandas-dataframe python python-3

Last synced: 20 Jan 2025

https://github.com/ysayaovong/portfolio

Explore my portfolio showcasing projects in data engineering, cybersecurity, software development, and cloud computing. Highlights include SQL tutorials, automation tools, cybersecurity assessments, and innovative Python applications. Dive into my work and see my expertise in action.

api-integration automation aws cloud-computing cybersecurity cybersecurity-risk-assessment data-analysis data-engineering data-science database-management etl linux project-management python scripting security-policy software-development sql system-optimization visualization

Last synced: 30 Jan 2025

https://github.com/alfikiafan/air-quality-analysis

This repository contains a comprehensive data analysis project on Air Quality Dataset, covering the complete data analysis process from data gathering, cleaning, exploratory data analysis (EDA), to building a fully interactive dashboard using Streamlit.

air-quality data-analysis dicoding

Last synced: 17 Jan 2025

https://github.com/faezeh-gholamrezaie/visual-google-scholar-search

A Python script that searches Google Scholar for specific keywords and visually presents the results in various chart formats, enabling researchers to analyze trends and insights in academic literature.

academic academic-research academic-trends ai ai-research bibliometrics data-analysis data-visualization google-scholar publication-analysis python research-trends scholarly scholarly-data word-cloud

Last synced: 28 Dec 2024

https://github.com/gattiharishkumar/blinkit-sales-analysis-dashboard

This project presents a comprehensive sales analysis dashboard for Blinkit, an Indian last-minute delivery app. The dashboard was created using Power BI and provides a detailed overview of the company's sales performance across various outlets and product categories.

dashboard data-analysis data-transformation data-visualization ms-excel-data-analytics power-query powerbi powerbi-visuals

Last synced: 29 Dec 2024

https://github.com/noturlee/imdb-dataanalysis

A data model that predicts the IMDb rating of a movie based on features like genre, director, and actors. Using regression techniques to tackle this problem.

data-analysis data-cleaning data-modeling data-science data-visualization

Last synced: 22 Dec 2024

https://github.com/wiseaidev/truth-guard

Analyzing a 79k Dataset of Misinformation and Fake News

data-analysis fastapi lstm machine-learning python supervised-learning

Last synced: 20 Dec 2024

https://github.com/lightbridge-ks/zoominterface

A data analysis Shiny app of program Zoom report files.

data-analysis r shiny-apps zoom-class zoom-meetings

Last synced: 16 Jan 2025

https://github.com/csoren66/diabetics_prediction

Predicting that whether the patient has diabetes or not on the basis of the features we will provide to our machine learning model.

data-analysis machine-learning python svm

Last synced: 13 Jan 2025

https://github.com/li-pearl/gene-count-normalizer

First step of data wrangling in MERFISH data project

data-analysis merfish merscope python

Last synced: 12 Jan 2025

https://github.com/stastnypremysl/lsql-csv

lsql-csv is a tool for small CSV file data querying from a shell with short queries. It makes it possible to work with small CSV files like with a read-only relational databases. The tool implements a new language LSQL similar to SQL, specifically designed for working with CSV files in shell.

csv data-analysis data-processing haskell language linux-shell lsql lsql-csv new-language query-language relational-database sql unix-command unix-philosophy unix-shell

Last synced: 02 Feb 2025

https://github.com/githubuseraccountamazing/the-amari-project

a project in which I attempted to push some of the limits of stable-diffusion while taking some data along the way

ai ai-generated-images bash data-analysis machine-learning stable-diffusion textual-inversion

Last synced: 20 Jan 2025

https://github.com/tddschn/whatsapp-chat-analyze

Command Line Tool to Generate Pretty Charts from Whatsapp Exported Chats

data-analysis data-visualization plotly python whatsapp whatsapp-data

Last synced: 17 Jan 2025

https://github.com/gattiharishkumar/employee-attendance-leaves-analytics-dashboard

This project showcases a Power BI dashboard created to analyze employee attendance and leaves over a three-month period. The data was sourced from Excel datasets available on the Codebasics website.

dashboards data-analysis data-cleaning data-transformation data-visualization power-query-editor powerbi

Last synced: 29 Dec 2024

https://github.com/ibensusan/wine-properties-assessment

Wine Properties Assessment using Microsoft Excel

data-analysis data-visualization excel

Last synced: 14 Dec 2024

https://github.com/JovaniPink/excel-powerbi

The folder of my work with Excel, VBA, and PowerBI for Data Analysis & Visualization.

data-analysis data-visualization dax excel excel-vba power-pivot power-query powerbi vba-macros

Last synced: 27 Nov 2024

https://github.com/michenriksen/inspectra

A simple web app for data inspection.

data-analysis decoding web-tool

Last synced: 14 Jan 2025

https://github.com/vara-co/sql-challenge

EmployeeSQL "Data modeling, data engineering, and data analysis."

data-analysis data-engineering data-modeling databases employee-database erd erdiagram postgres postgresql schema sql tables

Last synced: 02 Feb 2025

https://github.com/gonzalo123/pivot.pandas

Data Analysis with Python. Pivot tables with Pandas

data-analysis jupyter-notebook pandas pivot-tables python

Last synced: 27 Jan 2025

https://github.com/mohammadreza-mohammadi94/data-analysis-and-machine-learning-projects

A comprehensive collection of data analysis and machine learning projects, showcasing techniques and models for various data challenges. Dive in to explore code examples, analyses, and machine learning workflows.

data-analysis data-science dataframes exploratory-data-analysis pandas python scikit-learn visualization

Last synced: 07 Nov 2024

https://github.com/valeriopagliarino/esp2-2021-unito-public

Physics laboratory 2 course (electromagnetism, optics and modern physics)

data-analysis electronics optics physics

Last synced: 01 Feb 2025

https://github.com/vara-co/space-missions

Space Missions Over Time (1957-2022): Successes vs Failures, and Rocket Usage

data-analysis data-analysis-python history matplotlib pandas pandas-python space space-race spaceships team-project

Last synced: 02 Feb 2025

https://github.com/vara-co/pandas-challenge

PyCitySchools - Analysis between budget and academic performance in schools

budget-analysis data-analysis jupiter-notebook pandas-dataframe python school-performances

Last synced: 02 Feb 2025