Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/vandita2020/merra2_nasa_wind_speed_analysis

In this study, we aim to explore the vulnerability of power grids in the south-east region of the USA with the help of data analysis tools and machine learning algorithms

data-analysis data-science machine-learning-algorithms python

Last synced: 11 Jan 2025

https://github.com/al-ghaly/airline-company-data-warehouse

Data Warehouse modeling, design, implementation, and analysis for an Airline Company.

data-analysis data-warehousing database-modeling sql-server

Last synced: 22 Jan 2025

https://github.com/thecoderpinar/hms-brainactivity-analysiss

Welcome to the GitHub repo for "HMS - EEG Exploration & Neurocritical Care Journey"! Explore EEG data, understand wave patterns, and delve into conditions like LPDs, GPDs, LRDA, and GRDA.

critical-care data-analysis data-science data-visualization deep-neural-networks eeg eeg-signals exploratory-data-analysis healthcare medical-research neuroscience signal-processing

Last synced: 16 Dec 2024

https://github.com/rayyan9477/multiple-disease-prediction-system

This repository contains a Multiple Disease Prediction System leveraging machine learning techniques for accurate predictions. It utilizes Python, Pandas, Scikit-learn, and Flask for data preprocessing, model building, and web deployment. Explore the project and connect on LinkedIn for collaborations.

data-analysis data-science machine-learning python streamlit

Last synced: 10 Jan 2025

https://github.com/rayyan9477/youtube-spam-detection-with-flask-and-machine-learning

This is a web application built using Flask that detects spam comments on YouTube using a Naive Bayes classifier. It leverages techniques such as CountVectorizer for feature extraction and scikit-learn for machine learning. The application reads data from a CSV file and predicts whether a comment is spam or not.

data-analysis data-science machine-learning nlp-machine-learning spam-detection

Last synced: 10 Jan 2025

https://github.com/olekscode/covidanalysis

A setup for COVID-19 data analysis in Pharo

coronavirus covid-19 data-analysis pharo

Last synced: 18 Dec 2024

https://github.com/rayyan9477/household-transactions-analysis-and-clustering

This project involves analyzing household transaction data to gain insights into spending patterns and behaviors. The analysis includes data cleaning, exploratory data analysis (EDA), clustering using K-Means, and visualization of customer segments.

customer-segmentation data-analysis data-cleaning data-science exploratory-data-analysis kmeans-clustering machine-learning

Last synced: 10 Jan 2025

https://github.com/alejo1630/chicago_crimes

A Jupyter Notebook with the data analysis and data visualization of crimes in Chicago from 2017 to 2023 using libraries such as seaborn and folium

data-analysis data-visualization folium pandas python seaborn

Last synced: 31 Dec 2024

https://github.com/seabbs/explorebcgonoutcomes

Analysis to explore the association of BCG vaccination and TB outcomes.

bcg data-analysis regression rstats tuberculosis

Last synced: 01 Jan 2025

https://github.com/lobooooooo14/badwords-pt-br

💬 Wordlist com palavrões em pt-BR para análise de dados, filtros, ou texto considerado "evitável"

badword-filter badwords brasil data-analysis filter filter-lists filterlist portugues portuguese text-analysis wordlist

Last synced: 02 Dec 2024

https://github.com/prernarohra/quakeguard

QuakeGuard is an innovative project for reducing earthquake intensity and structural damage. It takes a proactive approach to seismic activity, by using complex algorithms and real-time data to improve safety and resilience for people in earthquake-prone areas.

artificial-intelligence backend data-analysis data-science earthquake-intensity final-year-project front-end geology machine-learning open-source python visualization

Last synced: 23 Jan 2025

https://github.com/evardnk/dataanalyticsportfolio

Собрание моих проектов по аналитике данных

api automation data-analysis etl-pipeline jupyter-notebook jupyterlab kpis numpy pandas pipeline postgresql powerbi python sql visualization

Last synced: 19 Dec 2024

https://github.com/sumidcyber/dataviz-master

This Python application provides a user-friendly interface to load and visualize the contents of a CSV file. Users can choose from various types of graphs and perform analyses on the dataset.

data-analysis data-analysis-project data-analysis-python database databases python python3

Last synced: 22 Jan 2025

https://github.com/msthamizh/airbnb_analysis

Developing a Streamlit application enabling users to explore and analyze Airbnb listing data. This application allows users to interactively visualize geospatial distributions of listings, analyze pricing trends, and explore availability patterns across different locations. Integrates MongoDB Atlas for data storage and PowerBi for advanced insights

data-analysis data-cleaning data-visualization json mongodb pandas-dataframe plotly powerbi python streamlit

Last synced: 18 Dec 2024

https://github.com/thomascenni/anfavea-data-analysis

Data analysis with Pandas and Datapane.

data-analysis datapane pandas

Last synced: 02 Dec 2024

https://github.com/nomadsdev/financial-trend-analyzer

FinancialTrendAnalyzer helps analyze and visualize sales data to uncover financial trends. It uses Python to calculate total sales, track changes, and generate insightful charts for better decision-making.

business-intelligence data-analysis data-visualization financial-analysis matplotlib numpy pandas python revenue-trends sales-data seaborn time-series-analysis

Last synced: 19 Dec 2024

https://github.com/roberto-butti/fit_explorer

FIT File Explorer, in GO Lang

data-analysis fitness geospatial golang

Last synced: 24 Dec 2024

https://github.com/grypesc/graduateadmissions

Visualization, analysis and predictive modeling of a Kaggle graduate admissions dataset.

data-analysis data-mining data-science data-visualization dataset

Last synced: 16 Dec 2024

https://github.com/emredurukn/data-analysis

Example notebooks for analyzing data

data-analysis data-visualization python

Last synced: 10 Jan 2025

https://github.com/ezzz-lui/rsm-evaluationproject

Este repositorio es donde esta documentado nuestro proyecto para RSM por parte de actividad final para el bootcamp Data Analyst

data-analysis python

Last synced: 03 Dec 2024

https://github.com/discdiver/new-belgium-ratings

Find the most popular New Belgium beers of all time!

beautifulsoup data-analysis pandas python seaborn webscraping

Last synced: 10 Jan 2025

https://github.com/jatin-mehra119/bike-rentals-dataset

This repository focuses on optimizing bike rental availability during peak hours and days using machine learning techniques. Leveraging publicly available data from the UCI Machine Learning Repository, it includes scripts for data preprocessing, model training, and visualization, along with detailed observations and results.

data-analysis data-science ensemble-model pandas scikitlearn-machine-learning

Last synced: 17 Jan 2025

https://github.com/jen-uis/la-crime-data-analysis

This repository contains project materials for the Fall 2023 MGT 256 class. This project is completed with assists from Professor Adem Orsdemir.

business-analytics crime-data crime-data-analysis data-analysis knn la-crimes-from-2020 la-safe r r-markdown r-studio report-generation rmd united-states visualization

Last synced: 21 Jan 2025

https://github.com/john-science/data_science_by_example

Examples of Data Science Tools & Libraries

data-analysis data-science ipython pandas

Last synced: 18 Nov 2024

https://github.com/juliusmarkwei/titanic-data-analysis

Data analysis, data visualization, feature scaling, feature transformation, model selection and model optimization.

data-analysis data-science data-visualization linear-regression model-selection regression

Last synced: 01 Jan 2025

https://github.com/juliusmarkwei/iris-dataset-analysis

Data analysis, data visualization and model training using the popular Iris Dataset

data-analysis data-visualisation linear-regression machine-learning

Last synced: 01 Jan 2025

https://github.com/prernarohra/mental-health-prediction

This project focuses on predicting mental health outcomes using machine learning algorithms. By analyzing various psychological, social, and lifestyle factors, the model aims to identify individuals at risk, enabling early intervention and support.

data-analysis data-science data-visualization machine-learning mental-health python

Last synced: 23 Jan 2025

https://github.com/aravind-selvam/bikeshare-company-analysis

Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company

analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server

Last synced: 14 Jan 2025

https://github.com/noodleslove/house-of-representative-analysis-i

This project uses public data about the stock trades made by members of the US House of Representatives.

data-analysis data-science eda kaggle-dataset matplotlib-pyplot pandas python stocks-trading

Last synced: 28 Jan 2025

https://github.com/walidalsafadi/titanic-disaster

In this challenge, we ask you to build a predictive model that answers the question: “what sorts of people were more likely to survive?” using passenger data (ie name, age, gender, socio-economic class, etc).

data-analysis data-science decision-trees eda gradient-boosting knearest-neighbors machine-learning-algorithms naive-bayes random-forest titanic-kaggle titanic-survival-prediction

Last synced: 22 Jan 2025

https://github.com/shreeparab1890/flipkart-laptops-analysis-eda

This ipython notebook is the Exploratory data analysis (EDA) of the Laptops listed on Flipkart.

data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly

Last synced: 01 Jan 2025

https://github.com/tim-hub/python-course

A new Python Course, a new trial to offer MOOC style learning resources and content for python learners

data-analysis learning python

Last synced: 23 Jan 2025

https://github.com/leosimoes/udacity-starbucks

Project 3 of the Udacity Machine Learning Engineer Nanodegree Program. Data analysis and machine learning application to Starbukcs data.

amazon-sagemaker data-analysis data-science machine-learning python

Last synced: 02 Dec 2024

https://github.com/leosimoes/datascienceacademy-powerbi-3.0

Projetos do curso Microsoft Power BI Para Data Science Versão 3.0 da DataScienceAcademy. Dashboards para diversos casos de negócios.

business-intelligence dashboard data-analysis data-visualization microsoft-power-bi

Last synced: 02 Dec 2024

https://github.com/leosimoes/uerj-tcc-analisador-dados

Trabalho de conclusão de curso (TCC) em engenharia de computação. Aplicativo Web para análise de dados.

data-analysis data-science data-visualization python streamlit

Last synced: 02 Dec 2024

https://github.com/miroslav-reiter/kurz_jazyk_sql_analytici_datovi_vedci

Materiály ku kurzu Jazyk SQL 1 pre Analytikov a Dátových Vedcov

analysis analytics data data-analysis data-science database mysql reiter sql

Last synced: 22 Dec 2024

https://github.com/aritrakar/statpy

A simple package containing some functions for analysing Gaussian and Binomial distributions. Created for the Udacity AWS MLE Foundations 2021 course.

data-analysis python statistics

Last synced: 01 Jan 2025

https://github.com/alejo1630/sport_stats

Data analysis of information from the summer and winter Olympic games over the years. UC Davis SQL Specialization Final Project

data-analysis jupyter-notebook olympics-dataset plotly python seaborn sql

Last synced: 31 Dec 2024

https://github.com/arv-anshul/easy-analysis

A python package to perform Data Analysis easily. (Not Recommended)

arv-dumped data-analysis data-science easy-analysis eda pypi pypi-package python3

Last synced: 25 Dec 2024

https://github.com/sarincr/basics-of-julia-programming-language

Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.

data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning

Last synced: 21 Jan 2025

https://github.com/leosimoes/uerj-tcc-analisador-dados-texto

Texto do trabalho de conclusão de curso (TCC) em engenharia de computação. Aplicativo Web para análise de dados.

data-analysis data-science data-visualization python streamlit

Last synced: 02 Dec 2024

https://github.com/mayankyadav23/air-bnb-data-analysis

Data analysis and insights from NYC Airbnb listings, focusing on key metrics such as host performance, neighborhood trends, pricing, and customer reviews. Comprehensive documentation of ETL processes and analytical methodologies is provided. Perfect for understanding Airbnb dynamics and decision-making in the NYC market.

advanced-excel business-intelligence data-analysis data-analytics data-visualization power-bi ppt

Last synced: 10 Jan 2025

https://github.com/akash1070/project---applied-statistics-

To dive deep into this data & find some valuable insights.

data-analysis data-science python statistics

Last synced: 29 Jan 2025

https://github.com/akash1070/data-science-advanced-analytics-virtual-experience-program

The BCG Open-Access Data Science & Advanced Analytics Virtual Experience Program

data-analysis data-science machine-learning-algorithms

Last synced: 29 Jan 2025

https://github.com/mysftz/deduction-of-a-law

Python files for proof of deduction of a law within oscillations. Proving theoretical law with physical data and comparing both. A supplementary LaTeX document report is linked below.

data-analysis deduction-of-a-law matplotlib python python3 univeristy university-laboratory university-project

Last synced: 26 Dec 2024

https://github.com/ultrasage-danz/weather-data-analysis

Weather Data Analysis notebook project. Created using Google collab

collaboration data-analysis data-science dataset google google-colab-notebook project

Last synced: 02 Dec 2024

https://github.com/colburncodes/se_pudding_2023

This project is a React app designed to showcase research conducted by a team of data scientists and data analysts. The app is utilizing React and React-Chartjs-2

chartjs-2 data-analysis data-science data-visualization react-chartjs-2 reactjs

Last synced: 22 Jan 2025

https://github.com/gracysapra/r-in-data-science

This repository contains essential guides for data analysis using R, covering topics like data preparation, data reshaping, and data visualization. Each file focuses on fundamental techniques to manipulate, clean, and visualize data effectively using R programming.

data-analysis data-preparation data-reshaping data-science data-visualization data-visualizations ggplot r r-for-data-science

Last synced: 15 Dec 2024

https://github.com/lastancientone/amd-vs-nvda

Analyzing 2 technology stocks using Master Analyst Program (MAP).

data data-analysis data-structures data-visualization excel forecasting time-series-analysis

Last synced: 26 Dec 2024

https://github.com/allanotieno254/british-airways-review-dashboard-using-tableau

An interactive dashboard built using Tableau, visualizing British Airways customer review data. This project analyzes customer sentiment, satisfaction levels, and other key metrics to provide insights into passenger experiences and identify improvement areas for British Airways.

customer-satisfaction-analysis data-analysis data-visualisation tableau-dashboards tableau-public

Last synced: 26 Dec 2024

https://github.com/allanotieno254/employee-performance-tracker-excel-

An Excel-based tool to track and evaluate employee performance, compliance, and skills assessments with summary statistics and visual charts

compliance-tracker data-analysis employee-performance-analysis excel human-resources

Last synced: 26 Dec 2024

https://github.com/allanotieno254/road-accident-data-analysis-dashboard-using-excel

This repository contains the Road Accident Data Analysis Dashboard, a comprehensive Excel-based tool designed to provide in-depth analysis and visualization of road accident data.

dashboards-excel data-analysis excel kpi visualization

Last synced: 26 Dec 2024

https://github.com/allanotieno254/codsoft

This repository showcases a series of data science projects completed during an internship with CODESOFT. Each project utilizes Python and various machine learning techniques to solve specific problems in data analysis, classification, regression, and predictive modeling.

classification data-analysis data-science feature-engineering machine-learning model-evaluation predictive-modeling python-programming regression

Last synced: 26 Dec 2024

https://github.com/akash1070/data-science-virtual-internship-by-accenture

data merging and data cleaning in python as well as data visulaisation with dashboard in Tableau.

data-analysis data-cleaning data-science python3 tableau visualization

Last synced: 29 Jan 2025

https://github.com/johnsesana/eda-liquor-sales

Exploratory Data Analysis on Public Datasets

data-analysis data-visualization sql tableau-dashboards

Last synced: 17 Jan 2025

https://github.com/emso-c/stream-analyser

A tool that analyses YouTube live streams.

cli data-analysis guessing highlights python youtube-video

Last synced: 18 Dec 2024

https://github.com/derrickbaruga7/mapping-median-age-europe

An R project that creates an interactive map of the median age across European regions using Eurostat data and spatial visualization packages.

data-analysis data-science data-visualization datascience european-union mapping r

Last synced: 03 Dec 2024

https://github.com/ituvtu/datamining-ab-testing

This project focuses on conducting A/B testing to evaluate the effectiveness of two marketing campaigns. Using statistical analysis and hypothesis testing, we determine which campaign is more effective in improving conversion rates.

a-b-testing data-analysis data-analysis-python data-mining ipynb jupyter jupyter-notebook python

Last synced: 16 Jan 2025

https://github.com/quantumudit/sales-statistical-analysis

This project focuses on a statistical analysis (using SQL queries) of various key metrics that impacts the overall sales of a certain fictitious store.

data-analysis postgresql sales-analysis sql statistics

Last synced: 26 Dec 2024

https://github.com/jmssnr/shuffle-kit

shuffle-kit: model and analyze playing card shuffles in Python

data-analysis playing-cards python shuffle statistics

Last synced: 02 Jan 2025

https://github.com/sevdanurgenc/python-for-data-science-lecture-notes

In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.

data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial

Last synced: 29 Jan 2025

https://github.com/ajwad-shaikh/sristi-sanshodh-collect

SRISTI Sanshodh Collect is an Android app for filling out forms. It's been used to collect billions of data points in challenging environments. Contribute and make the world a better place! ✨📋✨ https://docs.opendatakit.org/collect-…

collect data-analysis data-collection javarosa odk opendatakit

Last synced: 17 Dec 2024

https://github.com/prankshaw/election-analytica

Analyzing previous election results for Haryana Vidhan Sabha and other factors and to compare them with various parameter to conclude results.

anaconda collection data-analysis data-science data-visualization elections jupyter-notebook python python-3 wrangling

Last synced: 03 Dec 2024

https://github.com/sumitkumargiri/machine-learning-project

This repository contain all the best practices for managing Github repository.

data-analysis github machine-learning opensource project python

Last synced: 19 Jan 2025

https://github.com/pseudomanifold/us-inauguration-speeches

Data & feature extraction for U.S. inauguration speeches

data-analysis data-science inauguration politics speech speeches

Last synced: 26 Dec 2024

https://github.com/karthikmprakash/911-call-dataanalysis

Data Analysis of Emergency (911) Calls: Fire, Traffic, EMS for Montgomery County, PA

911-call-analysis data-analysis data-visualization python3 united-states-data

Last synced: 09 Jan 2025

https://github.com/solrikk/pictrace-web

PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image

automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api

Last synced: 09 Jan 2025

https://github.com/antonijn/polyfit

Fits a polygon to a given data input

c data-analysis linear-algebra toy

Last synced: 17 Dec 2024

https://github.com/cjunwon/youtube-data-analysis

End-to-end Youtube data analysis project using Youtube Data API, MySQL, AWS, Flask

aws-rds data-analysis datapipeline flask nlp pandas python shell sql vader-sentiment-analysis youtube youtube-api

Last synced: 15 Dec 2024

https://github.com/0xpr03/clantool

CF Management & Data Analysis Tool, crawler backend in rust

backend-server crawler data-analysis rust

Last synced: 02 Jan 2025

https://github.com/nomadsdev/sys-moninsight

System Monitoring and Analysis Tool is a utility for real-time performance tracking. It logs CPU, memory, and disk usage, provides visual graphs, and offers performance recommendations. Perfect for optimizing system efficiency.

automation cpu-usage data-analysis data-visualization disk-usage matplotlib memory-usage performance-analysis performance-optimization psutil python real-time-monitoring resource-management sys-moninsight system-metrics

Last synced: 02 Jan 2025

https://github.com/vidhi1290/zomato-data-analysis

Zomato Data Analysis - Explore the world of Zomato restaurant data through Python and data analysis. Uncover trends and insights using Pandas for data manipulation and Matplotlib for visualization. Join us in this journey to reveal the hidden stories within the data!

data-analysis data-analysis-python data-science data-visualization dataprocessing machine-learning machine-learning-algorithms matplotlib numpy pandas python scikit-learn zomato-data-analysis

Last synced: 08 Dec 2024

https://github.com/2kabhishek/pyramen

Data Analysis for Ramen 🍜💹

csv data data-analysis fun python report

Last synced: 12 Jan 2025

https://github.com/vidhi1290/machine-learning-pipeline

Explore a collection of Jupyter notebooks that guide you through various stages of the machine learning pipeline. From data analysis and feature engineering to model training and deployment, these notebooks provide practical insights for both beginners and experienced data enthusiasts. Let's dive into the world of data-driven decision-making! 📊🚀"

data-analysis feature-engineering feature-selection jupyter jupyter-notebook machine-learning machine-learning-algorithms machine-learning-pipeline model-training new-dataset opensource python

Last synced: 08 Dec 2024

https://github.com/djo/data-analysis

Data Analysis course notebooks in R

data-analysis r

Last synced: 09 Dec 2024

https://github.com/as16082023/music-store-analysis

This project involves analyzing music store data using SQL queries in MySQL workbench to enhance decision-making, identify trends, and understand customer behavior

data-analysis music-store-analysis mysql sql

Last synced: 23 Dec 2024

https://github.com/arhcoder/base-hackathon-2022

💸 Sistema que analiza las facturas de compra-venta de una empresa de importaciones y exportaciones, y crea una base de conocimiento con la que crea sugerencias de abastecimiento para las empresas clientes de Banco BASE, con el fin de ahorrarles dinero.

algorithms bank companies data-analysis decision-making exportation hackaton importation javascript mysql python suggestions

Last synced: 10 Jan 2025

https://github.com/randomshek/Working-With-Excel

Using Excel Power Query and PowerPivot, reorganise the data into a star schema and showcasing reports that can be created by data analysts using DAX formulae and PowerPivot

data-analysis excel power-pivot power-query

Last synced: 27 Nov 2024

https://github.com/mafda/seattle_airbnb_data_analysis

This repository contains a comprehensive analysis of the Seattle Airbnb dataset, conducted using the CRISP-DM (Cross Industry Standard Process for Data Mining) methodology.

crisp-dm data-analysis data-science jupyter-notebook pandas-python seattle-data

Last synced: 17 Jan 2025

https://github.com/ssreeramj/youtube_channels_analysis

This web app gives a detailed analysis of the videos uploaded in a particular youtube channel.

data-analysis heroku pandas python streamlit youtube

Last synced: 02 Jan 2025

https://github.com/draym/swmanager

Web-app to help you in your daily life raids in SpacesWars thanks to game statistics and data management

dashboard-application data-analysis data-visualization game-data game-utility

Last synced: 10 Jan 2025

https://github.com/shriram-vibhute/digit_classification

This project demonstrates various machine learning techniques for classifying handwritten digits from the MNIST dataset. It covers data preprocessing, model training, evaluation, and advanced classification strategies.

classification data-analysis data-visualization machine-learning matplotlib numpy pandas sk-learn

Last synced: 15 Jan 2025

https://github.com/fatihilhan42/eda-spacex-launches-falcon9-and-falcon-heavy

In this project, we analyze the space flight data of Spacex space research company Falcon 9 rocket.

data-analysis data-science data-visualization eda elonmusk spacex

Last synced: 29 Jan 2025