Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/supertetelman/kaggle-public

A collection of Python and Matlab projects aimed at utilizing various machine learning techniques to solve big data problems.

cnn data-analysis deep-learning machine-learning matlab python

Last synced: 28 Jan 2025

https://github.com/lacerbi/vbmc

Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)

bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference

Last synced: 05 Feb 2025

https://github.com/hvignolo87/ortex-programming-challenge

Coding challenges required for the Python Developer and Data Engineer job positions.

challenge data-analysis finance pandas python scripting sql sqlalchemy

Last synced: 02 Jan 2025

https://github.com/cego669/datathonengopevi

Equipe: Embrapeiros. Solução proposta para o Datathon do VI ENGOPE (Encontro Goiano de Probabilidade e Estatística). Obs: FOMOS CAMPEÕES!!!!!!!!

data-analysis data-science datathon python r streamlit xgboost-classifier

Last synced: 08 Dec 2024

https://github.com/shivamswarnkar/tesla-stock-prediction

Making prediction of close prices of Tesla Stocks using different regression methods.

data-analysis data-visualization plotly regression regularization sklearn stock-price-prediction

Last synced: 26 Jan 2025

https://github.com/invictusaman/socioeconomic-indicators-in-chicago-sql-python

This project displays how to create a database connection in notebook, update database using python and how to run Python program and SQL queries together. It uses SQLite and Chicago dataset for analysis.

data-analysis jupyter-notebook python sql sql-queries sqlite

Last synced: 16 Feb 2025

https://github.com/rapidsurveys/oldr

An Implementation of the Rapid Assessment Method for Older People (RAM-OP)

assessment data-analysis epidata estimate odk older-people r ram-op ranalyticflow rapid-assessment

Last synced: 16 Feb 2025

https://github.com/olow304/goboard

Python Data Analysis Dashboard using Public Dataset, Django

dashboard dashboard-templates data-analysis data-science django jupyter-notebook machine-learning python sklearn

Last synced: 04 Jan 2025

https://github.com/mynenik/xyplot-32

Extensible Plotting and Data Analysis Program for 32-bit x86 GNU/Linux

cpp data-analysis data-manipulation data-visualization forth linux-app motif xwindows

Last synced: 24 Jan 2025

https://github.com/c0deta1ker/matbase

MatBase provides access to an extensive database of material parameters, inelastic mean free paths (IMFP), photoionization binding energies, cross sections, and asymmetry parameters. Additionally, MatBase includes a suite of functions for users to load, process, model and fit their own data, making it an indispensable tool in the field.

cross-sections crystal-structure crystallography data-analysis data-fitting database electron imfp imfp-calculator-matlab material material-database matlab matlab-application matlab-gui matlab-toolbox pes-modelling photoelectron-spectroscopy photoionization simulation xps

Last synced: 30 Nov 2024

https://github.com/deep-diver/data-analysis-on-titanic

applying data analysis on titanic data sheet

data-analysis titanic-data

Last synced: 05 Feb 2025

https://github.com/freekatz/english-reading

考研英语(10-19)数据集及相关数据分析

data-analysis dataset

Last synced: 02 Feb 2025

https://github.com/nafiealhilaly/analyze-coderhub-sa

A simple web app to analyze/explore coderhub.sa API data, this project was my first real react app.

backend data-analysis eda frontend python react reactjs

Last synced: 08 Feb 2025

https://github.com/kenvilar/data-analysis-using-python

Transforming a description of a location from an analyzed CSV file data using Pandas with Python 3

bs4 data-analysis jupyter pandas python python3 requests xlrd

Last synced: 18 Nov 2024

https://github.com/deep-diver/enron-data-analysis

Data Analysis and Machine Learning on Enron Data

data-analysis enron-data exploratory-data-analysis machine-learning

Last synced: 05 Feb 2025

https://github.com/frikishaan/browsing-history-analysis

This is a data analysis of my browsing history for the last 7 months.

browsing-history data-analysis jupyter-notebook python

Last synced: 09 Jan 2025

https://github.com/seyedhosseinzadeh/ws_tm

Weather web scraping and Time series model to predict temperature, humidity and barometer

data-analysis deep-learning lstm-model machine-learning prediction prediction-model weather web-scraping

Last synced: 10 Jan 2025

https://github.com/emptymalei/mini-lab

Some code snippets used to explain stuff to myself in my personal data science wiki

data-analysis data-mining data-science data-visualization datascience

Last synced: 13 Feb 2025

https://github.com/thealphadollar/messiah

Messiah: The Mighty Son Of God Is Here To Help You Through Times Of Calamity

azure backend data data-analysis flask frontend materialize natural-disasters

Last synced: 13 Feb 2025

https://github.com/farahibrar/kpmg-job-simulation

This repository showcases my work from the KPMG Technology Job Simulation by Forage, focusing on Data Analytics and Cloud Engineering. Explore how I tackled real-world business challenges through sales data analysis, regional growth strategies, and AWS architecture design, highlighting my analytical and technical expertise.

aws-architecture business-intelligence cloud-engineering cloud-strategy-and-design data-analysis data-visualization fintech-solutions forage kpmg kpmg-careers python-for-data-analysis sales-data-insights sustainable-retail-analysis

Last synced: 25 Jan 2025

https://github.com/joanacmbarros/ardm-website

Website to support the R in Pharma 2023 workshop on the ARDM

analysis-results automation clinical-data data-analysis data-model r-in-pharma

Last synced: 09 Feb 2025

https://github.com/timzatko/fifa-19-dataset-machine-learning

Player's value prediction and game position classification on FIFA 19 dataset.

data-analysis fifa19 machine-learning scikit-learn

Last synced: 03 Jan 2025

https://github.com/viper373/baidutieba

爬取百度贴吧(指定吧名、起始页数/重点页数、日志输出)

baidutieba-crawler bert data-analysis deep-learning python spider

Last synced: 05 Feb 2025

https://github.com/winter000boy/dsa-practice

This repository holds my solutions for LeetCode’s Pandas playlists. Each section includes code and notes on using Pandas to handle real-world data tasks efficiently. Perfect for anyone looking to deepen their understanding of data manipulation with Pandas.

data-analysis data-science leetcode leetcode-python pandas-python python3

Last synced: 30 Jan 2025

https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm

📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.

big-data data data-analysis data-science data-visualization eda gotomarket

Last synced: 08 Feb 2025

https://github.com/shlokashah/student-depression-and-suicide-rate-prediction

https://shlokashah.github.io/Student-Depression-And-Suicide-Rate-Prediction/

data-analysis data-visualization machine-learning student suicide-rate-prediction

Last synced: 19 Feb 2025

https://github.com/worst001/note_machine_learning

整理了机器学习相关资料与手册,包括数学基础、机器学习模型实现示例、神经网络。

ai data-analysis deep-learning development guide learning machine-learning markdown mkdocs note notebook

Last synced: 12 Jan 2025

https://github.com/milind220/hk-air-quality-analysis

My final project for a statistics and data analysis course. Whew that was a lot of graphs!

data-analysis jupyter-notebook numpy pandas python python3 scipy seaborn statistics

Last synced: 03 Jan 2025

https://github.com/muzammil-13/data_analysis-inmakes

A data-driven project that leverages machine learning to predict Bitcoin price trends. Using historical Bitcoin data, this analysis provides 30-day price forecasts through advanced statistical modeling.

data-analysis data-science machine-learning numpy pandas python python-library

Last synced: 15 Feb 2025

https://github.com/grburgess/gbm_kitty

Database, reduce, and analyze GBM data without having to know anything. Curiosity killed the catalog.

3ml catalogue data-analysis fermi-science grbs pipelines

Last synced: 23 Jan 2025

https://github.com/poga/dat-ipynb-demo

use ipython notebook to analyze data in dat archive

dat data-analysis distributed jupyter-notebook

Last synced: 08 Feb 2025

https://github.com/thecoderpinar/reta

🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!

arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series

Last synced: 09 Feb 2025

https://github.com/jimbrig/eda

Exploratory Data Analysis R Package and Shiny App

data-analysis data-visualization eda r shiny

Last synced: 23 Jan 2025

https://github.com/walidalsafadi/house-prices

Ask a home buyer to describe their dream house, and they probably won't begin with the height of the basement ceiling or the proximity to an east-west railroad. With 79 explanatory variables describing (almost) every aspect of residential homes in Ames, Iowa, this competition challenges you to predict the final price of each home.

cross-validation data-analysis data-science data-visualization decision-trees eda house-price-prediction house-prices jupyter linear-regression machine-learning machine-learning-algorithms mlp-regressor plot python random-forest-regression regression svr xgboost-regression

Last synced: 22 Jan 2025

https://github.com/antononcube/wl-outlieridentifiers-paclet

Wolfram Language (aka Mathematica) paclet that provides outlier identifier functions.

data-analysis hampel outlier-detection outliers

Last synced: 08 Feb 2025

https://github.com/bkataru/physics-ia

Programs and files written for Astrostatistics for IB Physics IA. Topic: Visualizing and analyzing the habitable zones for 150,000 stars from the hipparcos catalogue.

astronomical-algorithms astronomy astrophysics astrostatistics data-analysis data-science data-visualization matplotlib plotting

Last synced: 15 Feb 2025

https://github.com/yahia3200/become-an-independent-data-scientist

My final project for the Applied Plotting, Charting & Data Representation in Python Course

data-analysis data-science data-visualization matplotlib

Last synced: 22 Jan 2025

https://github.com/v-octal/hows_india_feeling

An application which displays India's region wise twitter sentiment on the map.

data-analysis data-visualization flask-restful leaflet nlp-machine-learning sentiment-analysis

Last synced: 20 Feb 2025

https://github.com/virajbhutada/tableau-data-vizzes

Engage with a growing collection of Tableau dashboards covering financial trends, HR analytics, streaming service insights, real estate dynamics, and more. Meticulously crafted for valuable insights, this repository continues to expand with new and compelling visualizations.

business-analytics data-analysis data-visualization hr-analytics industry-trends netflix performance-metrics stock-market-analysis strategic-analytics tableau visual-insights

Last synced: 10 Jan 2025

https://github.com/andr3w03/bike-sharing-dashboard

Bike Sharing Data Analysis Streamlit Dashboard

dashboard data-analysis data-visualization python streamlit

Last synced: 29 Jan 2025

https://github.com/louislefevre/sstubs-miner

Data mining and analysis for the ManySStuBs4J dataset.

data-analysis data-mining manysstubs4j-dataset msr

Last synced: 05 Feb 2025

https://github.com/sevdanurgenc/data-modeling-techniques-lecture-notes

In this repo, I have the course contents of Data Modelling Techniques training, which will be given to Innova Technology by the cooperation of Academy Peak Information Technologies Training and Consultancy between 25 - 26 January 2022.

data-analysis data-mining data-modeling data-science data-structure data-visualization

Last synced: 29 Jan 2025

https://github.com/chaitanyac22/house-price-prediction-project-for-a-us-based-housing-company

The goal of this project is to garner data insights using data analytics to purchase houses at a price below their actual value and flip them on at a higher price. This project aims at building an effective regression model using regularization (i.e. advanced linear regression: Ridge and Lasso regression) in order to predict the actual values of prospective housing properties and decide whether to invest in them or not.

advanced-linear-regression business-analytics data-analysis data-cleaning data-manipulation data-visualization exploratory-data-analysis feature-engineering lasso-regression linear-regression machine-learning model-building model-evaluation prediction-model python3 regularization rfe ridge-regression statistics

Last synced: 01 Feb 2025

https://github.com/pratishtha-abrol/astronomy-dataanalysis

A key technique in Data Driven Astronomy

astronomy astropy crossmatch data-analysis

Last synced: 05 Feb 2025

https://github.com/vishrut-b/end-to-end-data-analytics-with-python-and-sql

This project involves the data cleaning and SQL-based analytics of a retail orders dataset using Python and SQL. It focuses on preprocessing data, followed by detailed analytics to extract insights on sales trends and product performance.

data-analysis python retail sql sql-server sqlalchemy

Last synced: 05 Feb 2025

https://github.com/manmolecular/http-response-clustering

:chart_with_downwards_trend: Clustering of HTTP responses using k-means++ and the elbow method

data-analysis elbow-method elbow-plot jupyter k-means-plus-plus python3

Last synced: 16 Jan 2025

https://github.com/juliusmarkwei/concrete-data

Data analysis, machine learning, model evaluation and optimization on the Concret_ Dataset

data-analysis data-science data-visualization ensemble-learning machine-learning modeling

Last synced: 01 Jan 2025

https://github.com/zachlagden/spotify-listening-analyzer

A comprehensive Python tool for analyzing your Spotify listening history data.

analytics data-analysis pandas python spotify-web-api spotipy

Last synced: 07 Feb 2025

https://github.com/dogan-the-analyst/developer_survey_analysis

Analysis of the 2024 Stack Overflow developer survey. Tools used include Python, Pandas, Matplotlib, and IBM Cognos.

data-analysis data-visualization ibm-cognos-analytics matplotlib pandas python

Last synced: 16 Feb 2025

https://github.com/kalebers/economic_analysis_data_science

Data Analysis Python project using economic data base to predict percentage of good and bad payers

data-analysis data-science machine-learning pandas python scipy sklearn-library

Last synced: 21 Jan 2025

https://github.com/ysayaovong/stockroom_management

The Stockroom Management project is a comprehensive tool that automates and simplifies the process of managing inventory in stockrooms. By incorporating features like real-time updates, report generation, and low-stock alerts, it helps businesses save time, reduce errors, and optimize their inventory operations.

business-applications data-analysis data-visualization database-management inventory-control inventory-management logistics sql warehouse warehouse-management

Last synced: 30 Jan 2025

https://github.com/mohamedhany99/collecting-sound-features-frequency-from-an-audio-file-and-save-it-in-excel

This script takes a single audio file and collect all the sound features in it including (Frequency - mean - variance - minimum value - maximum value - Median - Kurtosis - Skewness) and save it in an array and import this array in a row in the datasheet.

automation data-analysis data-science dataset-generation excel-import signal-processing

Last synced: 17 Jan 2025

https://github.com/rani-sikdar/python-data-structures

A repository for sharing Python implementations of common data structures and algorithms, including arrays, linked lists, stacks, queues, trees, graphs, and sorting/searching techniques. Perfect for learning, revisiting fundamentals, and collaborating on computer science concepts. Contributions welcome! 🚀

data-analysis data-structure data-visualization numpy-library pandas-dataframe python python-3

Last synced: 05 Feb 2025

https://github.com/mardavsj/weather-prediction

Weather prediction model which mainly focuses on visualization.

data-analysis data-visualization matplotlib numpy pandas pandas-dataframe

Last synced: 14 Feb 2025

https://github.com/ysayaovong/car-sales

An analysis of car sales data to uncover market trends and insights through data cleaning, analysis, and visualization.

automotive business-analysis data-analysis data-cleaning data-visualization market-trends matplotlib pandas python sales-data scikit-learn seaborn

Last synced: 23 Jan 2025

https://github.com/its-kanii/predictive-maintenance-for-healthcare-equipment

Predictive Maintenance for Healthcare Equipment utilizes machine learning to analyze operational metrics and predict equipment failures. This project leverages a dataset of usage hours, temperature, and maintenance history to enhance equipment reliability and reduce downtime.

data-analysis data-science failure-prediction feature-engineering healthcare-equipment jupyter-notebook machine-learning predictive-maintenance python time-series-analysis

Last synced: 12 Feb 2025

https://github.com/ezzz-lui/rsm-evaluationproject

Este repositorio es donde esta documentado nuestro proyecto para RSM por parte de actividad final para el bootcamp Data Analyst

data-analysis python

Last synced: 30 Jan 2025

https://github.com/sufiyanahmed4566/sql-musicmaven

"This Music Store Database Project showcases SQL skills through comprehensive database design, query optimization, and data analysis. Includes ER diagram, database file, query questions (Easy, Medium, Hard), answered queries, and CSV table data. Ideal for recruiters seeking skilled SQL developers for music store management and data analysis.

data-analysis database insights mysql-database oracle-database relational-databases sql

Last synced: 24 Jan 2025

https://github.com/pxaris/expenditure-analyzer

Application for analyzing expenditure data over time

data-analysis data-visualization docker python statistics

Last synced: 15 Feb 2025

https://github.com/elcaiseri/udacity-advanced-data-analysis

UDACITY - Advanced-Data-Analysis Track Project

data-analysis python

Last synced: 01 Jan 2025

https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba

First assignment for the course Data Mining @CSE.UOI

data-analysis data-science numpy scipy seaborn statistics

Last synced: 16 Jan 2025

https://github.com/rupav/fifa17-detailed-analysis

⚽ FIFA 17 data analysis using various Machine Learning Algorithms. ⚽

data-analysis data-visualization fifa17 machine-learning-algorithms radar-chart

Last synced: 09 Feb 2025

https://github.com/jen-uis/la-crime-data-analysis

This repository contains project materials for the Fall 2023 MGT 256 class. This project is completed with assists from Professor Adem Orsdemir.

business-analytics crime-data crime-data-analysis data-analysis knn la-crimes-from-2020 la-safe r r-markdown r-studio report-generation rmd united-states visualization

Last synced: 21 Jan 2025

https://github.com/narenkhatwani/arkouda-projects

This repository contains the source codes of the projects done using Arkouda (a software package that allows a user to interactively issue massive parallel computations on distributed data using functions and syntax that mimic NumPy, the underlying computational library used in most Python data science workflows.)

arkouda data-analysis data-analytics data-science high-performance high-performance-computing highperformancecomputing numpy pandas parallel-computing parallel-processing parallelization python

Last synced: 06 Feb 2025

https://github.com/agungbudiwirawan/supermarket_sales_dashboard-sales-analysis-using-pivot-table-and-chart

The objective of this project is to analyze supermarket sales using pivot table and chart in Microsoft Excel. After doing the analysis, I created an interactive dashboard that aims to make it easier for the audience to explore data.

dashboard data-analysis data-science data-visualization excel sales-dashboard spreadsheet

Last synced: 06 Feb 2025