Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/luminati-io/walmart-dataset-samples

A sample dataset of over 1000 Walmart products, extracted using the Bright Data API, ideal for consumer market insights and competitor analysis.

api data-analysis dataset walmart walmart-scraper web-scraping

Last synced: 23 Jan 2025

https://github.com/nilayhangarge/data-analysis-with-python

This repository provides a practical introduction to data acquisition and analysis using Pandas. It covers loading datasets, exploring data, manipulating data, and gaining insights through statistical summaries. Ideal for beginners, it offers code examples and explanations to enhance your data manipulation skills using Pandas for Python.

data-acquisition data-analysis data-analytics data-binning data-cleaning data-engineering data-fundamentals data-insights data-integration data-preprocessing data-science data-wrangling numpy pandas python

Last synced: 06 Jan 2025

https://github.com/mosalem149/pythonutilities

A collection of Python scripts for common utility tasks including file manipulation, word counting, longest word detection, and grade categorization. Perfect for quick and easy solutions to everyday programming problems.

data-analysis educational-tools file-io file-manipulation grade-calculation python text-analysis text-processing utility word-counting

Last synced: 06 Jan 2025

https://github.com/alphatwirl/qtwirl

qtwirl (quick-twirl), one-function interface to AlphaTwirl

alphatwirl data-analysis data-frame pandas r root-cern

Last synced: 20 Jan 2025

https://github.com/sebastianurdaneguibisalaya/enfermedades-fissal

Análisis holístico de atenciones por enfermedades raras, huérfanas y transplantes coberturados por FISSAL en el Perú.

data-analysis data-visualization python

Last synced: 06 Jan 2025

https://github.com/sebastianurdaneguibisalaya/colocaciones-de-credito-fondo-mivivienda-peru

Exploro las Colocaciones de Crédito del Fondo MIVIVIENDA S.A. entre 2018 y 2022, con un conjunto de datos descargado del Portal Nacional de Datos Abiertos del Perú. 🏠

data-analysis jupyter-notebook python

Last synced: 06 Jan 2025

https://github.com/satyacoder29/crowdfunding-in-sql

Crowdfunding is a method of raising funds for projects or causes by collecting small contributions from a large group of people, usually through online platforms. It enables individuals, startups, and nonprofits to secure funding, offering rewards or recognition in exchange, and helps bring ideas to life without traditional financing.

data-analysis data-cleaning database-management mysql-database quries sql sql-functions sql-server views

Last synced: 28 Dec 2024

https://github.com/al-ogr/sf_pr1_job_analysis_hh

SkillFactory DataScience PROJECT-1. Анализ резюме из HeadHunter

data-analysis data-science ipynb plotly python

Last synced: 20 Jan 2025

https://github.com/vavarm/data-analysis-french-electric-automobile-infrastructure

Data analysis realized in R Shiny and Python about the French electric vehicle and charging station infrastructure

data-analysis data-science data-visualization factominer geojson ggplot2 plotly python r rshiny

Last synced: 20 Jan 2025

https://github.com/nikbarb810/covid_growth_rate_390.51

Exploring Covid Growth Rate of European Population using genetic data analysis

bioinformatics data-analysis r rcpp

Last synced: 01 Jan 2025

https://github.com/vara-co/solar-eclipse-2024

Group Project on the 2024 Solar Eclipse's Path over the US with an interactive map and a couple of visualizations on the data gathered.

data-analysis data-visualizations html-css-javascript interactive-map javascript map solar-eclipse

Last synced: 02 Feb 2025

https://github.com/gurpreet17/uc-davis-sql-for-data-science-specialization

Completed the SQL Basics for Data Science Specialization from the University of California, Davis, gaining proficiency in Data Analysis, SQL, Apache Spark, and Delta Lake.

apache-spark bigdata data-analysis data-science delta-lake sqlite

Last synced: 28 Dec 2024

https://github.com/sferez/gradient_descent

Multiple Linear Regression, Gradient Descent with Python

data-analysis data-science gradient-descent linear-regression python

Last synced: 13 Jan 2025

https://github.com/jethronap/jstat-gui

Web-based GUI application for data analysis

data-analysis data-visualization java jstat mongodb

Last synced: 06 Jan 2025

https://github.com/kashirin-alex/thither.direct-onamove

an android skeleton-example application for using data from Thither.Direct platform on mobile applications

android-application data data-analysis data-structures data-visualization mobile-development mobility query research-data-management

Last synced: 19 Dec 2024

https://github.com/analyticalnahid/pandas-tutorial

A complete tutorial on Pandas for Data Science and Machine Learning

data-analysis pandas pandas-python pandas-tutorial

Last synced: 02 Feb 2025

https://github.com/rahulsm20/car-data

A data analytics project that involves analyzing a car dataset that includes information on various car brands, years, prices, mileage, and fuel types, in order to gain insights into the car market.

data-analysis data-analytics matplotlib numpy pandas python

Last synced: 06 Jan 2025

https://github.com/rahulsm20/storedata

A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.

data-analysis matplotlib numpy pandas python streamlit

Last synced: 06 Jan 2025

https://github.com/inevolin/multivariate-data-analysis

Showcases of modern multivariate & multidimensional data analysis in industrial and high-tech settings.

analytics data-analysis data-science data-visualization javascript

Last synced: 11 Jan 2025

https://github.com/mr-chang95/udacity_movie_project

Movie Data Analysis and Visualization Project for Udacity's Data Analyst Program. Using Python in Jupyter Notebook.

data-analysis data-visualization jupyter-notebook movie python

Last synced: 26 Jan 2025

https://github.com/mr-chang95/datascience_airbnb

Data Science Project for Udacity's Data Scientist Program. Using Python in Jupyter Notebook.

airbnb data-analysis data-science data-visualization jupyter-notebook numpy pandas python sklearn

Last synced: 26 Jan 2025

https://github.com/mr-chang95/sf_data_visualization

In this personal project, I am interested in examining all of the active businesses in the San Francisco Bay Area while performing some simple data visualizations, mainly on categorical variables.

business data-analysis data-visualization jupyter-notebook pandas python san-francisco

Last synced: 26 Jan 2025

https://github.com/vubacktracking/freecodecamp-data-analysis-with-python

5 Projects in Data Analysis With Python Course on Freecodecamp

data-analysis freecodecamp freecodecamp-project python

Last synced: 06 Jan 2025

https://github.com/rahulsm20/trackbyte

A full-stack web application that helps users keep track of their playlist and provides analytics based on their music taste. Built using React, Node.js, Express.js, MySQL and Bootstrap.

bootstrap data-analysis expressjs mysql nodejs reactjs sql

Last synced: 06 Jan 2025

https://github.com/rahulsm20/insurance-data

A data analytics project dealing with risk assessment and it's effects in health insurance.

data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 06 Jan 2025

https://github.com/sieunnnn/data-lab

데이터 분석을 연습하는 Repository 입니다.

data-analysis jupyter-notebook python

Last synced: 13 Jan 2025

https://github.com/ryuzen6/kaggle-series

This is a series of Machine Learning/Deep Learning Models made for practice.

artificial-intelligence data-analysis data-science deep-learning machine-learning python3

Last synced: 20 Jan 2025

https://github.com/shrutiijoshi/coffee_sales

This project aims to analyze coffee sales data to identify key trends, patterns, and factors influencing sales performance.

data-analysis microsoft-excel

Last synced: 18 Jan 2025

https://github.com/datalopes1/manufacturing_defects

Projeto de EDA utilizando o Manufacturing Defects que pode ser encontrado no Kaggle

data-analysis data-visualization eda exploratory-data-analysis python

Last synced: 02 Feb 2025

https://github.com/datalopes1/fifa21_datacleaning

Neste projeto será feito o processo de limpeza e manipulação a partir do dataset FIFA 21 messy, raw dataset for cleaning/ exploring, que pode ser encontrado no Kaggle, com licensa CC0: Public Domain e enviado por Rachit Toshniwal.

data-analysis data-cleaning python

Last synced: 02 Feb 2025

https://github.com/danielrosehill/data-projects-index

Data apps and datasets deployed to Streamlit Community Cloud, Hugging Face, and elsewhere.

data-analysis data-science data-visualization

Last synced: 20 Jan 2025

https://github.com/datalopes1/warehouse_rfv

Neste projeto será realizada uma análise do tipo RFV (Recência, Frequência e Valor) com dados que encontrei neste video no Youtube do canal Jie Jenn.

analise-rfv data-analysis data-science kmeans python rfm-analysis

Last synced: 02 Feb 2025

https://github.com/datalopes1/bank_marketing

Este projeto será baseado no Dataset Bank Marketing encontrado na UC Irvine - Machine Learning Repository e disponibilizado por S. Moro, R. Laureano e P. Cortez

data-analysis data-science data-visualization eda python

Last synced: 02 Feb 2025

https://github.com/chdre/data-analyzer

A small package to analyze and preprocess data.

data-analysis python

Last synced: 25 Jan 2025

https://github.com/alan-oliveir/state-of-data-2022

Neste projeto faço a análise da distribuição das faixas salariais para os profissionais de nível júnior para o cargo de analista, cientista e engenheiro de dados.

data-analysis jupyter-notebook pandas-python seaborn-python

Last synced: 13 Jan 2025

https://github.com/roland045/smart_fluid_sedimentation_tester

Control program for custom developed smart fluid sedimentation tester system

arduino data-analysis instrumentation measurement sensor

Last synced: 06 Jan 2025

https://github.com/apfirebolt/numpy-and-pandas-examples

Some examples and sample datasets to learn numpy, pandas and other data science libraries in Python

data-analysis jupyter-notebook numpy pandas python

Last synced: 25 Jan 2025

https://github.com/touradbaba/multi-page_dash_application

This repository contains a Multi-Page Dash Application designed to provide interactive visualizations of geo-spatial data, focusing on population and GDP. The app offers insights into demographic and economic trends through interactive maps and various types of charts. It is built with Python, using Plotly and Dash, and is deployed on Heroku.

dash dashboard data-analysis data-visualization exploratory-data-analysis heroku-deployment plotly pythonanywhere

Last synced: 21 Jan 2025

https://github.com/aurcode/breast_cancer_predict_api

API that can classify whether breast cancer is benign or malignant based on measured characteristics.

data-analysis flask python

Last synced: 08 Feb 2025

https://github.com/mrham17/spotify_streaming_analytics

Project is stable & documentation will be completed soon. Thank you for your understanding and patience.

big-data-analytics data-analysis google-colab music-data r-programming spotify streaming-analytics

Last synced: 31 Jan 2025

https://github.com/gaurav-van/optimizing-rate-of-penetration-in-geothermal-drilling-a-digital-twin-approach

Let’s explore something interesting together. In this project, we developed a machine learning digital twin using Intel-optimized XGBoost and daal4py to simulate and optimize the Rate of Penetration (ROP) in geothermal drilling. We leveraged SHAP for Explainable AI (XAI) to interpret model predictions.

data-analysis data-science digital-twin explainable-ai geothermal geothermal-energy jupyter-notebook machine-learning python shap xai xgboost

Last synced: 02 Feb 2025

https://github.com/gaurav-van/data-analysis-projects

Collections of Projects that involves Data Analysis and Informed Decision Making

data-analysis database powerbi sql

Last synced: 02 Feb 2025

https://github.com/mihaildoman/formula-one-insights-with-tableau

Using Tableau to analyze the achievements of drivers and constructors throughout Formula One's history.

data data-analysis data-visualization formula-one formula1

Last synced: 26 Jan 2025

https://github.com/mihaildoman/formula-one-insights-with-python-and-sql

Using Python and SQL to analyze the achievements of drivers and constructors throughout Formula One's history.

data data-analysis data-visualization formula-one formula1

Last synced: 26 Jan 2025

https://github.com/datawithbaraa/sql-data-analytics-project

This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.

analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql

Last synced: 02 Feb 2025

https://github.com/noeyislearning/credit-card-fraud

This project analyzes credit card transactions in the western U.S. to detect fraud. It utilizes a dataset with detailed transaction info including date, time, merchant, purchase type, amount, customer location, and fraud status. The goal is to enhance security measures and ensure customer trust.

credit-card-fraud data-analysis data-science jupyter-notebook python3

Last synced: 01 Feb 2025

https://github.com/bishopce16/pyber_analysis

The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.

data-analysis data-visualization jupyter-notebook matplotlib pandas python

Last synced: 06 Jan 2025

https://github.com/bishopce16/school_district_analysis

The school board requested an analysis on the various performance metrics for the school district.

data-analysis jupyter-notebook numpy pandas python visual-studio-code

Last synced: 06 Jan 2025

https://github.com/nerooc/device-downtime-detection

Repozytorium dotyczące projektu z przedmiotu "Sztuczne Sieci Neuronowe"

data-analysis detection-model recurrent-neural-networks

Last synced: 26 Jan 2025

https://github.com/tmmvn/analytics-notebooks

A bunch of data analytics notebooks done testing out JetBrains DataLore

ai algorithms data-analysis datalore elements-of-ai helsinki-university-mooc python

Last synced: 28 Dec 2024

https://github.com/hadson0/chess-live-ratings-data

A study project focused on web scraping the live chess ratings from chess.com, with data analysis and visualization on nearly 5000 players in the classical world ranking.

beautifulsoup chess data-analysis data-visualization numpy pandas python seaborn web-scraping

Last synced: 21 Jan 2025

https://github.com/chinmayee4/sales-analysis-for-ferns-n-petals

Analyzed Data By Creating Interactive Dashboard Using MS Excel

data-analysis data-cleaning data-visualization excel pivot-tables powerquery

Last synced: 02 Feb 2025

https://github.com/chinmayee4/vrinda_store_data_analysis

Analyzed Data By Creating Interactive Dashboard Using MS Excel

data-analysis data-cleaning data-visualization excel-dashboard pivot-tables power-query

Last synced: 02 Feb 2025

https://github.com/mumtaz4118/amazon-iphone-12-data-scrapped

Beautiful Soup is a Python library for getting data out of HTML, XML, and other markup languages.

data-analysis data-extraction data-science data-scraping html mark-up python

Last synced: 28 Dec 2024

https://github.com/mumtaz4118/nlp-course

Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning

course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning

Last synced: 28 Dec 2024

https://github.com/mohamed3nan/udacity

Udacity Data Analysis Nanodegree Program

data-analysis data-visualization numpy pandas python

Last synced: 13 Jan 2025

https://github.com/elishah-john/happiness-report-2019

Analysis of "Happiness Report 2019" using python.

data-analysis data-visualization educational jupyter-notebook python

Last synced: 21 Jan 2025

https://github.com/haonamnguyen/data-science-job-analysis

Evaluate the factors influencing salary trends in the data science industry, including experience levels, job titles, employment types, company sizes, and remote work arrangements, to help HR teams and hiring managers make data-driven decisions regarding compensation packages and recruitment strategies.

data-analysis data-science data-visualization jupyter-notebook python

Last synced: 21 Jan 2025

https://github.com/myke003/data-analysis-projects

This repository serves as a collection of all my projects.

data-analysis jupyter-notebook powerbi

Last synced: 21 Jan 2025

https://github.com/manisharora96/data-analysis-of-smartwatch

The project is structured with sample data, step-by-step Jupyter notebooks, and modular Python scripts for automated analysis

data-analysis data-visualization jupyter-notebook python smartwatch-analysis

Last synced: 21 Jan 2025

https://github.com/mecha-aima/demographic-analyzer

This project uses pandas to process census data from a csv file and draw useful results from the data by performing various filtering and calculations on it

data-analysis data-science pandas

Last synced: 03 Jan 2025

https://github.com/navdeep-g/data-quality-checker

A comprehensive Python tool for assessing and ensuring data quality

data-analysis data-science pandas python

Last synced: 24 Dec 2024