Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/smkdev-id/smkdevchallenge

Dokumentasi keseluruhan challenges/problems yang diinisiasikan untuk SMKDEV Coding Challenge

backend coding-challenges csharp data-analysis frontend golang java javascript machine-learning php python

Last synced: 09 Nov 2024

https://github.com/mahdi-meyghani/movie-recommendation-system

A Python-based movie recommendation system utilizing popularity-based, content-based, and collaborative filtering models with data science and machine learning techniques.

data-analysis data-science machine-learning recommendation-system scikit-learn scikitlearn-machine-learning

Last synced: 10 Oct 2024

https://github.com/cano1998/data-visualization-project

A project focused on data visualization to explore various aspects of a car dataset. The visualizations provide insights into car performance, efficiency, and characteristics based on different manufacturers and features.

bar-pl bar-plot data-analysis data-visualization histogram jupyter-notebook line-plot

Last synced: 09 Nov 2024

https://github.com/spacebakery/variance-in-weather-project

Codecademy | Statistics for Data Analysis | Variance and Standard Deviation

data-analysis python standard-deviation statistics variance

Last synced: 09 Nov 2024

https://github.com/mohamed3nan/udacity

Udacity Data Analysis Nanodegree Program

data-analysis data-visualization numpy pandas python

Last synced: 14 Nov 2024

https://github.com/carlosvinimsouza/dataanalysiswithpython

Learn Data Analysis using Libs Python (Numpy, Pandas, Matplotlib and Seaborn)

data-analysis data-science free-code-camp matplotlib numpy pandas python python3 seaborn

Last synced: 12 Nov 2024

https://github.com/tszon/data-science-projects

Included are all the worth-noting Data Science projects in my learning journey with DataCamp.

data-analysis data-science exploratory-data-analysis feature-engineering machine-learning modelling preprocessing-data scikit-learn supervised-learning

Last synced: 12 Oct 2024

https://github.com/docuvesta/la-mer-skincare-chicago-duty-free-analysis

Comparing La Mer product selection, availability and pricing from 3 different purchase locations ✈️

analytics cremedelamer data-analysis data-analytics data-science data-visualization lamer luxury plotly python seaborn skincare

Last synced: 09 Nov 2024

https://github.com/dcs-training/null-hypothesis-testing-with-r

This two-class course will focus on developing theoretical and practical skills for null hypothesis testing in R. Go to the readme file

data-analysis data-wrangling r statistics

Last synced: 10 Nov 2024

https://github.com/docuvesta/la-prairie-luxury-skincare-makeup-analysis

Web scraping La Prairie skincare websites for brand and product insights 🛍️

cosmetics data-analysis data-analytics data-visualization jupyter-notebook luxury python science skincare

Last synced: 09 Nov 2024

https://github.com/otonomee/against-the-clock-transcript-analysis

This repository contains code and analysis for exploring the transcripts of the various "Against The Clock" videos featured on the FACT Magazine YouTube channel. The goal is to uncover insights, patterns, and trends across the different artists and their creative process under time constraints.

against-the-clock ai-analysis audio-processing creative-ai creative-process data-analysis fact-magazine machine-learning music-production natural-language-processing nlp text-mining yt-dlp

Last synced: 10 Nov 2024

https://github.com/thanhngan22/data-analyst-fundamental

🧩 data analyst fundamental | Knowledge relevant to a datathon | materials

analyzing-data-using-pandas data-analysis datathon tensorflow

Last synced: 09 Nov 2024

https://github.com/jinkogule/multi-analyst

O Multi Analyst é uma ferramenta de análise de dados com uma usabilidade simples, que utiliza inteligência artificial para interpretar os resultados das análises realizadas, retornando insights úteis aos usuários.

apriori-algorithm bootstrap css data-analysis django html numpy open-ai pandas python web-application

Last synced: 09 Nov 2024

https://github.com/johannaschmidle/road-collisions-project

Understanding Accident Severity for Effective Road Management [Excel]

data-analysis data-visualization excel pivot-tables traffic-analysis

Last synced: 12 Nov 2024

https://github.com/dcs-training/network-analyisis-python

Course material for introducing data visualization with Altair and network analysis with NetworkX (in Python). Go to the readme file

data-analysis data-visualisation network-analysis python text-analysis

Last synced: 10 Nov 2024

https://github.com/johannaschmidle/amazon-cat-couch

Customer product reviews + ratings analysis and visualization [Python, Excel, Tableau, R]

data-analysis data-visualization jupyter-notebook python-notebook r-markdown sentiment-analysis text-analysis web-scraping

Last synced: 12 Nov 2024

https://github.com/dcs-training/introtodatabases

This repository host the material connected to a training developed by Dave Elsmore (Edina) for CDCS. Go to the readme file

data-analysis data-wrangling databases sql

Last synced: 10 Nov 2024

https://github.com/johannaschmidle/netflix-subscription-analysis

Analyzing Netflix subscription trends from 2021 - 2023 [SQL, Tableau]

data-analysis data-cleaning data-trend data-visualization netflix

Last synced: 12 Nov 2024

https://github.com/techshot25/graduateadmissions

Looking at the probability of being accepted in a graduate program using a machine learning model

bayesian-regression correlation-matrices data-analysis data-science linear-regression machie-learning random-forest-regression regression ridge-regression

Last synced: 10 Nov 2024

https://github.com/dcs-training/2023-10-22-carpentry-social-science

Go to https://dcs-training.github.io/2023-10-22-Carpentry-Social-Science/ to follow along the material

data-analysis data-visualisation data-wrangling intro-to-programming r

Last synced: 10 Nov 2024

https://github.com/dcs-training/machinelearning

Introduction to Machine Learning with Python delivered by the centre in the year 2022-23. Go to the read me file

data-analysis data-wrangling machine-learning python

Last synced: 10 Nov 2024

https://github.com/kwonnayeon/urban-parks-childrens-happiness

A thesis project exploring the causal impact of urban parks on children's happiness, with data, results, and code.

causal-inference data-analysis environmental-psychology latex matching propensity-score public-health r social-science statistical-analysis thesis-project urban-studies weighting

Last synced: 02 Nov 2024

https://github.com/abhipatel35/diabetes_ml_classification

Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.

classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn

Last synced: 31 Oct 2024

https://github.com/dcs-training/good-data-visualisation-with-r

Our guide on how we create data visualisations through R. Go to the readme file

data-analysis data-visualisation r rmarkdown

Last synced: 10 Nov 2024

https://github.com/abdoufermat5/twitter-analysis

Twitter data analysis using Pyspark

data-analysis pyspark spark twitter twitter-api

Last synced: 11 Nov 2024

https://github.com/agustin-caceres/proyecto-data-analyst

Proyecto de Data Analyst sobre servicios de Telecomunicaciones en Argentina

business-analytics business-intelligence data-analysis data-visualization database postgresql python streamlit

Last synced: 11 Nov 2024

https://github.com/davidzajac1/four-percent-rule-pandas-analysis

Analysis of the 4% Personal Finance Rule of Thumb

data-analysis data-visualization pandas python

Last synced: 09 Nov 2024

https://github.com/min-thway-htut/r-programming

Repository for R-Programming

data-analysis r-programming

Last synced: 10 Nov 2024

https://github.com/dthung1602/goodread-bestbook-prediction

Data analysis - trying to predict the result of Goodreads Choice Adward

data-analysis goodreads pca python r xgboost

Last synced: 09 Nov 2024

https://github.com/ninadpatil09/hospital_emergency_room_analysis

This comprehensive analysis delves into the performance and characteristics of the hospital's emergency room over the past year. By scrutinizing key metrics and patient demographics, this study aims to provide valuable insights for optimizing patient care, resource allocation, and overall operational efficiency.

data-analysis tableau-public visualization

Last synced: 09 Nov 2024

https://github.com/ninadpatil09/heart_disease_detection_analysis

The Heart Disease Detection Analysis aims to create a predictive model for identifying individuals at risk of heart disease. Using a dataset with attributes like age, sex, and health metrics, the project focuses on distinguishing patients with and without heart disease.

data-analysis data-cleaning data-science data-visualization machine-learning

Last synced: 09 Nov 2024

https://github.com/pratanup/solar-power-generation-prediction

A solar power generation company wants to optimize solar power production and needs the prediction model to predict ‘Clearsky DHI’, ‘Clearsky DNI’, ‘Clearsky GHI’.

anaconda data-analysis data-science google-colab jupiter-notebook machine-learning machine-learning-algorithms machinelearning-python prediction prediction-model python

Last synced: 08 Nov 2024

https://github.com/prekshivyas/cis-595-big-data-analytics

Comprehensive real estate price prediction project, integrating socioeconomic indicators and property features.

data-analysis data-cleaning data-mining data-preprocessing data-science data-visualization data-wrangling exploratory-data-analysis web-scraping

Last synced: 09 Nov 2024

https://github.com/guilherme-marcello/r-data-analysis-piechart

Reading RDS files, processing and presentation in pie charts

data-analysis data-visualization pie-chart r

Last synced: 10 Nov 2024

https://github.com/aangelone2/das-c

Lightweight parallel Data Analysis Suite in C

c correlation-analysis data-analysis monte-carlo multithreading openmp

Last synced: 14 Nov 2024

https://github.com/sabelomkhwanzi/data-alchemist-boot-camp

Built on Covalent's Unified API, Increment has the full historical data set for 40+ chains including every smart contract, event, transaction, address, etc. With access to all this data you can find:

covalent data-analysis increment

Last synced: 17 Nov 2024

https://github.com/syedanimrafatima/ecommerce-store-sales-analysis-powerbi

The Sales Analysis Dashboard is designed to help an E-commerce Business to overview their Sales performance throughout the year. It includes a report and visualizations that cover sales performance, customer segmentation, product analysis, and more.

business-intelligence csv dashboard data-analysis data-cleaning data-visualization excel powerbi sales-analysis-dashboard storytelling

Last synced: 09 Nov 2024

https://github.com/abishekaditya/machinelearningintro

Some simple stuff with pandas and Scipy

data-analysis ipython machine-learning pandas python scipy

Last synced: 08 Nov 2024

https://github.com/angelmtenor/idafc

Udacity's Intro to Data Analysis

data-analysis

Last synced: 09 Nov 2024

https://github.com/meokullu/prefill

PreFill adds desired characters onto output values to increase their legibility.

alignment data data-analysis data-engineering data-science legibility

Last synced: 14 Nov 2024

https://github.com/imrandil/sql_practice_with_analysis

SQL practice using postgres db and docker as a tool to setup postgres, loving the sql way

data-analysis docker markdown postgres sql

Last synced: 08 Nov 2024

https://github.com/imrandil/excel_learning_dir

Excel learning practice with some data, the doing

data-analysis datasets excel

Last synced: 08 Nov 2024

https://github.com/fisseha-estifanos/telecom

A showcase repository for a specific telecommunication company. Used to analyze several telecommunication data set features and generate useful insights accordingly. Insights generated could be seen at https://github.com/Fisseha-Estifanos/telecom-visualizer or at https://fisseha-estifanos-telecom-visualizer-home-huxgy0.streamlitapp.com/

data-analysis notebooks-jupyter python visual-studio-code visualization

Last synced: 09 Nov 2024

https://github.com/leopeng1995/neuralsql

Make DataStore More Intelligent

data-analysis mongodb sql

Last synced: 08 Nov 2024

https://github.com/drill-n-bass/dealavo-project

Cartesian product from dictionary to list of dictionaries and faster methods for finding index than the `index` method.

data-analysis data-analysis-python matplotlib pandas python python3 random timeit

Last synced: 07 Nov 2024

https://github.com/drill-n-bass/ovh-project

The goal of this task is to prepare statistical analysis of set of data from disks.

anaconda analysis data-analysis data-analysis-python jupyter-notebook matplotlib-python pandas python3 seaborn-plots

Last synced: 07 Nov 2024

https://github.com/arush-codes/paris-olympic-de

data engineering project on paris olympics 2024

azure data-analysis data-engineering microsoft-azure olympics2024 pipeline

Last synced: 14 Oct 2024

https://github.com/nandit123/python_on_excel

Data Analysis using python libraries on excel data

csv data-analysis data-science fill fluctuations graph numpy python python-library

Last synced: 12 Nov 2024

https://github.com/madusales/powerbi-etl-elt

Venho estudando, através do Bootcamp da DIO sobre Data Analytics & Power BI, acerca do uso de SQL para criar soluções em BI. Esse repositório é dedicado a registrar os meus conhecimentos adquiridos até então sobre o que é BI, Tipos de análises, ETL e ELT.

big-data business-intelligence data-analysis powerbi

Last synced: 11 Nov 2024

https://github.com/mecha-aima/demographic-analyzer

This project uses pandas to process census data from a csv file and draw useful results from the data by performing various filtering and calculations on it

data-analysis data-science pandas

Last synced: 09 Nov 2024

https://github.com/muneeb1030/webscrapper_mastodon

The Mastodon Social Platform Scraper is a Python-based web scraping tool designed to explore and extract valuable data from the Mastodon social platform.

data-analysis data-collection mastodon python3 scrapy scrapy-spider selenium-python webscraping

Last synced: 09 Oct 2024

https://github.com/samkazan/business-analysis-tableau

Business Analysis on Global/Superstore data using Tableau.

analysis data-analysis tableau visualization

Last synced: 05 Nov 2024

https://github.com/jrh89/sorting-hat

With a simple and user-friendly interface, the GUI allows users to easily enter data and extract the numbers they need and then sort and graph them.

data-analysis data-visualization datascience executable graphs-algorithms gui python sorting sorting-algorithms sorting-algorithms-implemented

Last synced: 10 Nov 2024

https://github.com/rahulsm20/car-data

A data analytics project that involves analyzing a car dataset that includes information on various car brands, years, prices, mileage, and fuel types, in order to gain insights into the car market.

data-analysis data-analytics matplotlib numpy pandas python

Last synced: 10 Nov 2024

https://github.com/cicku/en.650.672

HW of EN.650.672

analytics data-analysis numpy pandas

Last synced: 09 Nov 2024

https://github.com/rahulsm20/storedata

A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.

data-analysis matplotlib numpy pandas python streamlit

Last synced: 10 Nov 2024

https://github.com/madrury/hot-sauce

Simuation of a Hot Sauce Spicyness Dataset

data-analysis data-science data-visualization dataset machine-learning

Last synced: 10 Nov 2024

https://github.com/rahulsm20/insurance-data

A data analytics project dealing with risk assessment and it's effects in health insurance.

data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn

Last synced: 10 Nov 2024

https://github.com/rahulsm20/trackbyte

A full-stack web application that helps users keep track of their playlist and provides analytics based on their music taste.

bootstrap data-analysis expressjs mysql nodejs reactjs sql

Last synced: 10 Nov 2024

https://github.com/sinsunsan/earth-survival-kit

Global warning data visualisation app to make everyone understand global warning and take actions that matter

angular angular7 d3 data-analysis data-visualization ecology global-warning ngx-charts

Last synced: 08 Nov 2024

https://github.com/abhroroy365/market_analysis

This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.

clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis

Last synced: 09 Nov 2024

https://github.com/kentlouisetonino/ama-project-data-analysis

A course project for course MATH 6200.

ama-university data-analysis python

Last synced: 12 Nov 2024

https://github.com/anoopgeorge418/linked-analytics

"LinkedAnalytics is a project that scrapes LinkedIn data, analyzes it to uncover valuable insights, builds predictive models, and deploys them for practical applications. This repository contains all scripts, analysis notebooks, and deployment code needed to replicate the process."

beautifulsoup4 bokeh data-analysis data-science linkdin linkdindata machine-learning matplotlib numpy pandas plotly python requests seaborn sql web-scraping

Last synced: 10 Nov 2024

https://github.com/alan-oliveir/state-of-data-2022

Neste projeto faço a análise da distribuição das faixas salariais para os profissionais de nível júnior para o cargo de analista, cientista e engenheiro de dados. Os dados são da State of Data Brazil que é uma das maiores pesquisas sobre o panorama do mercado de trabalho brasileiro na área de dados.

data-analysis jupyter-notebook pandas-python seaborn-python

Last synced: 14 Nov 2024

https://github.com/lacerbi/vbmc

Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)

bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference

Last synced: 24 Oct 2024

https://github.com/abeltavares/postql

Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.

cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper

Last synced: 31 Oct 2024

https://github.com/svetlanam/pycon-workshop

Pycon CZ workshop: Better data analyses and product recommendations with Instagram data

data-analysis data-science martinus matplotlib pandas pycon2016 pyconcz python scikit-learn workshop

Last synced: 13 Nov 2024

https://github.com/themihirmathur/qlik-intern-project

Qlik Analysis of Road Safety & Accident Patterns in India 📈 Analyzed & visualized road safety data for 20.85k+ accident cases with 9+ accident data patterns in India using Qlik 📉 Reduced inefficiencies by 25% by developing design of an avant-garde data tracking dashboard that monitored injuries.

data-analysis data-visualization presentation qlik qlik-cloud qlik-sense qlikview

Last synced: 12 Nov 2024

https://github.com/shubham200137/cyclistic-case-study

This repository contains a case study for Google's Data Analytics Professional Certificate, focusing on Cyclistic, a fictional bike sharing company in Chicago. The case study aims to drive growth by converting casual riders into members through a marketing strategy.

data-analysis data-visualization numpy-python pandas-python presentation-slides sql tableau

Last synced: 10 Nov 2024

https://github.com/koldlight/bluetab-data-science-2017

Repositorio para compartir material y publicar los retos

course data-analysis data-science exercises

Last synced: 09 Nov 2024

https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office

Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.

data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python

Last synced: 08 Nov 2024

https://github.com/vubacktracking/freecodecamp-data-analysis-with-python

5 Projects in Data Analysis With Python Course on Freecodecamp

data-analysis freecodecamp

Last synced: 10 Nov 2024

https://github.com/svetlanam/pt-data-analyse

Data analyse of the czech parcel tracking providers

data-analysis matplotlib pandas parcel-tracking python3 visualisation

Last synced: 13 Nov 2024

https://github.com/lucaspadoni/9-11-hijackers-social-network-analysis

Social Network Analysis focused on the events of 9/11/2001. By examining publicly available data through SNA techniques, we gain insights into the organizational structure of the terrorist network, offering valuable perspectives on key relationships and connections.

9-11 data-analysis data-analytics graph-theory hijacking network-analysis sna social-network-analysis terrorism terrorist-attacks

Last synced: 09 Nov 2024

https://github.com/parthds02/pizza_sales_sql

SQL project analyzing pizza sales data. Includes creating tables, executing queries, and solving basic to advanced analytical questions to derive insights from sales data.

analytics data-analysis data-science pizza-sales sql

Last synced: 12 Nov 2024

https://github.com/aymane-maghouti/sentiment-analysis-for-jumia-reviews-and-smartphone-price-prediction-system

The project focuses on customer sentiment analysis for Jumia, aiding informed online decisions. It collects and analyzes product comments to determine sentiments and implements a decision-making algorithm. Additionally, it includes product price prediction system using regression techniques.

beutifulsoup data-analysis data-cleaning data-collection data-preprocessing data-scraping data-visualization eda falsk machine-learning python web-application

Last synced: 16 Nov 2024

https://github.com/marknature/machine-learning-intern

Machine Learning tasks involving the Titanic Dataset and Breast Cancer Wisconsin (Diagnostic) dataset

data-analysis github jupiter-notebook machine-learning matplotlib numpy pandas python scikit-learn sklearn

Last synced: 13 Nov 2024