Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/sanveed-adnan/supermarket-sales-sql-project

SQL-based data analysis project on supermarket sales performance using SQLite and Power BI.

business-intelligence data-analysis data-science data-science-projects data-visualization power-bi sales-data sql sqlite

Last synced: 28 Dec 2024

https://github.com/virajbhutada/credit-card-transaction-analysis-sql

This project provides a structured database schema and SQL scripts to analyze credit card data. It includes tools for managing and analyzing transaction data, helping to identify spending patterns and trends. The project features visual schema diagrams and supporting documentation for easy understanding.

creditcard customer data-analysis data-cleaning data-modeling database database-management insights performance-optimization postgresql query-language schema-design schema-diagram scripts sql transactions trends

Last synced: 10 Jan 2025

https://github.com/virajbhutada/zomato-restaurants-analysis-excel-powerbi-sql-tableau

Uncover insights from Zomato's restaurant data using Excel, SQL, Power BI, and Tableau. Analyze location trends, opening patterns, ratings distribution, and price ranges for a comprehensive understanding of the dining landscape. Dive into this repository and unlock the flavors waiting to be explored. Your culinary adventure starts here!

data-analysis data-driven-decisions data-visualization excel excel-functions interactive-dashboards powerbi powerbi-desktop restaurants sql sql-queries tableau-public tableau-visualization zomato-data-analysis

Last synced: 10 Jan 2025

https://github.com/ljadhav25/django-data-analyzer

Django Data Analyzer is a web application built using the Django framework, designed to streamline data analysis tasks. Users can upload CSV files containing data for analysis. The application utilizes the powerful data manipulation capabilities of Python libraries like pandas and numpy to perform various analyses on the uploaded data.

data-analysis data-visualization django-application matplotlib numpy pandas python seaborn

Last synced: 06 Jan 2025

https://github.com/mafda/seattle_airbnb_data_analysis

This repository contains a comprehensive analysis of the Seattle Airbnb dataset, conducted using the CRISP-DM (Cross Industry Standard Process for Data Mining) methodology.

crisp-dm data-analysis data-science jupyter-notebook pandas-python seattle-data

Last synced: 16 Nov 2024

https://github.com/2013xile/sheethub

Organize, import, export, concatenate sheet files on web application.

data-analysis data-wrangler excel sheets

Last synced: 22 Dec 2024

https://github.com/shimazadeh/ft_linear_regression

Implementing a modular linear regression from scratch to predict the price of cars using a gradient descent algorithm.

data-analysis data-science hyperparameter-tuning linear-regression predictive-modeling

Last synced: 11 Jan 2025

https://github.com/surajwate/datalab

DataLab is a versatile toolkit designed to simplify data exploration, analysis, and visualization for data scientists.

data-analysis data-science python visualization

Last synced: 09 Nov 2024

https://github.com/grindelfp/logistic-regression-study

Example of logical regression data analysis and exercise on it.

data-analysis ipynb logistic-regression python

Last synced: 12 Jan 2025

https://github.com/suhail25/hotel-booking-analysis

Analyzed the cancelling of booking of hotels and summarized insights to the Hotel Manager to increase profit by 30%. Demonstrated data exploration, cleaning, analysis using Python and its libraries: pandas, seaborn, matplot. Documented the results in PDF report: reduced cancellation by 30% and releasing discounts for 10 days in a month.

data-analysis ipynb-notebook matplotlib pandas python seaborn

Last synced: 12 Jan 2025

https://github.com/capjamesg/personal-notebooks

Notebooks for personal experiments with machine learning and computer vision.

data-analysis machine-learning notebooks

Last synced: 15 Dec 2024

https://github.com/souravsuvarna/whatsapp-chat-analyzer-and-visualizer-web-application

The WhatsApp chat analyzer and visualizer uses NLP algorithms to analyze chat data, tracking usage patterns and presenting insights through visually appealing charts and graphs. It helps users understand communication patterns and behaviors on WhatsApp.

data-analysis data-science data-visualization python python3 streamlit

Last synced: 05 Jan 2025

https://github.com/josericodata/statisticsapp

Interactive statistics analysis app using Python and Streamlit. Perform key statistical tests, visualise distributions, and explore data with ease.

alpha-value chi-square-test confidence-intervals data-analysis dublin dublin-ireland europe hyphotesis-tests ireland normal-distribution null-hypothesis p-value portfolio python statistics streamlit t-test tech ubuntu z-test

Last synced: 06 Jan 2025

https://github.com/thanaraklee/exploring-and-analyzing-data-in-oracle-database

This project focuses on data analysis using SQL with Oracle Database 21c. It aims to familiarize with data management and data analysis using SQL commands and Oracle Database 21c.

data-analysis oracle-database sql sql-developer

Last synced: 25 Dec 2024

https://github.com/ysayaovong/portfolio

Explore my portfolio showcasing projects in data engineering, cybersecurity, software development, and cloud computing. Highlights include SQL tutorials, automation tools, cybersecurity assessments, and innovative Python applications. Dive into my work and see my expertise in action.

api-integration automation aws cloud-computing cybersecurity cybersecurity-risk-assessment data-analysis data-engineering data-science database-management etl linux project-management python scripting security-policy software-development sql system-optimization visualization

Last synced: 03 Dec 2024

https://github.com/leticiamilan/dashboard-analitico-de-vendas-globais

Dashboard Analítico de Vendas Globais - DSA - Desenvolvido com Power BI

dashboard dashboard-power-bi data-analysis power-bi powerbi

Last synced: 29 Nov 2024

https://github.com/kahleryasla/car-sales-data-analysis

This project retrieves and cleans car sales data from a Google Sheets document, and performs various analyses on the data, including calculating the average price of cars for each year and the standard deviation of the prices of each brand. The cleaned and modified data is then saved to an Excel file.

data data-analysis data-cleaning data-manipulation excel google-sheets pandas pandas-python

Last synced: 13 Jan 2025

https://github.com/lotfiferaga/amazon-alexa-reviews-sentiment-analysis

Amazon Alexa, developed by Amazon, allows users to interact with technology through voice commands. Analyzing user sentiments about Alexa, with over 40 million users worldwide, is an intriguing data project.

classification data-analysis python sentiment-analysis

Last synced: 09 Jan 2025

https://github.com/brunomontezano/sleep-quality-cognition

💤 Analysis of the paper "Associations between general sleep quality and measures of functioning and cognition in subjects recently diagnosed with bipolar disorder".

bipolar-disorder cognition data-analysis sleep-analysis sleep-research

Last synced: 11 Jan 2025

https://github.com/brunomontezano/digital-interventions-for-depression

📱 "Digital interventions for depressive symptoms: a randomized clinical trial" code

academia clinical-trials cognitive-behavioral-therapy data-analysis digital-health open-science smartphone-app

Last synced: 11 Jan 2025

https://github.com/mrsamsonn/monolithic-polylithic-crystal-segmentation

A grid segmentation algorithm for clustering crystal structures using diffraction patterns. Useful in material science and nanotechnology, this code enables detailed analysis of crystals for research and industrial applications.

clustering crystal-structure crystallography data-analysis diffraction-patterns grid-segmentation image-processing k-means machine-learning matertial-science nanotechnology python research-project research-tools scientific-computing

Last synced: 21 Dec 2024

https://github.com/yash22222/web-scraping-for-data-analysis-predictive-model-on-customer-data

Utilized web scraping for customer feedback at Air India, conducting robust data analysis, and applying machine learning for predictive modeling. Drove data-driven decisions, enhancing services, and elevating customer satisfaction. Expertise in web scraping, analysis, and predictive modeling for actionable insights.

data-analysis data-preprocessing data-science data-visualization exploratory-data-analysis machine-learning powerbi random-forest-classifier sentiment-analysis tableau web-scraping

Last synced: 05 Jan 2025

https://github.com/neerajcodes888/data-science

This repository is a hub for data science enthusiasts, offering a diverse collection of projects, notebooks, and resources covering topics such as data analysis, machine learning, deep learning, and generative AI. Explore innovative ideas, contribute to cutting-edge research, and enhance your skills in the dynamic field of data science

data-analysis data-science data-visualization deep-learning deep-learning-algorithms eda genai jupyter-notebook machine-learning machine-learning-algorithms openai-api pandas plotting python3 sklearn-library streamlit

Last synced: 04 Dec 2024

https://github.com/sreekar0101/bank-financial-loan-performance-trend-analysis

About This project analyzes the performance trends of financial loans using SQL for data extraction and Tableau for visualization. The goal was to perform exploratory data analysis (EDA) to understand key metrics like loan applications, funded amounts, interest rates, and debt-to-income ratios using sql and tableau for visualization

data-analysis data-visualization sql tableau

Last synced: 06 Dec 2024

https://github.com/sreekar0101/electric-vehicle-market-growth-and-incentive-impact-analysis-dashboard

About This project involves the development of a comprehensive Tableau dashboard to analyze the growth and market dynamics of electric vehicles (EVs). The dashboard reveals key insights, including a 20% increase in EV adoption over five years, the dominance of Battery Electric Vehicles (BEVs) which make up 60% of the market

data-analysis data-visualization tableau-desktop

Last synced: 06 Dec 2024

https://github.com/matteofasulo/cdc-finf

Project of fundamentals of Computer Science

data-analysis data-science data-visualization numpy pandas python python3

Last synced: 19 Nov 2024

https://github.com/mleidel/sqlcel

Python GUI to run SQL select on Spreadsheets, CSV, and Sqlite database tables

csv-export csv-import data data-analysis data-science datascience excel-export excel-import pandas python3 sql sqlalchemy sqlite3

Last synced: 20 Dec 2024

https://github.com/muneeb1030/webscrapper_altnews

The project utilizes a combination of Python, Scrapy, and Selenium to navigate through the dynamic content of AltNews.in and collect valuable information for analysis and verification.

data-analysis data-collection python3 scrapy scrapy-spider selenium selenium-python

Last synced: 11 Jan 2025

https://github.com/anjasfedo/data-analysis

Repo to Explore Data Analysis

data-analysis numpy

Last synced: 16 Nov 2024

https://github.com/jabhij/fbi_nics-firearm-background-checks

This project is a try to showcase the use of guns across the US.

data-analysis data-analytics data-science data-visualization tableau

Last synced: 16 Nov 2024

https://github.com/abeltavares/hotel_performance_analysis

A Power BI project that analyzes the performance of a hotel, including revenue, expenses, customer data, hospitality metrics and financial ratios.

business-intelligence data-analysis expenses financial-analysis hospitality-industry power-bi revenue

Last synced: 05 Jan 2025

https://github.com/jabhij/eda_experiments

In this repo I'll use different types of datasets to explore and implement various Exploratory Data Analysis (EDA) approaches.

ames-housing analysis battery-life blackfriday-analysis data-analysis data-science data-visualization eda matplotlib-pyplot numpy pandas python seaborn visualization zomato-data-analysis

Last synced: 16 Nov 2024

https://github.com/jabhij/tableau_dashboards

Consists brief info about all of my tableau dashboards, insights that I got out of them, & the outcomes that I got after analyzing those visualizations.

data-analysis data-analytics data-science data-visualization tableau visualisation

Last synced: 16 Nov 2024

https://github.com/carlosvinimsouza/jupyter-notebook-basic

Armazenado todos os trabalhos referentes a Ciência de Dados.

data-analysis data-science programas-jupyter-notebook python

Last synced: 11 Jan 2025

https://github.com/aalkiyumi/project-4-big-data-analysis-with-pyspark-on-weather-data

In this project, I analyzed weather data from the NCEI Global Surface Summary of Day dataset using PySpark in Jupyter Notebook. Tasks included data cleaning, statistical analysis, and forecasting for temperature, wind speed, precipitation, and extreme weather events. The project also predicts future weather patterns for Cincinnati and Florida.

big-data-analytics cs5165 data-analysis data-cleaning data-engineering data-science introduction-to-cloud-computing jupyter-notebook machine-learning precipitation-analysis predictive-modeling pyspark statistical-analysis temperature-forecasting time-series-forecasting uc uc2026 university-of-cincinnati wind-speed-data

Last synced: 23 Nov 2024

https://github.com/subhojit45/python3-iphones-x-flipkart-sales-analysis

A simple six questions and their insights derived from iphone sales on Flipkart dataset.

data-analysis jupyter-notebook python3 visual-studio-code visualization

Last synced: 23 Nov 2024

https://github.com/virajbhutada/article-recommendation-system

This project aims to redefine content discovery by delivering personalized article recommendations tailored to individual user preferences. We use advanced machine learning techniques like PCA and K-means clustering to analyze user behavior and article characteristics to provide highly accurate recommendations.

anaconda article-recommendation clustering-algorithm data-analysis data-science keras-tensorflow machine-learning machine-learning-algorithms ml-models numpy pandas plotly python scikit-learn scipy

Last synced: 06 Dec 2024

https://github.com/shelton-beep/trading-algorithm

A simple trading algorithm for SPY ETF using a moving average crossover strategy. This project analyzes SPY weekly price data, implements a buy/sell algorithm, and tracks performance metrics to evaluate profitability and risk. Ideal for learning algorithmic trading basics and financial data analysis.

data-analysis financial-analysis investment-strategy jupyter-notebook pandas python quantitative-finance technical-analysis time-series-analysis trading-strategies

Last synced: 23 Nov 2024

https://github.com/mainak-97/weather-data-analysis-using-python

A comprehensive analysis of time-series weather data using Python and Pandas, focusing on data exploration, cleaning, and uncovering insights.

data-analysis jupyter-notebook pandas pandas-dataframe python python3 time-series-analysis

Last synced: 23 Nov 2024

https://github.com/beyzabasarir/northwind-traders-analysis

Northwind dataset analysis using PostgreSQL, Python, and Power BI. Focused on sales, customers, shipping, and performance insights.

dashboard data-analysis data-visualization jupyter-notebook matplotlib numpy pandas postgresql powerbi python seaborn

Last synced: 23 Nov 2024

https://github.com/swat1563/recommendation-system

This repository features a recommendation system and analytics engine using datasets on users, organizations, contents, contacts, events, and recommendations. It includes data preprocessing, building a recommendation system, and creating visual reports with Power BI.

analytics data-analysis data-visualization engine kaggle numpy pandas powerbi powerbi-dashboards powerbi-desktop powerbi-reports python recommendation-engine recommendation-system recommender-systems scikit-learn scipy

Last synced: 06 Dec 2024

https://github.com/mugambi645/exploring-ebay-car-sales-data

Exploring ebay car sales dataset

car-sales data-analysis numpy pandas

Last synced: 30 Nov 2024

https://github.com/rafgpereira/obmep-analise

Código que analisa a retrospectiva das premiações da Obmep em determinada localidade e escola

data-analysis excel pandas python

Last synced: 29 Nov 2024

https://github.com/aneeshmurali-n/project-ml-data-preprocessing

The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.

data-analysis data-cleaning data-encoding data-exploration feature-scaling label-encoding matplotlib minmaxscaler numpy one-hot-encoding outlier-detection pandas standardscaler

Last synced: 09 Jan 2025

https://github.com/rajkumargara/bike_rental_data_analysis

Chicago bike rental data analysis for business insights using R programming

data-analysis data-visualization data-wrangling large-dataset machine-learning-algorithms

Last synced: 13 Jan 2025

https://github.com/thinogueiras/jornada-python

Jornada Python - Hashtag Programação.

data-analysis data-science inteligencia-artificial python rpa

Last synced: 16 Nov 2024

https://github.com/serlo/data-pipeline-interactive-exercises

processing pipeline for exercise dashboards

data-analysis serlo

Last synced: 09 Jan 2025

https://github.com/robthepcguy/ahk-mouse-heatmap

An AutoHotkey script that records left, right, and middle mouse clicks, logging the date, time, and x and y coordinates. It features automatic GUI updates and generates a visual heatmap via a Python script, accessible from the system tray. This tool is ideal for analyzing user interaction and creating detailed mouse activity maps.

autohotkey-script click-counter click-log click-map data-analysis data-visualization heatmap heatmap-visualization mouse-events mouse-tracking python

Last synced: 14 Dec 2024

https://github.com/bhavinpatel4199/artificial-intelligence---ai-for-decision-making

Artificial Intelligence for Decision Making is a collection of projects focused on applying AI and machine learning techniques to solve decision-making challenges. It includes projects on wine quality prediction, Cassandra data modeling, and text classification, showcasing a range of data science and machine learning applications.

artificial-intelligence cassandra-cql data-analysis data-engineering data-preprocessing data-structures decision-making deep-learning feature-selection machine-learning-algorithms sentiment-analysis text-classification

Last synced: 13 Jan 2025

https://github.com/victorlcastro-dsa/pbl-datacamp

This repository features projects from DataCamp's Project-Based Learning (PBL) courses, showcasing practical applications of data analysis, machine learning, and visualization. Explore real-world datasets and interactive results that highlight the skills gained through hands-on learning.

data-analysis data-science data-visualization datacamp-projects hypothesis-testing machine-learning project-based-learning

Last synced: 13 Jan 2025

https://github.com/sharmas1ddharth/data-analysis-with-python

Freecodecamp's Data Analysis with Python Projects Code

data-analysis data-analysis-with-python freecodecamp-project

Last synced: 11 Jan 2025

https://github.com/sharmas1ddharth/mode_of_transport_analysis

This project requires you to understand what mode of transport employees prefers to commute to their office. The data includes employee information about their mode of transport as well as their personal and professional details like age, salary, and work exp. We need to predict whether or not an employee will use private transport. Also, which variables are a significant predictor behind this decision.

data-analysis r-programming

Last synced: 11 Jan 2025

https://github.com/mikma03/datascience_python_datacamp

DataScience with Python. Code and examples. Python libraries, including pandas, NumPy, Matplotlib, and many more.

data-analysis data-science datacamp datascience numpy pandas python

Last synced: 09 Jan 2025

https://github.com/fer-aguirre/covid19-venezuela

Análisis de datos de muertes por covid-19 en Venezuela

covid-19 data-analysis dataviz line-chart

Last synced: 23 Dec 2024

https://github.com/fer-aguirre/cookiecutter-data-analysis-extensive

A cookiecutter template for data analysis projects using Python.

cookiecutter data-analysis project-template python

Last synced: 23 Dec 2024

https://github.com/elzasimoes/challenge-xplab

Challenge for data analysis course.

data-analysis data-science jupyter-notebook python

Last synced: 05 Jan 2025

https://github.com/amyanchen/sf-airbnb

Exploratory Data Analysis of San Francisco Airbnb's

data-analysis data-science data-visualization r rmarkdown statistics

Last synced: 29 Dec 2024

https://github.com/mituskillologies/aiml-dypiemr-sep24

Programs conducted at DYPIEMR, Pune in training on AIML during September 2024.

artificial-intelligence data-analysis data-science machine-learning matplotlib neural-network numpy pandas python3

Last synced: 19 Dec 2024

https://github.com/leandrocollares/nyc-film-permits

NYC film permits: an exploratory data analysis

data-analysis data-visualization pandas plotly

Last synced: 08 Jan 2025

https://github.com/sanchittechnogeek/overscripted-analysis

Geolocation and user language extraction analysis from Mozilla Overscripted dataset

analysis data data-analysis mozilla

Last synced: 29 Nov 2024

https://github.com/yeonjaee/data-analytics

converts raw data into actionable insights

data-analysis text-mining

Last synced: 08 Jan 2025

https://github.com/sanchittechnogeek/rental-data-visualization_python

Statistics and visualization of rental data with python

data-analysis data-science data-visualization statistics

Last synced: 29 Nov 2024

https://github.com/harmanveer-2546/motor-vehicle-accidents-in-india

As per the report, a total of 4,61,312 road accidents have been reported by States and Union Territories (UTs) during the calendar year 2022, which claimed 1,68,491 lives and caused injuries to 4,43,366 persons.

accidents accidents-analysis darkgrid data-analysis eda exploratory-data-analysis indian-roads inline matplotlib motor-vehicles numpy pandas review seaborn visualization

Last synced: 19 Dec 2024

https://github.com/kiran-kumar-k3/sales-performance-dashboard

The Sales Performance Dashboard is an interactive Python-based web application that visualizes and analyzes sales data, providing actionable insights through dynamic charts and metrics.

data-analysis python streamlit

Last synced: 22 Dec 2024

https://github.com/pipe199x/end-to-end-prediction-california

End-to-end prediction project using various technologies to predict housing prices in California.

california-housing data-analysis machine-learning python

Last synced: 11 Jan 2025

https://github.com/danpoynor/data-analysis-of-video-game-sales-2000-2015

This analysis reviews sales for the top 100 video games from the years 2000-2015 to gather insights. Within the notebook I use Python’s Pandas, Matplotlib, and Seaborn libraries to interact with the data and create graphs.

data-analysis jupyter-notebook matplotlib pandas-dataframe python3 seaborn-plots video-game-sales

Last synced: 16 Nov 2024

https://github.com/danpoynor/data-analysis-spotify-songs-2010-2019

Spotify data analysis for songs between 2010 and 2019 using Jupyter Notebooks including pandas and Seaborn plots.

data-analysis jupyter-notebook matplotlib pandas-dataframe python3 seaborn-plots spotify

Last synced: 16 Nov 2024

https://github.com/danpoynor/python-number-guessing-game-with-stats

A number guessing game written in Python 3 that presents median, mode, and mean statistics

console-game data-analysis number-guessing-game python3 statistics

Last synced: 16 Nov 2024

https://github.com/danpoynor/pet-shelter-data-analysis-notebook

Demonstration of skills analyzing data from a pet shelter. The CSV data contains tables detailing the incoming and outgoing animals and I use my knowledge of Pandas to gather and present the requested information.

csv data-analysis data-cleaning data-science jupyter-notebook matplotlib numpy pandas pet-shelter tabular-data

Last synced: 16 Nov 2024

https://github.com/jonek/pv-city-mastr

Extract and analyze data about photovoltaic systems in Germany

data-analysis germany jupyter-notebook pandas photovolatic-power photovoltaic

Last synced: 13 Jan 2025

https://github.com/yalai92/alfalfa_imp_exp_analysis

This repository covers data cleaning, analysis, and visualization of global alfalfa and pellet imports, focusing on trends from 2003 to 2023. It also includes a predictive analysis of global alfalfa demand for 2024-2029, using data science techniques to provide insights for stakeholders in the alfalfa industry.

data-analysis data-cleaning data-visualization matplotlib numpy pandas python sckiit-learn tableau

Last synced: 19 Dec 2024

https://github.com/alinababer/data-science-and-insight-agent-rag-llama3-lava-llm

Data-Science-and-Insight-Agent-RAG-LLama3-Lava-LLM-Django-WebApplication is an advanced AI-driven chatbot designed to assist in data science, document analysis, and image interpretation. This repository contain the Datascience Agent of this project.

artificial-neural-networks classifcation data-analysis data-engineering data-visualization datascience large-language-models llama2 lstm machine-learning python random-forest regression

Last synced: 22 Dec 2024