Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with data-analytics

A curated list of projects in awesome lists tagged with data-analytics .

https://github.com/gher-uliege/cost-eumetsat-training

Material, data and presentations for the COST-EUMETSAT training school

bash data-analytics ocean-modelling ocean-sciences oceanography python remote-sensing satellite training

Last synced: 11 Dec 2024

https://github.com/rawsashimi1604/jobextract

Scrapes LinkedIn data. Conducts sentiment analysis on what traits and qualifications employers are looking for.

data data-analysis data-analytics data-cleaning linkedin mvc python webscraper

Last synced: 27 Dec 2024

https://github.com/aymane-maghouti/real-time-streaming-kafka-debezium-spark-streaming

This project demonstrates real-time data streaming and processing architecture using Kafka, Spark Streaming, and Debezium for capturing CDC (Change Data Capture) events. The pipeline collects transaction data, processes it in real time, and updates a dashboard to display real-time analytics for smartphone data.

change-data-capture dashboard data-analytics data-processing debezium docker java kafka mysql-database notifications postgresql-database python reactjs real-time-data-pipeline real-time-systems spark-streaming spring-boot web-development

Last synced: 29 Oct 2024

https://github.com/coumbacoulibaly/adventureworkscycles

Repository for Adventure Works Sample Database Analysis

adventureworks data-analysis data-analytics mssql-database mssqlserver sql ssms

Last synced: 17 Nov 2024

https://github.com/googlecloudplatform/terraform-google-dataplex-auto-data-quality

Move data between environments using Dataplex

cft-terraform data-analytics

Last synced: 07 Oct 2024

https://github.com/emso-exe/investidores_do_tesouro_direto

Projeto de análise de perfil de investidores do Tesouro Direto com base nos dados do site tesourotransparente.gov.br.

analise-de-dados ciencia-de-dados data-analytics data-science dataanalytics datascience powerbi python python-3 python3 tesouro-direto tesourodireto

Last synced: 15 Nov 2024

https://github.com/emso-exe/churn_clientes_de_banco

Projeto de análise de churn, utilizando machine learning na classificação de dados de clientes que poderão ou não efetuar o encerramento de conta bancária.

analise-de-dados ciencia-de-dados data-analytics data-science dataanalytics kaggle kaggle-dataset machine-learning machinelearning python python-3 python3

Last synced: 15 Nov 2024

https://github.com/emso-exe/comercio_eletronico_brasileiro

Projeto de análise de dados do comércio eletrônico brasileiro disponibilizado pela Olist via plataforma Kaggle.

analise-de-dados ciencia-de-dados data-analytics data-science datascience e-commerce postgres postgresql pyspark python python-3 python3 spark spark-sql sql

Last synced: 15 Nov 2024

https://github.com/tushar2704/superstore-sales-dashboard-with-streamlit

Superstore Sales with Streamlit is a data visualization and analysis project that uses the Streamlit framework to create an interactive web application for exploring and analyzing sales data from a superstore. This project aims to provide an easy-to-use interface for users to gain insights into sales trends, Sales performance, product performance,

analytics dashboard data-analytics data-science data-science-projects python streamlit streamlit-tushar2704 trend-analysis tushar2704

Last synced: 27 Dec 2024

https://github.com/quantumudit/analyzing-pokemons

This project focuses on scraping data related to Pokémons from a complete Pokédex; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analytics data-visualization jupyter-notebook power-bi python webscraping

Last synced: 26 Dec 2024

https://github.com/ivanildobarauna-dev/api-to-dataframe

Python library that simplifies obtaining data from API endpoints by converting them directly into Pandas DataFrames. This library offers robust features, including retry strategies for failed requests.

data-analysis data-analytics data-engineering library pypi-packages python

Last synced: 19 Dec 2024

https://github.com/ivanildobarauna-dev/data-consumer-api

ETL Process for Currency Quotes Data" project is a complete solution dedicated to extracting, transforming and loading (ETL) currency quote data. This project uses several advanced techniques and architectures to ensure the efficiency and robustness of the ETL process.

business-intelligence data-analysis data-analytics data-engineering data-pipeline data-visualization etl-pipeline python

Last synced: 19 Dec 2024

https://github.com/mgobeaalcoba/full_pandas_python_data

Un repo donde iré cargando todos mis trabajos y proyectos del curso de pandas en Platzi.

data-analytics deepnote google-colab jupyter-notebook pandas python

Last synced: 19 Nov 2024

https://github.com/rayyan9477/calorie-burnage-exploratory-data-analysis-

Calorie Burnage is the measure of calories burned during physical activity or exercise, crucial for weight management and fitness goals. This project focuses on analyzing a dataset that includes information on duration, pulse rates, and calories burned during exercise sessions.

data-analytics data-science data-visualization exploratory-data-analysis linear-regression r-language r-programming

Last synced: 11 Nov 2024

https://github.com/ivanildobarauna-dev/currency-quote

Complete solution for extracting currency pair quotes data with comprehensive testing, parameter validation, flexible configuration management, Hexagonal Architecture, CI/CD pipelines, code quality tools, and detailed documentation.

data-analysis data-analytics data-engineering library pypi-packages python

Last synced: 19 Dec 2024

https://github.com/tushar2704/sql-challenges

Collection of SQL challenges designed to demonstrate my skills and expertise in database management, data analysis, and data manipulation using SQL. This repository serves as a showcase of my abilities to handle complex SQL queries, optimize database performance, and solve real-world data problems.

data-analytics data-mining data-science datamanipulation mysql postgresql sql

Last synced: 27 Dec 2024

https://github.com/emso-exe/falsificacao_de_cedulas_banco_central_do_brasil

Projeto de análise de falsificação de cédulas de real (R$) com base nos dados do site dadosabertos.bcb.gov.br.

analise-de-dados ciencia-de-dados dashboard data-analytics data-science datascience powerbi python python-3 python3

Last synced: 15 Nov 2024

https://github.com/emso-exe/anuncios_em_redes_sociais

Projeto de machine learning aplicando regressão logistica nos dados de clientes que tiveram alguma interação com anúncios de redes sociais, se efetuaram ou não uma compra.

analise-de-dados ciencia-de-dados data-analytics data-science dataanalytics datascience kaggle kaggle-dataset logistic-regression machine-learning machinelearning python python-3 python3 regressao-logistica

Last synced: 15 Nov 2024

https://github.com/emso-exe/compra_de_carro

Projeto de machine learning aplicando regressão linear nos dados de compras de carros para criação de um modelo preditivo de valores para novas aquisições de veículos pelos clientes.

analise-de-dados ciencia-de-dados data-analytics data-science dataanalytics datascience kaggle kaggle-dataset machine-learning machinelearning python python-3 python3 regressao-linear regression-linear

Last synced: 15 Nov 2024

https://github.com/tirkarthi/redbus-scraper

A simple redbus scraper to analyse ticket trends

clojure data-analytics scraper

Last synced: 05 Jan 2025

https://github.com/yash22222/real-estate-price-prediction-using-linear-regression

This project employs linear regression to predict property prices based on key features. Through thorough data cleaning, preprocessing, and feature engineering, the model is fine-tuned for accuracy. With insights from exploratory data analysis, the model offers reliable estimates, aiding stakeholders in informed decision-making.

bangalore-house-price-prediction csv data-analytics datasets house-price-analysis house-price-prediction linear-regression machine-learning python real-estate-price-prediction

Last synced: 05 Jan 2025

https://github.com/yash22222/datathon-football-data-dilemma

Integrating FPL, Transfermarkt, and Understat data, our strategy optimizes squad selection. This approach, leveraging market values and detailed player statistics, ensures both financial prudence and on-field performance. Strategic acquisitions align with set rules and budget constraints, emphasizing value-driven choices for a robust squad

business-analytics comma-separated-values data-analytics fpl-analysis microsoft-power-bi no-code python transfermarkt understat web-scrapping

Last synced: 05 Jan 2025

https://github.com/noeyislearning/sql-coding-challenges

SQL Coding Challenges, this repository is a compilation of my journey on enhancing my SQL skills; coding challenges are from LeetCode, HackerRank, and other sources.

coding-challenge data-analytics data-science hackerrank leetcode mysql solutions

Last synced: 06 Dec 2024

https://github.com/demon-2-angel/post-partum-analysis

Postpartum Analysis is a comprehensive assessment conducted after childbirth to evaluate the mother's physical and emotional well-being. It covers factors like recovery progress, mental health, and newborn care, ensuring a holistic understanding of the postpartum experience for optimal support.

data-analytics medical-analysis post-partum-depression tableau-dashboards

Last synced: 14 Dec 2024

https://github.com/bryan-hoang/cmpe-351-advanced-data-analytics

My code written for an Advanced Data Analytics course at Queen's Unversity (CMPE-351).

data-analytics

Last synced: 06 Dec 2024

https://github.com/dennyglee/open-covid19-public

A collaboration between SCRI and Databricks on the analysis of open COVID-19 datasets.

covid-19 data data-analytics data-engineering data-science nlp

Last synced: 30 Nov 2024

https://github.com/cosmoduende/r-uber-trips-analyisis

Explore your activity on Uber with R: How to analyze and visualize your personal data history. Find out how you consume the Uber App using a copy of your data.

analisis-de-data data-analysis data-analytics data-science data-visualisation data-visualization data-viz eda flexdashboard ggmap ggplot2 mobility-as-a-service qmplot r-language r-programming ridesharing uber uber-data visualizacion-de-datos

Last synced: 27 Dec 2024

https://github.com/smrfeld/house-of-reps

Python package to model apportionment in house of representatives

apportionment congress data-analytics data-science government plotly python representation voting

Last synced: 24 Dec 2024

https://github.com/virajbhutada/global-universities-success-analysis-powerbi-sql-excel

This capstone project conducts in-depth analysis using Power BI, SQL, and Excel to explore complex dynamics shaping global university success. Integrating data from diverse ranking systems and criteria, our aim is to unravel the factors influencing universities worldwide.

capstone capstoneproject data-analysis data-analytics data-insights data-science data-science-projects data-visualization excel exploratory-data-analysis mece mysql powerbi powerpoint sql

Last synced: 10 Jan 2025

https://github.com/sumanthvrao/moviebuddy

Movie recommendation system to find common movie interests among a group of people.

collaborative-filtering content-based-filtering data-analytics movie python surprise-library

Last synced: 08 Jan 2025

https://github.com/leocornus/wp-visualdata

Data analyse and data visualization arsenal for WordPress

data-analytics data-visualization wordpress-development wordpress-php-library wordpress-plugin

Last synced: 08 Jan 2025

https://github.com/vimaltiwari2612/chrome-extension-charts

This extension can be used to Create dynamic charts by giving comma separated inputs for X and Y Axis. Mostly helpful for Data visualization and Analytics.

anaylsis bar chart chartsjs chrome-extension chrome-extensions data-analytics data-science data-visualization doughnut graph graphics html javascript javascript-library line-charts pie-chart tool

Last synced: 02 Jan 2025

https://github.com/alexandregazagnes/rica-analysis

This repository contains the code to download, analyse, and modelize the RICA dataset from the french ministry of agriculture.

analysis argiculture business data data-analysis data-analytics food python

Last synced: 03 Jan 2025

https://github.com/jaynil1611/google-play-store-app-launch-study

The project analyzes the insights provided by the client using data visualization in Python and GUI in Tkinter.

data-analytics google-playstore-dataset mysql-database python tkinter-gui

Last synced: 23 Nov 2024

https://github.com/diusmontenegro/pyproject-linear-regression

This code implements a simple linear regression model to generate a dataset and fit a line of best fit. The model calculates R^2 score to evaluate performance. The dataset is plotted using matplotlib library.

data-analytics data-science linear-regression

Last synced: 12 Nov 2024

https://github.com/javianng/housing-pricing-forecasting

Housing-Price-Time-Series-Forecasting: Explore a robust model to forecast Singapore's HDB housing prices using transaction data and geographic insights. Features data scraping, cleaning, feature engineering, and models like XGBoost, LSTM. Check serialized notebooks for workflow details.

data-analytics data-science deep-learning python

Last synced: 09 Jan 2025

https://github.com/marcelohfonseca/moneychart

Este é um projeto pessoal que nasceu privado, mas se tornou aberto para que todas as pessoas interessadas em investimentos e análise de dados possam testar e contribuir com melhorias no código fonte e nas visualizações de dados.

business-intelligence data-analytics data-visualization investing power-bi python stocks stocks-analysis

Last synced: 13 Nov 2024

https://github.com/mohagungnursalim/da-covid-data-cleansing

Identifying and correcting errors, inconsistencies, and inaccuracies within datasets related to the COVID-19 pandemic. This is essential for obtaining reliable and actionable insights from the data.

data-analytics data-visualization pandas-dataframe

Last synced: 03 Dec 2024

https://github.com/dhrupad17/ibm-data-analyst-professional-certificate

Prepare for a career as a data analyst. Build job-ready skills – and must-have AI skills – for an in-demand career. Earn a credential from IBM. No prior experience required.

assignment-solutions coursera data-analytics data-science data-visualization excel ibm pandas professional-certificate professional-certificates python quiz updated-2024

Last synced: 15 Nov 2024

https://github.com/rachit901109/datahack_1_go-jo

The presented solution during DataHack2.0 leverages data analytics and Streamlit to address challenges in analyzing India's booming startup ecosystem from 2018 to 2021. By integrating openly available news and datasets, the interactive dashboard delivers data-driven insights.

case-study data-analytics data-visualization

Last synced: 18 Nov 2024

https://github.com/josgard94/decision-regions-with-knn-algorithm

Implementation of the knn algorithm to generate decision regions using the Iris dataset.

data-analytics data-processing data-science decision-regions iris-dataset knn-algorithm machine-learning

Last synced: 21 Nov 2024

https://github.com/ricardomilhazes/mddbs-northwind

Building a multidimensional database system for NorthWind and analysing it's data

business-intelligence data-analytics data-warehouse mysql powerbi sql

Last synced: 07 Dec 2024

https://github.com/spandan114/luminai-data-analyst

LUMIN: Your data analysis companion that turns natural language questions into powerful insights through AI-driven visualizations and clear explanations.

ai-agents ai-data-analysis ai-tools chatgpt data-analytics fastapi groq langchain llm react sql typescript

Last synced: 25 Nov 2024

https://github.com/shaadclt/store-data-analysis-excel

This project provides a guide for analyzing store data using Microsoft Excel. It demonstrates how to utilize various Excel features and functions to gain insights into sales, trends, and other key metrics related to store performance.

data-analytics data-visualization excel pivot-tables

Last synced: 07 Dec 2024

https://github.com/shaadclt/employee-attrition-dashboard-powerbi

This project provides an interactive employee attrition dashboard created using Power BI. It aims to visualize and analyze employee attrition data to gain insights into factors contributing to employee turnover and develop strategies for retention.

business-intelligence data-analytics data-visualization powerbi

Last synced: 07 Dec 2024

https://github.com/jen-uis/loan-status-prediction

This repository contains project materials for the Winter STAT 206 class, University of California, Riverside, A. Gary Anderson School of Management.

data data-analysis data-analytics data-cleaning data-visualization descriptive-analytics julia julia-language jupyter-notebook predictive-analytics predictive-modeling team-collaboration

Last synced: 21 Nov 2024

https://github.com/deepak5256/data_visualisation

A Maven-based Java project for creating interactive and customizable data visualizations.

charts data-analytics data-visualization interactive-visualization java jfreechart-web-app maven

Last synced: 22 Nov 2024

https://github.com/macagua/entrenamiento.data_scientist_python

Repositorio de manuales y recursos del entrenamiento "Data Scientist en Python" realizado por Leonardo J. Caballero G.

data-analytics data-scientist data-visualization numpy pandas-dataframe python37 streamlit

Last synced: 09 Jan 2025

https://github.com/mdaltamashalam/smart-shopping-assistant---data-analytics

The Smart Shopping Assistant merges machine learning with KNIME for data analytics, offering personalized recommendations and real-time insights to optimize retail strategies and enhance the shopping experience for customers.

data-analytics knimes machine-learning python python-library

Last synced: 08 Dec 2024

https://github.com/mgautam98/ab-testing-for-ecommerce

Analysed results of an A/B test run by an e-commerce website. Using Hypothesis testing and Regression approach.

data-analytics machine-learning pandas

Last synced: 08 Dec 2024

https://github.com/vvhacker007/tsf_internship

✨This repository contains all the tasks📑 that were assigned to me👨🏻‍💻 in The Sparks Foundation Internship.✨

analysis data-analytics data-science dataset internship jupyter-notebook machine-learning python tasks

Last synced: 09 Dec 2024

https://github.com/mustika-putri-m/analysis-of-sales-transactions-in-an-online-shop---london

Crucial Question 1. How was the sales trend over the months? 2. What are the most frequently purchased products? 3. How many products does the customer purchase in each transaction? 4. What are the most profitable segment customers? 5. Based on your findings, what strategy could you recommend to the business to gain more profit?

data data-analysis-python data-analytics data-visualization ecommerce

Last synced: 04 Dec 2024

https://github.com/shridhar1504/sales-forecasting-datascience-project

Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.

data-analytics data-cleaning data-science data-testing data-visualization forecasting-models machin model-evaluation model-fitting prediction predictive-modeling python3 regression-algorithms salesforecast sklearn-library supervised-learning

Last synced: 23 Dec 2024

https://github.com/wgierke/distributed_data_analytics

Solutions for the hands-on sessions of the course "Distributed Data Analytics" at Hasso-Plattner-Institute using Akka and Spark.

akka data-analytics distributed inclusion-dependency spark

Last synced: 16 Dec 2024

https://github.com/camara94/tensorflow

TensorFlow est une bibliothèque de Machine Learning, il s’agit d’une boîte à outils permettant de résoudre des problèmes mathématiques extrêmement complexes avec aisance. Elle permet aux chercheurs de développer des architectures d’apprentissage expérimentales et de les transformer en logiciels. Et open source crée par google.

data-analytics data-science ia machine-learning tensorflow

Last synced: 23 Dec 2024

https://github.com/narenkhatwani/arkouda-projects

This repository contains the source codes of the projects done using Arkouda (a software package that allows a user to interactively issue massive parallel computations on distributed data using functions and syntax that mimic NumPy, the underlying computational library used in most Python data science workflows.)

arkouda data-analysis data-analytics data-science high-performance high-performance-computing highperformancecomputing numpy pandas parallel-computing parallel-processing parallelization python

Last synced: 12 Dec 2024

https://github.com/soumyaco/rare-species-analysis-across-uk

A Data Analysis on over 20M data on different species in UK and their location

data-analytics data-science folium-maps pandas python3

Last synced: 12 Dec 2024

https://github.com/bretsw/beds

Bookdown project for an open education resource (OER) book: Becoming Educational Data Scientists

analytics data-analysis data-analytics data-science

Last synced: 13 Dec 2024

https://github.com/virajbhutada/googlelookerstudio-ordersanalytics

This project is dedicated to the creation and management of an interactive and insightful dashboard developed using Google Looker Studio. The dashboard is designed to provide a comprehensive overview of order-related data, facilitating data analysis and decision-making.

analytics-dashboard business-intelligence data-analytics data-insights google-looker-studio orders

Last synced: 10 Jan 2025

https://github.com/rayyan9477/calorie-burnage-exploratory-data-analysis

Calorie Burnage is the measure of calories burned during physical activity or exercise, crucial for weight management and fitness goals. This project focuses on analyzing a dataset that includes information on duration, pulse rates, and calories burned during exercise sessions.

data-analytics data-science data-visualization exploratory-data-analysis linear-regression r-language r-programming

Last synced: 10 Jan 2025

https://github.com/luisfelipepoma/data_mining_tools

This repository contains my work from the Data Mining Tools course, focusing on practical applications of data mining techniques. It includes various projects involving data preprocessing, feature engineering, and the implementation of machine learning algorithms. One key project explores the analysis of the All Steam Games and Metadata dataset

data-analytics data-science datasets maths python

Last synced: 19 Dec 2024

https://github.com/luisfelipepoma/datascience_project_with_r

EXPLORATORY ANALYSIS OF A DATA SET IN R

data-analytics eda r r-studio

Last synced: 19 Dec 2024

https://github.com/jesparzarom/caba_puntos_carga_descarga

Proyecto de mapeo y tratamiento de datos del dataset brindado por el gobierno de la ciudad de Buenos Aires referente a los puntos hábilitados para carga y descarga de mercaderías

data-analytics datasets folium openpyxl pandas-dataframe plotly-express pyhton streamlit streamlit-cloud

Last synced: 25 Dec 2024

https://github.com/aegis301/nyc_high_school_project

Data cleaning project using NYC high school data

data-analytics data-cleaning data-science data-visualization pandas

Last synced: 26 Dec 2024

https://github.com/allanotieno254/data-analytics-with-tableau

Repository showcasing projects and insights generated through Tableau. Contains visualizations, dashboards, and analytical reports on various datasets,

analytics-intelligence business-intelligence dashboards-tableau data-analytics data-storytelling data-visualization tableau

Last synced: 26 Dec 2024

https://github.com/michael-insights/portfolio

This repository showcases my projects and skills in Data Analytics, Data Science, and Machine Learning. It includes hands-on work in data analysis, predictive modeling, and machine learning algorithms, aimed at solving real-world problems.

data-analytics data-science data-visualization datapreprocessing jupyter-notebooks machine-learning matplotlib numpy pandas predictive-modeling python scikit-learn sql

Last synced: 26 Dec 2024

https://github.com/balajimohan18/rafik-s-kitchen-data-analysis

The Project is about the Analysis of the Sales and Expenses Data of a Famous Fast-food Restaurant. This mainly focuses on gaining Insights that will boost the Future Sales and also Business Strategies it Improve the Profit Margins. Handled Tools are SQL, Python, Power BI, MS Office Tools.

business-analytics business-intelligence data-analysis data-analytics data-visualization eda ms-office powerbi-report powerpoint-presentations python sql-server

Last synced: 14 Nov 2024

https://github.com/balajimohan18/foreign-exchange-rate-time-series-datascience-project

This project will use time series analysis to forecast the exchange rate between the euro and the US dollar. The project will use a variety of statistical techniques, such as ARIMA to model the data and forecast the exchange rate.

data-analysis data-analytics data-preprocessing data-science data-transformation data-visualization eda exploratory-data-analysis foreign-exchange-rates machine-learning model-fitting predictive-modeling python3 time-series time-series-analysis

Last synced: 14 Nov 2024

https://github.com/thoratstuti/comprehensive-sales-insight-dashboard

A sales performance dashboard is a crucial tool for businesses to monitor and analyze their sales activities. It provides a comprehensive overview of various sales metrics and key performance indicators (KPIs) in real-time or near real-time.

data-analytics data-visualizations graphs pie-chart powerbi sql

Last synced: 14 Nov 2024

https://github.com/firyanulrizky/ubud-souvenir-center-v1.0

Undergraduate Thesis Project (Mobile Apps Management Sales & E-Marketplace using Apriori Algorithm) Dedicated to Ubud Art Market

apriori-algorithm data-analytics data-mining data-visualization php web-application

Last synced: 07 Jan 2025

https://github.com/stefagnone/-product-positioning-analysis-for-philips-consumer-electronics

Market and consumer analysis for positioning a new Philips product in a competitive landscape. Includes competitor analysis, survey insights, and strategic recommendations based on data-driven findings.

business-strategy competitor-analysis consumer-perception data-analytics market-research marketing-strategy philips-consumer-electronics product-positioning regression-analysis survey-design

Last synced: 19 Dec 2024

https://github.com/rosacarla/geracao-tech-unimed-bh-ciencia-de-dados

Material de aulas, atividades e projetos realizados no bootcamp Geração Tech Unimed-BH - Ciência de Dados, promovido pela DIO.

data-analytics datascience python3

Last synced: 06 Jan 2025