Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-02-18 00:08:03 UTC
- JSON Representation
https://github.com/tejas-130704/whatsapp-analyser
ChatMate is a web app that analyzes WhatsApp chats, providing insightful visualizations like word clouds, heatmaps, and activity timelines. It calculates total messages, words, media, links, and more, helping you understand chat patterns for groups or individuals with ease. Simply upload your chat file and get detailed reports instantly!
data-analysis data-visualization python streamlit web-application whatsapp-analysis
Last synced: 17 Feb 2025
https://github.com/edwinrlambert/investigating-netflix-movies
Demonstrates data analysis and visualization techniques for Netflix movies using Python in a Jupyter notebook. This is a DataCamp project.
data-analysis data-analysis-python netflix python
Last synced: 18 Jan 2025
https://github.com/badranalyst/tips-dataset-analysis-dashboard-with-streamlit-and-plotly
Interactive Streamlit dashboard analyzing the Seaborn 'tips' dataset, which records information on restaurant bills, including total bill amounts, tips, customer demographics (e.g., gender, smoking status), and dining details (e.g., day, time). Visualized with Plotly for insights into tipping patterns.
data-analysis data-analytics data-visualization dataset eda exploratory-data-analysis matplotlib matplotlib-pyplot numpy pandas plotly python seaborn streamlit
Last synced: 17 Feb 2025
https://github.com/aran203/fluxease
Python package for eddy flux data post processing
data-analysis data-science eddy-covariance python
Last synced: 09 Feb 2025
https://github.com/brownred/python-and-sql
Python and SQL (postgreSQL & mySQL) for data analysis.
data-analysis databases python3 sql
Last synced: 26 Jan 2025
https://github.com/luizassimoes/fitness-report
Create a personalized Fitness Wrapped report using your Apple Health data with this Streamlit application. Generate comprehensive, detailed summaries of your annual fitness activities, providing valuable insights into your year-long progress and achievements.
data-analysis data-visualization python streamlit
Last synced: 17 Feb 2025
https://github.com/blankscreen-exe/tsf_datascience
Repo for all TSF internship tasks
data-analysis data-mining data-mining-algorithms python
Last synced: 08 Jan 2025
https://github.com/arianarmw/da01-bike-sharing-analysis
🚴♀️ Data analysis project on bike-sharing systems. Includes data wrangling, exploratory data analysis (EDA), visualization, and interactive dashboards built with Streamlit. Explore patterns in bike usage and rental data!
bike-sharing-analysis data-analysis exploratory-data-analysis python streamlit visualization
Last synced: 17 Feb 2025
https://github.com/greenpau/esqrunner
Run Elasticsearh queries and create metrics based on the result of the queries in Elasticsearch database.
data-analysis elasticsearch query-builder querydsl
Last synced: 26 Jan 2025
https://github.com/luminati-io/Target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Amazon-dataset-samples
A sample dataset of over 1,000 Amazon product listings, extracted using the Bright Data API, perfect for competitive analysis, market trends, and eCommerce insights.
amazon api data-analysis data-science dataset ecommerce products web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Indeed-dataset-samples
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
api data-analysis datasets indeed jobs web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Walmart-dataset-samples
A sample dataset of over 1000 Walmart products, extracted using the Bright Data API, ideal for consumer market insights and competitor analysis.
api data-analysis dataset walmart walmart-scraper web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Shopee-dataset-samples
A sample dataset of over 1000 Shopee products, extracted using the Bright Data API, ideal for pricing optimization, gap analysis, and market strategy refinement..
api data-analysis data-mining datasets products shopee web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Airbnb-dataset-samples
A sample dataset of over 1000 Airbnb listings, extracted using the Bright Data API, ideal for competitor tracking, brand reputation, and market analysis.
airbnb airbnb-listings api data-analysis datasets web-scraper web-scraper-api web-scraping
Last synced: 06 Nov 2024
https://github.com/edwinrlambert/exploring-airbnb-market-trends
Dive into NYC's Airbnb market trends through detailed analysis of listings data, including prices, types, and review dates. This is a DataCamp project.
airbnb data-analysis jupyter-notebook market-trends python
Last synced: 18 Jan 2025
https://github.com/lunarwhite/lake-george-viz
Geroge Lake data analysis and visualization, ANU COMP1730/6730
Last synced: 17 Feb 2025
https://github.com/malucor/livros
Programa em Python para fazer uma análise de dados sobre livros, a partir de um arquivo Excel.
analise-de-dados book books bookshelf data-analysis ipynb jupyter-notebook livro livros python
Last synced: 05 Jan 2025
https://github.com/malucor/analise_exploratoria_dados
Programa em Python para fazer uma Análise Exploratória de Dados de Logística.
analise-de-dados analise-exploratoria analise-exploratoria-de-dados data-analysis ebac exploratory-data-analysis ipynb jupyter-notebook python
Last synced: 05 Jan 2025
https://github.com/robson-python/customer-cancellation
Data science and analytics project to reduce customer cancellations.
data-analysis data-science data-visualization jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn vscode
Last synced: 09 Feb 2025
https://github.com/robson-python/academic-performance
Project to evaluate students' academic performance.
csv-import data data-analysis data-science jupyter-notebook machine-learning matplotlib pandas python scikit-learn seaborn vscode
Last synced: 09 Feb 2025
https://github.com/angchekar28/air-quality-index-analysis
This project analyzes Air Quality Index (AQI) data to identify pollution trends, seasonal variations, and the impact of different pollutants. It includes data visualization, correlation analysis, and insights into air quality variations over time.
data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook machine-learning python
Last synced: 09 Feb 2025
https://github.com/angchekar28/valorant-gameplay-analysis
This project analyzes Valorant gameplay data to understand key factors affecting match outcomes. It compares various machine learning models to predict player performance, rank classification, and match success.
data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook machine-learning python
Last synced: 09 Feb 2025
https://github.com/fanisgl/video-games-sales-data
Data Analysis of Sales Dataset using Python.
data-analysis data-science data-visualization dataset jupyter-notebooks matplotlib numpy pandas poisson-distribution python python3 sales statistics
Last synced: 08 Jan 2025
https://github.com/ahmetzamanis/projectcatalog
Catalog of my data science projects.
classification clustering data-analysis data-science data-science-portfolio data-visualization machine-learning python quarto r regression rmarkdown statistics time-series time-series-analysis
Last synced: 29 Dec 2024
https://github.com/ttwag/p9_pandas
Problems that Introduce the DataFrame Object in Python's Pandas Library
data-analysis pandas-dataframe python
Last synced: 17 Feb 2025
https://github.com/kheriberto/pandas_and_seabron_project
In this project I showcase my ability using pandas and seaborn to mold, transform and plot data.
data-analysis pandas python seaborn
Last synced: 08 Jan 2025
https://github.com/kheriberto/bedu_dc
Ejercicios del curso de "python desde 0" de la plataforma BEDU
Last synced: 08 Jan 2025
https://github.com/sharoonjoseph321/indian-liver-diseases
Indian Liver Disease Analysis and Prediction This project leverages the Indian Liver Patient Dataset (ILPD) to analyze liver disease trends and develop predictive models for early diagnosis. Through data preprocessing, exploratory analysis, and machine learning, it identifies key risk factors and builds classification models
data-analysis data-science data-visualization logistic-regression machine-learning pandas python seaborn
Last synced: 21 Jan 2025
https://github.com/bilalhameed248/power-bi-learning-and-dev
Power BI Learning And Development
chats data-analysis data-preprocessing dataanalysis dax powerbi statistics visualization
Last synced: 16 Jan 2025
https://github.com/bhavinpatel4199/artificial-intelligence---ai-for-decision-making
Artificial Intelligence for Decision Making is a collection of projects focused on applying AI and machine learning techniques to solve decision-making challenges. It includes projects on wine quality prediction, Cassandra data modeling, and text classification, showcasing a range of data science and machine learning applications.
artificial-intelligence cassandra-cql data-analysis data-engineering data-preprocessing data-structures decision-making deep-learning feature-selection machine-learning-algorithms sentiment-analysis text-classification
Last synced: 13 Jan 2025
https://github.com/chitranjan806/predicting-on-time-premium-deposits
A Predictive analysis project to predict the success rate of On-Time deposits of Premiums by Policy Holders.
analytics-vidhya analytics-vidhya-competition catboostregressor data-analysis data-science linear-regression logistic-regression python3
Last synced: 08 Jan 2025
https://github.com/hannesfht/hotel-reservation-analysis-dashboard
A Power BI dashboard analyzing hotel bookings, cancellations, and operational metrics using the dataset from Kaggle.
business-intelligence data-analysis data-analytics data-cleaning data-visualization hotel-reservations jupyter-notebook power-bi powerbi powerbi-dashboard powerbi-desktop python tableau tableau-public
Last synced: 09 Feb 2025
https://github.com/cartelcreationyt/portfolio-optimization-and-backtesting-using-python-a-pragmatic-approach
Modern Portfolio Theory (MPT) and Monte Carlo simulations to optimize and backtest a portfolio of various financial assets
asset-management data-analysis data-cleaning jupyter-notebook modern-portfolio-theory monte-carlo-simulation multiprocessing multithreading numba numba-jit-compiler perfomance-python python
Last synced: 09 Feb 2025
https://github.com/ayushbaid/football_stats
Analysing the competitiveness in different European football leagues
Last synced: 09 Feb 2025
https://github.com/vvipjain/hockey-tournament-analysis
Hockey Tournament Analysis
beautifulsoup data data-analysis data-visualization databases pandas pandas-dataframe powerbi python python-library python-script requests-library-python sql sql-server sqlalchemy
Last synced: 11 Feb 2025
https://github.com/andimashkulli/vpms
Vehicle Parking Management System for Gjon Buzuku Gymnasium
backend-api data-analysis databases frontend-react mongodb nodejs software
Last synced: 11 Feb 2025
https://github.com/anderson-andre-p/uber-data-analysis
This repository contains a comprehensive data analysis project focused on Uber rides. The dataset used in this project is a spreadsheet obtained from Uber, containing data related to ride details, such as pick-up and drop-off locations, date and time of the ride, and the fare amount.
data-analysis data-science data-visualization python
Last synced: 22 Jan 2025
https://github.com/kayoudan/exploring-flipkart-categories-data
Explore the extensive dataset of Flipkart categories to uncover trends and insights within the e-commerce platform. Analyze the diverse range of products and delve into the shopping preferences of consumers across various categories on Flipkart.
dashboard data-analysis data-analyst data-cleaning dax dax-expression jupyter-notebook mysql pandas power-bi powerbi python sql web-scraping
Last synced: 09 Feb 2025
https://github.com/samuelsoaress/wkd-default-reduction
reduction of default from 35% to 25% or less with machine learning techniques
data-analysis data-exploration data-science machine-learning-algorithms
Last synced: 08 Jan 2025
https://github.com/samuelsoaress/predict-future-sales
Machine Learning applied to sales forecast
data-analysis data-mining data-science data-visualization forecasting-models
Last synced: 08 Jan 2025
https://github.com/xdevmy/nlp-resume-classification
The project leveraged advanced NLP techniques to accurately classify resume catehories with high precision and recall. Includes a Streamlit interface for seamless resume uploads and predictions. Built to handle edge cases like invalid inputs and out-of-dataset values.
data-analysis data-science django nlp nlpproject nltk numpy python resume-analysis resume-scoring resumes resumescreening textract wordcloud
Last synced: 09 Feb 2025
https://github.com/edoaltamura/rotational-ksz-macsis
Repository for suppelementary material from my publication on the rotational kinetic SZ effect in MACSIS
cosmology data-analysis galaxy-clusters high-performance-computing hydrodynamics
Last synced: 05 Jan 2025
https://github.com/cassiofb-dev/projetos-intensivao-python
Projetos do evento intensivão de Python da Hashtag treinamentos.
automation data-analysis data-science data-visualization jupyter-notebook machine-learning python webscraping
Last synced: 28 Dec 2024
https://github.com/rodrigojunqueiradev/100-days-of-code-bootcamp
100 Days of Code: The Complete Python Pro Bootcamp
data-analysis data-science python python-3 python-library python-script python3
Last synced: 20 Jan 2025
https://github.com/swapnanildutta/prediction-with-python
The projects are made using Jupyter Notebook
data-analysis jupyter-notebook machine-learning prediction python regression-models
Last synced: 08 Jan 2025
https://github.com/nouman6093/advanced-statistical-models
in this repository i will upload everything i have learned about data science advanced statistical models. there are over 42 statistical models. each of them work on algorithms. and there are over 32 algorithms. each library has its own way of writing such statistical models. after learning i will try to upload as much statistical models as possibl
data data-analysis data-science data-visualization
Last synced: 14 Jan 2025
https://github.com/dvarshith/yelp-business-analysis
Big Data analysis on Yelp reviews/businesses for Arizona. Using Hadoop, Spark, PySpark.
arizona-state-university big-data big-data-analytics data-analysis hadoop pyspark spark yelp
Last synced: 17 Feb 2025
https://github.com/athari22/applied-data-science-capstone
Applied-Data-Science-Capstone
api classification data-analysis data-cleaning data-collection data-science data-scraping data-visualization data-wrangling knn machine-learning sql
Last synced: 30 Dec 2024
https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office
Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.
data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python
Last synced: 30 Dec 2024
https://github.com/myles/notebooks
Some of my random Jupyter Notebooks.
data-analysis data-science jupyter-notebooks
Last synced: 09 Feb 2025
https://github.com/pedrosfaria2/analisetitulosnetflix
Estudo de popularidade dos filmes da Netflix no IMDB.
analise-de-dados data-analysis jupyter-notebook matplotlib numpy pandas python
Last synced: 05 Jan 2025
https://github.com/pedrosfaria2/fugascomhelicoptero
Meu primeiro uso do Jupyter Notebook em um projeto
analise-de-dados data-analysis jupyter-notebook matplotlib pandas python
Last synced: 05 Jan 2025
https://github.com/pedrosfaria2/analisandopostshn
Projeto para analisar as postagens da comunidade HackerNews
analise-de-dados data-analysis datetime jupyter-notebook matplotlib python python3
Last synced: 05 Jan 2025
https://github.com/navp7/roadaccident_powerbi
An interactive Power BI dashboard designed to analyze road accident data
dashboards data-analysis data-visualization powerbi
Last synced: 17 Feb 2025
https://github.com/smdlabtech/cy_clients_creditworthiness_py
🌎Prévision de la solvabilité des clients d'une banque
analysis-of-bank-credit-risk data-analysis data-science jupyter-notebook ml python
Last synced: 13 Feb 2025
https://github.com/tanaybhadula/twitter-trends-dashboard
An interactive dashboard to visualizes data on current Twitter trends by country and globally. Collects data of over 60 countries using the python Tweepy library, processed it,and visualized it in the form of bar chart and pie chart using the Plotly Dash framework.
dash dashboard data-analysis data-visualization plotly python trends twitter
Last synced: 10 Jan 2025
https://github.com/smdlabtech/cy_ranaviz_ml_with_shiny
🌎Datamart Analysis with Machine Learning
data-analysis data-science dataviz machine-learning ml r rstudio shiny
Last synced: 13 Feb 2025
https://github.com/mansogf/datascience_introduction
Data Science Introductions Practices
data-analysis data-science data-visualization graph
Last synced: 09 Feb 2025
https://github.com/albamerdani/iot_air_quality_ml
IoT Project for Air Quality and Data Analysis with Machine Learning
air-quality aqi data-analysis data-science decision-tree iot machine-learning-algorithms prediction random-forest raspberry-pi-3 sensors
Last synced: 29 Dec 2024
https://github.com/abdoomohamedd/data-science-projects
A collection of data science projects ranging from exploratory data analysis to predictive modeling and clustering. Each project is designed to solve specific problems or explore particular datasets using various data science techniques and tools.
data-analysis data-analysis-python data-cleaning data-science data-visualization machine-learning machine-learning-algorithms
Last synced: 17 Feb 2025
https://github.com/abdoomohamedd/python-data-analysis-projects
A collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp
data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python
Last synced: 17 Feb 2025
https://github.com/ntaraujo/cleo
Contact data processor for Cléo
contacts-manager data-analysis data-visualization whatsapp whatsapp-web
Last synced: 27 Jan 2025
https://github.com/cyprianfusi/data-scientist-technical-exercise-10ds
With recommendations to UK Department for Education of 10 Local Authorities where National Tutoring Programme (NTP) should be intensified and a response to UK Secretary of Health regarding a 76% Accident and Emergency (A&E) performance target which seems far-fetched.
data-analysis data-cleaning data-visualization hypothesis-testing pandas-python policy statistics
Last synced: 26 Jan 2025
https://github.com/nsandoya/python_scrp_project
This is a tool specially made for Dipaso ecommerce website. You can extract data from there, analyze it and see keywords, brands, and categories frecuency, prices distribution and other market tendencies as well —all in a group of friendly stadistic tables and graphics (exported from a Jupyter notebook) :)
beautifulsoup4 data data-analysis jupyter-notebook pandas python3
Last synced: 14 Jan 2025
https://github.com/ndomah/mysql-bootcamp-go-from-sql-beginner-to-expert
Repo containing materials from MySQL Bootcamp
aggregate-functions backup-restore crud-operations data-analysis data-modeling database-design date-time logical-operators mysql8 normalization performance-tuning sql sql-joins sql-modes sql-syntax subqueries views window-functions
Last synced: 09 Feb 2025
https://github.com/ianfelps/jornada_python
Projetos realizados durante a Jornada Python da Hashtag Treinamentos em maio de 2024.
artificial-intelligence automation data-analysis python
Last synced: 21 Jan 2025
https://github.com/itrauco/data-dirtying-tool
a simple command line tool to generate dirty data and do common data things in google cloud
data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning
Last synced: 06 Jan 2025
https://github.com/linguini1/tangerineanalyzer
Command line tool for analyzing transactions in CSV format provided by Tangerine Banking. Transactions can be downloaded in CSV format on your Tangerine account.
analysis analytics argparse banking cli command-line command-line-tool csv data-analysis data-analytics finance pandas python tangerine transactions
Last synced: 29 Dec 2024
https://github.com/jidesamuell/data-analytics-projects
This is a repository i have created to showcase my skills, share projects and track my progress in Data Analytics areas.
data-analysis excel matplotlib powrebi python sql
Last synced: 02 Feb 2025
https://github.com/dissorial/prx21_erikz
Analysis of self-tracked data: interactive visualizations & predictive algorithms
analytics data-analysis data-science data-visualization machine-learning matplotlib pandas python python3 visualization
Last synced: 09 Feb 2025
https://github.com/antoniszks/music-category-identifier
A 'Data-Science & Machine Learning' project where we are training a neural network to identify what kind of music we give to it. Based on a university project.
ai artificial-intelligence data-analysis data-science jupyter-notebook machine-learning ml notebook python
Last synced: 08 Jan 2025
https://github.com/antoniszks/data-analysis-problems
A repository containing real world data analysis python notebooks
data-analysis data-analysis-python data-science jupyter-notebook python real-world-data statistics
Last synced: 08 Jan 2025
https://github.com/tralahm/kaggle-titanic-competition
Predicting Titanic Passenger Survival Using Machine Learning
data-analysis jupyter-notebook kaggle-competition kaggle-dataset machine-learning matplotlib numpy pandas predictive-modeling python3 sklearn tralahm tralahtek
Last synced: 13 Jan 2025
https://github.com/giatraskon/sandbox.bio-solutions
Bash scripts replicating the commands from sandbox.bio's interactive bioinformatics tutorials, organized by categories such as Data Exploration, File Formats, Quality Control, and Data Analysis.
bam-files bash bed-files bioinformatics bioinformatics-workflows command-line-tools computational-biology data-analysis data-exploration data-wrangling fasta-files fastq-files file-formats genomic-data quality-control sandbox-bio sandbox-bio-tutorials sequence-alignment unix-shell variant-calling
Last synced: 06 Feb 2025
https://github.com/weybsonalves/prevendo-o-atrito-de-clientes
Projeto em que percorro as etapas que compõem o ciclo de vida da ciência de dados a fim de prever o atrito de clientes do serviço de cartões de crédito de um banco.
data-analysis data-science data-visualization machine-learning python
Last synced: 16 Jan 2025
https://github.com/marielachirinosr/pandas-weather-project
Pandas Weather Data. Explore straightforward Python scripts for weather information analysis.
Last synced: 29 Dec 2024
https://github.com/shubham200137/icc-women-s-t20-world-cup-data-analytics
Created a Power BI report to identify top 11 players for a T20 cricket team by scraping data from espncricinfo with Python, cleaning and transforming the data with pandas, and evaluating various player performance metrics.
beautifulsoup4 data-analysis data-visualization numpy-python pandas-python powerbi web-scraping
Last synced: 08 Jan 2025
https://github.com/shubham200137/customer-churn-analysis
In this case study, we analyze customer churn for a telecom company serving Southern California. The company faces increased competition and wants to retain customers by understanding the reasons for churn. Our objectives include improving service quality, identifying churn factors, pinpointing attractive services, and retaining high LTV customers.
data-analysis data-visualization numpy-python pandas-python sqlite tableau
Last synced: 08 Jan 2025
https://github.com/shubham200137/cyclistic-case-study
This repository contains a case study for Google's Data Analytics Professional Certificate, focusing on Cyclistic, a fictional bike sharing company in Chicago. The case study aims to drive growth by converting casual riders into members through a marketing strategy.
data-analysis data-visualization numpy-python pandas-python presentation-slides sql tableau
Last synced: 08 Jan 2025
https://github.com/faisal-fida/box-office-mojo-analysis
Analyzed box office data from Box Office Mojo, exploring relationships between worldwide revenue, release year, and a combined score that considers both factors. It includes visualizations like scatter plots, bar charts, and identifies top and bottom performing movies.
box-office data-analysis data-science python revenue-prediction visualization
Last synced: 08 Jan 2025
https://github.com/cur10usitydrives/text-similarity-analysis
bag-of-words cosine-similarity data-analysis machine-learning natural-language-processing nltk-python one-hot-encoding python stemming stop-word-removal stop-words text-mining text-processing text-similarity-analysis tf tf-idf tokenization
Last synced: 07 Nov 2024
https://github.com/shimaa83/eda_v2
Automatic EDA library
data-analysis data-science python
Last synced: 20 Jan 2025
https://github.com/goswamilucky/nanoparticles
Nanoparticles is a college research project website designed to educate users about nanoparticles and their applications in science and technology.
chemistry college-project computational-science data-analysis material-science molecular-modeling nanoparticles nanotechnology research scientific-computing simulations visualization
Last synced: 26 Jan 2025
https://github.com/jofaval/80-cereals
Data Analysis into almost 80 USA cereals user rating in 1993
cereals classification data-analysis data-science data-visualization google-colab kaggle linear-regression logistic-regression machine-learning matplotlib python regression scikit-learn seaborn
Last synced: 04 Feb 2025
https://github.com/jofaval/california-housing-pricing
Data Analysis about the California Housing Pricing in 1997
data-analysis data-science data-visualization deep deep-learning deep-neural-networks google-colab keras machine-learning matplotlib python regression scikit-learn seaborn tensorflow
Last synced: 04 Feb 2025
https://github.com/sanveed-adnan/supermarket-sales-sql-project
SQL-based data analysis project on supermarket sales performance using SQLite and Power BI.
business-intelligence data-analysis data-science data-science-projects data-visualization power-bi sales-data sql sqlite
Last synced: 18 Feb 2025
https://github.com/loginchik/mid_contracts
Анализ контрактов государственных закупок МИДа РФ
data-analysis dataset pandas python
Last synced: 08 Nov 2024
https://github.com/thanaraklee/pyspark-dataframe-operations
This project focuses on utilizing PySpark DataFrames to analyze and visualize data sourced from external datasets, such as CSV files. It provides a practical example of how to manipulate, transform, and gain insights from large datasets using the PySpark framework.
data-analysis dataframe pyspark python
Last synced: 16 Feb 2025
https://github.com/chanmeng666/mnist-handwritten-digit-recognition-project
A comprehensive implementation and analysis of handwritten digit recognition using multiple neural network architectures on the MNIST dataset. Features basic MLP, optimized feature-selected model, and deep CNN approaches with detailed performance comparisons and visualizations.
cnn computer-vision data-analysis data-visualization deep-learning feature-analysis handwritten-digit-recognition keras machine-learning mlp mnist model-optimization neural-networks python scikit-learn tensorflow
Last synced: 02 Jan 2025
https://github.com/kind-unes/time-series-analysis
Learning & applying TSA
data-analysis data-science python time-series-analysis time-series-forecasting
Last synced: 02 Feb 2025
https://github.com/geo-y20/uber-rides-data-analysis
This project aims to analyze Uber ride data to understand various aspects of ride usage, such as the distribution of rides across different categories, purposes, months, days, and times.
dashboard dashboard-templates data data-analysis data-analysis-python data-analytics data-visualization pandas powerbi python recommendation-system rides uber
Last synced: 08 Jan 2025
https://github.com/prasad-chavan1/bank_data_analysis_r
Bank data analysis in R language
data data-analysis data-science r
Last synced: 07 Jan 2025
https://github.com/yutkin/gotohack-2-solution
Solutions of the selection stage
data-analysis data-science hackathon machine-learning
Last synced: 15 Feb 2025
https://github.com/shramkoweb/bookbot
A Python-based text analyzer that counts words and character frequencies in any .txt file, providing a detailed, sorted report. Perfect for quick text insights and learning text processing basics!
automation beginner-friendly character-frequency data-analysis file-processing open-source python text-analysis text-parser text-processing word-count
Last synced: 02 Feb 2025
https://github.com/agustin-caceres/arg-telecom-analisis
Proyecto de Data Analyst sobre servicios de Telecomunicaciones en Argentina
business-analytics business-intelligence data-analysis data-visualization database postgresql python streamlit
Last synced: 09 Feb 2025
https://github.com/cannt39t/wylsacom-analysis-reflinks-datamining
data data-analysis data-mining python3 sql
Last synced: 05 Jan 2025
https://github.com/alessandrodealmeida2/google_advanced_data_analytics
Projetos do curso avançado de análise de dados do Google
analise-de-dados ciencia-de-dados data-analysis data-science machine-learning python regression-models statistics
Last synced: 13 Jan 2025