Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2024-11-18 00:06:50 UTC
- JSON Representation
https://github.com/meokullu/prefill
PreFill adds desired characters onto output values to increase their legibility.
alignment data data-analysis data-engineering data-science legibility
Last synced: 14 Nov 2024
https://github.com/drill-n-bass/dealavo-project
Cartesian product from dictionary to list of dictionaries and faster methods for finding index than the `index` method.
data-analysis data-analysis-python matplotlib pandas python python3 random timeit
Last synced: 07 Nov 2024
https://github.com/imrandil/sql_practice_with_analysis
SQL practice using postgres db and docker as a tool to setup postgres, loving the sql way
data-analysis docker markdown postgres sql
Last synced: 08 Nov 2024
https://github.com/imrandil/excel_learning_dir
Excel learning practice with some data, the doing
Last synced: 08 Nov 2024
https://github.com/fisseha-estifanos/telecom
A showcase repository for a specific telecommunication company. Used to analyze several telecommunication data set features and generate useful insights accordingly. Insights generated could be seen at https://github.com/Fisseha-Estifanos/telecom-visualizer or at https://fisseha-estifanos-telecom-visualizer-home-huxgy0.streamlitapp.com/
data-analysis notebooks-jupyter python visual-studio-code visualization
Last synced: 09 Nov 2024
https://github.com/danmadeira/algoritmos-estatistica-pl-sql
Demonstração de Algoritmos de Estatística em PL/SQL
algorithms data-analysis data-science database oracle oracle-database pl-sql statistics
Last synced: 10 Nov 2024
https://github.com/nandit123/python_on_excel
Data Analysis using python libraries on excel data
csv data-analysis data-science fill fluctuations graph numpy python python-library
Last synced: 12 Nov 2024
https://github.com/drill-n-bass/ovh-project
The goal of this task is to prepare statistical analysis of set of data from disks.
anaconda analysis data-analysis data-analysis-python jupyter-notebook matplotlib-python pandas python3 seaborn-plots
Last synced: 07 Nov 2024
https://github.com/namratagulati/fraud_detection
This fulfills all the requirements of a fraud detection model developed on linear regression using feature scaling, engineering and testing model with the help of auc-roc curve and others.
data-analysis data-visualization machine-learning machine-learning-algorithms machinelearning-python
Last synced: 10 Nov 2024
https://github.com/arush-codes/paris-olympic-de
data engineering project on paris olympics 2024
azure data-analysis data-engineering microsoft-azure olympics2024 pipeline
Last synced: 14 Oct 2024
https://github.com/madusales/powerbi-etl-elt
Venho estudando, através do Bootcamp da DIO sobre Data Analytics & Power BI, acerca do uso de SQL para criar soluções em BI. Esse repositório é dedicado a registrar os meus conhecimentos adquiridos até então sobre o que é BI, Tipos de análises, ETL e ELT.
big-data business-intelligence data-analysis powerbi
Last synced: 11 Nov 2024
https://github.com/mecha-aima/demographic-analyzer
This project uses pandas to process census data from a csv file and draw useful results from the data by performing various filtering and calculations on it
data-analysis data-science pandas
Last synced: 09 Nov 2024
https://github.com/patricialjohnson/data-visualization-tableau-project
Tableau Visualization Project
business-analytics business-intelligence data-analysis data-visualization digital-marketing digital-marketing-agency kpi microsoft-excel program-management project-management python search-engine-optimization seo sql tableau
Last synced: 10 Nov 2024
https://github.com/muneeb1030/webscrapper_mastodon
The Mastodon Social Platform Scraper is a Python-based web scraping tool designed to explore and extract valuable data from the Mastodon social platform.
data-analysis data-collection mastodon python3 scrapy scrapy-spider selenium-python webscraping
Last synced: 09 Oct 2024
https://github.com/rahulsm20/car-data
A data analytics project that involves analyzing a car dataset that includes information on various car brands, years, prices, mileage, and fuel types, in order to gain insights into the car market.
data-analysis data-analytics matplotlib numpy pandas python
Last synced: 10 Nov 2024
https://github.com/jrh89/sorting-hat
With a simple and user-friendly interface, the GUI allows users to easily enter data and extract the numbers they need and then sort and graph them.
data-analysis data-visualization datascience executable graphs-algorithms gui python sorting sorting-algorithms sorting-algorithms-implemented
Last synced: 10 Nov 2024
https://github.com/rahulsm20/storedata
A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.
data-analysis matplotlib numpy pandas python streamlit
Last synced: 10 Nov 2024
https://github.com/samkazan/business-analysis-tableau
Business Analysis on Global/Superstore data using Tableau.
analysis data-analysis tableau visualization
Last synced: 05 Nov 2024
https://github.com/abeltavares/postql
Python library and command-line interface (CLI) tool for interacting with PostgreSQL databases, providing simplified database management, query execution, and result export functionalities.
cli command-line-interface data-analysis data-engineering data-export data-management data-processing data-visualization database database-administration database-tools etl oop postgres postgresql psycopg2 python sql sqlalchemy wrapper
Last synced: 31 Oct 2024
https://github.com/cicku/en.650.672
HW of EN.650.672
analytics data-analysis numpy pandas
Last synced: 09 Nov 2024
https://github.com/rkreddybogati/data-engineering-interview
Explore data engineering architectures in this Git project
data-analysis data-cleaning data-engi data-engineering-pipeline data-mining data-processing data-visualization python sql-query
Last synced: 08 Nov 2024
https://github.com/madrury/commute-times
Simulated Commute Times Data
data-analysis data-science data-visualization dataset
Last synced: 10 Nov 2024
https://github.com/rahulsm20/insurance-data
A data analytics project dealing with risk assessment and it's effects in health insurance.
data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 10 Nov 2024
https://github.com/madrury/hot-sauce
Simuation of a Hot Sauce Spicyness Dataset
data-analysis data-science data-visualization dataset machine-learning
Last synced: 10 Nov 2024
https://github.com/sinsunsan/earth-survival-kit
Global warning data visualisation app to make everyone understand global warning and take actions that matter
angular angular7 d3 data-analysis data-visualization ecology global-warning ngx-charts
Last synced: 08 Nov 2024
https://github.com/abhroroy365/market_analysis
This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.
clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis
Last synced: 09 Nov 2024
https://github.com/rahulsm20/trackbyte
A full-stack web application that helps users keep track of their playlist and provides analytics based on their music taste.
bootstrap data-analysis expressjs mysql nodejs reactjs sql
Last synced: 10 Nov 2024
https://github.com/kentlouisetonino/ama-project-data-analysis
A course project for course MATH 6200.
ama-university data-analysis python
Last synced: 12 Nov 2024
https://github.com/anoopgeorge418/linked-analytics
"LinkedAnalytics is a project that scrapes LinkedIn data, analyzes it to uncover valuable insights, builds predictive models, and deploys them for practical applications. This repository contains all scripts, analysis notebooks, and deployment code needed to replicate the process."
beautifulsoup4 bokeh data-analysis data-science linkdin linkdindata machine-learning matplotlib numpy pandas plotly python requests seaborn sql web-scraping
Last synced: 10 Nov 2024
https://github.com/alan-oliveir/state-of-data-2022
Neste projeto faço a análise da distribuição das faixas salariais para os profissionais de nível júnior para o cargo de analista, cientista e engenheiro de dados. Os dados são da State of Data Brazil que é uma das maiores pesquisas sobre o panorama do mercado de trabalho brasileiro na área de dados.
data-analysis jupyter-notebook pandas-python seaborn-python
Last synced: 14 Nov 2024
https://github.com/themihirmathur/qlik-intern-project
Qlik Analysis of Road Safety & Accident Patterns in India 📈 Analyzed & visualized road safety data for 20.85k+ accident cases with 9+ accident data patterns in India using Qlik 📉 Reduced inefficiencies by 25% by developing design of an avant-garde data tracking dashboard that monitored injuries.
data-analysis data-visualization presentation qlik qlik-cloud qlik-sense qlikview
Last synced: 12 Nov 2024
https://github.com/lacerbi/vbmc
Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)
bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference
Last synced: 24 Oct 2024
https://github.com/danpoynor/python-number-guessing-game-with-stats
A number guessing game written in Python 3 that presents median, mode, and mean statistics
console-game data-analysis number-guessing-game python3 statistics
Last synced: 16 Nov 2024
https://github.com/svetlanam/pycon-workshop
Pycon CZ workshop: Better data analyses and product recommendations with Instagram data
data-analysis data-science martinus matplotlib pandas pycon2016 pyconcz python scikit-learn workshop
Last synced: 13 Nov 2024
https://github.com/svetlanam/pt-data-analyse
Data analyse of the czech parcel tracking providers
data-analysis matplotlib pandas parcel-tracking python3 visualisation
Last synced: 13 Nov 2024
https://github.com/koldlight/bluetab-data-science-2017
Repositorio para compartir material y publicar los retos
course data-analysis data-science exercises
Last synced: 09 Nov 2024
https://github.com/thlindustries/mortalidade_neonatal_python_react
Uma plataforma de visualização de dados montada utilizando Python e React com a library de visualização do Plotly
data-analysis data-visualization plotly python python3 react reactjs
Last synced: 11 Nov 2024
https://github.com/athari22/applied-data-science-capstone
Applied-Data-Science-Capstone
api classification data-analysis data-cleaning data-collection data-science data-scraping data-visualization data-wrangling knn machine-learning sql
Last synced: 08 Nov 2024
https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office
Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.
data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python
Last synced: 08 Nov 2024
https://github.com/vubacktracking/freecodecamp-data-analysis-with-python
5 Projects in Data Analysis With Python Course on Freecodecamp
Last synced: 10 Nov 2024
https://github.com/kenatsf/basic_data_analysis
Basic data science project: ETL, forecast and data visualization.
analysis data data-analysis data-science logistic-regression matplotlib matplotlib-pyplot numpy pandas powerbi python scikit-learn time-series time-series-analysis time-series-forecasting
Last synced: 10 Nov 2024
https://github.com/lucaspadoni/9-11-hijackers-social-network-analysis
Social Network Analysis focused on the events of 9/11/2001. By examining publicly available data through SNA techniques, we gain insights into the organizational structure of the terrorist network, offering valuable perspectives on key relationships and connections.
9-11 data-analysis data-analytics graph-theory hijacking network-analysis sna social-network-analysis terrorism terrorist-attacks
Last synced: 09 Nov 2024
https://github.com/parthds02/pizza_sales_sql
SQL project analyzing pizza sales data. Includes creating tables, executing queries, and solving basic to advanced analytical questions to derive insights from sales data.
analytics data-analysis data-science pizza-sales sql
Last synced: 12 Nov 2024
https://github.com/isabelleysseric/data-analysis
Data analysis with R
data-analysis data-processing data-science-projects graph graph-algorithms r
Last synced: 08 Nov 2024
https://github.com/yeonjaee/data-analytics
converts raw data into actionable insights
Last synced: 11 Nov 2024
https://github.com/psyplot/psy-transect
Visualize and explore transects with psyplot
data-analysis data-exploration data-science exploratory-data-analysis psyplot transects
Last synced: 08 Nov 2024
https://github.com/samuelsoaress/predict-future-sales
Machine Learning applied to sales forecast
data-analysis data-mining data-science data-visualization forecasting-models
Last synced: 10 Nov 2024
https://github.com/marknature/machine-learning-intern
Machine Learning tasks involving the Titanic Dataset and Breast Cancer Wisconsin (Diagnostic) dataset
data-analysis github jupiter-notebook machine-learning matplotlib numpy pandas python scikit-learn sklearn
Last synced: 13 Nov 2024
https://github.com/v-octal/random_forest_from_scratch
My implementation of Random Forest regressor in python
data-analysis machine-learning random-forest
Last synced: 08 Nov 2024
https://github.com/kashirin-alex/thither.direct-onamove
an android skeleton-example application for using data from Thither.Direct platform on mobile applications
android-application data data-analysis data-structures data-visualization mobile-development mobility query research-data-management
Last synced: 01 Nov 2024
https://github.com/eudesgccunha/desafios-dnc
Desafios desenvolvidos durante a formação em Cientista de Dados da Escola DNC.
big-data classification-algorithm clustering data data-analysis data-science data-visualization database excel machine-learning nlp powerbi python recommendation-system regression-models sql
Last synced: 29 Sep 2024
https://github.com/purposeachiever6/discovering_hidden_pattern
Discovering Hidden Patterns in Sequential and Numerical Data
data-analysis r statistical-analysis
Last synced: 12 Nov 2024
https://github.com/diegopino/publibdata_codexhackathon
Public Library Data processing/analysis codex hackathon attempt
data-analysis data-visualization libraries public
Last synced: 08 Nov 2024
https://github.com/rijul007/smartwatch-data-analysis-using-python
Smartwatch Data Analysis to uncover insights into health and activity patterns using Python for data cleaning, exploratory analysis, and interactive visualizations.
data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas plotly python
Last synced: 08 Nov 2024
https://github.com/rijul007/diamonds-analysis-using-r
Diamonds data analysis using R, exploring relationships between diamond attributes (such as carat, cut, color, and clarity) and price, with a focus on providing insights for engagement ring selection through various statistical techniques and data visualizations including histograms, boxplots, scatter plots, and bar charts.
Last synced: 08 Nov 2024
https://github.com/mysto-007/world_population_growth_analysis
World Population Growth Analysis
data-analysis data-science data-visualization kaggle matplotlib
Last synced: 08 Nov 2024
https://github.com/leandrocollares/home-team-advantage-in-epl
Home team advantage in the English Premier League: an exploratory data analysis
data-analysis matplotlib pandas plotly
Last synced: 11 Nov 2024
https://github.com/mysto-007/dog-vision-a-dog-breed-recognizer-kaggle-competition
A solution for Dog Breed Identification on Kaggle competition
colab-notebook data-analysis data-science data-visualization jupyter-notebook kaggle kaggle-competition python
Last synced: 08 Nov 2024
https://github.com/jrdnbradford/the-office-us
Data concerning NBC's mockumentary series The Office (U.S. version)
csv data-analysis json the-office xml
Last synced: 08 Nov 2024
https://github.com/marknature/oibsip
AICTE Oasis Infobyte Data Science Internship
data-analysis data-science data-visualization github google-sheets jupyter-notebook linkedin machine-learning project-management python
Last synced: 13 Nov 2024
https://github.com/achronus/data-exploration
A repository dedicated to interesting data exploration projects I've completed
data-analysis exploratory-data-analysis machine-learning matplotlib pandas python scikit-learn seaborn
Last synced: 13 Oct 2024
https://github.com/salma-mamdoh/exploring-the-evolution-of-linux-project
My Project to learn the Basics of Analysis on DataCamp
data-analysis datacamp pandas python time-series-analysis
Last synced: 08 Nov 2024
https://github.com/salma-mamdoh/project-writing-functions-for-product-analysis
My Project to learn the Basics of Analysis on DataCamp
data-analysis data-camp pandas python
Last synced: 08 Nov 2024
https://github.com/sharmas1ddharth/mode_of_transport_analysis
This project requires you to understand what mode of transport employees prefers to commute to their office. The data includes employee information about their mode of transport as well as their personal and professional details like age, salary, and work exp. We need to predict whether or not an employee will use private transport. Also, which variables are a significant predictor behind this decision.
Last synced: 12 Nov 2024
https://github.com/sharmas1ddharth/data-analysis-with-python
Freecodecamp's Data Analysis with Python Projects Code
data-analysis data-analysis-with-python freecodecamp-project
Last synced: 12 Nov 2024
https://github.com/salma-mamdoh/a-visual-history-of-nobel-prize-winners-project
My project aims to practice Data Analysis and Data Visualization on DataCamp
data-analysis data-visualization datacamp matplotlib pandas python seaborn
Last synced: 08 Nov 2024
https://github.com/iantomasinicola/portfoliodataanalyst
Progetto di Data analysis con Python, Microsoft Sql Server e Excel
data-analysis excel python sql
Last synced: 09 Nov 2024
https://github.com/blankscreen-exe/tsf_datascience
Repo for all TSF internship tasks
data-analysis data-mining data-mining-algorithms python
Last synced: 11 Nov 2024
https://github.com/chaitanyaprasad60/sql-queries
This is a list of complex SQL Queries I have practiced.
data-analysis sql window-functions
Last synced: 11 Nov 2024
https://github.com/mattdelaune/powerbi_healthcare_dashboard
Interactive Hospital Insights Dashboard built with Power BI, showcasing comprehensive analysis of patient demographics, treatment outcomes, and hospital performance.
data-analysis healthcare power-bi visualization
Last synced: 08 Nov 2024
https://github.com/mattdelaune/excel_sales_dashboard
Interactive Excel Dashboard for Coffee Sales Analysis: This project leverages Excel to analyze sales data, uncover seasonal trends, regional preferences, and customer behaviors, providing actionable insights for optimizing inventory and marketing strategies.
data-analysis excel pivot-tables sales-dashboard sales-data
Last synced: 08 Nov 2024
https://github.com/fmind/malpop
Rank the popularity of malware applications by their occurrence on VirusTotal
data-analysis malware popularity ranking virustotal
Last synced: 06 Nov 2024
https://github.com/abdelmajidlh/cours
Cours Data engineering et data analyse.
apache-spark big-data data-analysis data-engineering docker jupyter-notebook pyspark
Last synced: 11 Nov 2024
https://github.com/prasad-chavan1/bank_data_analysis_r
Bank data analysis in R language
data data-analysis data-science r
Last synced: 10 Nov 2024
https://github.com/souza-vitor/stock-market
codecademy data data-analysis data-mining data-science sql sqlite
Last synced: 09 Nov 2024
https://github.com/nikbarb810/covid_growth_rate_390.51
Exploring Covid Growth Rate of European Population using genetic data analysis
bioinformatics data-analysis r rcpp
Last synced: 08 Nov 2024
https://github.com/mikma03/datascience_python_datacamp
DataScience with Python. Code and examples. Python libraries, including pandas, NumPy, Matplotlib, and many more.
data-analysis data-science datacamp datascience numpy pandas python
Last synced: 11 Nov 2024
https://github.com/aishwaryahastak/ipl_analysis
Analysis of IPL dataset using PySpark
Last synced: 08 Nov 2024
https://github.com/loginchik/mid_contracts
Анализ контрактов государственных закупок МИДа РФ
data-analysis dataset pandas python
Last synced: 08 Nov 2024
https://github.com/nikbarb810/motif_detection_in_r
Motif Detection for TFBS in Glycolysis and Glyconeogenesis pathways
bioinformatics data-analysis null-hypothesis pwm r
Last synced: 08 Nov 2024
https://github.com/jhrcook/protein-language-models
Experimenting with protein language model predictions
data-analysis protein-language-model variant-effect-prediction
Last synced: 13 Nov 2024
https://github.com/manishbisht/machine-learning
Machine Learning
data-analysis data-mining machine-learning machine-learning-algorithms machinelearning numpy pandas python
Last synced: 10 Nov 2024
https://github.com/ygalvao/uow_ai_final_project
This was my Final Project for the Artificial Intelligence Diploma program of The University of Winnipeg - Professional, Applied and Continuing Education (PACE).
data-analysis data-analytics dbscan elections k-means k-means-clustering machine-learning som som-clustering
Last synced: 12 Nov 2024
https://github.com/preetesh21/spotme
This repository is using the web-based API provided by Spotify to retrieve data and then analyse it.
Last synced: 08 Nov 2024
https://github.com/marielachirinosr/bellabeat-wellness-data-trends
Analyzing smart device data for insights on user activity patterns to optimize interventions for better health outcomes.
data data-analysis data-visualization pandas python python3 tableau tableau-public
Last synced: 07 Nov 2024
https://github.com/marielachirinosr/hotel-data-analysis
Pandas & Matplotlib Learning Analysis. Repository featuring data analysis projects using Pandas and Matplotlib libraries
data data-analysis matplotlib pandas python
Last synced: 07 Nov 2024
https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020
Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).
bigquery data data-analysis data-visualization python sql tableau
Last synced: 07 Nov 2024
https://github.com/wilfordaf/dataanalyst-test
Test task for Junior Data Analyst position
data-analysis pandas python trading-data
Last synced: 12 Nov 2024
https://github.com/jabhij/eda_experiments
In this repo I'll use different types of datasets to explore and implement various Exploratory Data Analysis (EDA) approaches.
ames-housing analysis battery-life blackfriday-analysis data-analysis data-science data-visualization eda matplotlib-pyplot numpy pandas python seaborn visualization zomato-data-analysis
Last synced: 16 Nov 2024
https://github.com/weybsonalves/prevendo-o-atrito-de-clientes
Projeto em que percorro as etapas que compõem o ciclo de vida da ciência de dados a fim de prever o atrito de clientes do serviço de cartões de crédito de um banco.
data-analysis data-science data-visualization machine-learning python
Last synced: 16 Nov 2024
https://github.com/edumoraes1/comissao-reduzida
Criação de segmentação de publico via SQL para nova feature do enjoei de comissão reduzida
bq data-analysis salesforce sql
Last synced: 12 Oct 2024
https://github.com/gattiharishkumar/blinkit-sales-analysis-dashboard
This project presents a comprehensive sales analysis dashboard for Blinkit, an Indian last-minute delivery app. The dashboard was created using Power BI and provides a detailed overview of the company's sales performance across various outlets and product categories.
dashboard data-analysis data-transformation data-visualization ms-excel-data-analytics power-query powerbi powerbi-visuals
Last synced: 07 Nov 2024
https://github.com/ray-chew/pycsam
pyCSAM is a robust approach for approximating geodesic subgrid-scale orographic spectra with applications to weather forecasting and broader data analysis
data-analysis gmted icon-model merit-dem orographic spectral-analysis topography weather-forecast
Last synced: 12 Nov 2024
https://github.com/gattiharishkumar/employee-attendance-leaves-analytics-dashboard
This project showcases a Power BI dashboard created to analyze employee attendance and leaves over a three-month period. The data was sourced from Excel datasets available on the Codebasics website.
dashboards data-analysis data-cleaning data-transformation data-visualization power-query-editor powerbi
Last synced: 07 Nov 2024
https://github.com/marielachirinosr/analysis-urgencias-hospital-pitalito
This project involves analyzing emergency room admission data from the E.S.E Hospital Departamental de Pitalito using a star schema model.
bigquery data data-analysis etl-pipeline tableau
Last synced: 12 Oct 2024
https://github.com/shubhammohanty680/uber_data_analysis
bigquery data-analysis gcp-compute gcp-project looker-studio mageai python
Last synced: 12 Oct 2024
https://github.com/enayar478/nomad_machine_learning_dash_app
An interactive Machine Learning app built with Dash and Plotly, developed as part of the Data Analytics Bootcamp at Le Wagon Bordeaux. It allows users to visualize data, make real-time predictions, and explore various model insights.
analytics cachetools dash dashboard-application data-analysis data-science deployment gunicorn interactive-visualization machine-learning pandas plotly plotly-dash prediction-model python python3 render scikit-learn web-application
Last synced: 12 Oct 2024
https://github.com/harmanveer-2546/movie-industry
Investigate the film industry to gain sufficient understanding of what attributes to success and in turn utilize this analysis to create actionable recommendations for companies to enter the industry.
business business-analytics data-analysis datatime film-industry graphs matplotlib movie-database numpy pandas python scraping-websites seaborn visualization web-scraping-python
Last synced: 12 Nov 2024