Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2024-11-19 00:06:50 UTC
- JSON Representation
https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020
Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).
bigquery data data-analysis data-visualization python sql tableau
Last synced: 07 Nov 2024
https://github.com/virajbhutada/creditcard_complaints_analysis
A comprehensive repository dedicated to the Credit Card Complaints Analysis Dashboard in Tableau. Explore insightful visualizations, trends, and analytics related to credit card complaints. Stay informed and make data-driven decisions with this powerful Tableau analysis tool.
analytics complaints credit-card dashboard data-analysis data-visualization insights tableau visualizations
Last synced: 11 Nov 2024
https://github.com/wilfordaf/dataanalyst-test
Test task for Junior Data Analyst position
data-analysis pandas python trading-data
Last synced: 12 Nov 2024
https://github.com/weybsonalves/prevendo-o-atrito-de-clientes
Projeto em que percorro as etapas que compõem o ciclo de vida da ciência de dados a fim de prever o atrito de clientes do serviço de cartões de crédito de um banco.
data-analysis data-science data-visualization machine-learning python
Last synced: 16 Nov 2024
https://github.com/gattiharishkumar/blinkit-sales-analysis-dashboard
This project presents a comprehensive sales analysis dashboard for Blinkit, an Indian last-minute delivery app. The dashboard was created using Power BI and provides a detailed overview of the company's sales performance across various outlets and product categories.
dashboard data-analysis data-transformation data-visualization ms-excel-data-analytics power-query powerbi powerbi-visuals
Last synced: 07 Nov 2024
https://github.com/eudesgccunha/desafios-dnc
Desafios desenvolvidos durante a formação em Cientista de Dados da Escola DNC.
big-data classification-algorithm clustering data data-analysis data-science data-visualization database excel machine-learning nlp powerbi python recommendation-system regression-models sql
Last synced: 29 Sep 2024
https://github.com/gattiharishkumar/employee-attendance-leaves-analytics-dashboard
This project showcases a Power BI dashboard created to analyze employee attendance and leaves over a three-month period. The data was sourced from Excel datasets available on the Codebasics website.
dashboards data-analysis data-cleaning data-transformation data-visualization power-query-editor powerbi
Last synced: 07 Nov 2024
https://github.com/edumoraes1/comissao-reduzida
Criação de segmentação de publico via SQL para nova feature do enjoei de comissão reduzida
bq data-analysis salesforce sql
Last synced: 12 Oct 2024
https://github.com/linguini1/coopscraper
Scrapes the co-op job board provided by Carleton for jobs on my shortlist, then saves the jobs to a CSV file so that I can manipulate them with Excel.
csv data-analysis python selenium webscraper webscraping
Last synced: 07 Nov 2024
https://github.com/marielachirinosr/analysis-urgencias-hospital-pitalito
This project involves analyzing emergency room admission data from the E.S.E Hospital Departamental de Pitalito using a star schema model.
bigquery data data-analysis etl-pipeline tableau
Last synced: 12 Oct 2024
https://github.com/linguini1/tangerineanalyzer
Command line tool for analyzing transactions in CSV format provided by Tangerine Banking. Transactions can be downloaded in CSV format on your Tangerine account.
analysis analytics argparse banking cli command-line command-line-tool csv data-analysis data-analytics finance pandas python tangerine transactions
Last synced: 07 Nov 2024
https://github.com/linguini1/edueval
The BorealisAI Let's Solve It mentorship project: summarizing student feedback submissions on their professor into one cohesive paragraph for faculty consideration during performance reviews.
ai data data-analysis data-science machine-learning machinelearning nlp python pytorch sentiment-analysis
Last synced: 07 Nov 2024
https://github.com/savinrazvan/degrees
A program that computes the "degrees of separation" between two actors by identifying the sequence of movies connecting them, inspired by the Six Degrees of Kevin Bacon game. Uses IMDb-based datasets for actors, movies, and their relationships.
actor-connections ai data-analysis degrees-of-separation educational-project graph-theory imdb movie-database python six-degrees-of-kevin-bacon
Last synced: 11 Nov 2024
https://github.com/itrauco/data-dirtying-tool
a simple command line tool to generate dirty data and do common data things in google cloud
data data-analysis data-engineering data-ops data-pipeline data-science data-visualization data-wrangling dirty-data google-cloud machine-learning
Last synced: 10 Nov 2024
https://github.com/shubhammohanty680/uber_data_analysis
bigquery data-analysis gcp-compute gcp-project looker-studio mageai python
Last synced: 12 Oct 2024
https://github.com/albamerdani/iot_air_quality_ml
IoT Project for Air Quality and Data Analysis with Machine Learning
air-quality aqi data-analysis data-science decision-tree iot machine-learning-algorithms prediction random-forest raspberry-pi-3 sensors
Last synced: 07 Nov 2024
https://github.com/an0n1mity/spamclassifiereval
A repository for evaluating the misclassification rate of spam classification models using a threshold-based approach.
data-analysis machine-learning natural-language-processing python-programming spam-classification text-classification
Last synced: 07 Nov 2024
https://github.com/willie-conway/datavista
A robust 🐍Python application for data analysis that provides a wide range of tools for 🔃loading, 🧹cleaning, and 🔃preprocessing data. It includes features for 📈statistical analysis, 👨🏿🔬hypothesis testing, 🦾machine learning, clustering, ⏳time series forecasting, and 📊data visualization, all designed to enhance your analytical workflow.
analytics big-data command-line data-analysis data-cleaning data-driven data-mining data-pipeline data-preprocessing data-science data-scientist data-visualization data-wrangling exploratory-data-analysis machine-learning pandas predictive-analytics python statistics visualization-tools
Last synced: 11 Nov 2024
https://github.com/marknature/oibsip
AICTE Oasis Infobyte Data Science Internship
data-analysis data-science data-visualization github google-sheets jupyter-notebook linkedin machine-learning project-management python
Last synced: 13 Nov 2024
https://github.com/achronus/data-exploration
A repository dedicated to interesting data exploration projects I've completed
data-analysis exploratory-data-analysis machine-learning matplotlib pandas python scikit-learn seaborn
Last synced: 13 Oct 2024
https://github.com/enayar478/nomad_machine_learning_dash_app
An interactive Machine Learning app built with Dash and Plotly, developed as part of the Data Analytics Bootcamp at Le Wagon Bordeaux. It allows users to visualize data, make real-time predictions, and explore various model insights.
analytics cachetools dash dashboard-application data-analysis data-science deployment gunicorn interactive-visualization machine-learning pandas plotly plotly-dash prediction-model python python3 render scikit-learn web-application
Last synced: 12 Oct 2024
https://github.com/abdoufermat5/twitter-analysis
Twitter data analysis using Pyspark
data-analysis pyspark spark twitter twitter-api
Last synced: 11 Nov 2024
https://github.com/tanaybhadula/twitter-trends-dashboard
An interactive dashboard to visualizes data on current Twitter trends by country and globally. Collects data of over 60 countries using the python Tweepy library, processed it,and visualized it in the form of bar chart and pie chart using the Plotly Dash framework.
dash dashboard data-analysis data-visualization plotly python trends twitter
Last synced: 12 Nov 2024
https://github.com/jakobzmrzlikar/pca-on-genomes
An analysis of human genome mutations from different populations.
data-analysis genome-analysis pca-analysis
Last synced: 07 Nov 2024
https://github.com/simranjeet97/covid-19
Covid-19 Data Analysis and Important Topics to be Covered to get the Impact and Solution.
coronavirus coronavirus-analysis coronavirus-dataset coronavirus-prediction coronavirus-tracking covid-19-data-analysis covid19 covid19-data covid19-india dash dash-app dash-plotly data data-analysis data-science data-science-projects data-visualization python3
Last synced: 14 Nov 2024
https://github.com/jakobzmrzlikar/trg-dela
Data analysis of student job offers.
data-analysis ipython-notebook web-scraping
Last synced: 07 Nov 2024
https://github.com/navp7/hr_analysis_excel
This project utilizes Microsoft Excel to conduct a comprehensive analysis of HR data, focusing on identifying the various reasons for employee attrition and evaluating job satisfaction
dashboards data-analysis excel visualization
Last synced: 07 Nov 2024
https://github.com/valyaevgeorgiy/r_basic
Работа с основами среды R и тем самым изучения нового языка программирования, связанного непосредственно с анализом данных и построением графиков и диаграмм.
coding data data-analysis r rstudio
Last synced: 07 Nov 2024
https://github.com/harkishen-singh/agriculture-ds
An Agricultural based Mtech project, on Data Science, which predicts the growth of crops based on previous year records.
Last synced: 07 Nov 2024
https://github.com/xre22zax/roller-coaster
Explore award-winning wood and steel coasters from 2013-2018 Golden Ticket Awards & Captain Coaster, all powered by Python and interactive visualizations.
analytics data-analysis data-visualization pandas python python-lambda python3 visualization
Last synced: 06 Nov 2024
https://github.com/bhaskaracharjee/student-results-analysis
Analyzing student results to uncover insights
Last synced: 06 Nov 2024
https://github.com/netesf13d/expt-sequence-analysis
Data processing, analysis and visualization package for atomic physics experiments in the single-atom regime.
cold-atoms data-analysis data-visualization optical-tweezers
Last synced: 11 Nov 2024
https://github.com/iguptashubham/pizzahut-analysis-sql
best dataset for data analysis. Pizzahut data analysis done by Shubham Gupta in MySql. This dataset is provided by friend of mine intern at pizzahut. In pizzahut, they used this dataset to train and ask question. This data does not reveal anything about the pizzahut. It is safe to share. data
data-analysis data-analytics database dataset datasets mysql mysql-database pizzahut
Last synced: 14 Nov 2024
https://github.com/yandexdataschool/ml-sweights-experiments
Experiments for the "Machine Learning on data with sPlot background subtraction" paper
data-analysis high-energy-physics machine-learning statistics
Last synced: 06 Nov 2024
https://github.com/lucs1590/agidatatest
This is a repository with data analysis and data science tests.
data-analysis data-science python test
Last synced: 13 Nov 2024
https://github.com/pseudomanifold/pump
A generic data flow program
c-plus-plus-11 cplusplus data-analysis data-flow small
Last synced: 06 Nov 2024
https://github.com/k8hertweck/intro_r
data-analysis data-analysis-in-r r tidyverse training
Last synced: 14 Nov 2024
https://github.com/pseudomanifold/us-inauguration-speeches
Data & feature extraction for U.S. inauguration speeches
data-analysis data-science inauguration politics speech speeches
Last synced: 06 Nov 2024
https://github.com/jayita11/exploring-most-streamed-songs-for-last-four-decades-eda
Perform EDA to uncover trends in streaming patterns, likes, and artists over the last four decades.
data-analysis eda hypothesis-testing matplotlib most-streamed-songs pandas python seaborn
Last synced: 13 Nov 2024
https://github.com/jayita11/eda-student-exam-performance
This project performs Exploratory Data Analysis (EDA) and hypothesis testing on student performance data. It explores trends based on attributes like gender, race/ethnicity, parental education, lunch type, and test preparation course completion.
data-analysis eda hypothesis-testing matplotlib pandas python seaborn statsmodels student-performance-analysis
Last synced: 13 Nov 2024
https://github.com/amyanchen/sf-airbnb
Exploratory Data Analysis of San Francisco Airbnb's
data-analysis data-science data-visualization r rmarkdown statistics
Last synced: 07 Nov 2024
https://github.com/ahmetzamanis/projectcatalog
Catalog of my data science projects.
classification clustering data-analysis data-science data-science-portfolio data-visualization machine-learning python quarto r regression rmarkdown statistics time-series time-series-analysis
Last synced: 07 Nov 2024
https://github.com/jayita11/atliqo-bank-credit-card-launch-eda
This project involves exploratory data analysis and statistical testing for AtliQo Bank's new credit card launch. Key insights include targeting high-income occupations and the 18-25 age group. Recommendations focus on tailored marketing campaigns, education, and incentives to enhance credit card adoption and usage among young adults.
data-analysis hypothesis-testing matplotlib p-value pandas python seaborn statistics z-test
Last synced: 13 Nov 2024
https://github.com/hyperentangledqubit/shellplot
shellplot -- Generate plot(s) directly from terminal via matplotlib or ggplot2 (plotnine)!
data-analysis ggplot2 graphics matplotlib plotnine plotting pyplot terminal
Last synced: 11 Nov 2024
https://github.com/2kabhishek/pyramen
Data Analysis for Ramen 🍜💹
csv data data-analysis fun python report
Last synced: 12 Nov 2024
https://github.com/andimashkulli/vpms
Vehicle Parking Management System for Gjon Buzuku Gymnasium
backend-api data-analysis databases frontend-react mongodb nodejs software
Last synced: 31 Oct 2024
https://github.com/vvipjain/hockey-tournament-analysis
Hockey Tournament Analysis
beautifulsoup data data-analysis data-visualization databases pandas pandas-dataframe powerbi python python-library python-script requests-library-python sql sql-server sqlalchemy
Last synced: 31 Oct 2024
https://github.com/andremenezesds/pa004_health_insurance
Health Insurance Cross-Sell(Learning to Rank Machine Learning Project)
backend backend-api data-analysis data-science data-visualization dataviz lgbm machine-learning matplotlib numpy optuna pandas python scikit-learn shell-script sql webapi xgboost
Last synced: 11 Nov 2024
https://github.com/silianpan/python-data-analysis-course
python data analysis course of drotion-lega
data-analysis jupyter-notebook panda
Last synced: 06 Nov 2024
https://github.com/motapinto/agent-based-simulation-conquest
Agent-based simulation modelation of the conquest Battlefield gamemode
agent-based-simulation data-analysis jade java sajas swing
Last synced: 06 Nov 2024
https://github.com/quantumudit/groceries-basket-analysis
This project performs market basket analysis using Power BI and Python to reveal associations between grocery items. It involves transforming raw transaction data into a processed dataset, creating interactive Power BI reports, and generating key insights through Python, enabling data-driven decision-making.
data-analysis data-visualization pandas powerbi python
Last synced: 06 Nov 2024
https://github.com/mirokeimioniemi/classifying-software-pirates
Exploring the factors driving people into software piracy by training two machine learning models to predict whether a person with certain characteristics and sentiments is likely to possess any pirated software or not using a dataset collected via a survey targeting users of music production software.
data-analysis data-science decision-tree-classifier logistic-regression machine-learning piracy python software-piracy survey
Last synced: 12 Nov 2024
https://github.com/dsarceno/portfolio
Portafolio de Científico de Datos. Proyectos realizados por Diego Sarceño.
computer-vision data-analysis data-science deep-learning docker graph-algorithms investing keras-tensorflow machine-learning markdown neural-networks optimization-algorithms pipelines python sentiment-analysis sklearn tensorflow voice-recognition
Last synced: 10 Oct 2024
https://github.com/karencofre/riesgorelativo-lookerstudio
proyecto de análisis de datos y análisis perdicitvo en looker studio y google colab
bigquery data-analysis data-science machine-learning matplotlib python sklearn sql
Last synced: 13 Oct 2024
https://github.com/lunarwhite/lake-george-viz
Simple data analysis and visualization of Lake Geroge with Python.
Last synced: 06 Nov 2024
https://github.com/chrispsang/customerchurnanalysis
Predicting customer churn using a RandomForestClassifier with detailed EDA, model evaluation, and visualization. Includes a Tableau dashboard for interactive insights.
customerchurn data-analysis data-visualization datapreprocessing machine-learning python scikit-learn tableau
Last synced: 10 Oct 2024
https://github.com/kianaasd93/sensors-
Data Analysis of wearable technologies autonomous systems sensor in physiotherapy, Conducted a comprehensive data analysis on Xsens MTx sensor data
classification data-analysis data-science jupyter jupyter-notebook knn machine-learning physiotherapy python sensor svm wearable-devices wearable-technology
Last synced: 13 Nov 2024
https://github.com/luminati-io/Airbnb-dataset-samples
A sample dataset of over 1000 Airbnb listings, extracted using the Bright Data API, ideal for competitor tracking, brand reputation, and market analysis.
airbnb airbnb-listings api data-analysis datasets web-scraper web-scraper-api web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Shopee-dataset-samples
A sample dataset of over 1000 Shopee products, extracted using the Bright Data API, ideal for pricing optimization, gap analysis, and market strategy refinement..
api data-analysis data-mining datasets products shopee web-scraping
Last synced: 06 Nov 2024
https://github.com/szymon-budziak/real_estate_house_prices_prediction
Predicting real estate house prices using various machine learning algorithms, including data exploration, preprocessing, model training, and evaluation.
data-analysis data-preprocessing data-science eda jupyter-notebook machine-learning matplotlib numpy optuna pandas predictive-modeling price-prediction python random-forest regression scikit-learn seaborn
Last synced: 10 Oct 2024
https://github.com/luminati-io/Walmart-dataset-samples
A sample dataset of over 1000 Walmart products, extracted using the Bright Data API, ideal for consumer market insights and competitor analysis.
api data-analysis dataset walmart walmart-scraper web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Indeed-dataset-samples
A sample dataset of over 1000 Indeed job listings, extracted using the Bright Data API, ideal for market analysis and growth.
api data-analysis datasets indeed jobs web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Amazon-dataset-samples
A sample dataset of over 1,000 Amazon product listings, extracted using the Bright Data API, perfect for competitive analysis, market trends, and eCommerce insights.
amazon api data-analysis data-science dataset ecommerce products web-scraping
Last synced: 06 Nov 2024
https://github.com/luminati-io/Target-dataset-samples
A sample dataset of over 1000 target products, extracted using the Bright Data API, ideal for brand reputation, tracking inventory, and optimizing prices.
api data-analysis data-mining datasets target web-scraper web-scraping
Last synced: 06 Nov 2024
https://github.com/khanovico/python-stock-analyzer
This is a Webapp implemented by python and several data science frameworks, enabling online stock trend analyzing.
amcharts-js-charts data-analysis data-visualization flask javascript pandas python scikit-learn
Last synced: 03 Nov 2024
https://github.com/blakeziegler/binary-classification-competition
Binary Classification of Insurance Crosselling Kaggle Competition
data-analysis data-science database kaggle kaggle-competition machine-learning python rstudio scikit-learn xgboost
Last synced: 10 Oct 2024
https://github.com/lopez86/datascienceexamples
Examples of various data science & data analysis topics using various sources of data.
data-analysis data-science pandas scikit-learn tutorial visualization
Last synced: 06 Nov 2024
https://github.com/lopez86/rust-mlearn
Machine Learning Tools in Rust
data-analysis data-science machine-learning rust
Last synced: 06 Nov 2024
https://github.com/rickcontreras/modelos1
Modelo de clasificación para predecir el desempeño de estudiantes en las Pruebas Saber Pro en Colombia. Incluye análisis exploratorio de datos, preprocesamiento y modelos de machine learning.
classification colombia data-analysis data-science education educational-assessment exploratory-data-analysis jupyter-notebook machine-learning python saber-pro scikit-learn student-performance
Last synced: 10 Oct 2024
https://github.com/samuelsoaress/wkd-default-reduction
reduction of default from 35% to 25% or less with machine learning techniques
data-analysis data-exploration data-science machine-learning-algorithms
Last synced: 10 Nov 2024
https://github.com/edumoraes1/republicacao-produtos
SQL Query realizada para criação de automação de disparo de push via salesforce
bq data-analysis salesforce sql
Last synced: 12 Oct 2024
https://github.com/balajimohan18/loan-classification-datascience-project
This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.
classification data-analysis data-cleaning data-science data-visualization loan-prediction loan-status machine-learning sql supervised-learning
Last synced: 14 Nov 2024
https://github.com/edumoraes1/journey_active_users
Segmentação de base via SQL para jornada de vendedores ativos
bq data-analysis salesforce sql
Last synced: 12 Oct 2024
https://github.com/shuddha2021/stellar-candidate-selector
A sophisticated candidate selection algorithm leveraging multi-criteria analysis and machine learning to identify top software engineering candidates. This tool features flexible filtering, score adjustment, and detailed visualizations to streamline the recruitment process.
candidate-selection data-analysis data-visualization machine-learning pandas plotting-in-python python python-data-analysis recruitment scikit-learn
Last synced: 06 Nov 2024
https://github.com/balajimohan18/loan-clustering-datascience-project
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataeset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
clustering-algorithm data-analysis data-science data-visualization eda kmeans-clustering machine-learning sql unsupervised-learning
Last synced: 14 Nov 2024
https://github.com/m0saan/python-for-data-analysis
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney,
data-analysis data-science ipython-notebook machine-learning matplotlib numpy pandas python
Last synced: 16 Nov 2024
https://github.com/quocduyenanhnguyen/roi-modeling-and-analysis-of-sports-dataset
In this project, you will find my ROI model for retirement savings and PowerPoint presentation of my ROI model, as well as my data analysis/visualization of Sports Ticket Sales dataset that I concluded with a PDF group written report
data-analysis data-visualization microsoft-excel rate-of-return-modeling sports-ticket-sales-dataset
Last synced: 08 Nov 2024
https://github.com/rnddave/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 11 Oct 2024
https://github.com/quocduyenanhnguyen/human-trafficking-analysis
I analyzed human trafficking data
data-analysis data-analytics data-visualization human-trafficking mysql mysql-database mysql-workbench query sql tableau tableau-dashboards tableau-public
Last synced: 08 Nov 2024
https://github.com/quocduyenanhnguyen/california-crime-data-analysis
I analyzed crime incident-based data in California in the year 2022. I used SQL for analysis and Tableau for visualization.
2022 california crime-data dashboard data-analysis data-analytics data-manipulation data-modeling data-visualisation data-visualization database fbi mysql mysql-workbench nibrs sql tableau tableau-dashboards tableau-public
Last synced: 08 Nov 2024
https://github.com/leeway64/lwwordcounter
C++ application that analyzes the frequency of words in a text file
bson cmake conan cpp data-analysis json json-schema text-analysis ubjson
Last synced: 08 Nov 2024
https://github.com/leeway64/lwtaiwancurrencyconverter
Converts US dollars (USD) to New Taiwan Dollars (TWD) and vice versa
cfg currency currency-converter data-analysis data-visualization docker exchange-rates federal-reserve julia taiwan twd united-states usd
Last synced: 08 Nov 2024
https://github.com/mysftz/statistical-analysis
A in-depth review of statistical analysis in Python from datasets.
data-analysis python python3 statistics university university-project
Last synced: 06 Nov 2024
https://github.com/mysftz/statistics-analysis
A python statistical analysis of a dataset and probability.
data-analysis matplotlib python python3 statistical-analysis
Last synced: 06 Nov 2024
https://github.com/aisurjyasamantaray/-optimizing-target-s-brazilian-operations-insights-from-order-processing-pricing-and-payment-trends-
This project offers an in-depth analysis of consumer behavior, logistical performance, and payment preferences within the e-commerce sector. By examining order costs, delivery times, and payment methods, businesses can uncover valuable insights into operational efficiency and customer preferences.
bigquery consumer-insights data-analysis database sql target
Last synced: 12 Oct 2024
https://github.com/mysftz/numerical-methods-in-matlab
Multiple MatLab scripts over multiple data analysis assignments.
data-analysis data-science matlab university university-assignment
Last synced: 06 Nov 2024
https://github.com/thoratstuti/power-bi-dashboards-for-finance-analysis
Power BI can group and gather information from multiple systems to present the whole picture of business data analytics in one “single view”. It made the staff of the financial institution work in a collective digital platform, where they can compute and share relevant data.
data-analysis data-visualizations excel graph pie-chart powerbi
Last synced: 14 Nov 2024
https://github.com/saitoxu/data-analysis-workspace
Docker image for data analysis
Last synced: 10 Nov 2024
https://github.com/yash1882/music-store-data-analysis
A project focuses on analyzing music store data using SQL ♬
begineer-friendly data-analysis music music-store-data music-store-data-analysis sql-project
Last synced: 16 Nov 2024
https://github.com/abdoomohamedd/python-data-analysis-projects
Welcome to the Python Data Analysis Projects repository! This repository contains a collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp
data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python
Last synced: 06 Nov 2024
https://github.com/mumtaz4118/scraping-medium-and-data-analytics
The file DataExtraction.py extracts information from the json files scrapped by the scrapper medium_scrapper_post.py. To extract information from json files scrapped by medium_scrapper_tag_archive.py (scrapping from tags archive) then use Data_Extraction_Archive_Tags.py
data data-analysis data-analytics data-extraction data-preprocessing data-science data-scraping deep-learning machine-learning python
Last synced: 07 Nov 2024
https://github.com/manjit-baishya-datascience/flipkart-laptop-listing-eda
This project analyzes laptop price data from Flipkart using AutoScraper for web scraping. It includes data loading, EDA, cleaning, statistical analysis, and visualization. The goal is to derive insights for pricing strategies and market positioning. Explore the repository for detailed documentation and code.
data-analysis ecommerce-platform flipkart laptop python
Last synced: 13 Nov 2024
https://github.com/mrsamsonn/monolithic-polylithic-crystal-segmentation
A grid segmentation algorithm for clustering crystal structures using diffraction patterns. Useful in material science and nanotechnology, this code enables detailed analysis of crystals for research and industrial applications.
clustering crystal-structure crystallography data-analysis diffraction-patterns grid-segmentation image-processing k-means machine-learning matertial-science nanotechnology python research-project research-tools scientific-computing
Last synced: 02 Nov 2024
https://github.com/mumtaz4118/covid-19-data-visualization
covid-19-data-visulaization
covid-19 covid-data data-analysis data-analytics data-science machine-learning
Last synced: 07 Nov 2024
https://github.com/mumtaz4118/nlp-course
Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning
course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning
Last synced: 07 Nov 2024
https://github.com/mumtaz4118/amazon-iphone-12-data-scrapped
Beautiful Soup is a Python library for getting data out of HTML, XML, and other markup languages.
data-analysis data-extraction data-science data-scraping html mark-up python
Last synced: 07 Nov 2024
https://github.com/mateusoliveira30/top-intelligent-people
This project performs an exploratory analysis of the top_intelligent_people_in_the_world_5000.csv dataset, featuring some of the world's most intelligent individuals. Using pandas and matplotlib, the analysis includes checking for missing values, describing variables, and visualizing data.
data-analysis graphics kaggle-dataset python3
Last synced: 08 Nov 2024