Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2024-11-19 00:06:50 UTC
- JSON Representation
https://github.com/rnddave/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 11 Oct 2024
https://github.com/mysftz/statistical-analysis
A in-depth review of statistical analysis in Python from datasets.
data-analysis python python3 statistics university university-project
Last synced: 06 Nov 2024
https://github.com/mysftz/statistics-analysis
A python statistical analysis of a dataset and probability.
data-analysis matplotlib python python3 statistical-analysis
Last synced: 06 Nov 2024
https://github.com/mysftz/numerical-methods-in-matlab
Multiple MatLab scripts over multiple data analysis assignments.
data-analysis data-science matlab university university-assignment
Last synced: 06 Nov 2024
https://github.com/saitoxu/data-analysis-workspace
Docker image for data analysis
Last synced: 10 Nov 2024
https://github.com/steviecurran/multi-dish
Scripts to reduce data from large radio telescopes (GMRT, VLA)
data-analysis interferometer radio-astronomy telescopes
Last synced: 14 Nov 2024
https://github.com/abdoomohamedd/python-data-analysis-projects
Welcome to the Python Data Analysis Projects repository! This repository contains a collection of data analysis projects that I have worked on. Each project involves cleaning data, performing exploratory data analysis (EDA), and creating visualizations to extract insights. The projects utilize various Python libraries, including pandas, numpy, matp
data-analysis data-analysis-python data-cleaning data-visualization matplotlib matplotlib-pyplot numpy pandas python
Last synced: 06 Nov 2024
https://github.com/mumtaz4118/scraping-medium-and-data-analytics
The file DataExtraction.py extracts information from the json files scrapped by the scrapper medium_scrapper_post.py. To extract information from json files scrapped by medium_scrapper_tag_archive.py (scrapping from tags archive) then use Data_Extraction_Archive_Tags.py
data data-analysis data-analytics data-extraction data-preprocessing data-science data-scraping deep-learning machine-learning python
Last synced: 07 Nov 2024
https://github.com/mumtaz4118/covid-19-data-visualization
covid-19-data-visulaization
covid-19 covid-data data-analysis data-analytics data-science machine-learning
Last synced: 07 Nov 2024
https://github.com/mumtaz4118/nlp-course
Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning
course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning
Last synced: 07 Nov 2024
https://github.com/mumtaz4118/amazon-iphone-12-data-scrapped
Beautiful Soup is a Python library for getting data out of HTML, XML, and other markup languages.
data-analysis data-extraction data-science data-scraping html mark-up python
Last synced: 07 Nov 2024
https://github.com/mateusoliveira30/top-intelligent-people
This project performs an exploratory analysis of the top_intelligent_people_in_the_world_5000.csv dataset, featuring some of the world's most intelligent individuals. Using pandas and matplotlib, the analysis includes checking for missing values, describing variables, and visualizing data.
data-analysis graphics kaggle-dataset python3
Last synced: 08 Nov 2024
https://github.com/manjit-baishya-datascience/flipkart-laptop-listing-eda
This project analyzes laptop price data from Flipkart using AutoScraper for web scraping. It includes data loading, EDA, cleaning, statistical analysis, and visualization. The goal is to derive insights for pricing strategies and market positioning. Explore the repository for detailed documentation and code.
data-analysis ecommerce-platform flipkart laptop python
Last synced: 13 Nov 2024
https://github.com/mrsamsonn/monolithic-polylithic-crystal-segmentation
A grid segmentation algorithm for clustering crystal structures using diffraction patterns. Useful in material science and nanotechnology, this code enables detailed analysis of crystals for research and industrial applications.
clustering crystal-structure crystallography data-analysis diffraction-patterns grid-segmentation image-processing k-means machine-learning matertial-science nanotechnology python research-project research-tools scientific-computing
Last synced: 02 Nov 2024
https://github.com/aisurjyasamantaray/-optimizing-target-s-brazilian-operations-insights-from-order-processing-pricing-and-payment-trends-
This project offers an in-depth analysis of consumer behavior, logistical performance, and payment preferences within the e-commerce sector. By examining order costs, delivery times, and payment methods, businesses can uncover valuable insights into operational efficiency and customer preferences.
bigquery consumer-insights data-analysis database sql target
Last synced: 12 Oct 2024
https://github.com/tmmvn/analytics-notebooks
A bunch of data analytics notebooks done testing out JetBrains DataLore
ai algorithms data-analysis datalore elements-of-ai helsinki-university-mooc python
Last synced: 07 Nov 2024
https://github.com/chrispsang/healthcare-dataanalysis
Analyze synthetic patient data to identify trends, improve healthcare delivery, and predict patient outcomes using machine learning models. Includes data exploration, preprocessing, model building, and visualizations.
data-analysis data-science data-visualization healthcare jupyter-notebook machine-learning python
Last synced: 06 Nov 2024
https://github.com/thisisashukla/survival-analysis
Hands-On Survival Analysis in Python
data-analysis data-science survival-analysis
Last synced: 07 Nov 2024
https://github.com/kheriberto/pandas_and_seabron_project
In this project I showcase my ability using pandas and seaborn to mold, transform and plot data.
data-analysis pandas python seaborn
Last synced: 10 Nov 2024
https://github.com/andryadsm/asset-analyzer
📈 Project Asset Analyzer (Python)
commodities data-analysis data-visualization economics financial-markets investing matplotlib numpy pandas python seaborn stock-market strategy trading
Last synced: 06 Nov 2024
https://github.com/jbn/vaquero
A Python library for iterative and interactive data wrangling at laptop-scale.
data data-analysis data-cleaning data-mining dirty-data elt etl etl-framework
Last synced: 13 Nov 2024
https://github.com/andryadsm/predicting-house-prices
🏘️ Project Predicting House Prices (Python)
data-analysis data-preprocessing data-visualization feature-engineering house-prices machine-learning matplotlib numpy pandas python real-estate seaborn sklearn
Last synced: 06 Nov 2024
https://github.com/andryadsm/pizza-sales-report
🍕 Project Pizza Sales Report (MySQL, Tableau)
dashboards data-analysis data-visualization database-management mysql sales sql tableau
Last synced: 06 Nov 2024
https://github.com/kwonnayeon/medium-post-projects
Contains the code and projects from my Medium posts. I share what I've learned through trial and error to help others tackle similar work smoothly.
data-analysis data-science data-visualization medium-articles python r-language sql
Last synced: 13 Nov 2024
https://github.com/sunsided/esc2024
Exploratory Data Analysis on the ESC 2024 results
csv data-analysis eurovision-song-contest scraping
Last synced: 02 Nov 2024
https://github.com/dataforgeopenaihub/steam-sales-analysis
This repository features an ETL pipeline for retrieving, processing, validating, and ingesting game metadata and sales data from SteamSpy and Steam APIs. Data is stored in a MySQL database on Aiven Cloud and visualized using Tableau dashboards for insightful analysis of gaming trends and sales performance.
data-analysis data-engineering data-pipepline data-warehousing games mysql-database python steam-api tableau typer-cli
Last synced: 10 Oct 2024
https://github.com/themihirmathur/uber-data-analytics
The goal of this project is to perform comprehensive data analytics on Uber trip data using a modern data engineering stack on Google Cloud Platform (GCP).
bigquery data-analysis data-engineering etl-pipeline google-cloud-platform looker python
Last synced: 12 Oct 2024
https://github.com/darkdk123/handwashing-discovery-analysis
A Guided Project in a Boot camp to Analyse the Original Data used in the Discovery of Viruses & Hand Washing By Dr. Ignaz Semmelweis in Vienna General Hospital in the 1840s.
data-analysis data-science data-visualization matplotlib-pyplot numpy pandas plotly-python python seaborn-plots
Last synced: 07 Nov 2024
https://github.com/rupeshtr78/machine_learning
Machine Learning TensorFlow Neural Networks Deep Learning
classification data-analysis deep-learning deep-neural-networks flink jupyter-notebook keras machine-learning machinelearning-python perceptron python3 spark tensorflow
Last synced: 13 Nov 2024
https://github.com/hamzazafar10/movie-recommendation-system
Content based movie recommendation system using cosine similarity.
cosine-similarity data-analysis data-preprocessing data-science data-structures data-visualization jupyter-notebook machine-learning movie-recommendation python
Last synced: 16 Nov 2024
https://github.com/darkdk123/house-valuation-model
A Challenge Project in a Boot-Camp to create a ML Model to predict the prices of houses in Boston Massachusetts from multiple parameters Using Multivariable Regression.
data-analysis data-science data-visualization matplotlib-pyplot multivariate-regression predictive-modeling statistics
Last synced: 07 Nov 2024
https://github.com/aidan-zamfir/the-iliad
Data analysis & relationship network for the characters of Homers Iliad
data data-analysis dataframes networks python selenium webscraping
Last synced: 13 Nov 2024
https://github.com/pranav016/exploratory-data-analysis-of-google-app-store-dataset
This is a data analysis done on the Google app store dataset to answer a few questions related to the data through data visualization techniques.
Last synced: 05 Nov 2024
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 06 Nov 2024
https://github.com/thesfinox/mltools
A collection of simple tools for data science and machine learning projects.
ai data-analysis data-science data-visualization logging machine-learning matplotlib neural-network python toolbox
Last synced: 06 Nov 2024
https://github.com/spaghettifunk/gvb
Analysis of GVB in Amsterdam
data-analysis public-transportation
Last synced: 08 Nov 2024
https://github.com/pranav016/exploratory-data-analysis-of-sp500-dataset
This a data-analysis that I performed on the S&P 500 dataset and answered a few questions through data visualization techniques.
Last synced: 05 Nov 2024
https://github.com/marcogdepinto/olympichistoryanalysis
Python visual analysis of the Olympic Games history. Kaggle gold medal with 15000+ views, 200+ upvotes and 100+ comments.
data-analysis data-science jupyter-notebook olympic-games python seaborn
Last synced: 10 Nov 2024
https://github.com/beaprogrammer02345/python_data_analysis
Sales Analysis using Python
data-analysis data-visualization python
Last synced: 06 Nov 2024
https://github.com/madeiradata/microsoft-data-analysts-club
Open-source Repository of Useful Scripts and Solutions for Microsoft Data Analysts
data-analysis data-visualization microsoft-data-analysis powerbi powerbi-report
Last synced: 06 Nov 2024
https://github.com/banyc/dfplot
Summarize a data frame by plotting. `cargo install --git https://github.com/Banyc/dfplot.git`.
csv data-analysis plotly plotting statistics
Last synced: 19 Nov 2024
https://github.com/findmyway/dataframe-in-julia
A quick introduction of DataFrame in Julia for users from Python
data-analysis dataframe julia jupyter-notebook
Last synced: 12 Oct 2024
https://github.com/vidyadnina/cyclistic-sql-tableau-project
Trip data analysis for a bike-sharing service company using SQL and Tableau.
bigquery dashboard data-analysis data-analytics-sql data-cleaning data-visualization sql
Last synced: 12 Oct 2024
https://github.com/manishkr1754/nifty50_data_analysis_nsetools_nsepy_python
NIFTY50 Data Analysis from scratch (Data Extraction & Visualization to Investment Insights)
candlestick-chart cufflinks data-analysis data-visualization heikin-ashi interactive-visualizations moving-average nsepy nsetools plotly portfolio-analysis portfolio-analytics portfolio-and-investment-analysis python sharpe-ratio technical-analysis
Last synced: 23 Oct 2024
https://github.com/grindelfp/logistic-regression-study
Example of logical regression data analysis and exercise on it.
data-analysis ipynb logistic-regression python
Last synced: 13 Nov 2024
https://github.com/cannt39t/wylsacom-analysis-reflinks-datamining
data data-analysis data-mining python3 sql
Last synced: 09 Nov 2024
https://github.com/dionixius7/titanic-disaster-ml-model
This project predicts the survival of passengers on the Titanic by using Kaggle Titanic Disaster Dataset. The dataset contains information related to passengers, such as age, gender, and class. Different machine learning algorithms have been applied for this predictive model to accomplish an accurate prediction that will define the survival chances
data-analysis data-science data-visualization eda knn-classifier machine-learning neural-network python scikit-learn svm tensorflow titanic-kaggle titanic-survival-prediction
Last synced: 17 Nov 2024
https://github.com/ljadhav25/django-data-analyzer
Django Data Analyzer is a web application built using the Django framework, designed to streamline data analysis tasks. Users can upload CSV files containing data for analysis. The application utilizes the powerful data manipulation capabilities of Python libraries like pandas and numpy to perform various analyses on the uploaded data.
data-analysis data-visualization django-application matplotlib numpy pandas python seaborn
Last synced: 10 Nov 2024
https://github.com/zeshanfareed/spam_mail_detection_ml_django_project
Machine learning projects through Django
css data-analysis data-visualization django-framework frontend html machine-learning mailcsvfile matplotlib pandas project-repository projects python
Last synced: 10 Nov 2024
https://github.com/ksharma67/eda-on-ipl
In this python notebook, analysis of IPL matches from 2008 to 2020 is done using python packages like pandas, matplotlib and seaborn.
data-analysis data-science eda matplotlib numpy pandas python seaborn
Last synced: 06 Nov 2024
https://github.com/buildwithlal/introduction-to-data-science-in-python-coursera
introduction to data science in python, part of Applied Data Science using Python Specialization from University of Michigan offered by Coursera
data-analysis matplotlib numpy pandas
Last synced: 08 Nov 2024
https://github.com/emanoelcampos/python-onemonth
This repository contains educational materials and projects developed during a Python course offered by OneMonth. It covers Python basics, intermediate concepts, web development with Flask, and data analysis with pandas. The course is structured into weeks, each focusing on a different aspect of Python programming and its applications.
data-analysis flask jupyter-notebook onemonth python python3
Last synced: 07 Nov 2024
https://github.com/mlund2k/project-1-baseball-performance-vs.-attendance
Project assets for my first exploratory data analysis: Baseball Performance vs. Attendance.
bigquery data-analysis data-cleaning data-visualization excel rstudio sql tableau tidyverse
Last synced: 12 Oct 2024
https://github.com/faezeh-gholamrezaie/visual-google-scholar-search
A Python script that searches Google Scholar for specific keywords and visually presents the results in various chart formats, enabling researchers to analyze trends and insights in academic literature.
academic academic-research academic-trends ai ai-research bibliometrics data-analysis data-visualization google-scholar publication-analysis python research-trends scholarly scholarly-data word-cloud
Last synced: 07 Nov 2024
https://github.com/the-pinbo/dimensionalityredux-pca-vs-autoencoders
Comparative study of PCA and Autoencoders for effective dimensionality reduction, assessed through PSNR and SSIM metrics.
autoencoder-mnist autoencoders data-analysis dimensionality-reduction image-compression mnist neural-networks pca psnr ssim
Last synced: 06 Nov 2024
https://github.com/abidshafee/google.colaboratory_projects
This repository contains the collections of interactive python notebooks (ipynb) that are some of my projects on Data Science, Machine Learning (ML), and Natural Language Processing (NLP).
colaboratory data-analysis data-science lstm machine-learning nlp statistics time-series
Last synced: 06 Nov 2024
https://github.com/ireneli393/music-recommendation-system-with-listenbrainz-dataset
Recommendation System
alternate-least-squares baseline-model data-analysis data-sceince lightfm-library python recommendation-system
Last synced: 12 Nov 2024
https://github.com/virajbhutada/article-clustered-recommendation-system-ml
This project aims to redefine content discovery by delivering personalized article recommendations tailored to individual user preferences. We use advanced machine learning techniques like PCA and K-means clustering to analyze user behavior and article characteristics to provide highly accurate recommendations.
anaconda article-recommendation clustering-algorithm data-analysis data-science keras-tensorflow machine-learning machine-learning-algorithms ml-models numpy pandas plotly python scikit-learn scipy
Last synced: 15 Oct 2024
https://github.com/tirendazacademy/hands-on-data-science-with-gcp
Google BigQuery Tutorial
big-data big-data-analytics bigdata bigquery bigquery-ml bigqueryml cloud-computing data-analysis data-analytics data-engineering data-science dataanalysis dataengineering google-bigquery google-cloud-platform machienlearning machine-learning
Last synced: 08 Nov 2024
https://github.com/vaishnavipaithane/bellabeat-data-analysis-case-study
This capstone project was done as a part of Google Data Analytics Professional Certificate course.
bigquery data-analysis sql tableau
Last synced: 12 Oct 2024
https://github.com/alinenog/desenvolve_gb_2022
Formação Desenvolve 2022 do Grupo Boticário na área de dados
data-analysis data-science googlesheet machine-learning numpy pandas python
Last synced: 06 Nov 2024
https://github.com/theultrabadduck/dinoanalysis
Dino who tries to understand their future
data-analysis data-science datascience jupyter jupyter-notebook jupyter-notebooks jupyterlab python python3
Last synced: 08 Nov 2024
https://github.com/alexgenovese/react-charts-covid-19-data
Examples on COVID-19 data using different library charts: G2, G2Plot, Plotly, ApexCharts
data-analysis data-science data-visualization react reactjs
Last synced: 07 Nov 2024
https://github.com/datalopes1/desafio_delivery
Desafio do Clube de Assinaturas da Universidade dos Dados para simular as demandas reais de um analista de dados
Last synced: 11 Oct 2024
https://github.com/dharininadkar/movies-data-dashboard
Data Analysis of Movies data
data-analysis data-mining data-science data-visualization ms-excel ms-sql-server tableau
Last synced: 17 Nov 2024
https://github.com/ramapinnimty/udacity-mlfoundation-nanodegree
This is a repository containing solutions to the assignments that are a part of the Udacity Machine Learning Foundation Nanodegree program.
assignments data-analysis python3 statistics udacity-machine-learning-nanodegree
Last synced: 08 Nov 2024
https://github.com/zimmi48/nixpkgs-issues
Analysis on nixpkgs issue lifetime.
data-analysis github-api nixpkgs
Last synced: 06 Nov 2024
https://github.com/bineet-ratna-shakya/data-science-salary-analysis
analyzing a dataset containing salaries of data science professionals from 2020 to 2023.
data-analysis data-science data-visualization jupyter numpy pandas python
Last synced: 11 Oct 2024
https://github.com/baguilar6174/python-jupyter-notebooks
Explore data analysis projects with Python, Jupyter and more tools. Discover stunning visualizations and reveal meaningful information in datasets to make informed decisions.
data-analysis jupyter-notebook kaggle pandas python
Last synced: 07 Nov 2024
https://github.com/shreeparab1890/chat-analyzer
This project is a Data Analysis project to analyze the WhatsApp chats.
data-analysis numpy pandas python
Last synced: 08 Nov 2024
https://github.com/nickenshidqia/sql-for-financial-data-analysis
Design SQL queries to generate accurate and timely financial reports including Profit and Loss statements, Balance Sheets, and Cash Flow statements
azure-data-studio data-analysis finance microsoft-sql-server sql
Last synced: 17 Nov 2024
https://github.com/itsmeyogesh22/people-s-analytics-case-study
Part of Danny Ma's virtual apprenticeship online program, "People's Analytics Case Study" aims to demonstrate practical use of various SQL Concepts like Materialized Views, Snapshot Data and Historical Data
danny-ma data-analysis dataanalysis datascience mssqlserver pgadmin4 postgresql snapshot-data sqlserver t-sql
Last synced: 17 Nov 2024
https://github.com/maazie-khan/olympics-data-enigeering
Worked with Azure Data Factory, Databricks, Data Lake Storage, and Synapse Analytics to build an ETL pipeline for processing and analyzing Olympic Games data from Kaggle.
azure big-data data-analysis dataengineering devops pipeline
Last synced: 14 Oct 2024
https://github.com/shreeparab1890/flipkart-laptops-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Laptops listed on Flipkart.
data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly
Last synced: 08 Nov 2024
https://github.com/shreeparab1890/indian-elections-2019-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.
data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization
Last synced: 08 Nov 2024
https://github.com/shreeparab1890/unicorns-of-india-till-sep-2022-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Unicorns of India till Sep 2022.
analysis data-analysis eda exploratory-data-analysis matplotlib-pyplot numpy pandas plotly
Last synced: 08 Nov 2024
https://github.com/tj2904/lfb-callout-analysis
An investingation into London FIre Brigade's callout data.
data-analysis decsion-tree kmeans lfb-incidents london-fire-brigade pandas python seaborn
Last synced: 07 Nov 2024
https://github.com/astropenguin/optimap
Optimized integrated intensity map method for spectral cubes
astronomy data-analysis data-science python python3 radio-astronomy spectral-cubes
Last synced: 05 Nov 2024
https://github.com/madhursinghbhadoriya/data_analysis_fifa-players
• Using NumPy, Matplotlib, Pandas, etc processed important Information and Characteristic traits on Jupyter Notebook.
analysis data-analysis data-science graphs jupyter-notebook pandas python
Last synced: 05 Nov 2024
https://github.com/madhursinghbhadoriya/atliq_salesdata_analysis-powerbi
AtliqH_SalesData Analysis - Power
dashboard data-analysis powerbi
Last synced: 05 Nov 2024
https://github.com/madhursinghbhadoriya/data_analysis_sales_insights_using_tableau
• Performed Data Cleaning using MySQL. • Data analysis and ETL in Tableau. • Created an Interactive Dashboard with significant information about the Sales Insights, Profit and Revenue Analysis.
data-analysis data-visualization dataanalysis etl mysql tableau-dashboards tableau-desktop
Last synced: 05 Nov 2024
https://github.com/abhinav-codealchemist/open-government-data-analysis
Data Analysis Using Pandas
data-analysis data-science jupyter-notebook python
Last synced: 13 Nov 2024
https://github.com/bkataru/physics-e.e
Project Repository for IB physics extended essay
astrometry astronomical-algorithms astronomical-images astronomy astrophotography astrostatistics data-analysis data-science data-visualization modeling physics polynomial-regression regression-analysis
Last synced: 05 Nov 2024
https://github.com/bkataru/chem-ia
Data and analysis code for IB Chemistry IA
data-analysis data-science data-visualization matplotlib modeling plotting regression-analysis regression-models
Last synced: 05 Nov 2024
https://github.com/bkataru/math-ia
Data and analysis for IB Math IA
data-analysis data-science data-visualization matplotlib modeling plotting regression-analysis regression-models
Last synced: 05 Nov 2024
https://github.com/josafary-ds/curso_dnc
Repositório para armazenamento dos arquivos de estudo e projetos DNC - Cientista de Dados
data-analysis data-science data-visualization machine-learning powerbi python
Last synced: 19 Nov 2024
https://github.com/prakashjha1/new-analysis-using-llm-locally
An interactive news analysis tool built with Streamlit and local LLMs. This app allows users to analyze and gain insights from the latest news articles using advanced language models, all running locally. Explore trends, sentiment, and key topics with an intuitive interface.
artificial-intelligence data-analysis data-science llms ollama python streamlit
Last synced: 12 Oct 2024
https://github.com/changyeop-yang/study-datasciencefoundation
Big Data Science and its Analytics plays a major role in this decade. How to clean and prepare your data for analysis is still a challenge, like How to perform basic visualization of your data, How to model your data, How to curve-fit your data, And finally, how to present your findings and wow the audience
data-analysis ios kyungpook-national-university swift
Last synced: 06 Nov 2024
https://github.com/tsffarias/my-books
Exploratory analysis of my Dataset 'All_the_Books_I_read' which contains all the books I've read
books data-analysis python tableau
Last synced: 05 Nov 2024
https://github.com/virajbhutada/diamond-price-estimator
This project develops a predictive model to estimate diamond prices based on characteristics like carat, cut, color, and clarity. It covers data preprocessing, feature engineering, model selection, training, and evaluation. The final product is a web app where users can input diamond attributes to get accurate and instant price predictions.
cross-validation css data-analysis data-science-projects data-visualization eda feature-engineering html hyperparameter-tuning jupyter-notebooks machine-learning ml-algorithms model-deployment model-selection performance-optimization predictive-modeling python python-app user-interface
Last synced: 11 Nov 2024
https://github.com/virajbhutada/hr-analytics-excel-sql-tableau-powerbi
Explore a comprehensive HR Analytics portfolio showcasing data analysis and visualization skills. Featuring dashboards in Power BI, Excel, and Tableau, along with SQL queries for deeper insights. A holistic view of expertise in HR analytics, data visualization, and database management. Let's dive into the game of data insights!
data-analysis data-management data-visualization excel hr-analytics interactive-dashboards portfolio-project postgresql powerbi powerbi-visuals sql sql-queries tableau tableau-public
Last synced: 11 Nov 2024
https://github.com/borjamome/visualization_supermarkets_with_r
Visualization using R and OpenStreetMaps
data-analysis datavisualization openstreetmap r
Last synced: 12 Oct 2024
https://github.com/greenpau/esqrunner
Run Elasticsearh queries and create metrics based on the result of the queries in Elasticsearch database.
data-analysis elasticsearch query-builder querydsl
Last synced: 13 Oct 2024
https://github.com/alinababer/data-science-and-insight-agent-rag-llama3-lava-llm
Data-Science-and-Insight-Agent-RAG-LLama3-Lava-LLM-Django-WebApplication is an advanced AI-driven chatbot designed to assist in data science, document analysis, and image interpretation. This repository contain the Datascience Agent of this project.
artificial-neural-networks classifcation data-analysis data-engineering data-visualization datascience large-language-models llama2 lstm machine-learning python random-forest regression
Last synced: 12 Oct 2024
https://github.com/an4pdm/relatorio-de-vendas
O presente projeto foi feito através das ferramentas oferecidas pelo Power BI afim de aprimorar meus conhecimentos sobre ETL. Os dados utilizados foram de origem do site "Kaggle".
data-analysis data-visualization database etl powerbi
Last synced: 13 Nov 2024
https://github.com/aniketmondal/dataanalysis
Contains cleaning, transformation, and exploratory analysis of various data sets using Python Pandas, NumPy, re, random, etc.
analysis data-analysis data-science pandas python
Last synced: 07 Nov 2024
https://github.com/abdelrahmanbayoumi/titanic-machine-learning-from-disasters
Knowing from a training set of samples listing passengers who survived or did not survive the Titanic disaster, can our model determine based on a given test dataset not containing the survival information, if these passengers in the test dataset survived or not.
data-analysis data-science data-visualization machine-learning pandas
Last synced: 05 Nov 2024
https://github.com/jacksonrakena/lcg-toolkit
Simulator and plotter for linear congruential generator (LCG) functions in Python
borland congruence congruent congruential data-analysis data-science generation generator lcg linear linear-congruential-generator numerical-recipes randu randu-function randu-generator randu-rng rng
Last synced: 05 Nov 2024