Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with dataanalysis
A curated list of projects in awesome lists tagged with dataanalysis .
https://github.com/tanu-n-prabhu/python
This repository helps you understand python from the scratch.
data dataanalysis datascraping google-colab google-colab-notebook jupyter-notebook machine-learning numpy numpy-arrays pandas-dataframe prediction python python-3 python3
Last synced: 19 Dec 2024
https://github.com/Tanu-N-Prabhu/Python
This repository helps you understand python from the scratch.
data dataanalysis datascraping google-colab google-colab-notebook jupyter-notebook machine-learning numpy numpy-arrays pandas-dataframe prediction python python-3 python3
Last synced: 08 Nov 2024
https://github.com/prateekiiest/code-sleep-python
Awesome Projects in Python - Machine Learning Applications, Games, Desktop Applications all in Python :snake:
analysis caesar-cipher case-study cipher classification dataanalysis hacktoberfest hamlet kharagpur-winter-of-code projects python python-script social-network-analysis translation
Last synced: 18 Dec 2024
https://github.com/prateekiiest/Code-Sleep-Python
Awesome Projects in Python - Machine Learning Applications, Games, Desktop Applications all in Python :snake:
analysis caesar-cipher case-study cipher classification dataanalysis hacktoberfest hamlet kharagpur-winter-of-code projects python python-script social-network-analysis translation
Last synced: 15 Nov 2024
https://github.com/mtahiraslan/data-analyst-roadmap
Based on my own experience, I think this roadmap will answer all the questions of how to become a data analyst from zero, which technologies and programming languages are better to know, what kind of soft skills do we need, how do I start my professional career in this field.
blogs businessintelligence courses data dataanalysis dataanalyst excel interview mtahiraslan powerbi programming python resources resume roadmap softskills sql statistics tableau tutorials
Last synced: 25 Oct 2024
https://github.com/akabe/ocaml-jupyter
An OCaml kernel for Jupyter (IPython) notebook
dataanalysis datascience functional-programming jupyter jupyter-kernels jupyter-notebook machine-learning ocaml ocaml-kernel ocaml-repl
Last synced: 15 Dec 2024
https://github.com/cs-mohamedayman/data-science-case-studies
Data Science Case Studies for computer science students.
casestudies dashboard dataanalysis datacamp datascience datascienceindustries deeplearning excel googlesheets hackerrank kaggle leetcode machinelearning powerbi powerpoint sql tableau
Last synced: 09 Nov 2024
https://github.com/ptyadana/mysql-tableau-for-data-analytics-and-business-intelligence
collection of SQL - Tableau integration projects for Data Analytics and Business Intelligence
business-analytics business-intelligence csv data-analysis data-analytics data-visualizations dataanalysis datavisualization integration mysql mysqlworkbench sql tableau tableau-desktop tableau-public
Last synced: 16 Dec 2024
https://github.com/GeoDaCenter/rgeoda
R library for spatial data analysis based on libgeoda and GeoDa
dataanalysis geoda geospatial r
Last synced: 22 Nov 2024
https://github.com/cis-team/datascience-squad
Data Science Squad Roadmap
cis-team computer-science data-science dataanalysis
Last synced: 08 Nov 2024
https://github.com/akabe/docker-ocaml-jupyter-datascience
Dockerfiles for data science in OCaml on Jupyter
dataanalysis datascience docker dockerfile functional-programming jupyter-notebook machine-learning ocaml
Last synced: 30 Oct 2024
https://github.com/gher-uliege/diva
DIVA (Data-Interpolating Variational Analysis) is a software tool dedicated to the spatial interpolation of in situ data in oceanography.
analysis dataanalysis emodnet interpolation ocean-data ocean-sciences oceanography odv seadatacloud seadatanet
Last synced: 11 Dec 2024
https://github.com/georgehanymilad/data-analysis-and-bi-resources
Data Analysis and BI Resources 📊
business-intelligence data-visualization dataanalysis database excel powerbi python sql tableau
Last synced: 16 Nov 2024
https://github.com/yahoo/cubed
Data Mart As A Service
bigdata businessintelligence dataanalysis datamart etl funnel-analysis saas
Last synced: 13 Nov 2024
https://github.com/hexastack/eazychart
EazyChart is a reactive chart library 📈, it allows you to easily add SVG charts in your React and Vue web applications.
chart charts d3 data dataanalysis dataviz graphs hacktoberfest hacktoberfest2022 javascript library react typescript visualization vue web
Last synced: 17 Dec 2024
https://github.com/uditmahato/heart-attack-analysis
"Heart Attack Analysis" - A data science project for predicting heart attacks using machine learning on health-related data.
dataanalysis heartattack jupyternotebook python
Last synced: 15 Nov 2024
https://github.com/hemansnation/python-for-beginners
Course for Python Beginners
dataanalysis python pythonforbeginner
Last synced: 08 Nov 2024
https://github.com/pingcap/dbt-tidb
A dbt adapter for TiDB
dataanalysis database dbt mysql python sql tidb
Last synced: 06 Nov 2024
https://github.com/developer-student-clubs/dataxchange
Welcome to DataScience Collaborative, a community-driven data science project where data enthusiasts, analysts, and machine learning practitioners come together to collaborate on data analysis tasks and projects. Whether you're a seasoned data scientist or just getting started with data analysis, this is the place to learn, contribute, and grow you
data-sciense dataanalysis hacktoberfest machine-learning
Last synced: 23 Nov 2024
https://github.com/MohammedSardar/Bive
Bive is a Kurdish profanity language processing project.
data dataanalysis kurdish kurdish-corpus kurdish-dataset kurdish-language-processing kurdishdata kurdishnlp
Last synced: 14 Nov 2024
https://github.com/arm-university/arduino-projects-for-schools
Arduino MKR Projects for Schools is a colourful entry-level resource, which introduces learners to the exciting world of microcontrollers, the Internet of Things and Data Science. Learners use both simulators and physical devices to build systems and solve real-life problems.
arduino arduinomkr computerscience computing cs dataanalysis dataanalysisusingpython datascience education embeddedsystems iot pbl physicalcomputing
Last synced: 29 Nov 2024
https://github.com/cecivieira/cotas-genero-eleicoes-e-proposicoes-legislativas
Análise de dados sobre cotas de gênero e seu impacto nas eleições e proposições legislativas da Câmara dos Deputados Federais entre 1934 e 2021. Parte do TCC da pós-graduação em Inteligência Artificial e Aprendizado de Máquina na @pucminas
dataanalysis pandas preprocessing-data python randomforestclassifier
Last synced: 13 Dec 2024
https://github.com/ahammadnafiz/predicta
Predicta: Simplify your workflow with our powerful data analysis and machine learning tool.
analytics data-science data-visualization dataanalysis machine-learning pandas project python streamlit streamlit-webapp webapp
Last synced: 03 Dec 2024
https://github.com/akashkobal/data-science
I'm excited to share my data science project🚀, where I've applied various techniques and insights to solve a specific problem. The project follows best practices for maintainability and reproducibility, using the Data Science Project Template. Dive into the project to explore the code, datasets, documentation, and resources that showcase MyJourney
akash akash-kobal akashkobal applied-data-science artificial-intelligence classification data-science dataanalysis dataanalytics datascienceproject datascientist deep-learning kobal machine-learning prediction regression
Last synced: 05 Dec 2024
https://github.com/karan-malik/prepdata
Automating the process of Data Preprocessing for Data Science
classification data dataanalysis dataframe datapreprocessing datascience machine-learning numpy pandas pip preprocessing pypi-package python python3 random-forest regress sklearn
Last synced: 14 Oct 2024
https://github.com/praveendecode/youtube-data-harvesting-warehousing
Efficient YouTube data harvesting and warehousing with Python, SQL, MongoDB, and Streamlit, enabling seamless analysis and visualization for insightful decision-making in content management and audience engagement strategies
api apiintegration dataanalysis dataharvesting datawarehousing eda mongodb postgres python sql
Last synced: 17 Dec 2024
https://github.com/ersinaksar/make-your-jarvis-usin-gpt-3-and-python
Python to convert audio input from the microphone to text, generate a response from GPT-3 using the OpenAI API, convert the response to speech using the gTTS library, and save the audio to a file.
artificial-intelligence data-science dataanalysis gpt-3 jarvis machine-learning neural-network python smartassistant voice-assistant voice-commands
Last synced: 12 Dec 2024
https://github.com/ashishpatel26/parul-university-sttp-on-data-analysis-with-python
Data Analysis with Python STTP at Parul University
data-science dataanalysis python
Last synced: 19 Nov 2024
https://github.com/huseyincenik/data_science
Data Science materials
data data-science data-structures data-visualization dataanalysis dataengineering datapreparation dataprocessing datascience dataset time-series time-series-analysis timeline timeseries timeseries-analysis timeseriesforecasting
Last synced: 01 Dec 2024
https://github.com/farahibrar/programming-in-python
Explore a comprehensive collection of Python programming for diverse data analysis and data science projects. This repository covers data exploration, visualization, statistical analysis, machine learning, NLP, and model deployment. Perfect for enthusiasts looking to delve into practical examples and advanced techniques.
beautifulsoup dataanalysis docker flask folium jupyter-notebook machine-learning matplotlib nltk numpy pandas python pytorch scikit-learn scikitlearn scipy seaborn spacy statsmodels tensorflow
Last synced: 06 Dec 2024
https://github.com/easonlai/databricks_odbc_connection_to_azure_sql_db_with_azure_ad_user_access_token
Making ODBC connection from Databricks (Azure Databricks) to Azure SQL Database with Azure AD User Access Token.
azure azuread azuredatabricks azuresql azuresqldb bigdata data-analysis dataanalysis dataanalytics databricks databricks-notebooks datascience microsoft microsoft-azure microsoftazure odbc odbc-driver pandas pyodbc spark
Last synced: 10 Nov 2024
https://github.com/sevdanurgenc/pythonsamples
The scope of this project includes examples of data analysis related to python.
artifical-intelligence data dataanalysis datascience machine-learning oop python
Last synced: 30 Nov 2024
https://github.com/sondosaabed/data-analyst-nanodegree
I aquired a full scholarship from Google Launchpad. Advanced data wrangling skills to work with messy, complex real-world datasets. Highly customized visualizations using the Matplotlib Python library
data-science dataanalysis datawrangling nanodegree python udacity-nanodegree
Last synced: 06 Nov 2024
https://github.com/thecoderpinar/ml-perceptron-project
This project demonstrates the implementation of the Perceptron algorithm for binary classification tasks. It includes various advanced features such as data augmentation, feature engineering, and deep learning techniques to enhance model performance and robustness.
artificialintelligence binaryclassification dataanalysis datascience deep-learning jupyter-notebook machinelearning opensource perceptronalgorithm programming python
Last synced: 16 Dec 2024
https://github.com/coderixc/pricefeedx
This Module will load NSE Top 100 Stocks and Its Price (Current LTP and 5 Day Previos Price) will store all NSE Stocks Symbol in Database(using Mysql) .
algorithms algotrading bhavcopy bse dataanalysis equity kiteconnect multicharts nse quant-dev stock-market
Last synced: 28 Nov 2024
https://github.com/sumitgirwal/google-play-store-data-analysis
This Google Play Store dataset from Kaggle, analysis using Python, NumPy , Pandas , and Matplotlib.In dataset analysis, you can view the best app 5-star rating app, most review app, or most download app, etc.
dataanalysis matplotlib numpy pandas playstore-data-analysis python3
Last synced: 25 Nov 2024
https://github.com/nagipragalathan/animal_prediction
Animal Prediction is a web application developed using Python and Django that allows users to predict the animal species based on uploaded photos. The application utilizes machine learning algorithms to analyze the image data and provide accurate predictions.
animalclassification animalprediction animalrecognition computervision dataanalysis datascience deeplearning django githubproject imageclassification imageprocessing machinelearning opensource predictiveanalysis python speciesprediction webapplication
Last synced: 18 Dec 2024
https://github.com/newking9088/sql-guide-to-solve-complex-data-science-problems
This is a comprehensive SQL guide for both MySQL users and PostgreSQL users, covering topics from basic `SELECT` statements to advanced window functions. My SQL learning journey and the suggestions from my mentees, colleagues and seniors to document this were the motivations to write this document.
dataanalysis datascience-machinelearning mysql postgresql sql
Last synced: 27 Oct 2024
https://github.com/adritpal08/export-and-import-data-analysis-dashborad-using-power-bi
Export and Import Data Analysis Dashboard using Power BI
dataanalysis datamodeling datavisualization export-import powerbi
Last synced: 22 Nov 2024
https://github.com/ctroupin/ctroupin.github.io
Personal webpage
data-visualization dataanalysis julia leaflet oceanography python remote-sensing running science trail-running
Last synced: 20 Oct 2024
https://github.com/adritpal08/e-commerce-sales-analysis-dashboard-using-power-bi
E-Commerce Sales Analysis Dashboard using Power BI
dataanalysis datamodeling datavisualization e-commerce-project powerbi
Last synced: 22 Nov 2024
https://github.com/faizanmohd5/web-scraping-iphone-11-reviews
This is a web scraping project that extracts customer reviews for the iPhone 11 from Flipkart.com using Python and BeautifulSoup. The extracted data is saved in a CSV file for further analysis. Use it as a starting point for your own web scraping projects or for analyzing customer reviews of the iPhone 11.
beautifulsoup csv data-visualization dataanalysis dataextraction datainsights datamining datapreprocessing ecommerce-website ipython-notebook jupyter-notebook python reviews reviewscrapper webscraping
Last synced: 13 Nov 2024
https://github.com/omkarpattnaik8080/pandas
Using dataset from kaggle I am implementing basic and advanced panda operations to analyze data from the dataset
colab-notebook data-science dataanalysis jupyter-notebook machine-learning pandas python
Last synced: 14 Nov 2024
https://github.com/sarincr/data-visualizations-and-dashboards
Data visualization is an interdisciplinary field that deals with the graphic representation of data. It is a particularly efficient way of communicating when the data is numerous as for example a Time Series.
analytics artificial-intelligence big-data bokeh businessintelligence dashboard data-science dataanalysis dataanalytics datavisualization dataviz deep-learning machine-learning matplotlib plotly python python3 seaborn visualization
Last synced: 20 Nov 2024
https://github.com/kingabzpro/kaggle-competition-2020
In the past decade, computer science has evolved and the importance of Data Science has become the new norm, every company s looking to invest more in Machine learning. This is where someone like me who never had an Interest before got curious and started learning more about this world of Kaggle, it took me 3 months to get hold of some of the basic and advanced tools used by Data Scientists. I am using those same tools to evaluate this data set and come up with the best conclusion. In this Notebook, I will be telling you the story of data and I will be sharing my own experience so that any beginner can learn from my mistake and get ahead.
beginner-project data-visualization dataanalysis kaggle-competition learning python3
Last synced: 17 Nov 2024
https://github.com/sukanyabag/alexa-reviews-sentiment-analysis
This repository's notebook provides the insight of customer reviews on Amazon's kid Alexa, by data visualization, sentiment analysis and classification using NLP techniques.
dataanalysis datavisualization kaggle-dataset natural-language-processing sentiment-analysis sentiment-classification
Last synced: 10 Nov 2024
https://github.com/adritpal08/customer-churn-analysis-report-using-power-bi
Customer Churn Analysis Report using powerbi
customer-churn-analysis dataanalysis datamodeling datavisualization powerbi
Last synced: 22 Nov 2024
https://github.com/sumitgirwal/pokemon-dataset-data-analysis
This is a simple analysis of Pokemon dataset from Kaggle using Python , NumPy, Pandas, and Matplotlib.
dataanalysis kaggledatasets matplotlib numpy pandas-python pokemon-dataset python3
Last synced: 25 Nov 2024
https://github.com/emso-exe/venda_de_medicamentos_controlados_e_antimicrobianos_-_industrializados
Projeto de análise de vendas de medicamentos controlados por um período de 12 meses e perfil dos consumidores com base nos dados disponibilizados pela Anvisa.
analise-de-dados anvisa dataanalysis dataanalyst dataanalytics medicament medicamento medicamentos medicaments python python-3 python3
Last synced: 15 Nov 2024
https://github.com/arm-university/asp_smart-schools-on-arduino
Our Smart Schools resource provides accessible and engaging projects for teachers and learners that utilise the more advanced features of Arduino in real-world contexts.
arduino computerscience computing computingscience cs dataanalysis electronics embeddedsystems iot physicalcomputing pthon
Last synced: 29 Nov 2024
https://github.com/fortunewalla/airportdb
We used the http://flughafendb.cc/ airport database (small) and converted most of the fields to English.
airport airport-data airport-db airportdb dataanalysis database datascience flughafen flughafendb heatwave machinelearning mysql oracle sample-database sql sqlquery
Last synced: 04 Dec 2024
https://github.com/solrikk/datadigger
DataDigger is a powerful and intuitive web application designed to extract and analyze data from web pages.
automation competitive-analysis content-analysis data data-automation data-processing dataanalysis excel excelize go golang goquery http scraper-tools web-crawler webscraping
Last synced: 08 Dec 2024
https://github.com/camara94/data-science
Bienvenu dans ce tutorie, aucours duquel nous allons découvrir la librairie pandas qui est l'une des libraire les plus importantes en python, lorsque nous voulons découvrir la data science. Avec cette librairie nous pouvons faire tout ce dont nous pouvons imaginer en data science en python
data-science dataanalysis pandas-dataframe python
Last synced: 05 Nov 2024
https://github.com/syedfaiqueali/the-sparks-foundation
Data Science and Business Analytics Tasks
data-science dataanalysis sparksfoundation-intern
Last synced: 17 Dec 2024
https://github.com/mrfoxak/artificial-intelligence
This is All About AI & ML
airtificialintelligence data-science dataanalysis datapreprocessing datavisualization deep-learning feature-engineering feature-extraction feature-selection jyputer-notebook machine-learning machine-learning-algorithms natural-language-processing neural-network python
Last synced: 28 Nov 2024
https://github.com/deva-246/data-anlaysis-on-real-time-swiggy-data-using-excel
Analyzing Real time food ordering data using Excel!
dataanalysis dataorder datatransformation datavisualization keytrends microsoftexcel pivottabe statisticalfunctions
Last synced: 28 Nov 2024
https://github.com/rushilsharma1/financialdataset_frauddetection_analysis
Analysis of the fraud detection from the financial dataset using sql workbench and command line
command-line dataanalysis datascience insights sql sqlquery sqlworkbench
Last synced: 09 Nov 2024
https://github.com/davidemiceli/social-analytics-platform
Open Source Web Platform for Social Media Analytics
analytics artificial-intelligence bigdata dataanalysis datascience hapijs marketing monitoring nodejs socialmedia vuejs2
Last synced: 27 Nov 2024
https://github.com/keanteng/kaggle_notebook
📚A repository that stores my notebooks on Kaggle and other learnings.
dataanalysis datascience kaggle learning notebook python r
Last synced: 02 Dec 2024
https://github.com/joyalshaji135/r-workshop
R programming is a popular language for data analysis and statistical computing. It offers a rich ecosystem of packages and libraries for a wide range of data-related tasks. In R, you can manipulate data frames, perform statistical analysis, create data visualizations, and generate reports with ease. The language's open-source nature and active com
dataanalysis datavisualization r
Last synced: 28 Nov 2024
https://github.com/shuklayash02/complete_data_analysis_project
A Full Data Analysis project where a sales data is ask,prepare,process,analyze,share and act through data analysis process
data data-visualization dataanalysis database datacleaning powerbi sql
Last synced: 05 Nov 2024
https://github.com/sumitgirwal/super-market-research
This is simple research on supermarket.its must you know how your organization or corporation growth is increasing or decreasing.
dataanalysis matplotlib numpy pandas python3 supermarket-dataset
Last synced: 25 Nov 2024
https://github.com/wahidpanda/apple-company-stock_market_prediction-forecasting-ml
Apple stock market prediction and Forecasting for next 30 days with LSTM
buisness-intelligence data-science dataanalysis forecasting-models lstm-neural-networks machine-learning-algorithms machinelearningprojects prediction stock-market stock-prediction stock-price-prediction
Last synced: 21 Nov 2024
https://github.com/mart-dore/datascience-portfolio
Complete data science analysis + API deployemeny of 'MPG car' dataset
api dataanalysis datascience flask machinelearning
Last synced: 09 Nov 2024
https://github.com/pavankethavath/microsoft-classifying-cybersecurity-incidents-with-ml
A machine learning pipeline for classifying cybersecurity incidents as True Positive(TP), Benign Positive(BP), or False Positive(FP) using the Microsoft GUIDE dataset. Features advanced preprocessing, XGBoost optimization, SMOTE, SHAP analysis, and deployment-ready models. Tools: Python, scikit-learn, XGBoost, LightGBM, SHAP and imbalanced-learn
classificationreport correlation-analysis dataanalysis decision-tree-classifier exploratory-data-analysis feature-engineering feature-selection gradientboosting hyperparameter-tuning joblib lgbmclassifier logistic-regression machine-learning modelselection pandas randomforestclassifier randomsearchcv shap smote xgboost-classifier
Last synced: 10 Dec 2024
https://github.com/aisurjyasamantaray/customer-purchase-analysis
This project involves an in-depth analysis of customer purchasing behavior and sales performance to drive business insights and strategies.
customer-purchasing customer-segmentation data-visualization dataanalysis dataanalysis-projects marketing-insights matplotlib numpy pandas python revenue-and-performance-analysis seaborn-plots
Last synced: 18 Nov 2024
https://github.com/sumitgirwal/drinks-dataset-data-analysis
A drinks dataset from Kaggle. Applying analysis using Python, NumPy, Pandas, Matplotlib.
dataanalysis drinks-dataset kaggle matplotlib numpy pandas python3
Last synced: 25 Nov 2024
https://github.com/georgehanymilad/heart-disease-detection
Machine Learning Project
anaconda classification classification-algorithm dataanalysis datascience kaggle machine-learning machine-learning-algorithms matplotlib numpy pandas python python3 seaborn
Last synced: 16 Nov 2024
https://github.com/reddyprasade/bicycle-sharing-system-in-us
A bicycle-sharing system, public bicycle system, or bike-share scheme, is a service in which bicycles are made available for shared use to individuals on a short term basis for a price or free. Many bike share systems allow people to borrow a bike from a "dock" and return it at another dock belonging to the same system. Docks are special bike racks that lock the bike, and only release it by computer control. The user enters payment information, and the computer unlocks a bike. The user returns the bike by placing it in the dock, which locks it in place. Other systems are dockless. For many systems, smartphone mapping apps show nearby available bikes and open docks.
dataanalysis jupyter maplotlib numpy pandas python r
Last synced: 06 Dec 2024
https://github.com/sarincr/basics-of-julia-programming-language
Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.
data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning
Last synced: 20 Nov 2024
https://github.com/harmanveer2546/movie-industry
Investigate the film industry to gain sufficient understanding of what attributes to success and in turn utilize this analysis to create actionable recommendations for companies to enter the industry.
dataanalysis datetime matplotlib numpy pandas python seaborn
Last synced: 13 Nov 2024
https://github.com/javipsanchez/gcp_projects
Google Cloud Projects
cloud data dataanalysis dataengineering datascience deployment
Last synced: 24 Nov 2024
https://github.com/kingabzpro/digital-learning-during-covid19-eda
In this project, we will be using data analysis tools to figure out trends in digital learning and how it is effective towards improvised communities. We will be comparing districts and states on factors like demography, internet access, learning product access, and finance.
covid-19 data-science dataanalysis eda education learnplatform usa
Last synced: 17 Nov 2024
https://github.com/shibam120302/data-scraper-swiggy-restaurants
A simple scraper to scrape restaurant data from swiggy
dataanalysis scraper selenium webscraping
Last synced: 20 Nov 2024
https://github.com/lisakey/datacamp-data-analyst-python-sql-projects
Several projects completed during my Data Analyst 📊 training on the DataCamp platform with Python 🐍 and SQL 🗃️. Each project addresses real-world challenges using modern analytical tools and techniques.
analysis cleaning-data data dataanalysis dataanalyst matplotlib pandas python seaborn sql transformation visuali
Last synced: 06 Nov 2024
https://github.com/mchenryspagg/creating-a-dimensional-data-model
This project involves creating a dimensional data model using MySQL Workbench for a car repair shop’s operations in western Canada by examining a sample invoice,
dataanalysis dimensional-modeling erdiagram mysql mysql-database salesanalysis
Last synced: 06 Dec 2024
https://github.com/kunalshelke90/predict-bank-credit-risk-using-south-german-credit-data
This is an end-to-end ML project, which aims at developing a classification model for the problem of classifying a given customer profile into either of the risk category (safe or not safe). The final classifier used for this project is CatBoost classifier. Deployed in AWS.
aws cassandra catboost-classifier classification credit-risk data data-science dataanalysis dockerfile finance financial-analysis flask github-actions logging machine-learning mlflow numpy pandas python
Last synced: 19 Nov 2024
https://github.com/mishaa931/aircovid
This repository contains Python code using Plotly and Pandas to analyze airport traffic data during the COVID-19 pandemic. The code aims to provide visual insights into the impact of the pandemic on airport operations .
data-science dataanalysis datavisualization interactive-visualizations pandas python
Last synced: 12 Nov 2024
https://github.com/montanaz0r/imdb-ratings-auto-inserter
A Python script that enables auto-inserting movie ratings into the IMDB profile.
data data-science dataanalysis imdb movies pandas pandas-dataframe python3 selenium selenium-webdriver webscraping
Last synced: 14 Dec 2024
https://github.com/imsalmanmalik/linear-regression-model-airbnb-prices-seattle
Linear Regression Model on Airbnb prices of Seattle using Dash and Python
airbnb choropleth-map dash dataanalysis datacleaning datamanupilation datascience exploratory-data-analysis feature-engineering machine-learning matplotlib normalization numpy onehot-encoding pandas python seaborn-plots sklearn-library trainandtestsets visualization
Last synced: 09 Nov 2024
https://github.com/trainingbypackt/beginning-data-science-with-python-and-jupyter-elearning
Perform reproducible data analyses with these data exploration tools
dataanalysis datascience html jupyter jupyter-notebook python
Last synced: 14 Nov 2024
https://github.com/san089/black-friday-sales-analysis
This Project gives an insight into few statistics related to black Friday Sale.
custom data dataanalysis insights sales statistics
Last synced: 16 Nov 2024
https://github.com/karan-malik/uberdataanalysis
Uber Data Analysis and Visualization using Python
data-analysis data-analysis-python data-analytics data-science data-visualization dataanalysis matplotlib-pyplot numpy pandas pandas-dataframe python python3 seaborn uber uber-data
Last synced: 14 Nov 2024
https://github.com/bhavik-jikadara/movie-recommendation-system
Recommendation systems are among the most popular applications of data science. They are used to predict the Rating or Preference that a user would give to an item.
data-science dataanalysis deep-learning machine-learning movie-recommendation movierecommendationsystem python
Last synced: 09 Nov 2024
https://github.com/hectorta1989/802.15.4-wireless-mac-level-performance
MAC-level performance and parameters dataset for 802.15.4 wireless networks.
802154 dataanalysis datasets layer2 lr-wpan macprotocol python3 wireless
Last synced: 10 Nov 2024
https://github.com/anoopgeorge418/end2end-datascience
This repository is a treasure trove of data science projects that showcase my learning journey. Here, you'll find a variety of projects covering the entire data science lifecycle—from data cleaning and exploration to machine learning, model evaluation, and deployment. Each project is designed to tackle real-world problems, with detailed explanation
dataanalysis datacollection datascience flask machine-learning python
Last synced: 10 Nov 2024
https://github.com/prajwalsinha/unveiling-climate-change-dynamics-through-earth-surface-temperature-analysis
Climate change analysis through global surface temperature data. Includes data preprocessing, statistical analysis, visualizations, and forecasting. Python-based project using Pandas, Matplotlib, and Scikit-learn.
data dataanalysis dynamic-mapping pyplot python scikit-learn seaborn
Last synced: 21 Dec 2024
https://github.com/pavankethavath/car_dekho_car_price_prediction
A Streamlit web app utilizing Python, scikit-learn, and pandas for used car price prediction. Features data preprocessing (scaling, encoding), Random Forest model optimization with GridSearchCV, and interactive user input handling. Achieves high accuracy (R² score: 0.9028), showcasing skills in machine learning, data engineering, and deployment.
dataanalysis datacleaning datapreprocessing eda encoding feature-extraction feature-selection featureimportance fine-tuning machine-learning minmaxscaling normalization pandas pickle prediction-model python random-forest randomsearch-cv regression streamlit
Last synced: 10 Dec 2024
https://github.com/yessasvini23/cisco-data-analytics-essentials_-virtual-_internship
From the CISCO Networking Academy
data dataanalysis database datascience excel relational-databases sql statistics structured-query tableau
Last synced: 09 Nov 2024
https://github.com/shoaib-33/dhaka-city-rent-analysis
A project to visualize the rent analysis of entire Dhaka city
data-science dataanalysis datavisualization selenium tableau webscraping
Last synced: 13 Nov 2024
https://github.com/akashash01/hr-analytics
Power BI report created for Job application tracking(HR analysis) under certain requirements. note: Flat file converted into Star Schema(Data modelling).
dashboard dataanalysis powerbi starschema
Last synced: 14 Dec 2024
https://github.com/akashash01/restaurant-analysis
Power BI report created for Restaurant sales analysis under certain conditions and requirements.
dashboard dataanalysis dax grouping-and-summarizing powerbi sales
Last synced: 14 Dec 2024
https://github.com/tanay-dwivedi/netflix-dataset-data-analysis
This project involves analyzing a Netflix dataset to derive insights on show ratings, release trends, durations, geographical distribution, and content types, facilitating strategic decision-making and content curation efforts.
dataanalysis netflix python seaborn visualization
Last synced: 07 Nov 2024
https://github.com/akashash01/kpi-analysis
Power Bi report created for KPI analysis of an Industrial machines under certain requirements.
dashboard dataanalysis dax kpi-report powerbi
Last synced: 14 Dec 2024
https://github.com/akashash01/performance-report
This is an simple Data analysis report shows the Annual Performance rate of an Employee.
dataanalysis dax-languague excel performance-analysis powerbi
Last synced: 14 Dec 2024
https://github.com/tanay-dwivedi/marathon-running-data-analysis
The project involves exploratory data analysis of marathon race data to uncover patterns, explore relationships between variables, and generate insights aimed at optimizing athlete performance and race organization strategies.
dataanalysis dataset marathon matplotlib-pyplot python seaborn visualization
Last synced: 07 Nov 2024
https://github.com/akashash01/energy-consumption-report
Analyzed the energy consumption data on different attributes over the years on USA.
dataanalysis datavisualization dax energy-consumption excel powerbi
Last synced: 14 Dec 2024
https://github.com/akashash01/coffee_sales_analysis
Analyzed the coffee sales data using My SQL and retrieved Business related insights.
dataanalysis mysql sales-analysis
Last synced: 14 Dec 2024