Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2024-11-19 00:06:50 UTC
- JSON Representation
https://github.com/gjjvdburg/veld
Easy command line analytics
cli command-line-tool data-analysis data-science data-visualization statistics
Last synced: 09 Nov 2024
https://github.com/johannaschmidle/road-collisions-project
Understanding Accident Severity for Effective Road Management [Excel]
data-analysis data-visualization excel pivot-tables traffic-analysis
Last synced: 12 Nov 2024
https://github.com/johannaschmidle/amazon-cat-couch
Customer product reviews + ratings analysis and visualization [Python, Excel, Tableau, R]
data-analysis data-visualization jupyter-notebook python-notebook r-markdown sentiment-analysis text-analysis web-scraping
Last synced: 12 Nov 2024
https://github.com/techshot25/blackfriday
Shopping behavior data analysis
consumer data-analysis data-science gradient-descent machine-learning normal-equation regression-analysis shopping shopping-habits statistics
Last synced: 10 Nov 2024
https://github.com/johannaschmidle/netflix-subscription-analysis
Analyzing Netflix subscription trends from 2021 - 2023 [SQL, Tableau]
data-analysis data-cleaning data-trend data-visualization netflix
Last synced: 12 Nov 2024
https://github.com/abhirajp595/python
Data Science Project using Python
data-analysis data-science data-visualization eda jyputer-notebook numpy pandas statistics
Last synced: 15 Nov 2024
https://github.com/techshot25/graduateadmissions
Looking at the probability of being accepted in a graduate program using a machine learning model
bayesian-regression correlation-matrices data-analysis data-science linear-regression machie-learning random-forest-regression regression ridge-regression
Last synced: 10 Nov 2024
https://github.com/zeshanfareed/graduation_admission_prediction_ml_django_project
predict file is Django Frameework code file
data-analysis data-visualization datasets-csv django-framework machine-learning machinelearning-python python
Last synced: 11 Oct 2024
https://github.com/davidzajac1/four-percent-rule-pandas-analysis
Analysis of the 4% Personal Finance Rule of Thumb
data-analysis data-visualization pandas python
Last synced: 09 Nov 2024
https://github.com/shubham200137/cyclistic-case-study
This repository contains a case study for Google's Data Analytics Professional Certificate, focusing on Cyclistic, a fictional bike sharing company in Chicago. The case study aims to drive growth by converting casual riders into members through a marketing strategy.
data-analysis data-visualization numpy-python pandas-python presentation-slides sql tableau
Last synced: 10 Nov 2024
https://github.com/min-thway-htut/r-programming
Repository for R-Programming
Last synced: 10 Nov 2024
https://github.com/burhanahmed1/recipe-recommendor-using-pyspark
A smart recipe recommendation system that suggests recipes based on ingredient similarities. This project is done in PySpark
data-analysis data-science datawrangling education learning-python machine-learning machine-learning-algorithms nltk-python numpy pandas pyspark python python-project reccomendersystem recommendation-system
Last synced: 10 Nov 2024
https://github.com/analitico-771/etf_analyzer
This is an An application that pulls and analyzes ETF data from a database
conda-environment data-analysis data-structures data-visualization database etf-investments fintech hvplot pandas-dataframe python quantitative-finance sqlalchemy
Last synced: 09 Nov 2024
https://github.com/dthung1602/goodread-bestbook-prediction
Data analysis - trying to predict the result of Goodreads Choice Adward
data-analysis goodreads pca python r xgboost
Last synced: 09 Nov 2024
https://github.com/ninadpatil09/hospital_emergency_room_analysis
This comprehensive analysis delves into the performance and characteristics of the hospital's emergency room over the past year. By scrutinizing key metrics and patient demographics, this study aims to provide valuable insights for optimizing patient care, resource allocation, and overall operational efficiency.
data-analysis tableau-public visualization
Last synced: 09 Nov 2024
https://github.com/ninadpatil09/heart_disease_detection_analysis
The Heart Disease Detection Analysis aims to create a predictive model for identifying individuals at risk of heart disease. Using a dataset with attributes like age, sex, and health metrics, the project focuses on distinguishing patients with and without heart disease.
data-analysis data-cleaning data-science data-visualization machine-learning
Last synced: 09 Nov 2024
https://github.com/samalyarov/practicum_projects
Various data analysis projects displaying tools and instruments that I am proficient with
data-analysis datetime folium geojson matplotlib numpy pandas plotly postgresql powerpoint python regular-expressions requests-library-python scipy seaborn sql sqlalchemy tableau tqdm
Last synced: 13 Nov 2024
https://github.com/aymane-maghouti/sentiment-analysis-for-jumia-reviews-and-smartphone-price-prediction-system
The project focuses on customer sentiment analysis for Jumia, aiding informed online decisions. It collects and analyzes product comments to determine sentiments and implements a decision-making algorithm. Additionally, it includes product price prediction system using regression techniques.
beutifulsoup data-analysis data-cleaning data-collection data-preprocessing data-scraping data-visualization eda falsk machine-learning python web-application
Last synced: 16 Nov 2024
https://github.com/mahdi-meyghani/movie-recommendation-system
A Python-based movie recommendation system utilizing popularity-based, content-based, and collaborative filtering models with data science and machine learning techniques.
data-analysis data-science machine-learning recommendation-system scikit-learn scikitlearn-machine-learning
Last synced: 10 Oct 2024
https://github.com/puspacempaka/superstore-analysis-with-sql
This repository showcases various data analyses on the popular Superstore dataset using SQL queries. The analyses cover a range of business insights, including sales performance, customer segmentation, and product profitability. Each analysis is documented with the SQL queries used and explanations of the steps involved.
business-intelligence data-analysis sales-analysis sql superstore-dataset
Last synced: 15 Nov 2024
https://github.com/agustin-caceres/proyecto-data-analyst
Proyecto de Data Analyst sobre servicios de Telecomunicaciones en Argentina
business-analytics business-intelligence data-analysis data-visualization database postgresql python streamlit
Last synced: 11 Nov 2024
https://github.com/whis99/data_analysis_journey
A repositories of my data analysis projects.
data data-analysis data-analysis-python data-visualization dataset jupyter-notebook matplotlib python visualization
Last synced: 13 Nov 2024
https://github.com/pratanup/solar-power-generation-prediction
A solar power generation company wants to optimize solar power production and needs the prediction model to predict ‘Clearsky DHI’, ‘Clearsky DNI’, ‘Clearsky GHI’.
anaconda data-analysis data-science google-colab jupiter-notebook machine-learning machine-learning-algorithms machinelearning-python prediction prediction-model python
Last synced: 08 Nov 2024
https://github.com/pratanup/bank-customer-churn
A prediction model based on ML as well as DL and compare their performances to find Churned Customers
adaboost-classifier ann churn-prediction data-analysis data-visualization decision-tree-classifier deep-learning deep-learning-algorithms gaussian-naive-bayes-classification gradient-boosting-classifier k-nearest-neighbours logistic-regression machine-learning machine-learning-algorithms random-forest-classifier svc svm-classifier xgboost-classifier
Last synced: 08 Nov 2024
https://github.com/aangelone2/das
A simple Data Analysis Suite
correlation-analysis data-analysis monte-carlo numpy statistics
Last synced: 14 Nov 2024
https://github.com/aangelone2/das-c
Lightweight parallel Data Analysis Suite in C
c correlation-analysis data-analysis monte-carlo multithreading openmp
Last synced: 14 Nov 2024
https://github.com/inevolin/multivariate-data-analysis
Showcases of modern multivariate & multidimensional data analysis in industrial and high-tech settings.
analytics data-analysis data-science data-visualization javascript
Last synced: 12 Nov 2024
https://github.com/sabelomkhwanzi/data-alchemist-boot-camp
Built on Covalent's Unified API, Increment has the full historical data set for 40+ chains including every smart contract, event, transaction, address, etc. With access to all this data you can find:
covalent data-analysis increment
Last synced: 17 Nov 2024
https://github.com/vagnerbellacosa/116_usandoamazontextractocrextracaodadosdynamodb
Usando o Amazon Textract como OCR para Extração de Dados no DynamoDB
amazon-textract aws data-analysis data-extraction digital-innovation-one dio dynamodb lab ocr python
Last synced: 09 Nov 2024
https://github.com/syedanimrafatima/ecommerce-store-sales-analysis-powerbi
The Sales Analysis Dashboard is designed to help an E-commerce Business to overview their Sales performance throughout the year. It includes a report and visualizations that cover sales performance, customer segmentation, product analysis, and more.
business-intelligence csv dashboard data-analysis data-cleaning data-visualization excel powerbi sales-analysis-dashboard storytelling
Last synced: 09 Nov 2024
https://github.com/abishekaditya/machinelearningintro
Some simple stuff with pandas and Scipy
data-analysis ipython machine-learning pandas python scipy
Last synced: 08 Nov 2024
https://github.com/meokullu/prefill
PreFill adds desired characters onto output values to increase their legibility.
alignment data data-analysis data-engineering data-science legibility
Last synced: 14 Nov 2024
https://github.com/kwonnayeon/urban-parks-childrens-happiness
A thesis project exploring the causal impact of urban parks on children's happiness, with data, results, and code.
causal-inference data-analysis environmental-psychology latex matching propensity-score public-health r social-science statistical-analysis thesis-project urban-studies weighting
Last synced: 02 Nov 2024
https://github.com/imrandil/sql_practice_with_analysis
SQL practice using postgres db and docker as a tool to setup postgres, loving the sql way
data-analysis docker markdown postgres sql
Last synced: 08 Nov 2024
https://github.com/imrandil/excel_learning_dir
Excel learning practice with some data, the doing
Last synced: 08 Nov 2024
https://github.com/fisseha-estifanos/telecom
A showcase repository for a specific telecommunication company. Used to analyze several telecommunication data set features and generate useful insights accordingly. Insights generated could be seen at https://github.com/Fisseha-Estifanos/telecom-visualizer or at https://fisseha-estifanos-telecom-visualizer-home-huxgy0.streamlitapp.com/
data-analysis notebooks-jupyter python visual-studio-code visualization
Last synced: 09 Nov 2024
https://github.com/nandit123/python_on_excel
Data Analysis using python libraries on excel data
csv data-analysis data-science fill fluctuations graph numpy python python-library
Last synced: 12 Nov 2024
https://github.com/drill-n-bass/dealavo-project
Cartesian product from dictionary to list of dictionaries and faster methods for finding index than the `index` method.
data-analysis data-analysis-python matplotlib pandas python python3 random timeit
Last synced: 07 Nov 2024
https://github.com/drill-n-bass/ovh-project
The goal of this task is to prepare statistical analysis of set of data from disks.
anaconda analysis data-analysis data-analysis-python jupyter-notebook matplotlib-python pandas python3 seaborn-plots
Last synced: 07 Nov 2024
https://github.com/ankit21111/carpredict
This project predicts car prices using machine learning models, including Simple and Multiple Linear Regression. It covers data acquisition, feature selection, and optimization techniques like Ridge Regression. The best model, Multiple Linear Regression, achieved an R² score of 0.84. Check out the full analysis in the repository!
data-analysis data-visualization matplotlib numpy pandas pyhton scipy seaborn sklearn
Last synced: 19 Nov 2024
https://github.com/madusales/powerbi-etl-elt
Venho estudando, através do Bootcamp da DIO sobre Data Analytics & Power BI, acerca do uso de SQL para criar soluções em BI. Esse repositório é dedicado a registrar os meus conhecimentos adquiridos até então sobre o que é BI, Tipos de análises, ETL e ELT.
big-data business-intelligence data-analysis powerbi
Last synced: 11 Nov 2024
https://github.com/abhipatel35/diabetes_ml_classification
Predict diabetes using machine learning models. Experiment with logistic regression, decision trees, and random forests to achieve accurate predictions based on health indicators. Complete lifecycle of ML project included.
classification data-analysis data-science data-visualization descision-tree diabetes-prediction jupiter-notebook logistic-regression machine-learning model-evaluation open-source pandas pycharm-ide python random-forest scikit-learn
Last synced: 31 Oct 2024
https://github.com/ireneflorez/exploration_r
Data exploration on the 'White Wine Quality' dataset using R
data-analysis data-visualization r
Last synced: 12 Nov 2024
https://github.com/mecha-aima/demographic-analyzer
This project uses pandas to process census data from a csv file and draw useful results from the data by performing various filtering and calculations on it
data-analysis data-science pandas
Last synced: 09 Nov 2024
https://github.com/davydantoniuk/statistics-r-2024
data-analysis r regression statistics
Last synced: 19 Nov 2024
https://github.com/patricialjohnson/data-visualization-tableau-project
Tableau Visualization Project
business-analytics business-intelligence data-analysis data-visualization digital-marketing digital-marketing-agency kpi microsoft-excel program-management project-management python search-engine-optimization seo sql tableau
Last synced: 10 Nov 2024
https://github.com/danpoynor/pet-shelter-data-analysis-notebook
Demonstration of skills analyzing data from a pet shelter. The CSV data contains tables detailing the incoming and outgoing animals and I use my knowledge of Pandas to gather and present the requested information.
csv data-analysis data-cleaning data-science jupyter-notebook matplotlib numpy pandas pet-shelter tabular-data
Last synced: 16 Nov 2024
https://github.com/jrh89/sorting-hat
With a simple and user-friendly interface, the GUI allows users to easily enter data and extract the numbers they need and then sort and graph them.
data-analysis data-visualization datascience executable graphs-algorithms gui python sorting sorting-algorithms sorting-algorithms-implemented
Last synced: 10 Nov 2024
https://github.com/cicku/en.650.672
HW of EN.650.672
analytics data-analysis numpy pandas
Last synced: 09 Nov 2024
https://github.com/davydantoniuk/yahoo-financial-analyse
data-analysis jupyter-notebook python yahoo-finance
Last synced: 19 Nov 2024
https://github.com/pipe199x/end-to-end-prediction-california
End-to-end prediction project using various technologies to predict housing prices in California.
california-housing data-analysis machine-learning python
Last synced: 12 Nov 2024
https://github.com/rkreddybogati/data-engineering-interview
Explore data engineering architectures in this Git project
data-analysis data-cleaning data-engi data-engineering-pipeline data-mining data-processing data-visualization python sql-query
Last synced: 08 Nov 2024
https://github.com/madrury/commute-times
Simulated Commute Times Data
data-analysis data-science data-visualization dataset
Last synced: 10 Nov 2024
https://github.com/madrury/hot-sauce
Simuation of a Hot Sauce Spicyness Dataset
data-analysis data-science data-visualization dataset machine-learning
Last synced: 10 Nov 2024
https://github.com/davydantoniuk/stackoverflow-graph-analyse-r
data-analysis graph r stackoverflow
Last synced: 19 Nov 2024
https://github.com/priyadarshinijain/air-quality-data-analysis-and-visualization
# 🌍 Air Quality Data Analysis and Visualization
data-analysis jupyter-notebook python visualization
Last synced: 31 Oct 2024
https://github.com/sinsunsan/earth-survival-kit
Global warning data visualisation app to make everyone understand global warning and take actions that matter
angular angular7 d3 data-analysis data-visualization ecology global-warning ngx-charts
Last synced: 08 Nov 2024
https://github.com/abhroroy365/market_analysis
This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.
clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis
Last synced: 09 Nov 2024
https://github.com/zafir100100/cancer-stage-prediction
This code predicts cancer data using various regression models, calculates their average R-squared scores, and prints the best model.
cross-validation data-analysis data-preprocessing decision-trees gradient-boosting linear-regression machine-learning-algorithms numpy pandas random-forest regression scikit-learn
Last synced: 11 Nov 2024
https://github.com/rahulsm20/car-data
A data analytics project that involves analyzing a car dataset that includes information on various car brands, years, prices, mileage, and fuel types, in order to gain insights into the car market.
data-analysis data-analytics matplotlib numpy pandas python
Last synced: 10 Nov 2024
https://github.com/rahulsm20/storedata
A data analysis project aimed at analyzing the sales data of the super store and providing useful insight into customer preferences.
data-analysis matplotlib numpy pandas python streamlit
Last synced: 10 Nov 2024
https://github.com/kentlouisetonino/ama-project-data-analysis
A course project for course MATH 6200.
ama-university data-analysis python
Last synced: 12 Nov 2024
https://github.com/anoopgeorge418/linked-analytics
"LinkedAnalytics is a project that scrapes LinkedIn data, analyzes it to uncover valuable insights, builds predictive models, and deploys them for practical applications. This repository contains all scripts, analysis notebooks, and deployment code needed to replicate the process."
beautifulsoup4 bokeh data-analysis data-science linkdin linkdindata machine-learning matplotlib numpy pandas plotly python requests seaborn sql web-scraping
Last synced: 10 Nov 2024
https://github.com/alan-oliveir/state-of-data-2022
Neste projeto faço a análise da distribuição das faixas salariais para os profissionais de nível júnior para o cargo de analista, cientista e engenheiro de dados. Os dados são da State of Data Brazil que é uma das maiores pesquisas sobre o panorama do mercado de trabalho brasileiro na área de dados.
data-analysis jupyter-notebook pandas-python seaborn-python
Last synced: 14 Nov 2024
https://github.com/rahulsm20/insurance-data
A data analytics project dealing with risk assessment and it's effects in health insurance.
data-analysis data-analytics machine-learning matplotlib numpy pandas python scikit-learn
Last synced: 10 Nov 2024
https://github.com/rahulsm20/trackbyte
A full-stack web application that helps users keep track of their playlist and provides analytics based on their music taste.
bootstrap data-analysis expressjs mysql nodejs reactjs sql
Last synced: 10 Nov 2024
https://github.com/douglasdavis/twaml
tW Analysis Machine Learning
data-analysis high-energy-physics machine-learning python
Last synced: 11 Nov 2024
https://github.com/themihirmathur/qlik-intern-project
Qlik Analysis of Road Safety & Accident Patterns in India 📈 Analyzed & visualized road safety data for 20.85k+ accident cases with 9+ accident data patterns in India using Qlik 📉 Reduced inefficiencies by 25% by developing design of an avant-garde data tracking dashboard that monitored injuries.
data-analysis data-visualization presentation qlik qlik-cloud qlik-sense qlikview
Last synced: 12 Nov 2024
https://github.com/koldlight/bluetab-data-science-2017
Repositorio para compartir material y publicar los retos
course data-analysis data-science exercises
Last synced: 09 Nov 2024
https://github.com/leandrocollares/nyc-film-permits
NYC film permits: an exploratory data analysis
data-analysis data-visualization pandas plotly
Last synced: 11 Nov 2024
https://github.com/leandrocollares/street-cherry-trees-in-vancouver
Street cherry trees in Vancouver: an exploratory data analysis
data-analysis data-visualization folium pandas plotly-express
Last synced: 11 Nov 2024
https://github.com/athari22/applied-data-science-capstone
Applied-Data-Science-Capstone
api classification data-analysis data-cleaning data-collection data-science data-scraping data-visualization data-wrangling knn machine-learning sql
Last synced: 08 Nov 2024
https://github.com/athari22/investigating-netflix-movies-and-guest-stars-in-the-office
Apply basic Python skills in Introduction to Python and Intermediate Python by processing and visualizing film and television data.
data-analysis data-science data-visualization loop loops matplotlib matplotlib-pyplot netflix numpy office pandas python
Last synced: 08 Nov 2024
https://github.com/lacerbi/vbmc
Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)
bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference
Last synced: 24 Oct 2024
https://github.com/prekshivyas/cis-595-big-data-analytics
Comprehensive real estate price prediction project, integrating socioeconomic indicators and property features.
data-analysis data-cleaning data-mining data-preprocessing data-science data-visualization data-wrangling exploratory-data-analysis web-scraping
Last synced: 09 Nov 2024
https://github.com/arush-codes/paris-olympic-de
data engineering project on paris olympics 2024
azure data-analysis data-engineering microsoft-azure olympics2024 pipeline
Last synced: 14 Oct 2024
https://github.com/lucaspadoni/9-11-hijackers-social-network-analysis
Social Network Analysis focused on the events of 9/11/2001. By examining publicly available data through SNA techniques, we gain insights into the organizational structure of the terrorist network, offering valuable perspectives on key relationships and connections.
9-11 data-analysis data-analytics graph-theory hijacking network-analysis sna social-network-analysis terrorism terrorist-attacks
Last synced: 09 Nov 2024
https://github.com/parthds02/pizza_sales_sql
SQL project analyzing pizza sales data. Includes creating tables, executing queries, and solving basic to advanced analytical questions to derive insights from sales data.
analytics data-analysis data-science pizza-sales sql
Last synced: 12 Nov 2024
https://github.com/vubacktracking/freecodecamp-data-analysis-with-python
5 Projects in Data Analysis With Python Course on Freecodecamp
Last synced: 10 Nov 2024
https://github.com/isabelleysseric/data-analysis
Data analysis with R
data-analysis data-processing data-science-projects graph graph-algorithms r
Last synced: 08 Nov 2024
https://github.com/psyplot/psy-transect
Visualize and explore transects with psyplot
data-analysis data-exploration data-science exploratory-data-analysis psyplot transects
Last synced: 08 Nov 2024
https://github.com/v-octal/random_forest_from_scratch
My implementation of Random Forest regressor in python
data-analysis machine-learning random-forest
Last synced: 08 Nov 2024
https://github.com/kenatsf/basic_data_analysis
Basic data science project: ETL, forecast and data visualization.
analysis data data-analysis data-science logistic-regression matplotlib matplotlib-pyplot numpy pandas powerbi python scikit-learn time-series time-series-analysis time-series-forecasting
Last synced: 10 Nov 2024
https://github.com/purposeachiever6/discovering_hidden_pattern
Discovering Hidden Patterns in Sequential and Numerical Data
data-analysis r statistical-analysis
Last synced: 12 Nov 2024
https://github.com/samuelsoaress/predict-future-sales
Machine Learning applied to sales forecast
data-analysis data-mining data-science data-visualization forecasting-models
Last synced: 10 Nov 2024
https://github.com/bhavinpatel4199/artificial-intelligence---ai-for-decision-making
Artificial Intelligence for Decision Making is a collection of projects focused on applying AI and machine learning techniques to solve decision-making challenges. It includes projects on wine quality prediction, Cassandra data modeling, and text classification, showcasing a range of data science and machine learning applications.
artificial-intelligence cassandra-cql data-analysis data-engineering data-preprocessing data-structures decision-making deep-learning feature-selection machine-learning-algorithms sentiment-analysis text-classification
Last synced: 14 Nov 2024
https://github.com/diegopino/publibdata_codexhackathon
Public Library Data processing/analysis codex hackathon attempt
data-analysis data-visualization libraries public
Last synced: 08 Nov 2024
https://github.com/rijul007/smartwatch-data-analysis-using-python
Smartwatch Data Analysis to uncover insights into health and activity patterns using Python for data cleaning, exploratory analysis, and interactive visualizations.
data-analysis data-science data-visualization exploratory-data-analysis jupyter-notebook matplotlib numpy pandas plotly python
Last synced: 08 Nov 2024
https://github.com/rijul007/diamonds-analysis-using-r
Diamonds data analysis using R, exploring relationships between diamond attributes (such as carat, cut, color, and clarity) and price, with a focus on providing insights for engagement ring selection through various statistical techniques and data visualizations including histograms, boxplots, scatter plots, and bar charts.
Last synced: 08 Nov 2024
https://github.com/muneeb1030/webscrapper_mastodon
The Mastodon Social Platform Scraper is a Python-based web scraping tool designed to explore and extract valuable data from the Mastodon social platform.
data-analysis data-collection mastodon python3 scrapy scrapy-spider selenium-python webscraping
Last synced: 09 Oct 2024
https://github.com/mysto-007/world_population_growth_analysis
World Population Growth Analysis
data-analysis data-science data-visualization kaggle matplotlib
Last synced: 08 Nov 2024
https://github.com/mysto-007/dog-vision-a-dog-breed-recognizer-kaggle-competition
A solution for Dog Breed Identification on Kaggle competition
colab-notebook data-analysis data-science data-visualization jupyter-notebook kaggle kaggle-competition python
Last synced: 08 Nov 2024
https://github.com/jrdnbradford/the-office-us
Data concerning NBC's mockumentary series The Office (U.S. version)
csv data-analysis json the-office xml
Last synced: 08 Nov 2024
https://github.com/aneeshmurali-n/project-ml-data-preprocessing
The main objective of this project is to design and implement a robust data preprocessing system that addresses common challenges such as missing values, outliers, inconsistent formatting, and noise. By performing effective data preprocessing, the project aims to enhance the quality, reliability, and usefulness of the data for machine learning.
data-analysis data-cleaning data-encoding data-exploration feature-scaling label-encoding matplotlib minmaxscaler numpy one-hot-encoding outlier-detection pandas standardscaler
Last synced: 11 Nov 2024
https://github.com/salma-mamdoh/exploring-the-evolution-of-linux-project
My Project to learn the Basics of Analysis on DataCamp
data-analysis datacamp pandas python time-series-analysis
Last synced: 08 Nov 2024
https://github.com/salma-mamdoh/project-writing-functions-for-product-analysis
My Project to learn the Basics of Analysis on DataCamp
data-analysis data-camp pandas python
Last synced: 08 Nov 2024