Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-02-11 00:07:23 UTC
- JSON Representation
https://github.com/amrrs/introduction-to-eda-with-python
Introduction to EDA with Python Session Files
Last synced: 15 Nov 2024
https://github.com/gxjansen/user-analysis-with-r-google-analytics
Analyzing user behavior of an E-commerce website with R and (mainly) Google Analytics Data
analytics analytics-api conversion-rate-optimization data-analysis ecommerce google google-analytics r
Last synced: 30 Oct 2024
https://github.com/johnsell620/sentiment-analysis-goodreads-reviews
Document-level sentiment analysis of book reviews scraped from the Goodreads website. Technologies used include TensorFlow, Spark, HDFS, Sqoop, Scrapy, and D3.js.
data-analysis data-visualization recurrent-neural-networks web-scraping
Last synced: 29 Nov 2024
https://github.com/DataHerb/dataherb-python
Python Package for DataHerb: create, search, and load datasets.
data data-analysis data-mining database dataset python
Last synced: 15 Nov 2024
https://github.com/mdh266/wikimedia_challenge
Analyzing click-through rates from Wikimedia
data-analysis data-challenge exploratory-data-analysis matplotlib pandas python
Last synced: 31 Jan 2025
https://github.com/quantumudit/alteryx-weekly-challenges
This repository contains Alteryx solutions to the weekly challenges published in Alteryx Community
alteryx alteryx-workflow data-analysis data-science data-transformation data-visualization etl
Last synced: 26 Dec 2024
https://github.com/quantumudit/demographic-data-analysis
This project focuses on analyzing and finding correlations between the three important metrics by 195 countries,i.e., birth rate, internet users, and income group.
data-analysis jupyter-notebook power-bi python
Last synced: 26 Dec 2024
https://github.com/quantumudit/analyzing-gamerevolution-games
This project focuses on scraping data related to video games from the GameRevolution website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/verbasik/yandex.practicum.datascience
Портфолио проектов Data Science, выполненных в рамках профессиональной переподготовки в Яндекс.Практикум. Включает исследования в области финансов, недвижимости, кинопроката и других, с использованием статистики, машинного обучения и анализа данных.
data-analysis data-science machine-learning yandex-praktikum
Last synced: 10 Jan 2025
https://github.com/quantumudit/analyzing-quotes
This project focuses on scraping all the quotes and their related data from the "Quotes To Scrape" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/quantumudit/uk-student-accommodation-analysis
This project focuses on scraping student properties related data from the UK Student Accommodation website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/quantumudit/analyzing-goodreads-famous-quotes
This project focuses on scraping famous quotes and their related data from the GoodReads website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/quantumudit/analyzing-suez-services
This project focuses on scraping all the service locations across Australia & New Zealand and their associated attributes from "Suez" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/1994nikunj/nlp-toolkit-desktop-app
The code is a collection of NLP analyses, including text cleaning, most common words, n-grams generation, co-occurrence matrix generation, wordcloud generation, topic modeling (using Latent Dirichlet Allocation), and general text statistics.
data-analysis n-grams network-visualization nlp python text-cleaning topic-modeling wordcloud-generator
Last synced: 25 Nov 2024
https://github.com/yahia3200/become-an-independent-data-scientist
My final project for the Applied Plotting, Charting & Data Representation in Python Course
data-analysis data-science data-visualization matplotlib
Last synced: 22 Jan 2025
https://github.com/alieymsxxn/sql_project_data_job_analysis
This project explores top-paying jobs, in-demand skills, and where high demand meets high salary in data analytics.
data-analysis postgresql sql sqlite
Last synced: 28 Nov 2024
https://github.com/arv-anshul/campusx-project-notebooks
Capstone project by Campusx in DSMP course.
campusx campusx-dsmp data-analysis data-science eda jupyter-notebook machine-learning ml-project nlp project python3 recommender-system regression streamlit
Last synced: 25 Dec 2024
https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django
A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data
analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas
Last synced: 10 Feb 2025
https://github.com/uts58/international-student-job-insights-usa
Data-driven insights on job hunting for international students in the USA, analyzing listings, roles, and trends.
career-insights cpt data-analysis eb1 eb2 eb3 h1b handshake job-analytics job-trends jobs jupyter-notebook opt python work-visa
Last synced: 25 Dec 2024
https://github.com/app-generator/devtool-db-introspection
Database Introspection Tool - Open-Source | AppSeed
data-analysis database-schema database-tool db-scan db-tool developer-tools peewee peewee-orm python-database python-datatypes python-db python-tool
Last synced: 20 Dec 2024
https://github.com/justsecret123/twitter-sentiment-analysis
A sentiment analysis model trained with Kaggle GPU on 1.6M examples, used to make inferences on 220k tweets about Messi and draw insights from their results.
classification data-analysis data-science deep-learning deep-neural-networks docker glove-embeddings kaggle lstm lstm-neural-networks machine-learning natural-language-processing nlp python rnn scikit-learn sentiment-analysis sentiment-classification tensorflow word-embeddings
Last synced: 05 Nov 2024
https://github.com/arjo129/image-sorter
Sort through folders of videos and images. Root out blurred and overexposed images.
computational-photography data-analysis photo-browser photo-gallery photography uwp uwp-apps
Last synced: 06 Jan 2025
https://github.com/rubydamodar/the-ultimate-pandas-bootcamp
Welcome to the Pandas for Data Science repository! This course is designed to take you from beginner to proficient in using Pandas, the powerful data manipulation library in Python. Whether you're just starting your data science journey or looking to sharpen your skills, this repository contains all the resources
beginner-friendly csv-data data-analysis data-cleaning data-manipulation data-science data-visualization dataframe exploratory-data-analysis jupyter-notebook machine-learning matplotlib numpy pandas python python-pandas series statistical-analysis time-series titanic-dataset
Last synced: 18 Oct 2024
https://github.com/programmer-rd-ai/moviedatascraper
Explore the cinematic universe with our IMDb web scraping project! Dive into movie data with ease, uncovering insights from cast to critical reviews. With dynamic visualizations and reliable data, let's journey through the world of movies like never before. Lights, camera, analysis!
beautifulsoup beautifulsoup4 data data-analysis jupyter-notebook matplotlib numpy pandas programming python python3 scraping seaborn software web
Last synced: 12 Jan 2025
https://github.com/manmolecular/http-response-clustering
:chart_with_downwards_trend: Clustering of HTTP responses using k-means++ and the elbow method
data-analysis elbow-method elbow-plot jupyter k-means-plus-plus python3
Last synced: 16 Jan 2025
https://github.com/juliusmarkwei/concrete-data
Data analysis, machine learning, model evaluation and optimization on the Concret_ Dataset
data-analysis data-science data-visualization ensemble-learning machine-learning modeling
Last synced: 01 Jan 2025
https://github.com/invictusaman/socioeconomic-indicators-in-chicago-sql-python
This project displays how to create a database connection in notebook, update database using python and how to run Python program and SQL queries together. It uses SQLite and Chicago dataset for analysis.
data-analysis jupyter-notebook python sql sql-queries sqlite
Last synced: 12 Oct 2024
https://github.com/c0deta1ker/matbase
MatBase provides access to an extensive database of material parameters, inelastic mean free paths (IMFP), photoionization binding energies, cross sections, and asymmetry parameters. Additionally, MatBase includes a suite of functions for users to load, process, model and fit their own data, making it an indispensable tool in the field.
cross-sections crystal-structure crystallography data-analysis data-fitting database electron imfp imfp-calculator-matlab material material-database matlab matlab-application matlab-gui matlab-toolbox pes-modelling photoelectron-spectroscopy photoionization simulation xps
Last synced: 30 Nov 2024
https://github.com/jimbrig/eda
Exploratory Data Analysis R Package and Shiny App
data-analysis data-visualization eda r shiny
Last synced: 23 Jan 2025
https://github.com/prajwalchapke055/cognizant-artificial-intelligence-job-simulation-forage
Advise one of Cognizant’s clients on a supply chain issue by applying knowledge of machine learning models.
artificial-intelligence cognizant communication data-analysis data-modeling data-visualization development evaluation forage job-simulation machine-learning machine-learning-algorithms machine-learning-engineering model-interpretation presentation problem-statement python quality-assurance skills virtual-internship
Last synced: 22 Jan 2025
https://github.com/rapidsurveys/oldr
An Implementation of the Rapid Assessment Method for Older People (RAM-OP)
assessment data-analysis epidata estimate odk older-people r ram-op ranalyticflow rapid-assessment
Last synced: 24 Dec 2024
https://github.com/draym/covid19tracker
Coronavirus COVID-19 dashboard to track global cases
covid-19 covid19-tracker dashboard data-analysis
Last synced: 02 Feb 2025
https://github.com/simoneas02/data-science
🐍 A planning study to become a data scientist and to improve my current skills. 🤘🏼🌻
data data-analysis data-science data-visualization deep-learning machine-learning pandas python3 r sql
Last synced: 01 Feb 2025
https://github.com/prakhar-ff13/yellow-taxi-demand-prediction
Predicting Taxi Demand in various regions of New York City
data-analysis data-analytics data-science data-visualization machine-learning python3 time-series
Last synced: 28 Jan 2025
https://github.com/jamesquinlan/intro-stats-mat150
Introduction to Statistics
data-analysis statistics university-course
Last synced: 10 Feb 2025
https://github.com/leewannacott/datacamp-projects
DataCamp project solutions.
data-analysis data-mining data-science datacamp-projects machine-learning python r
Last synced: 30 Jan 2025
https://github.com/jhrcook/100daysofpython
100 days, at least 1 hour a day, of learning the Python programming language.
100-days-of-code 100daysofcode continued-learning data-analysis data-science decision-trees deep-learning keras keras-tensorflow machine-learning neural-network neural-networks plots python python3 scikit-learn tensorflow
Last synced: 13 Nov 2024
https://github.com/tirendazacademy/data-sets
Data sets for Tirendaz Akademi Youtube
Last synced: 01 Jan 2025
https://github.com/ivanildobarauna-dev/data-pipeline-sync-ingest
ETL Process for Currency Quotes Data" project is a complete solution dedicated to extracting, transforming and loading (ETL) currency quote data. This project uses several advanced techniques and architectures to ensure the efficiency and robustness of the ETL process.
business-intelligence data-analysis data-analytics data-engineering data-pipeline data-visualization etl-pipeline python
Last synced: 26 Jan 2025
https://github.com/simranjeet97/ipl-dataanalysis
Data Analysis performed on IPL Dataset with Data Profiling, Data Pre-Processing, Data Manipulation, and Data Visualization.
artificial-intelligence data-analysis data-manipulation data-mining data-preprocessing data-science data-visualization indian-premier-league-2008-2018 ipl ipl-dataset iplayer python
Last synced: 14 Jan 2025
https://github.com/dataopstix/modelt
Modelt(mow·delt) is a modern data integration solution that connects data to data for advanced analytics.
airbyte airflow airflow-docker data data-analysis data-visualization database dbt elt etl etl-automation metabase metadata modern modern-dev modernization
Last synced: 02 Feb 2025
https://github.com/mindlessmuse666/client-data-analysing-tool
Проект производственной практики: Инструмент для анализа данных, построенный с использованием Python (бэкэнд, фронтэнд PyQt6), Pandas, Matplotlib и SQLite. Это приложение позволяет пользователям загружать данные в формате CSV, фильтровать их, визуализировать ключевые показатели с помощью графиков и создавать отчеты.
data-analysis desktop-application matplotlib pandas pyqt6 pyqt6-desktop-application python sqlite student-project
Last synced: 23 Dec 2024
https://github.com/fabienarcellier/qjoin
qjoin is a data manipulation library that provides simple and efficient joining and collection processing functionality
composable data-analysis developer-tools functools python
Last synced: 28 Nov 2024
https://github.com/zelosleone/finncorr
A .NET Core financial analysis tool/API for calculating correlations between time series data with interactive visualizations powered by ML.NET and Plotly.js.
aspnet-core correlation-analysis csv-parser data-analysis dotnet financial-analysis machine-learning ml-net plotly rest-api statistical-analysis swagger time-series visualization
Last synced: 06 Feb 2025
https://github.com/janheinrichmerker/song-analysis
Analysing the Million Song Dataset.
big-data data-analysis data-science hadoop hadoop-mapreduce java kotlin songs
Last synced: 24 Dec 2024
https://github.com/virajbhutada/cliquebait-digital-marketing-analysis-using-sql
This GitHub repository contains the CliqueBait Digital Marketing Analysis project, utilizing SQL for comprehensive analysis of marketing campaigns, user engagement, product performance, and website interactions within the Clique Bait food app. The project offers actionable insights for optimizing marketing strategies in competitive landscape.
campaign-website data-analysis data-extraction data-science digital-marketing food-store microsoft-excel mysql product-performance sql sql-database sql-project user-engagement website-analytics
Last synced: 10 Jan 2025
https://github.com/yogeshnile/nifty50-index-time-series-analysis
In this repo i did analysis of Nifty50 five year data from 01-04-2015 to 31-03-2020. Data Downloaded from nse official website.
data-analysis matplotlib nifty numpy pandas plotly python3 time-series-analysis
Last synced: 10 Jan 2025
https://github.com/thisisashukla/survival-analysis
Hands-On Survival Analysis in Python
data-analysis data-science survival-analysis
Last synced: 28 Dec 2024
https://github.com/yogeshnile/covid-19-time-series-data-analysis
In this repo created a Covid-19 Time Series Data Analysis on python
covid-19 covid19 data-analysis folium folium-maps pandas plotly time-series-analysis visualization
Last synced: 10 Jan 2025
https://github.com/karan-malik/uberdataanalysis
Uber Data Analysis and Visualization using Python
data-analysis data-analysis-python data-analytics data-science data-visualization dataanalysis matplotlib-pyplot numpy pandas pandas-dataframe python python3 seaborn uber uber-data
Last synced: 14 Jan 2025
https://github.com/phollemans/cwutils
CoastWatch Utilities software for working with satellite data files from NOAA CoastWatch and elsewhere
cdat coastwatch-utilities data-analysis data-visualization install4j java noaa-coastwatch remote-sensing satellite-imagery
Last synced: 12 Nov 2024
https://github.com/mindful-ai-assistants/movierevenueanalysis
🎬💰 Analyze movie companies' revenue, release strategies, and financial performance using statistical techniques for actionable insights. This project explores data on total revenue, number of releases, and lifetime gross to uncover patterns that can drive strategic decisions in the film industry.
correlation-analysis data-analysis data-science heatmap jupyter-notebook open-source python statistical-analysis statistical-analysis-and-hypothesis-testing statistics ttest
Last synced: 22 Jan 2025
https://github.com/hvignolo87/ortex-programming-challenge
Coding challenges required for the Python Developer and Data Engineer job positions.
challenge data-analysis finance pandas python scripting sql sqlalchemy
Last synced: 02 Jan 2025
https://github.com/virajbhutada/tableau-data-vizzes
Engage with a growing collection of Tableau dashboards covering financial trends, HR analytics, streaming service insights, real estate dynamics, and more. Meticulously crafted for valuable insights, this repository continues to expand with new and compelling visualizations.
business-analytics data-analysis data-visualization hr-analytics industry-trends netflix performance-metrics stock-market-analysis strategic-analytics tableau visual-insights
Last synced: 10 Jan 2025
https://github.com/abeltavares/nps_performance_analysis
Analyzing and Monitoring Net Promoter Score (NPS) Performance for Healthcare Companies using SQL and Power BI
customer-satisfaction dashboard data-analysis data-visualization healthcare net-promoter-score nps-analysis performance-monitoring power-bi sql
Last synced: 05 Jan 2025
https://github.com/zrkhadija/data-analysis-for-financial-time-series
In this notebook, we performed data analysis on financial time series data from Yahoo Finance for the US market. We examined seasonality, trends, stationarity, and other aspects such as outliers and correlations.
autocorrelation correlation-analysis data-analysis financial-analysis time-series-analysis timeseries-forecasting visualization
Last synced: 09 Feb 2025
https://github.com/aad99bxp/whatsapp-chat-analyzer
A project intended for Business Owners / Managers to analyze Whatsapp chats between their customer care executives and their customers.
data-analysis heroku-deployment python3
Last synced: 22 Jan 2025
https://github.com/saadarazzaq/excel-merger
Merge multiple Excel and CSV files into a single dataset with the Excel Merger Streamlit app. 📊🔄🚀
data-analysis excel pandas python streamlit-webapp
Last synced: 23 Jan 2025
https://github.com/michaelcurrin/water-crisis-scraper
Scrape and explore data related to Cape Town's water crisis (Python3 application)
cape-town cron csv dam-levels data-analysis html open-data python3 schedule scraping south-africa water-crisis water-level webscraping
Last synced: 28 Oct 2024
https://github.com/w-edward/youtube-keyword-popularity-analyzer
An effort to discover the top trending keywords on Youtube.
data-analysis node-js numpy python webscraping youtube-api
Last synced: 16 Jan 2025
https://github.com/shengyuan-lu/mgmt-172
Material for MGMT 172 Data and Programming for Analytics. (Course)
csv data-analysis data-visualization jupyter-notebook pandas sql sqllite yfinance
Last synced: 09 Feb 2025
https://github.com/walidbosso/r_data_mining
Extract knowledge from a data using different techniques, including Association Rules Hierarchical Agglomerative Clustering (HAC) K-means Clustering Decision Trees
association-rule-mining association-rules clustering data-analysis data-mining data-science data-visualization decision-tree-classifier decision-trees exportation extract-data hac hierarchical-clustering k-means k-means-clustering k-means-r r-programming r-studio
Last synced: 28 Jan 2025
https://github.com/nafiealhilaly/analyze-coderhub-sa
A simple web app to analyze/explore coderhub.sa API data, this project was my first real react app.
backend data-analysis eda frontend python react reactjs
Last synced: 08 Feb 2025
https://github.com/ashwinpn/visualization
Data Visualization using Matplotlib, Pandas Visualization, Seaborn, ggplot, and Plotly.
analysis data data-analysis data-science data-visualization graphs plots python python3 visualization
Last synced: 16 Jan 2025
https://github.com/thecoderpinar/reta
🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!
arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series
Last synced: 09 Feb 2025
https://github.com/0mppula/element-compare
A single page Next.js 14 app that allows the user to inspect and compare elements from the periodic table.
compare dark-mode data-analysis elements inspect lucide-react nextjs periodic-table reactjs server-side-rendering shadcn-ui single-page-app typescript vercel zustand
Last synced: 11 Nov 2024
https://github.com/ivangrigorov/neutrino-search-engine
Creating Java search engine both for HTML or document type of files
data data-analysis data-knowledge information-extraction information-retrieval java-language search-engine
Last synced: 06 Feb 2025
https://github.com/jupyterphysscilab/documentation
Documentation for the Jupyter Physical Science Lab Suite of Packages
analog-to-digital-converter data-acquisition data-analysis education jupyter-notebooks pandas physical-sciences plotting python raspberry-pi
Last synced: 04 Dec 2024
https://github.com/praju-1/pandas
The library is widely used in data science and machine learning for data cleaning, preparation, and analysis.
Last synced: 08 Feb 2025
https://github.com/jesussantana/data-science-with-python-it-academy
Learn how to extract value from data by ingesting, transforming, storing, analyzing, and visualizing data
classification-model clustering-methods dash data-analysis data-mining data-science database machine-learning matplotlib mongodb numpy pandas plotly python3 regression-models seaborn sklearn sql web-scraping
Last synced: 12 Nov 2024
https://github.com/emaasit/pydata-book
Learning data analysis with python
data-analysis jupyter pandas python
Last synced: 21 Nov 2024
https://github.com/tathithienthanh/dataanalysis_diagnosis-of-diabetes-based-on-data-set-of-blood-test-result
Implement all learned knowledge about data analysis and data mining to make a complete project about Diagnosis of diabetes based on data set of blood test result
blood-test classification clustering data-analysis data-processing decision-tree diabetes-prediction diagnosis exercise google-colab health hierarchical ipynb kmeans knn knns py python smote-sampling visualization
Last synced: 04 Feb 2025
https://github.com/jimbrig/EDA
Exploratory Data Analysis R Package and Shiny App
data-analysis data-visualization eda r shiny
Last synced: 04 Dec 2024
https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation
we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.
algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning
Last synced: 06 Dec 2024
https://github.com/asifdotexe/covidporfolioproject
This is a SQL + Tableau Project on real world Covid 19 Dataset from the start of recorded case to 2nd March 2022 i.e My birthday XD
dashboard data-analysis data-exploration data-visualization sql sql-server tableau
Last synced: 15 Jan 2025
https://github.com/a-r-j/npview
CLI tools for quickly inspecting CSV/TSV & NumPy (.npy) array files
cli csv data-analysis inspector npy numpy python tsv
Last synced: 10 Feb 2025
https://github.com/casualcomputer/sql.mechanic
Functions that generate SQL queries that summarize high-dimensional tables stored in various databases (e.g. Microsoft SQL Servers, Netezza, DB2, Postgres, Oracle, MySQL, etc.).
data-analysis data-quality-checks data-science database mysql netezza oracle postgres quality-control r sql sql-server
Last synced: 04 Dec 2024
https://github.com/duart38/sponge
Quickly make endpoints for testing
cms data-analysis deno developer-tools development-tools helper-tool mock server sponge testing testing-tools toolkit tools
Last synced: 06 Feb 2025
https://github.com/mohamedomar2020/random-forest
Creating a Random Forest model to predict the progression of bladder cancer
bladder-cancer cancer-genomics cancer-research data-analysis data-science genomics machine-learning machine-learning-algorithms random-forest
Last synced: 30 Jan 2025
https://github.com/mindful-ai-assistants/credit-card-prediction
💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.
artificial-intelligence data-analysis data-science jupyter logistic-regression machine-learning predictive-modeling python3 scikit-learn
Last synced: 09 Dec 2024
https://github.com/omarsar/energy_stats
Analyzing energy production with Kibana Lens
data-analysis data-science data-visualization elasticsearch kibana
Last synced: 23 Jan 2025
https://github.com/ikanurfitriani/project-data-analysis-python
This repository contains the results of data analysis learning using the Python.
data-analysis data-analysis-project data-analysis-python python
Last synced: 26 Jan 2025
https://github.com/nhsdigital/sde_example_analysis
Example of what you can do in Databricks in the Secure Data Environment (SDE) using Python, SQL, and R.
data-analysis data-science databricks-notebooks machine-learning mlflow
Last synced: 23 Dec 2024
https://github.com/shlokashah/ipl-data-analysis
Data Analysis and Visualizations done on IPL dataset
data-analysis data-visualization pandas powerbi
Last synced: 28 Dec 2024
https://github.com/cbg-ethz/scdna-pipe
Python data analysis pipeline for single cell copy number event history reconstruction
bioinformatics bioinformatics-pipeline data-analysis genomics python snakemake snakemake-workflows workflow
Last synced: 28 Jan 2025
https://github.com/akshat0427/spotify_history
code to find out some insights in spotify streaming data (work in progress)
data-analysis data-visualization
Last synced: 31 Jan 2025
https://github.com/shlokashah/student-depression-and-suicide-rate-prediction
https://shlokashah.github.io/Student-Depression-And-Suicide-Rate-Prediction/
data-analysis data-visualization machine-learning student suicide-rate-prediction
Last synced: 28 Dec 2024
https://github.com/josechirif/reviews-and-satisfaction-analysis-of-airbnb-brazil-and-mexico-from-june-2010-to-february-2021
This project analyzes the reviews and satisfaction of customers who used AirBnB services. It also studies if there is a relationship between another variables.
data data-analysis data-visualization powerbi sql-server
Last synced: 23 Oct 2024
https://github.com/asifdotexe/timeseriesanalysis
This repository serves as a central hub for all of my projects related to time series analysis. Here, you'll find a collection of projects, code samples, and resources that explore various aspects of time series data and its analysis.
data-analysis feature-engineering jupyter-notebook pandas python time-series-analysis visualization
Last synced: 15 Jan 2025
https://github.com/burhanahmed1/recipe-recommendor-using-pyspark
A smart recipe recommendation system that suggests recipes based on ingredient similarities. This project is done in PySpark
data-analysis data-science datawrangling education learning-python machine-learning machine-learning-algorithms nltk-python numpy pandas pyspark python python-project reccomendersystem recommendation-system
Last synced: 06 Jan 2025
https://github.com/gustavohnsv/teamwork_mqa
Repositório dedicado ao trabalho em grupo baseado nos estudos de métodos para análise de dados da matéria Métodos Quantitativos para Anáise Multivariada.
data-analysis group-project r team-repo
Last synced: 16 Dec 2024
https://github.com/cosmoduende/r-arduino
Interoperability Data-IoT: How to send and receive data and take control of your Arduino, from R. How to establish interoperability between R and Arduino (Data and IoT) using a data flow between the two
arduino arduino-data arduino-dataflow arduino-serial arduino-serial-data arduino-serial-led arduino-uno data-analysis data-arduino data-cleaning data-iot data-visualization interoperability iot-rstudio r-analytics r-data-visualization r-iot rstudio-arduino serial-read serialport
Last synced: 27 Dec 2024
https://github.com/pitmonticone/covid-italy
References for COVID-19 situation in Italy.
coronavirus covid-19 covid-19-italy data data-analysis documentation testing
Last synced: 22 Jan 2025
https://github.com/haloapping/malas-ngetik-clf
Saya malas ngetik, makanya saya buat aja template proyek kompetisi Kaggle 😜. Template ini khusus untuk kasus klasifikasi.
data-analysis exploratory-data-analysis feature-engineering kaggle kaggle-competition machine-learning python3 scikit-learn
Last synced: 06 Jan 2025
https://github.com/bkataru/physics-ia
Programs and files written for Astrostatistics for IB Physics IA. Topic: Visualizing and analyzing the habitable zones for 150,000 stars from the hipparcos catalogue.
astronomical-algorithms astronomy astrophysics astrostatistics data-analysis data-science data-visualization matplotlib plotting
Last synced: 22 Dec 2024
https://github.com/jpcadena/classification-tweets-national-security-ecuador
Classification of Tweets about national security at Ecuador 2022
classification classification-model data-analysis data-science ecuador insecurity machine-learning natural-language-processing nlp nltk numpy pandas python pytorch scikit-learn snscrape supervised-learning tensorflow tweet twitter
Last synced: 15 Jan 2025
https://github.com/marios-mamalis/mca-visualisation
A script for automatic visualisation of Multiple Correspondence Analysis (MCA) results from FactoMineR in 3 dimensions using Plotly (exported as html)
3d-scatterplots correspondence-analysis data-analysis factominer html mca multiple-correspondence-analysis plotly visualisation
Last synced: 13 Jan 2025