Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2024-11-18 00:06:50 UTC
- JSON Representation
https://github.com/abeltavares/nps_performance_analysis
Analyzing and Monitoring Net Promoter Score (NPS) Performance for Healthcare Companies using SQL and Power BI
customer-satisfaction dashboard data-analysis data-visualization healthcare net-promoter-score nps-analysis performance-monitoring power-bi sql
Last synced: 09 Nov 2024
https://github.com/shridhar1504/milk-production-time-series-forecasting-datascience-project
This project uses time series forecasting to predict future milk production. The data used in this project is monthly milk production data from January 1962 to December 1975. The ARIMA (autoregressive integrated moving average) model is used to forecast the milk production. The model is evaluated using various metric.
adf arima-model augmented-dickey-fuller-test data-analysis data-analytics data-science data-visualization eda exploratory-data-analysis machine-learning machine-learning-algorithms python python3 residuals sarimax seasonality time-series time-series-forecasting trends
Last synced: 05 Nov 2024
https://github.com/smahala02/magnetism-lab
This repository contains Python scripts and data for analyzing inductance in toroidal coils to calculate the magnetic permeability of ferrite materials. The project helps classify materials as soft or hard magnets based on experimental data.
data-analysis inductance jupyter-notebook magnetism python toroids
Last synced: 31 Oct 2024
https://github.com/dvaser/heart-attact-analysis-prediction
DATA ANALYSIS
classification data data-analysis data-visualization jupyter jupyter-notebook lineer-regresyon machine-learning python regression
Last synced: 31 Oct 2024
https://github.com/jayita11/customer-engagement-insights-for-yelp-restaurant-business-success
This project analyzes Yelp restaurant data using SQLite, Python, and Tableau to explore user engagement, reviews, and ratings. It provides insights into restaurant success across cities, regions, and user behavior.
customer-engagement data-analysis interactive-visualizations json python ratings review sqlite3 tableau-dashboards-for-data-visualization yelp-restaurants
Last synced: 02 Nov 2024
https://github.com/jayqi/data-analysis-tools
Presentation on Data Analysis Tools
data-analysis presentation-slides
Last synced: 21 Oct 2024
https://github.com/muskanmi/data_analysis_python
Data analysis on students result dataset using python libraries.
boxplot countplots data-analysis numpy pandas pie-chart python3 seaborn
Last synced: 05 Nov 2024
https://github.com/cego669/datathonengopevi
Solução proposta para o Datathon do VI ENGOPE (Encontro Goiano de Probabilidade e Estatística).
data-analysis data-science datathon python r streamlit xgboost-classifier
Last synced: 19 Oct 2024
https://github.com/amlanmohanty1/genai-data-analysis-report-generator
Generating data analysis and EDA reports from CSV files using Generative AI - Langchain, Llama, Groq.
ai data-analysis data-science flask generative-ai groq langchain llama3 llm prompt-engineering python
Last synced: 19 Oct 2024
https://github.com/tameronline/ai-financial-analyst
AI-driven financial analyst system utilizing LangChain and Ollama for real-time stock analysis, market trends, and financial insights.
ai data-analysis finance financial-analysis langchain machine-learning nlp ollama stock-market
Last synced: 19 Oct 2024
https://github.com/avs/avs-go
Common web components for AVS data visualization products
avs big-data charting-library csharp csharp-library data-analysis data-science data-visualization datamanagement high-performance high-performance-computing java java-library javascript multi-dimensional multi-platform multi-threading rendering-3d-graphics rendering-engine webcomponents
Last synced: 21 Oct 2024
https://github.com/abeltavares/online_retail_pyspark_analysis
PySpark data analysis of the Online Retail Data Set
business-intelligence churn-analysis customer-segmentation data-analysis data-visualization jupyter-notebook machine-learning market-basket-analysis online-retail product-affinity-analysis pyspark
Last synced: 09 Nov 2024
https://github.com/jagoda11/elastic-vision
This repository contains a full-stack application designed to explore data from ElasticSearch🧐indices and visualize it using charts and graphs. The backend is built using Node.js and the frontend is powered🚀 by React.
backend chartjs dashboard-development data-analysis data-visualization docker elasticsearch frontend fullstack javascript material-ui monorepo mui-x node pie-chart react restful-api tables
Last synced: 08 Nov 2024
https://github.com/jillie-wink/sql-portfolio
SQL Data Analysis Projects
data-analysis data-manipulation portfolio sql sqlite
Last synced: 12 Oct 2024
https://github.com/a-iceberg/whisper_model_evaluator
WER, MER, WIL of Whisper vs Vosk vs Google transcribators comparator
asr audio-to-text automatic-speech-recognition data-analysis evaluation google-speech-recognition python tuning-parameters visualization vosk whisper
Last synced: 24 Oct 2024
https://github.com/ibrahimhabibeg/national-university-of-singapore-sms-analysis
Analysis of SMS messages collected by the National University of Singapore
analytics data-analysis data-science nlp python
Last synced: 05 Nov 2024
https://github.com/abeltavares/hotel_performance_analysis
A Power BI project that analyzes the performance of a hotel, including revenue, expenses, customer data, hospitality metrics and financial ratios.
business-intelligence data-analysis expenses financial-analysis hospitality-industry power-bi revenue
Last synced: 09 Nov 2024
https://github.com/shridhar1504/titanic-survivor-prediction-datascience-classification-project
This projects predicts whether a passenger on the titanic survived or not using machine learning algorithms with the given details of the passenger data.
classification-algorithm data-analysis data-analytics data-cleaning data-preprocessing data-science data-visualization eda exploratory-data-analysis gradient-boosting jupyter-notebook machine-learning machine-learning-algorithms matplotlib naive-bayes-classifier predictive-modeling python3 scikit-learn seaborn supervised-learning
Last synced: 05 Nov 2024
https://github.com/multitagging/benchmarks
Provides benchmarks to test the MultiTagging framework
benchmarks data-analysis ethereum smart-contracts vulnerabilities
Last synced: 11 Oct 2024
https://github.com/bonelesswater/tradingbot
This project is a web application for a trading bot that displays financial data and indicators. It includes functionality for researching financial data, displaying market indicators, and more.
ai azure css d3 data-analysis django html javascript jquery materializecss python stock-market
Last synced: 25 Oct 2024
https://github.com/eubrunoo/beer-consumption-predictor
An R project analyzing the impact of environmental factors on beer consumption in São Paulo, with a predictive linear regression model.
data-analysis data-science data-visualization machine-learning r statistical-analysis statistics
Last synced: 28 Oct 2024
https://github.com/dcs-training/exploratory-data-analysis-and-visualisation-with-observable-plot
This two-hour workshop will teach you how to follow an exploratory data analysis pipeline with Observable Plot, a new JavaScript library based on the Grammar of Graphics, that proposes a simple yet expressive interface to create powerful graphics easily shareable on the web. Go to the Readme file
d3 data-analysis data-visualisation javascript observable-notebook
Last synced: 25 Oct 2024
https://github.com/upes-open/open-cryptocurrency-analysis
A web app to visualise and predict the cryptocurrency’s impact by using Web scraping, data exploration, EDA and Data Visualization.
analysis cryptocurrency data-analysis data-science data-visualization jupyter-notebook streamlite visualization
Last synced: 08 Nov 2024
https://github.com/katiesaund/tidy_tuesday
A weekly data project in R from the R4DS online learning community
data-analysis data-visualization datascience plot r rstats tidytuesday
Last synced: 14 Oct 2024
https://github.com/cosmoduende/r-twitter
Explore your Twitter activity with R: Sentiment Analysis and Data Visualization. How to analyze your Twitter account (or any account), discover your habits and sentiments with the "rtweet" package and NLP.
data-analysis data-visualization lemmatization nlp nlp-library nlp-resources nltk nltk-library r-package r-programming r-studio rtweet stemming twitter twitter-api twitter-data twitter-data-analysis twitter-data-extraction twitter-sentiment-analysis udpipe
Last synced: 07 Nov 2024
https://github.com/quocduyenanhnguyen/yelp-analysis
Yelp data analysis of business rating, categories, any trends/patterns, correlation, etc.
csv-to-database data-analysis data-analytics data-visualization database json json-parsing json-to-csv mysql mysql-database mysql-workbench pycharm python python3 restaurant sql tableau tableau-dashboards tableau-public yelp-dataset
Last synced: 31 Oct 2024
https://github.com/cosmoduende/r-ggcats
StrangeR things: Adding… Cats? to your plots on R. How to analyze and visualize data with the help of funny cats with the ”ggcat” package.
data-analysis data-analytics data-science data-visualisation data-visualization data-viz dataviz ggcats r-language r-library r-package r-programming r-scripts r-studio rstats rstudio
Last synced: 07 Nov 2024
https://github.com/Fisseha-Estifanos/telecom
A showcase repository for a specific telecommunication company. Used to analyze several telecommunication data set features and generate useful insights accordingly. Insights generated could be seen at https://github.com/Fisseha-Estifanos/telecom-visualizer or at https://fisseha-estifanos-telecom-visualizer-home-huxgy0.streamlitapp.com/
data-analysis notebooks-jupyter python visual-studio-code visualization
Last synced: 23 Oct 2024
https://github.com/cosmoduende/r-earthquakes
Análisis y visualización de datos de actividad sísmica en México con R. Cómo analizar y visualizar la historia sísmica de México con datos del SSN (Servicio Sismológico Nacional)
data-analysis data-analytics data-science dataviz earthquakes r-code r-programming r-studio rstudio sismo sismologia sismos ssn ssnmx terremoto terremotos
Last synced: 07 Nov 2024
https://github.com/analyticace/data-analysis-projects
Collection of Open Source Data Analysis Projects
data-analysis jupyter-notebook prediction predictive-analytics python statistics
Last synced: 05 Nov 2024
https://github.com/ezmiller/esd-viz
Visualization of European Social Survey (http://www.europeansocialsurvey.org/data/)
clojure data-analysis visualization
Last synced: 15 Nov 2024
https://github.com/bishopce16/school_district_analysis
The school board requested an analysis on the various performance metrics for the school district.
data-analysis jupyter-notebook numpy pandas python visual-studio-code
Last synced: 10 Nov 2024
https://github.com/bishopce16/pyber_analysis
The purpose of this project was to complete an exploratory analysis and create visualizations of the 2019 ride sharing data from PyBer.
data-analysis data-visualization jupyter-notebook matplotlib pandas python
Last synced: 10 Nov 2024
https://github.com/jidesamuell/data-analytics-projects
This is a repository i have created to showcase my skills, share projects and track my progress in Data Analytics areas.
data-analysis excel matplotlib powrebi python sql
Last synced: 18 Oct 2024
https://github.com/mkoeppe/jiawei-computations
Computations supporting Chapters 2 and 3 of Jiawei Wang's dissertation "Subadditivity of Piecewise Linear Functions", UC Davis, Ph.D. program in Mathematics, 2020
benchmark-framework branch-and-bound cluster cutting-planes data-analysis hpc integer-programming reproducible-research sagemath
Last synced: 23 Oct 2024
https://github.com/deeksha-dhawan/pizza-outlet-analysis-using-sql
This project analyzes pizza sales data to gain insights into customer behavior and revenue patterns. Key analyses include customer insights, popular pizza types and sizes, revenue generation, and order trends. The findings help optimize menu offerings, staffing, and marketing strategies to boost overall business performance.
coding-challenge data-analysis data-science microsoft my portfolio-project programming project projects sql sql-analysis sql-project sqlproject sqlserver
Last synced: 14 Oct 2024
https://github.com/muneeb706/exploratory-data-analysis
Exploratory Data Analysis of some problems using python (numpy & pandas)
data-analysis exploratory-data-analysis jupyter-n numpy pandas python3
Last synced: 15 Oct 2024
https://github.com/muneeb706/r-programming
R-Programming examples for data analysis.
Last synced: 15 Oct 2024
https://github.com/muneeb706/human_activity_recognition
This project performs data cleaning and data exploration steps for Human Activity Recognition Using Smartphones Data Set in R programming language.
data-analysis data-cleaning data-exploration r-programming
Last synced: 15 Oct 2024
https://github.com/mdaffailhami/king_county_home_sales_analysis
This repository contains code and analysis for exploring home sales data in King County, featuring geospatial mapping to visualize trends and factors influencing housing prices, including location, size, and various property features, using Python and popular data analysis libraries.
data-analysis data-science folium-maps geospatial python
Last synced: 07 Nov 2024
https://github.com/mh0386/motorcycle_data_analysis
Data analysis applied to motorcycle dataset.
Last synced: 07 Nov 2024
https://github.com/farzeennimran/apriori-algorithm
Apriori Algorithm for Association Rule Mining
algorithm apriori apriori-algorithm association-rule-mining association-rules data-analysis data-mining data-science python
Last synced: 07 Nov 2024
https://github.com/marlysson/craw
A system to show the data collected from various sources using chartjs - ⚡️
chartsjs data-analysis data-science web-scraping
Last synced: 14 Oct 2024
https://github.com/thenorthkun/movies-dataset-analysis
Analysis & categorizing of Movies based on Actors, Genres, Gross covered etc 🦸🏼🧜🏼♀️🎧
data-analysis data-visualization filtering
Last synced: 14 Oct 2024
https://github.com/erickchacon/day2day
Functions that can be useful in the day-to-day data analysis. It comprehends functions to find paths for projects, make summaries of databases inside folder and so on.
data-analysis exploratory-data-analysis simulation spatial-analysis
Last synced: 17 Nov 2024
https://github.com/hickee032/python
python
data-analysis data-visualization python study
Last synced: 08 Nov 2024
https://github.com/justinhennis1/hackathon24
Hofstra's Hacknology Competition 2024 - Team Null Pointers
data data-analysis data-science data-visualization data-visualization-python dataanalysis dataanalytics traveling web webapplication
Last synced: 13 Nov 2024
https://github.com/dmvianna/python-nix
Trivial Nix environment with pandas and postgresql
Last synced: 15 Oct 2024
https://github.com/wadeChriestenson/Main_Application
A Django application to host my personal resume.
data-analysis data-visualization django plotly python ui-design
Last synced: 23 Oct 2024
https://github.com/pedrosfaria2/analisandopostshn
Projeto para analisar as postagens da comunidade HackerNews
analise-de-dados data-analysis datetime jupyter-notebook matplotlib python python3
Last synced: 09 Nov 2024
https://github.com/akshaypratapsingh09/zomato-blogs-all-links-dataset
Engineering / Culture / Blogs Data gathered for Educational and Learning purposes from Zomato's Blogs and spreading the better problem solving Methodologies adapted by Modern Unicorns
data-analysis dataset regex selenium webdriver zomato-data-analysis
Last synced: 01 Nov 2024
https://github.com/mumtaz4118/employee-satisfaction-and-attrition
Analysis of attrition based on environmental satisfaction from a Kaggle dataset.
data data-analysis data-science data-visualization ipynb jupyter-notebook machine-learning python statistical-analysis statistical-models
Last synced: 18 Oct 2024
https://github.com/prangonghose/wikipedia-blocking-policies
This study investigates the relationship between editors’ disruptive behavior and regulation policies on English Wikipedia, focusing on the Blocking Policy page. The study collects and analyzes data from 2004 to 2022 using the Wikipedia API, page statistics, and keyword extraction.
data-analysis data-visualization matplotlib open-source pandas python3 seaborn
Last synced: 20 Oct 2024
https://github.com/waseemofficial/ml-practice
ML Practice
data data-analysis jupyter-notebook machine-learning ml python
Last synced: 24 Oct 2024
https://github.com/bpkaur/exploring-the-evolution-of-linux
This project explores the evolution of the Linux kernel by finding top 10 contributors and visualization of commits over the years.
data-analysis data-science datacamp ipynb-jupyter-notebook python3
Last synced: 13 Nov 2024
https://github.com/farzeennimran/fashion-mnist-dataset-classification-using-neural-network
Implementation of a Multi-layer Perceptron classifier with hyperparameter tuning and k-fold cross-validation employing GridSearchCV for classifying images on the Fashion MNIST dataset 👗👚👖
artificial-intelligence data-analysis data-mining data-science dataset deep-learning fashion-mnist-dataset gridsearchcv hyperparameter-tuning kfold-cross-validation machine-learning multilayer-perceptron-network neural-network numpy pandas python sklearn
Last synced: 07 Nov 2024
https://github.com/byte7/fifa17-analysis-and-prediction
⚽ FIFA 17 Analysis and Prediction⚽
data-analysis dreamteam fifa17 fifa17-analysis ultimate-team
Last synced: 15 Oct 2024
https://github.com/thijswillemmoens/historical_document_analysis
Trying to do some Data Science with OpenAI and LLMs.
data-analysis llama2 ollama-api openai openai-api python
Last synced: 08 Nov 2024
https://github.com/jayita11/healthcare-management-optimization-analysis-and-visualization
This project analyzes healthcare data from 2019 to May 2024, optimizing patient care, resource allocation, and financial management. Insights include billing trends, blood bank management, doctor performance, and medication demand, supported by excel,interactive Tableau dashboards and SQL analysis.
data-analysis excel healthcare interactive-dashboards mysql sql tableau-dashboards
Last synced: 14 Oct 2024
https://github.com/ashwin331133/sql-healthcare-data
This repository contains SQL queries designed to analyze health care data. The queries focus on patient demographics, encounter costs, and flu shot statistics, aiming to provide insights into patient behavior and financial impacts. The datasets include information on patient encounters, flu shots, and hospital admissions.
Last synced: 14 Oct 2024
https://github.com/touchesir/twitter_physicalactivity
Companion Data / Analysis for "Monitoring Physical Activity Levels using Social Media Data"
Last synced: 15 Oct 2024
https://github.com/lit26/novel-corona-virus-2019
Data Analysis for Novel Corona Virus 2019
analysis coronavirus-case data-analysis sir-model
Last synced: 15 Oct 2024
https://github.com/shyamkumarnagilla/ai-powered-forecasting-for-agricultural-productivity
AI Powered Forecasting for Agricultural Productivity is a project that utilizes machine learning to predict crop yields and optimize farming practices. By harnessing historical and real-time data, this model empowers farmers with data-driven insights to enhance productivity and sustainability in agriculture.
data-analysis data-visualization deep-learning flask neural-network
Last synced: 09 Nov 2024
https://github.com/pedrosfaria2/analisetitulosnetflix
Estudo de popularidade dos filmes da Netflix no IMDB.
analise-de-dados data-analysis jupyter-notebook matplotlib numpy pandas python
Last synced: 09 Nov 2024
https://github.com/pedrosfaria2/fugascomhelicoptero
Meu primeiro uso do Jupyter Notebook em um projeto
analise-de-dados data-analysis jupyter-notebook matplotlib pandas python
Last synced: 09 Nov 2024
https://github.com/saymyname1337/bachelor-s-thesis
Bachelor's thesis of a student of the MPEI of Shevts G. V.
Last synced: 05 Nov 2024
https://github.com/yutkin/gotohack-2-solution
Solutions of the selection stage
data-analysis data-science hackathon machine-learning
Last synced: 05 Nov 2024
https://github.com/adrianlardies/feelms_predict_by_emotion
Feelms is a mood-based movie recommendation app that uses collaborative filtering and machine learning to suggest films based on your emotions. Built with Streamlit and powered by AWS, Feelms personalizes each user's experience through simulated interactions and tailored predictions. AWS, Feelms brings
aws-ec2 aws-rds data-analysis data-science machine-learning python streamlit
Last synced: 31 Oct 2024
https://github.com/stribedi-94/comcast-telecom-consumer-complaints-data-analysis
Comcast Telecom Consumer Complaints Analysis
comcast-telcom-complaints data-analysis data-science data-visualization python
Last synced: 16 Nov 2024
https://github.com/rileynwong/forecasting-coffee-prices
Predict coffee prices in Kenya
data data-analysis data-scraping data-visualization forecasting forecasting-models forecasting-prices jupyter-notebook prophet prophet-model
Last synced: 14 Oct 2024
https://github.com/nitins17/tableauvisualizations
Visualizations I created while learning to work with Tableau
data-analysis data-science data-visualization tableau visualization
Last synced: 28 Oct 2024
https://github.com/ntaraujo/cleo
Contact data processor for Cléo
contacts-manager data-analysis data-visualization whatsapp whatsapp-web
Last synced: 14 Oct 2024
https://github.com/anindyadas2001/resume
My Resume
cloud computer-science-engineering data-analysis freshers resume-template
Last synced: 15 Nov 2024
https://github.com/kheriberto/knn_project
This is a simple project that uses dummie data to practice and demonstrate my knowledge of the KNN algorithm.
data-analysis knn-classifier numpy python scikit-learn seaborn
Last synced: 18 Oct 2024
https://github.com/vishal-verma-96/capstone_project_by_skill_academy
Capstone Project by skill Academy- Exploratory Analysis, Visualization and Prediction of Used Car Prices. Deploying the highest-scoring model with Streamlit web app
data-analysis data-science jupyter-notebook machine-learning machine-learning-algorithms matplotlib numpy pandas python3 regression-algorithms scikit-learn seaborn
Last synced: 18 Oct 2024
https://github.com/eesunmoon/genai_cor-recom
[Project] Outfit Coordination Recommender System using KoAlpaca
data-analysis fine-tuning generative-ai huggingface keyword llm numpy pandas python python3 selenium
Last synced: 18 Oct 2024
https://github.com/mxagar/data_science_udacity
My personal notes, code and projects of the Udacity Data Science Nanodegree.
dashboard data-analysis data-engineering data-science machine-learning-pipelines
Last synced: 05 Nov 2024
https://github.com/salfaris/toy-data-analysis
Random toy data projects. For my portfolio data projects, see linked website
Last synced: 15 Nov 2024
https://github.com/ct83/become-a-data-analyst-udacity
This repository contains all of the code, projects and reports that I wrote as I pursued my Udacity - Data Analyst NanoDegree.
data-analysis data-analysis-python data-analyst data-visualisation data-visualization-project datascience python udacity udacity-data-analyst-nanodegree
Last synced: 13 Nov 2024
https://github.com/tusharpandey003/chat_analysis
Analysis of group chat with respect to individual member of group
chat-analysis chat-analyzer data-analysis data-science streamlit whatsapp whatsapp-chat whatsapp-web
Last synced: 14 Oct 2024
https://github.com/i-e-b/dynamictimewarp
A quick C# implementation of https://jeremykun.com/2012/07/25/dynamic-time-warping/
data-analysis pattern-matching working
Last synced: 14 Oct 2024
https://github.com/jimmymugendi/email-sms-spam_classifier
Email SMS Spam Classifier is a cutting-edge machine learning solution designed to combat spam messages in email and SMS communications. Leveraging advanced Natural Language Processing (NLP) techniques, the system preprocesses text data by tokenizing, removing stopwords, and stemming, ensuring the most accurate classification results.
data-analysis data-visualization machine-learning-algorithms pandas seaborn-plots sklearn-library
Last synced: 16 Nov 2024
https://github.com/swethajoseph/credit-risk-assessment-eda-case-study
Conducted an Exploratory Data Analysis (EDA) using Python to assess credit risk, identifying key factors that contribute to loan defaults and improving lending decisions
data-analysis data-visualization datacleaning datapreparation exploratory-data-analysis feature-engineering jupyter-notebook matplotlib-pyplot numpy-library pandas-library python-library risk-analysis risk-assessment risk-management seaborn-plots visual-studio-code
Last synced: 31 Oct 2024
https://github.com/manish506/loan-approval-prediction
Explore predictive modeling in this project by applying classification techniques to a loan approval dataset. Analyze and preprocess the data, then use models like K-Nearest Neighbors, Random Forest, SVC, and Logistic Regression to predict loan outcomes. Gain insights into approval factors and enhance prediction accuracy.
classification classification-models data-analysis data-science jupyter-notebook loan-approval-prediction machine-learning predictive-analytics predictive-modeling project python
Last synced: 31 Oct 2024
https://github.com/farizsidki/proyek-analisis-data
Submission Dicoding Indonesia "Proyek Analisis Data"
data-analysis data-visualization exploratory-data-analysis jupyter-notebook python streamlit
Last synced: 31 Oct 2024
https://github.com/risdorn/restaurant-delivery-platforms-analysis-bdm-project
This project analyzes restaurant delivery platforms to understand customer preferences, industry competition, and expansion opportunities. Conducted as part of the BDM project from IITM, it includes descriptive stats, distribution, correlation, regression, and geospatial analysis using multiple datasets.
data-analysis data-visualization jupyter-notebook kaggle
Last synced: 31 Oct 2024
https://github.com/mxagar/eda_fe_summary
An 80/20 guide for Data Processing: Data Cleaning, Exploratory Data Analysis, Feature Engineering, Feature Selection.
data-analysis data-cleaning data-modeling data-science data-visualization eda exploratory-data-analysis feature-engineering feature-selection machine-learning pandas
Last synced: 05 Nov 2024
https://github.com/se7en69/rna-seq-data-processing-and-analysis-pipeline
This pipeline automates essential steps for RNA-Seq data analysis, including quality control, read trimming, alignment to a reference genome, and coverage quantification. It leverages tools like FastQC, fastp, STAR, and bedtools to ensure high-quality results, with MultiQC reports providing an overview at each stage.
bioinformaitcs-scripting bioinformatics bioinformatics-pipeline data-analysis linux scripts shell
Last synced: 31 Oct 2024
https://github.com/billgewrgoulas/hypothesis-testing-on-37-seasons-of-nba
First assignment for the course Data Mining @CSE.UOI
data-analysis data-science numpy scipy seaborn statistics
Last synced: 16 Nov 2024
https://github.com/roland045/smart_fluid_sedimentation_tester
Control program for custom developed smart fluid sedimentation tester system
arduino data-analysis instrumentation measurement sensor
Last synced: 10 Nov 2024
https://github.com/omr5221/bi-scripts
Example of DI BI tool scripting
automation configuration-files data-analysis data-warehousing dimensional-analysis diver etl-pipeline lookup modeling perl-script python shell-script sql summarization
Last synced: 12 Oct 2024
https://github.com/cecoeco/sas_certificate
my code from Coursera's SAS programming specialization
Last synced: 07 Nov 2024
https://github.com/mxagar/space_exploration
This repository is a collection of mini-projects and tutorials related to space images and geo-spatial data.
data-analysis deep-learning geospatial machine-learning
Last synced: 05 Nov 2024
https://github.com/lulloooo/bizdata-nexus
Collection of my Business & Data Analysis projects, from professional/academic endeavors to passion-driven explorations 📊
business-analysis data-analysis economics etl excel finance mysql python r risk-analysis
Last synced: 28 Oct 2024
https://github.com/gracysapra/r-in-data-science
This repository contains essential guides for data analysis using R, covering topics like data preparation, data reshaping, and data visualization. Each file focuses on fundamental techniques to manipulate, clean, and visualize data effectively using R programming.
data-analysis data-preparation data-reshaping data-science data-visualization data-visualizations ggplot r r-for-data-science
Last synced: 28 Oct 2024
https://github.com/rijul007/market-basket-analysis-using-r
Market Basket Analysis using association rules, leveraging R’s powerful tools for data-driven retail strategies.
Last synced: 28 Oct 2024