Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/cego669/datathonengopevi

Equipe: Embrapeiros. Solução proposta para o Datathon do VI ENGOPE (Encontro Goiano de Probabilidade e Estatística). Obs: FOMOS CAMPEÕES!!!!!!!!

data-analysis data-science datathon python r streamlit xgboost-classifier

Last synced: 08 Dec 2024

https://github.com/praju-1/pandas

The library is widely used in data science and machine learning for data cleaning, preparation, and analysis.

data-analysis pandas python

Last synced: 15 Dec 2024

https://github.com/virajbhutada/tableau-data-vizzes

Engage with a growing collection of Tableau dashboards covering financial trends, HR analytics, streaming service insights, real estate dynamics, and more. Meticulously crafted for valuable insights, this repository continues to expand with new and compelling visualizations.

business-analytics data-analysis data-visualization hr-analytics industry-trends netflix performance-metrics stock-market-analysis strategic-analytics tableau visual-insights

Last synced: 10 Jan 2025

https://github.com/deep-diver/data-analysis-on-titanic

applying data analysis on titanic data sheet

data-analysis titanic-data

Last synced: 18 Jan 2025

https://github.com/deep-diver/enron-data-analysis

Data Analysis and Machine Learning on Enron Data

data-analysis enron-data exploratory-data-analysis machine-learning

Last synced: 18 Jan 2025

https://github.com/joanacmbarros/ardm-website

Website to support the R in Pharma 2023 workshop on the ARDM

analysis-results automation clinical-data data-analysis data-model r-in-pharma

Last synced: 16 Dec 2024

https://github.com/rapidsurveys/oldr

An Implementation of the Rapid Assessment Method for Older People (RAM-OP)

assessment data-analysis epidata estimate odk older-people r ram-op ranalyticflow rapid-assessment

Last synced: 24 Dec 2024

https://github.com/thecoderpinar/credit-card-fraud-detection-project

This project focuses on the detection of credit card fraud using various data science and machine learning techniques. The dataset includes a record of credit card transactions over a specific period, with the goal of accurately identifying fraudulent activities. 🚀✨

anamoly-detection classification-algorithms credit-card-transactions data-analysis data-preprocessing data-science data-visualization fraud-detection machine-learning python

Last synced: 16 Dec 2024

https://github.com/thecoderpinar/reta

🍃 Explore the world of renewable energy production, analyze historical data, and predict sustainable energy trends. Join us on the journey to a greener future!

arima clean-energy data-analysis data-science data-visualization energy-future forecasting-models innovation renewable-energy sustainability time-series

Last synced: 16 Dec 2024

https://github.com/worst001/note_machine_learning

整理了机器学习相关资料与手册,包括数学基础、机器学习模型实现示例、神经网络。

ai data-analysis deep-learning development guide learning machine-learning markdown mkdocs note notebook

Last synced: 12 Jan 2025

https://github.com/thecoderpinar/gen-expression

Gene expression analysis is a fundamental component of genomics research, providing valuable insights into how genes are regulated and their impact on various biological processes. This project delves into the realm of gene expression data, aiming to uncover hidden patterns and relationships within complex datasets. 🚀

bioinformatics biotechnology data-analysis data-science data-visualization genomics kaggle machine-learning pca python

Last synced: 16 Dec 2024

https://github.com/jimbrig/eda

Exploratory Data Analysis R Package and Shiny App

data-analysis data-visualization eda r shiny

Last synced: 23 Jan 2025

https://github.com/thecoderpinar/hms-brainactivity-analysiss

Welcome to the GitHub repo for "HMS - EEG Exploration & Neurocritical Care Journey"! Explore EEG data, understand wave patterns, and delve into conditions like LPDs, GPDs, LRDA, and GRDA.

critical-care data-analysis data-science data-visualization deep-neural-networks eeg eeg-signals exploratory-data-analysis healthcare medical-research neuroscience signal-processing

Last synced: 16 Dec 2024

https://github.com/antononcube/wl-outlieridentifiers-paclet

Wolfram Language (aka Mathematica) paclet that provides outlier identifier functions.

data-analysis hampel outlier-detection outliers

Last synced: 15 Dec 2024

https://github.com/shashankbansal6/signal-analysis-for-patient-monitoring

A reliable patient monitoring system which analyzes the correlated physiological signals collected from the patient's body, and generates alarms for abnormalities.

data-analysis patient-monitoring

Last synced: 17 Dec 2024

https://github.com/gjbex/python-dashboards

Repository that contains material for training sessions on creating dashboards using Python.

dash dashboard data-analysis data-exploration data-science data-visualization panel python streamlit training training-materials visualization

Last synced: 22 Nov 2024

https://github.com/juliusmarkwei/concrete-data

Data analysis, machine learning, model evaluation and optimization on the Concret_ Dataset

data-analysis data-science data-visualization ensemble-learning machine-learning modeling

Last synced: 01 Jan 2025

https://github.com/bkataru/physics-ia

Programs and files written for Astrostatistics for IB Physics IA. Topic: Visualizing and analyzing the habitable zones for 150,000 stars from the hipparcos catalogue.

astronomical-algorithms astronomy astrophysics astrostatistics data-analysis data-science data-visualization matplotlib plotting

Last synced: 22 Dec 2024

https://github.com/ziaeemehr/itng_nest

Nest Simulator quick guides and examples, adding new model using NESTML

computational-neuroscience data-analysis nest-simulator neuroscience

Last synced: 01 Jan 2025

https://github.com/mwoss/mlflow-stock-market-example

Stock market prediction - machine learning pipeline using MLFlow.

anaconda data-analysis databricks example lstm mlflow python stock-market stock-price-prediction tutorial

Last synced: 24 Jan 2025

https://github.com/uts58/international-student-job-insights-usa

Data-driven insights on job hunting for international students in the USA, analyzing listings, roles, and trends.

career-insights cpt data-analysis eb1 eb2 eb3 h1b handshake job-analytics job-trends jobs jupyter-notebook opt python work-visa

Last synced: 25 Dec 2024

https://github.com/adirthaborgohain/community-data-analysis

Data and Visual Analysis on several different communities generated using Louvain Algorithm in Neo4j on the dblp dataset.

data-analysis lda python

Last synced: 11 Dec 2024

https://github.com/frikishaan/browsing-history-analysis

This is a data analysis of my browsing history for the last 7 months.

browsing-history data-analysis jupyter-notebook python

Last synced: 09 Jan 2025

https://github.com/farahibrar/kpmg-job-simulation

This repository showcases my work from the KPMG Technology Job Simulation by Forage, focusing on Data Analytics and Cloud Engineering. Explore how I tackled real-world business challenges through sales data analysis, regional growth strategies, and AWS architecture design, highlighting my analytical and technical expertise.

aws-architecture business-intelligence cloud-engineering cloud-strategy-and-design data-analysis data-visualization fintech-solutions forage kpmg kpmg-careers python-for-data-analysis sales-data-insights sustainable-retail-analysis

Last synced: 25 Jan 2025

https://github.com/jshinm/web-scrapper

Web Scrapper used to extract NeuroData github repo stats

data-analysis web-scraping

Last synced: 17 Dec 2024

https://github.com/mynenik/xyplot-32

Extensible Plotting and Data Analysis Program for 32-bit x86 GNU/Linux

cpp data-analysis data-manipulation data-visualization forth linux-app motif xwindows

Last synced: 24 Jan 2025

https://github.com/anushadatta/airbnb-in-seattle

🏨 Understanding the Airbnb rental landscape in Seattle using data science.

airbnb data-analysis data-exploration data-visualization datascience sentiment-analysis

Last synced: 11 Dec 2024

https://github.com/thennen/py-ivtools

A package for flexible and reproducible measurement and analysis of current-voltage characteristics of electronic devices.

current-voltage data-analysis data-visualization electrical-engineering emerging-technology instrumentation measurements

Last synced: 24 Jan 2025

https://github.com/pratishtha-abrol/astronomy-dataanalysis

A key technique in Data Driven Astronomy

astronomy astropy crossmatch data-analysis

Last synced: 12 Dec 2024

https://github.com/hvignolo87/ortex-programming-challenge

Coding challenges required for the Python Developer and Data Engineer job positions.

challenge data-analysis finance pandas python scripting sql sqlalchemy

Last synced: 02 Jan 2025

https://github.com/depressioncenter/data-and-design-core

Code developed by the EFDC Data and Design Core team to support mental health research.

data-analysis data-science efdc inference r statistical-analysis umich

Last synced: 25 Jan 2025

https://github.com/lacerbi/vbmc

Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)

bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference

Last synced: 12 Dec 2024

https://github.com/mindlessmuse666/client-data-analysing-tool

Проект производственной практики: Инструмент для анализа данных, построенный с использованием Python (бэкэнд, фронтэнд PyQt6), Pandas, Matplotlib и SQLite. Это приложение позволяет пользователям загружать данные в формате CSV, фильтровать их, визуализировать ключевые показатели с помощью графиков и создавать отчеты.

data-analysis desktop-application matplotlib pandas pyqt6 pyqt6-desktop-application python sqlite student-project

Last synced: 23 Dec 2024

https://github.com/sevdanurgenc/r-programming-for-data-science-lecture-notes

In this repo, I have the course contents of R Programming For Data Science training, which will be given to Sigorta Bilgi ve Gözetim Merkezi by the cooperation of Academy Peak Information Technologies Training and Consultancy between 21 - 23 March 2023.

data-analysis data-science data-visualization r r-programming r-programming-projects

Last synced: 30 Nov 2024

https://github.com/tathithienthanh/datamining-banking-dataset

Implement some learned data mining techniques and predict if the client will subscribe to a term deposit

apriori association-rules classification clustering data-analysis data-mining data-processing google-colab ipynb kmeans naive-bayes py python scikit-learn svm visualization

Last synced: 25 Jan 2025

https://github.com/atxtechbro/flightradar24

Advanced Python application leveraging the power of APIs and the pandas library to retrieve and perform in-depth analysis of flight data from Flightradar24. It uncovers insights such as the most common departure and arrival cities, contributing to the field of aviation data science.

api-integration aviation-data data-analysis data-science data-visualization flightradar24-api pandas-library python requests-library web-scraping

Last synced: 25 Jan 2025

https://github.com/poga/dat-ipynb-demo

use ipython notebook to analyze data in dat archive

dat data-analysis distributed jupyter-notebook

Last synced: 15 Dec 2024

https://github.com/kevinyang372/san-francisco-crime-data-analysis

An ARIMA prediction model for forecasting potential crimes based on users' time and location

data-analysis machine-learning

Last synced: 02 Dec 2024

https://github.com/yahia3200/become-an-independent-data-scientist

My final project for the Applied Plotting, Charting & Data Representation in Python Course

data-analysis data-science data-visualization matplotlib

Last synced: 22 Jan 2025

https://github.com/quantumudit/uk-student-accommodation-analysis

This project focuses on scraping student properties related data from the UK Student Accommodation website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 26 Dec 2024

https://github.com/quantumudit/analyzing-goodreads-famous-quotes

This project focuses on scraping famous quotes and their related data from the GoodReads website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 26 Dec 2024

https://github.com/quantumudit/analyzing-quotes

This project focuses on scraping all the quotes and their related data from the "Quotes To Scrape" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 26 Dec 2024

https://github.com/quantumudit/analyzing-suez-services

This project focuses on scraping all the service locations across Australia & New Zealand and their associated attributes from "Suez" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 26 Dec 2024

https://github.com/quantumudit/demographic-data-analysis

This project focuses on analyzing and finding correlations between the three important metrics by 195 countries,i.e., birth rate, internet users, and income group.

data-analysis jupyter-notebook power-bi python

Last synced: 26 Dec 2024

https://github.com/quantumudit/alteryx-weekly-challenges

This repository contains Alteryx solutions to the weekly challenges published in Alteryx Community

alteryx alteryx-workflow data-analysis data-science data-transformation data-visualization etl

Last synced: 26 Dec 2024

https://github.com/quantumudit/analyzing-gamerevolution-games

This project focuses on scraping data related to video games from the GameRevolution website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 26 Dec 2024

https://github.com/verbasik/yandex.practicum.datascience

Портфолио проектов Data Science, выполненных в рамках профессиональной переподготовки в Яндекс.Практикум. Включает исследования в области финансов, недвижимости, кинопроката и других, с использованием статистики, машинного обучения и анализа данных.

data-analysis data-science machine-learning yandex-praktikum

Last synced: 10 Jan 2025

https://github.com/gustavohnsv/teamwork_mqa

Repositório dedicado ao trabalho em grupo baseado nos estudos de métodos para análise de dados da matéria Métodos Quantitativos para Anáise Multivariada.

data-analysis group-project r team-repo

Last synced: 16 Dec 2024

https://github.com/mindful-ai-assistants/credit-card-prediction

💳 This repository focuses on building a predictive model to assess the likelihood of credit card defaults. The project includes data analysis, feature engineering, and machine learning to provide accurate default predictions.

artificial-intelligence data-analysis data-science jupyter logistic-regression machine-learning predictive-modeling python3 scikit-learn

Last synced: 09 Dec 2024

https://github.com/sabujxi/python-scraper-and-data-analysts-admin-panel-in-django

A data scraper from texas govt site and a helping web app for managing, reviewing and editing the data

analyst data data-analysis data-entry data-scraper django django-application python python-scraper real-estate regex scraper texas

Last synced: 01 Jan 2025

https://github.com/nhsdigital/sde_example_analysis

Example of what you can do in Databricks in the Secure Data Environment (SDE) using Python, SQL, and R.

data-analysis data-science databricks-notebooks machine-learning mlflow

Last synced: 23 Dec 2024

https://github.com/kenvilar/data-analysis-using-python

Transforming a description of a location from an analyzed CSV file data using Pandas with Python 3

bs4 data-analysis jupyter pandas python python3 requests xlrd

Last synced: 18 Nov 2024

https://github.com/casualcomputer/sql.mechanic

Functions that generate SQL queries that summarize high-dimensional tables stored in various databases (e.g. Microsoft SQL Servers, Netezza, DB2, Postgres, Oracle, MySQL, etc.).

data-analysis data-quality-checks data-science database mysql netezza oracle postgres quality-control r sql sql-server

Last synced: 04 Dec 2024

https://github.com/yusufcinarci/covid-19-data-analysis-visualization

The first project of our data visualization studies is the COVID-19 data analysis project. In this project, we analyzed the data of the COVID-19 pandemic, which started in the first month of 2020 and still continues to affect the world, on the basis of countries. You can find the brief details of the project we realized in 3 stages in the readme file. We have tried to explain the details of the project step by step below. We wish you healthy days.

covid-19-data-visualization data-analysis data-science data-visualization

Last synced: 26 Dec 2024

https://github.com/goggle/dataisbeautiful

Some data analysis Jupyter notebooks, mainly indented for submissions on the subreddit /r/dataisbeautiful.

data-analysis data-visualisation jupyter-notebook notebook reddit

Last synced: 26 Dec 2024

https://github.com/njoyedevs/chatgpt3_riskanalyzer

In this project, ChatGPT3 was fine tuned on 9 data series spanning 40 years. This helped train ChatGPT3 to provide a market risk score. To view, visit: https://www.aimarketrisk.com

chatgpt3 data-analysis flask fred-api full-stack-web-development pandas python

Last synced: 02 Dec 2024

https://github.com/lafayettegabe/nlp-resume-extraction

📝 NER (Named Entity Recognition) project aimed at solving the problem of manually shortlisting resumes by automating the process. This project proposes using NLP techniques and NER model to classify and extract relevant entities from resumes such as person name, college name, academics information, relevant experiences, skill set, etc.

big-data data data-analysis data-science eda ner nlp resume-extractor

Last synced: 16 Dec 2024

https://github.com/lafayettegabe/g2m-insight-for-cab-investment-firm

📊 Exploratory Data Analysis (EDA) on multiple datasets related to the cab industry in the US, to provide actionable insights and recommendations to a private firm looking to invest in the market. The analysis includes data cleaning, transformation, visualization, and hypothesis testing.

big-data data data-analysis data-science data-visualization eda gotomarket

Last synced: 16 Dec 2024

https://github.com/jimbrig/EDA

Exploratory Data Analysis R Package and Shiny App

data-analysis data-visualization eda r shiny

Last synced: 04 Dec 2024

https://github.com/grburgess/gbm_kitty

Database, reduce, and analyze GBM data without having to know anything. Curiosity killed the catalog.

3ml catalogue data-analysis fermi-science grbs pipelines

Last synced: 23 Jan 2025

https://github.com/freekatz/english-reading

考研英语(10-19)数据集及相关数据分析

data-analysis dataset

Last synced: 07 Dec 2024

https://github.com/milind220/hk-air-quality-analysis

My final project for a statistics and data analysis course. Whew that was a lot of graphs!

data-analysis jupyter-notebook numpy pandas python python3 scipy seaborn statistics

Last synced: 03 Jan 2025

https://github.com/timzatko/fifa-19-dataset-machine-learning

Player's value prediction and game position classification on FIFA 19 dataset.

data-analysis fifa19 machine-learning scikit-learn

Last synced: 03 Jan 2025

https://github.com/tirendazacademy/data-sets

Data sets for Tirendaz Akademi Youtube

data-analysis dataset

Last synced: 01 Jan 2025

https://github.com/al-ghaly/airline-company-data-warehouse

Data Warehouse modeling, design, implementation, and analysis for an Airline Company.

data-analysis data-warehousing database-modeling sql-server

Last synced: 22 Jan 2025

https://github.com/seyedhosseinzadeh/ws_tm

Weather web scraping and Time series model to predict temperature, humidity and barometer

data-analysis deep-learning lstm-model machine-learning prediction prediction-model weather web-scraping

Last synced: 10 Jan 2025

https://github.com/iamgmujtaba/scholar_search

This project provides a tool for extracting and analyzing the quantity and distribution of scholarly articles related to a particular topic or field over a desired time span, using Google Scholar search results and built-in data visualization functionality.

academia academic academic-papers data-analysis data-visualization google google-scholar scholarly-articles

Last synced: 16 Dec 2024

https://github.com/olow304/goboard

Python Data Analysis Dashboard using Public Dataset, Django

dashboard dashboard-templates data-analysis data-science django jupyter-notebook machine-learning python sklearn

Last synced: 04 Jan 2025

https://github.com/shivamswarnkar/tesla-stock-prediction

Making prediction of close prices of Tesla Stocks using different regression methods.

data-analysis data-visualization plotly regression regularization sklearn stock-price-prediction

Last synced: 26 Jan 2025

https://github.com/walidalsafadi/house-prices

Ask a home buyer to describe their dream house, and they probably won't begin with the height of the basement ceiling or the proximity to an east-west railroad. With 79 explanatory variables describing (almost) every aspect of residential homes in Ames, Iowa, this competition challenges you to predict the final price of each home.

cross-validation data-analysis data-science data-visualization decision-trees eda house-price-prediction house-prices jupyter linear-regression machine-learning machine-learning-algorithms mlp-regressor plot python random-forest-regression regression svr xgboost-regression

Last synced: 22 Jan 2025

https://github.com/ehtisham-sadiq/ai-pioneers-datascience-arena

This repository is dedicated to the AI Amigos team's participation in the Artificial Intelligence (AI) competition with a focus on Data Science.

artificial-intelligence competition data-analysis data-science data-visualization machine-learning model-building model-evaluation numpy pandas python3 supervised-learning unsupervised-learning

Last synced: 11 Jan 2025

https://github.com/elhaban3ro/thewildtool

TheWildTool is a tool developed with the main objective of saving time when working with audio datasets. Either to prepare them, to get them or to train a model with them. 🤖

ai audio audio-processing data-analysis data-science dataset deeplearning python

Last synced: 03 Dec 2024

https://github.com/vandita2020/merra2_nasa_wind_speed_analysis

In this study, we aim to explore the vulnerability of power grids in the south-east region of the USA with the help of data analysis tools and machine learning algorithms

data-analysis data-science machine-learning-algorithms python

Last synced: 11 Jan 2025

https://github.com/yogeshnile/nifty50-index-time-series-analysis

In this repo i did analysis of Nifty50 five year data from 01-04-2015 to 31-03-2020. Data Downloaded from nse official website.

data-analysis matplotlib nifty numpy pandas plotly python3 time-series-analysis

Last synced: 10 Jan 2025

https://github.com/ysayaovong/car-sales

An analysis of car sales data to uncover market trends and insights through data cleaning, analysis, and visualization.

automotive business-analysis data-analysis data-cleaning data-visualization market-trends matplotlib pandas python sales-data scikit-learn seaborn

Last synced: 23 Jan 2025

https://github.com/narenkhatwani/arkouda-projects

This repository contains the source codes of the projects done using Arkouda (a software package that allows a user to interactively issue massive parallel computations on distributed data using functions and syntax that mimic NumPy, the underlying computational library used in most Python data science workflows.)

arkouda data-analysis data-analytics data-science high-performance high-performance-computing highperformancecomputing numpy pandas parallel-computing parallel-processing parallelization python

Last synced: 12 Dec 2024