Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/sevdanurgenc/python-for-data-science-lecture-notes

In this repo, I have the course contents of Python for Data Science training, which will be given to Siemens by the cooperation of Academy Peak Information Technologies Training and Consultancy between 28 June - 1 July 2022.

data-analysis data-mining data-modeling data-science data-structure data-visualization matplotlib-tutorial numpy-tutorial pandas-tutorial

Last synced: 29 Jan 2025

https://github.com/mohamedhany99/human-voice-identifier-counter

the application developed in (KIVY) it can identify the users imported into the dataset based on the support vector machine training model it has two features ( Importing new voice - Detection to detect the human voices and count them)

android android-app android-application automation automation-framework data data-analysis data-mining data-science data-visualization datascience kivy kivy-framework machine-learning python

Last synced: 18 Jan 2025

https://github.com/fatihilhan42/web_scraping_football_statistics_per_game_data-main

In this notebook I will describe the process of scraping data from web portal understat.com that has a lot of statistical information about all games in top 5 European football leagues.

data-analysis data-manipulation data-science data-scraping data-visualization jupyter-notebook python

Last synced: 29 Jan 2025

https://github.com/gmbeddard/ee152-realtime_embedded_systems-finalproject

An STM32-based implementation of the Pan-Tompkins algorithm for real-time QRS detection. Includes robust debugging tools, heart rate monitoring, and live ECG signal support via a python graphing script.

cpp-programming data-analysis ecg embedded-c freertos stm32

Last synced: 20 Jan 2025

https://github.com/ayaanjawaid/google_playstore_data_analysis

This project provides an in-depth analysis of Google Play Store apps and user reviews, focusing on understanding app performance, user sentiment, and key trends in app categories. Using Python, I performed data cleaning, feature engineering, and exploratory data analysis (EDA) on app data and reviews.

data-analysis eda html numpy pandas-dataframe plotly python vizualisation

Last synced: 31 Oct 2024

https://github.com/madhuresh2011/amazon-sales-report-analysis-using-python

This project focuses on analyzing Amazon sales data using Python to uncover insights into sales performance, customer behavior, and product trends

charts cleaning-data data-analysis jupyter-notebook matplotlib numpy pandas python seaborn visualization

Last synced: 02 Feb 2025

https://github.com/pheithar/socialdata_madridcentral

Social data and visualization course at DTU - 2022. Effectiveness of Madrid Central

data-analysis data-visualization jupyer-notebook madrid python

Last synced: 29 Jan 2025

https://github.com/faezeh-gholamrezaie/visual-google-scholar-search

A Python script that searches Google Scholar for specific keywords and visually presents the results in various chart formats, enabling researchers to analyze trends and insights in academic literature.

academic academic-research academic-trends ai ai-research bibliometrics data-analysis data-visualization google-scholar publication-analysis python research-trends scholarly scholarly-data word-cloud

Last synced: 28 Dec 2024

https://github.com/manikantasanjay/time_series_data_analysis_on_stocks

Time Series Data Analysis project on Daily Stock Prices of the following companies(Apple, Microsoft, Google, Amazon) for a span of 5 years.

data-analysis pandas stock time-series time-series-analysis

Last synced: 22 Dec 2024

https://github.com/faisal-khann/diwali-sales-analysis

The "Diwali Sales Analysis" project aims to analyze the sales data during the Diwali festival period to uncover insights and trends that can help improve marketing strategies and sales performance in the future

csv data-analysis eda jupyter-notebook matplotlib numpy pandas python seaborn

Last synced: 29 Jan 2025

https://github.com/matthewgrosman/messenger-analytics

Project that ingests Facebook Messenger conversations and generates analytics.

analytics data-analysis excel facebook facebook-messenger java mongodb

Last synced: 08 Nov 2024

https://github.com/titanscouting/tra-analysis

Titan Robotics 2022 Strategy Team Analysis Repository

data-analysis frc frc-scouting hacktoberfest python

Last synced: 09 Feb 2025

https://github.com/vara-co/sql-challenge

EmployeeSQL "Data modeling, data engineering, and data analysis."

data-analysis data-engineering data-modeling databases employee-database erd erdiagram postgres postgresql schema sql tables

Last synced: 02 Feb 2025

https://github.com/dcs-training/intromachinelearning

This course is aimed at providing an introduction to machine learning for those with some beginner level python skills. Go to the readme file

data-analysis data-wrangling machine-learning python statistics

Last synced: 07 Jan 2025

https://github.com/vara-co/space-missions

Space Missions Over Time (1957-2022): Successes vs Failures, and Rocket Usage

data-analysis data-analysis-python history matplotlib pandas pandas-python space space-race spaceships team-project

Last synced: 02 Feb 2025

https://github.com/hayatiyrtgl/data_analysis_project

Financial data analysis: preprocess, visualize, calculate technical indicators.

data-analysis data-analysis-python data-science dataframe numpy pandas python python3 stock-price-prediction talib trade-analysis

Last synced: 22 Dec 2024

https://github.com/vara-co/pandas-challenge

PyCitySchools - Analysis between budget and academic performance in schools

budget-analysis data-analysis jupiter-notebook pandas-dataframe python school-performances

Last synced: 02 Feb 2025

https://github.com/dannyben/datamix

DSL for manipulating tabular data

csv data data-analysis data-engineering gem ruby tabular-data

Last synced: 02 Feb 2025

https://github.com/noodleslove/house-of-representative-analysis-i

This project uses public data about the stock trades made by members of the US House of Representatives.

data-analysis data-science eda kaggle-dataset matplotlib-pyplot pandas python stocks-trading

Last synced: 28 Jan 2025

https://github.com/gowthamsundaresan/eigenscan

blockexplorer for eigenlayer

crypto data-analysis eigenlayer nextjs web3

Last synced: 08 Jan 2025

https://github.com/gaurav-van/house_price_predictor_streamlit_web_app

Data Science Project to Predict House Prices in Bangalore using the concept of Regression. This Repository is used for Deployment of the Project

data-analysis data-science exploratory-data-analysis machine-learning prediction python regression streamlit

Last synced: 02 Feb 2025

https://github.com/ibnaleem/cyberchef-discord

A versatile Discord bot that implements CyberChef's features for encoding, decoding, encrypting, compressing, analysing data directly and more in your Discord server

compression cti cyberchef cybersecurity data-analysis data-manipulation discord-bot discord-js encoding encryption hashing infosec parsing redteam

Last synced: 02 Feb 2025

https://github.com/sarincr/basics-of-julia-programming-language

Julia is a high-level, high-performance, dynamic programming language. While it is a general purpose language and can be used to write any application, many of its features are well-suited for high-performance numerical analysis and computational science.

data data-analysis data-mining data-science data-visualization dataanalysis dataanalytics datascience julia julia-language julia-library julia-package julialang machine-learning

Last synced: 21 Jan 2025

https://github.com/x1ao4/doc-merger

通过 python 脚本将两个相对不完整的文档合并为一个完整的文档 / merge two relatively incomplete documents into one complete document via python script

data-analysis data-merging document-analysis document-comparison document-processing documents filtering filtering-data merge merge-documents

Last synced: 30 Dec 2024

https://github.com/sumitgirwal/procoder-public

"ProCoder", which is a web-based application providing massive open online courses for both professionals and students. It aims to offer a platform for learning coding skills online, accessible to anyone who is interested in learning programming or enhancing their coding knowledge. ProCoder provides courses on various programming languages, tools.

blog-platform bootstrap-4 chat-application css3 data-analysis django-crud django-project html5 javascript numpy-library pandas-library python3

Last synced: 06 Jan 2025

https://github.com/jen-uis/la-crime-data-analysis

This repository contains project materials for the Fall 2023 MGT 256 class. This project is completed with assists from Professor Adem Orsdemir.

business-analytics crime-data crime-data-analysis data-analysis knn la-crimes-from-2020 la-safe r r-markdown r-studio report-generation rmd united-states visualization

Last synced: 21 Jan 2025

https://github.com/solrikk/pictrace-web

PicTraceV2 is a highly efficient image matching platform that leverages computer vision using OpenCV, deep learning with TensorFlow and the ResNet50 model, asynchronous processing with aiohttp, and Selenium for browser automation. PicTraceV2 allows users to upload images directly or provide URLs, quickly scanning a vast database to find image

automation computer-vision data-analysis data-extraction deep-learning image-processing image-search machine-learning natural-language-processing opencv openpyxl pandas python selenium tensorflow web-scraping yandex yandex-api

Last synced: 09 Jan 2025

https://github.com/jrschmidtt/csv-to-html

Convert csv file to html table in javascript.

body-parser csv data-analysis javascript nodejs oop

Last synced: 03 Jan 2025

https://github.com/sandk21/detection_faux_billets

Algorithme de détection de faux billets selon leurs dimensions géométriques et application web pour générer les prédictions

data-analysis data-science data-visualization machine-learning pandas python scipy sklearn streamlit

Last synced: 02 Feb 2025

https://github.com/teja-1403/forage-standard-bank-data-science

This repository contains solutions to the 4 different tasks that must be performed during the Data Science virtual internship provided by Standard Bank via Forage.

automl communication-skills data-analysis data-science machine-learning python sql

Last synced: 30 Dec 2024

https://github.com/shipyardapp/amazonathena-blueprints

Simplified blueprints for building data pipelines with Amazon Athena.

amazon-athena athena cli data-analysis data-engineering data-science elt etl

Last synced: 04 Dec 2024

https://github.com/abdelhakim-gh/machine-learning_data-analysis_project

recognizing handwritten numbers & comparing the Life Expectancy vs Fertility in 1960 & 2013 of regions

data-analysis jupyter-notebook machine-learning python r r-studio

Last synced: 30 Jan 2025

https://github.com/shriram-vibhute/data-analysis

This repository offers a comprehensive collection of data analysis techniques using NumPy Pandas, Matplotlib and Seaborn.

data-aggregation data-analysis data-visualization data-wrangling matplotlib numpy pandas seaborn

Last synced: 15 Jan 2025

https://github.com/willie-conway/datavista

A robust 🐍Python application for data analysis that provides a wide range of tools for 🔃loading, 🧹cleaning, and 🔃preprocessing data. It includes features for 📈statistical analysis, 👨🏿‍🔬hypothesis testing, 🦾machine learning, clustering, ⏳time series forecasting, and 📊data visualization, all designed to enhance your analytical workflow.

analytics big-data command-line data-analysis data-cleaning data-driven data-mining data-pipeline data-preprocessing data-science data-scientist data-visualization data-wrangling exploratory-data-analysis machine-learning pandas predictive-analytics python statistics visualization-tools

Last synced: 10 Jan 2025

https://github.com/shriram-vibhute/digit_classification

This project demonstrates various machine learning techniques for classifying handwritten digits from the MNIST dataset. It covers data preprocessing, model training, evaluation, and advanced classification strategies.

classification data-analysis data-visualization machine-learning matplotlib numpy pandas sk-learn

Last synced: 15 Jan 2025

https://github.com/airscholar/data_analysis_with_ai

A repository showing how to use AI and ChatGPT for Data Analysis with Pandas and Python

chatgpt data-analysis gpt4 openai pandas pandasai python

Last synced: 13 Jan 2025

https://github.com/fbraza/paris_airbnb

Analysis of Paris AirBnB data using R and Shiny

analysis data data-analysis paris-airbnb r shiny

Last synced: 26 Jan 2025

https://github.com/gappeah/global-shipping-analytics-dashboard

This Tableau project provides a comprehensive visual analysis of global sales, shipping costs, and quality metrics across different regions and countries.

data data-analysis data-analyst data-visualization metrics tableau

Last synced: 07 Jan 2025

https://github.com/fbraza/python-dataframe-skim

Get an extended statistic summary of your pandas DataFrame

data-analysis data-science dataframe pandas python3

Last synced: 26 Jan 2025

https://github.com/dual-points/dplearn

A Python package for data analysis.

data-analysis data-science python python-package

Last synced: 01 Feb 2025

https://github.com/dsrodrigovieira/houserocketsales

Este repositório contém um projeto desenvolvido para praticar habilidades de análise de dados utilizando Python

data-analysis data-visualization heroku kaggle-dataset python

Last synced: 30 Dec 2024

https://github.com/virajbhutada/global-universities-success-analysis-powerbi-sql-excel

This capstone project conducts in-depth analysis using Power BI, SQL, and Excel to explore complex dynamics shaping global university success. Integrating data from diverse ranking systems and criteria, our aim is to unravel the factors influencing universities worldwide.

capstone capstoneproject data-analysis data-analytics data-insights data-science data-science-projects data-visualization excel exploratory-data-analysis mece mysql powerbi powerpoint sql

Last synced: 10 Jan 2025

https://github.com/johnsesana/eda-video-game-sales

Exploratory Data Analysis on Public Datasets

data-analysis data-visualization excel

Last synced: 17 Jan 2025

https://github.com/mrankitgupta/titanic-survival-prediction-93-xgboost

Titanic Survival Prediction Project (93% Accuracy)🛳️ In this notebook, The goal is to correctly predict if someone survived the Titanic shipwreck using different Machine Learning Model & Hyperparameter tunning.

classification data-analysis data-science data-visualization gradient-boosting kaggle-competition linear-regression logistic-regression machine-learning machine-learning-algorithms ml ml-models nlp prediction predictive-modeling random-forest titanic titanic-kaggle titanic-survival-prediction xgboost

Last synced: 17 Jan 2025

https://github.com/aekanshd/crazytics-suicidesindia

Basic interpretation of the Suicides in India data-set using R.

data-analysis data-science graph india r suicides

Last synced: 15 Jan 2025

https://github.com/walidalsafadi/titanic-disaster

In this challenge, we ask you to build a predictive model that answers the question: “what sorts of people were more likely to survive?” using passenger data (ie name, age, gender, socio-economic class, etc).

data-analysis data-science decision-trees eda gradient-boosting knearest-neighbors machine-learning-algorithms naive-bayes random-forest titanic-kaggle titanic-survival-prediction

Last synced: 22 Jan 2025

https://github.com/namratha2301/best-selling-books

Comprehensive examination of best-selling books, focusing on understanding sales patterns, genre distributions, and the impact of various features on book performance.This project aims to predict book sales and classify genres, providing valuable insights for authors, publishers, and readers.

data-analysis data-visualization matplotlib pandas sckiit-learn seaborn

Last synced: 28 Jan 2025

https://github.com/maskedsyntax/taskit

A simple web based Task Tracker for better focus

charts data-analysis python3 streamlit task-tracker-app todo-list

Last synced: 05 Feb 2025

https://github.com/abhinavsharma07/fraud_analytics-credit_card_fraud_detection

The aim of this project is to predict fraudulent credit card transactions with the help of different machine learning models.

banking data-analysis decision-trees hyperparameter-optimization machine-learning-algorithms pipelines random-forest-classifier svm-classifier xgboost-classifier

Last synced: 10 Jan 2025

https://github.com/phammings/sales-management-analysis

Sales management analysis and Power BI dashboard for sample business request and user stories

data-analysis excel powerbi sql

Last synced: 15 Jan 2025

https://github.com/akshat0427/python_youtube_history

a bunch of data science operations performed on youtube history data

data-analysis data-science extracting-features

Last synced: 11 Jan 2025

https://github.com/2003harsh/house-price-prediction-using-machine-learning

This project features a web app that predicts house prices using a linear regression model. Users can input details like location, square footage, bathrooms, and bedrooms through an HTML form. I've added a CI/CD pipeline with GitHub Actions, unit testing with pytest, and automated Docker containerization to improve deployment and robustness.

ci-cd data-analysis docker-image flask linear-regression machine-learning matplotlib mlops-workflow requests scikit-learn

Last synced: 09 Feb 2025

https://github.com/raad07/sql_project-world_layoffs_dataset

This is a SQL project which comprises the Data Cleaning in the first part and Exploratory Data Analysis (EDA) in the second part.

data-analysis database mysql sql

Last synced: 22 Jan 2025

https://github.com/phomint/udacity_dataanalysis

All projects and activities

data-analysis python udacity-nanodegree

Last synced: 15 Jan 2025

https://github.com/aadityatamrakar/futures_spread_chart

Cash Market & Futures Daily Spread Chart - NSE Stocks

data data-analysis data-mining expressjs nodejs requests

Last synced: 27 Jan 2025

https://github.com/rakumar99/power-bi-projects

This repository contains various power bi projects and dashboards of Humaan Resources , Financial Analysis using Power BI Desktop.

dashboards data-analysis data-visualization databases datacleaning datamodeling etl powerbi powerquery reports

Last synced: 08 Jan 2025

https://github.com/vkbo/osirisanalysis

Matlab toolbox for analysing simulation results from Osiris 3

data-analysis matlab matlab-gui physics-simulation

Last synced: 16 Nov 2024

https://github.com/rahul-404/full_stack_data_science_with_generative_ai

Welcome to the repository for the course "Full Stack Data Science with Generative AI". This repository is designed to accompany the course and provide resources, exercises, and projects related to the study of data science and generative AI techniques.

data-analysis data-science data-visualization database deep-learning exploratory-data-analysis feature-engineering generative-ai machine-learning nlp python statistics

Last synced: 01 Feb 2025

https://github.com/colburncodes/se_pudding_2023

This project is a React app designed to showcase research conducted by a team of data scientists and data analysts. The app is utilizing React and React-Chartjs-2

chartjs-2 data-analysis data-science data-visualization react-chartjs-2 reactjs

Last synced: 22 Jan 2025

https://github.com/sumidcyber/dataviz-master

This Python application provides a user-friendly interface to load and visualize the contents of a CSV file. Users can choose from various types of graphs and perform analyses on the dataset.

data-analysis data-analysis-project data-analysis-python database databases python python3

Last synced: 22 Jan 2025

https://github.com/yash-kavaiya/ai-analytics

This is a Streamlit app that uses Pandas and AI to perform data analytics on uploaded CSV files.

data-analysis generative-ai pandas streamlit

Last synced: 24 Dec 2024

https://github.com/aravind-selvam/bikeshare-company-analysis

Google Data Analytics Professional Certificate program's Capstone project, of a bike sharing company

analytics business-analytics business-intelligence data data-analysis data-visualization dataanalytics google-data-analytics postgresql sql sql-server

Last synced: 14 Jan 2025

https://github.com/gholamrezadar/favourite-youtube-channels

this program goes through your youtube watch history and sorts channels based how many of their videos you have watched!

data-analysis data-visualization python

Last synced: 30 Dec 2024

https://github.com/gholamrezadar/most-profitable-actors

Finds the list of actors with the most boxoffice profit using TMDB API.

crawling data-analysis tmdb

Last synced: 30 Dec 2024

https://github.com/emso-c/stream-analyser

A tool that analyses YouTube live streams.

cli data-analysis guessing highlights python youtube-video

Last synced: 10 Feb 2025

https://github.com/mattdelaune/retail_rfm_analysis

Power BI multi-page report leveraging advanced data visualization for RFM analysis. Delivers deep analytical insights into customer behavior, engagement, and spending patterns, driving strategic business decisions.

data-analysis dax powerbi report rfm-analysis sales-data visualization

Last synced: 30 Dec 2024

https://github.com/mg380/ibm-applied-data-science-capstone

This Capstone is the 10th (final) course in IBM Data Science Professional Certificate specialization, and it actually summarises in the form of project all materials that have been learned during this specialization

capstone data data-analysis data-science datascience ibm machine-learning plotly python scikit-learn sql

Last synced: 09 Feb 2025

https://github.com/rickcontreras/modelos1

Modelo de clasificación para predecir el desempeño de estudiantes en las Pruebas Saber Pro en Colombia. Incluye análisis exploratorio de datos, preprocesamiento y modelos de machine learning.

classification colombia data-analysis data-science education educational-assessment exploratory-data-analysis jupyter-notebook machine-learning python saber-pro scikit-learn student-performance

Last synced: 09 Feb 2025

https://github.com/prime-infinity/type-one

Software to visualize and analyze GitHub repos based on certain statistics such as stars, forks and issues

data-analysis data-visualization

Last synced: 24 Nov 2024

https://github.com/abhi18av/innovation-competition

Submission for a programming challenge

clojure clojurescript data-analysis

Last synced: 05 Jan 2025

https://github.com/danhenriquex/data-science-project

The main goal of this project was to apply the concepts of data visualization and analysis.

data-analysis data-science numpy pandas python

Last synced: 14 Jan 2025

https://github.com/wiseaidev/truth-guard

Analyzing a 79k Dataset of Misinformation and Fake News

data-analysis fastapi lstm machine-learning python supervised-learning

Last synced: 20 Dec 2024