Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/ktmud/github-life

A data explorer for GitHub projects' life cycles

data-analysis github scraper time-series

Last synced: 23 Jan 2025

https://github.com/nmicht/food-language-variety

A map to display distribution of food names using their synonyms in different regions.

data-analysis language-distribution language-variety map

Last synced: 13 Feb 2025

https://github.com/agungbudiwirawan/e-commerce_analysis_using_sql

The objective of this project is to provide an analysis of Olist Store (Brazilian E-commerce) sales.

data-analysis data-science e-commerce e-commerce-project microsoft-sql-server sql sql-server

Last synced: 30 Jan 2025

https://github.com/gher-uliege/seadatacloud

Tools and interfaces to work with DIVA interpolation software tool.

data-analysis data-visualization interpolation nco netcdf ocean-sciences oceanography

Last synced: 05 Feb 2025

https://github.com/ascender1729/iris-flower-classification-2024

An exploratory data analysis and machine learning project using the Iris dataset to classify flower species with a K-Nearest Neighbors classifier. It includes data visualization, feature scaling, model training, and evaluation with 100% accuracy on the test set.

classification data-analysis iris-dataset k-nearest-neighbors machine-learning matplotlib pandas python scikit-learn seaborn

Last synced: 06 Feb 2025

https://github.com/dmytrovoytko/data-engineering-amazon-reviews

Data Engineering project for ZoomCamp`24: JSONL -> PostgreSQL/BigQuery + Metabase + Mage.AI

bash-script bigquery codespaces data-analysis data-visualization etl metabase pipeline python-script

Last synced: 14 Nov 2024

https://github.com/nasdin/drone-strike-visualization

Connecting to Drone Strike API and performing an analysis on frequency of drone strikes, trend and patterns

analysis api basemap data-analysis data-science data-visualization drone drone-strikes eda geopy jupyter-notebook visualization

Last synced: 19 Feb 2025

https://github.com/kwokhing/exploratory-data-analysis-on-smrt-tweets

Demo on performing exploratory data analysis (EDA) on train service disruptions based on scrapped (user generated contents) tweets from the train operator's (SMRT) twitter account

data-analysis data-cleaning data-collection data-preparation exploratory-data-analysis exploratory-data-visualizations folium geospatial-data leaflet-map python python3 regex scraping selenium selenium-python social-media text-processing user-generated-content web-scraping webscraping

Last synced: 02 Dec 2024

https://github.com/agungbudiwirawan/socioeconomic_analysis

The objective of this project is to analyze the socio-economic in Chicago.

chicago-crime crime-data data-analysis data-science microsoft-sql-server project sql sql-project sql-server

Last synced: 02 Dec 2024

https://github.com/zmyzheng/signature-authentication-pen

Signature Authentication Pen, a cloud based IoT project which realizes identity authentication by exploiting the signature biometric features of the users. Details:

android aws data-analysis identity-authentication iot neural-network signature-authentication-pen

Last synced: 05 Feb 2025

https://github.com/mohammadkarbalaee/python-for-data-analysis-book

All the practice and code that I am doing while I read the book called, Python for data analysis

data-analysis data-science python

Last synced: 01 Feb 2025

https://github.com/arnavk-09/world-population

🌏 Taipy demo to explore world population data...

data-analysis python taipy world-population

Last synced: 06 Feb 2025

https://github.com/ihabbendidi/diamond-analysis

Exploratory statistical analysis of a Diamond dataset

data-analysis data-visualization exploratory-data-analysis machine-learning r

Last synced: 09 Feb 2025

https://github.com/asifdotexe/sentimentscoringmodel

This project focuses on performing sentiment analysis on Amazon reviews using natural language processing (NLP) techniques. It includes various steps, from data exploration and preprocessing to building and evaluating sentiment models.

data-analysis data-visualization natural-language-processing sentiment-analysis

Last synced: 15 Nov 2024

https://github.com/vidhi1290/robust-yield-prediction-

"Predicting a Greener Future 🌾📊 Delve into the world of agriculture and data science with our Yield Prediction project. We harness machine learning and weather data to forecast crop yields accurately. Join us in cultivating smarter farming practices for a sustainable tomorrow."

artificial-intelligence data-analysis data-cleaning-and-preprocessing data-science data-visualization dataexploration devops docker machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas python scikit-learn scikitlearn-machine-learning streamlit yield-prediction-for-food-processing

Last synced: 02 Feb 2025

https://github.com/darsan-in/rumour-monger-spotter

Rumour Monger Spotter is a prototype developed during a national-level cyber hackathon to identify false information on Twitter. Using the Google Fact Check API and a Multinomial Naive Bayes classifier, the tool analyzes tweet content to assess the likelihood of misinformation. Despite a development window of less than 24 hours, the project won a t

ai data-analysis fact-checking hackathon india naive-bayes national-competition natural-language-processing prototype real-time-analysis social-media text-classification tweet-content twitter

Last synced: 12 Dec 2024

https://github.com/mykhode/python-sic-mini-project

SAMSUNG SIC Finish Project Course - Python

data-analysis python-analysis samsung-sic

Last synced: 11 Jan 2025

https://github.com/storopoli/r_scripts

Couple of handy R Scripts that I use in a daily basis for Scientific Research

data-analysis data-science data-visualization r scientific

Last synced: 20 Nov 2024

https://github.com/cheminfo/compass

Strategy for improved characterisation of human metabolic phenotypes using a COmbined Multiblock Principal components Analysis with Statistical Spectroscopy (COMPASS)

data-analysis metabolomics metabonomics multiblock nmr-spectroscopy pca population-analysis population-model

Last synced: 28 Jan 2025

https://github.com/thecoderpinar/credit-card-fraud-detection-project

This project focuses on the detection of credit card fraud using various data science and machine learning techniques. The dataset includes a record of credit card transactions over a specific period, with the goal of accurately identifying fraudulent activities. 🚀✨

anamoly-detection classification-algorithms credit-card-transactions data-analysis data-preprocessing data-science data-visualization fraud-detection machine-learning python

Last synced: 09 Feb 2025

https://github.com/DataHerb/dataherb-python

Python Package for DataHerb: create, search, and load datasets.

data data-analysis data-mining database dataset python

Last synced: 15 Nov 2024

https://github.com/andreip/twitter-authorities

Find authorities for Twitter topics. [Licenta][Undergraduate thesis]

data-analysis mongodb python twitter-topics

Last synced: 12 Feb 2025

https://github.com/yashksaini-coder/meriskill

Data analytics extracts insights from raw data through statistical analysis and visualization. It empowers informed decision-making, guiding strategies and addressing challenges with a data-driven approach.

analytics data data-analysis data-science internship-task

Last synced: 20 Feb 2025

https://github.com/elkronos/anovatoolbox

This GitHub repository contains a collection of functions for performing various statistical analyses and generating visualizations. The functions are designed to work with different types of data and provide comprehensive outputs for data analysis.

anova anova-model data-analysis r statistics

Last synced: 24 Jan 2025

https://github.com/juangesino/behaviouraleconomics

All the files and data for the experiment performed during the course Behavioural Economics @ University of Amsterdam

behavioral-economics behavioural-economics data-analysis economics game-theory statistics

Last synced: 23 Jan 2025

https://github.com/michaelnabil230/laravel-analytics

A Laravel package to retrieve pageviews and other data from Database

data-analysis data-structures database laravel php

Last synced: 15 Nov 2024

https://github.com/benjamindpb/wikidata-preprocessing

Wikidata dump preprocessing & analysis of georreferencial entities

data-analysis preprocessing wikidata wikidata-dump

Last synced: 19 Jan 2025

https://github.com/sondosaabed/advanced-data-wrangling

In this advanced course, Learning the three phases of data wrangling: gathering, assessing, and cleaning data.

data-analysis data-analyst-nanodegree data-wrangling numpy pandas python

Last synced: 06 Nov 2024

https://github.com/lebrancconvas/personal-saved-datasets

Saved Datasets for doing stuff in the area of Statistics & Data Science.

data-analysis data-science database datasets excel kaggle kaggle-dataset microsoft-excel sample-dataset

Last synced: 08 Jan 2025

https://github.com/garrettj403/albertaenergysources

Get grid data from Alberta Electric System Operator (AESO)

alberta canada data-analysis energy-data

Last synced: 27 Jan 2025

https://github.com/jesussantana/ibm-data-analysis-with-python-da0101en

This course will take you from the basics of Python to exploring many different types of data.

anova correlation data-analysis model-evaluation numpy pandas prepare-data python regression-models statistics

Last synced: 12 Nov 2024

https://github.com/crafterkolyan/applied-statistical-data-analysis

Курс прикладного статистического анализа данных. ВМК МГУ. Весна 2020

autoexec-scripts autotest data-analysis github-actions statistics university

Last synced: 11 Feb 2025

https://github.com/rafaelbroseghini/data-analysis-visualizations-ml

:bar_chart: NLP, Regression Models, Story telling, Time Series Analysis ... in Python

data-analysis data-science machine-learning machine-learning-algorithms time-series visualization

Last synced: 04 Feb 2025

https://github.com/erictleung/microbiome-analysis-resources

:microscope: Resources and notes on studying the human microbiome

bioinformatics data-analysis gut-microbiome microbiology microbiome notes resources

Last synced: 09 Feb 2025

https://github.com/autuanliu/rlang

科学统计实验与研究(数据方面)

data-analysis r

Last synced: 22 Jan 2025

https://github.com/sondosaabed/data-visualization-with-matplotlib-and-seaborn

Learning to apply sound design and data visualization principles to the data analysis process. Also learning how to use analysis and visualizations to tell a story with data.

data-analysis data-analyst-nanodegree data-visualization matplotlib python seaborn seaborn-plots

Last synced: 06 Nov 2024

https://github.com/manikantasanjay/loan_repayment_regression_project

Prediction Of Loan Repayment using Sequential Neural Networks on Lending Club Dataset.

data-analysis data-visualization lending-club loan-repayment matplotlib numpy pandas-library seaborn tensorboard tensorflow

Last synced: 05 Nov 2024

https://github.com/mggg/gerrytools

Tools for evaluating districting plans.

data-analysis tooling

Last synced: 06 Dec 2024

https://github.com/virajbhutada/power-bi-resources

Comprehensive Power BI resources covering interview questions, group training materials, project portfolio ideas, and a cheat sheet for quick reference. Elevate your Power BI skills with curated content designed to enhance your proficiency and boost project success.

data-analysis data-modeling data-visualization dax-expression interview-preparation portfolio-projects powerbi quick-reference visualization-techniques

Last synced: 10 Jan 2025

https://github.com/rickyxume/data_statistic_analysis

2021数据统计与分析大赛全国一等奖方案

data-analysis data-mining data-science

Last synced: 05 Jan 2025

https://github.com/ahammadmejbah/ibm-project-03-analyzing-spreadsheet-data-with-python

Spreadsheets are computer programs that allow users to enter, view, and change information in a gridlike format. One of the most useful programs on a PC is a spreadsheet, which allows users to organize data in tabular form. A spreadsheet's primary purpose is to store numerical data and sometimes a few words of text.

data-analysis data-analysis-python data-analyst data-science excel python spreadsheet

Last synced: 11 Nov 2024

https://github.com/liweitianux/chandra-acis-analysis

Chandra ACIS analysis tools and documents

acis chandra data-analysis

Last synced: 06 Jan 2025

https://github.com/joamag/pandas

Loads of pandas data from China with awesome data

data data-analysis jupyter notebook pandas

Last synced: 16 Jan 2025

https://github.com/mszell/fyp2021

INSTRUCTOR's VERSION: Lectures of First Year Project 2021, Project 1

data-analysis data-science education python teaching-materials

Last synced: 12 Nov 2024

https://github.com/dataket/dataket

Propuesta de proyecto para el Datatón Anticorrupción 2021. Equipo Dataket

corruption data-analysis dataviz open-government

Last synced: 18 Jan 2025

https://github.com/lucasbotang/real_estate_management_data_analysis

Data analysis for real estate management

data-analysis excel mysql tableau

Last synced: 25 Jan 2025

https://github.com/forhadulislam/sna-project

A project for Social Network Analysis. Analyzed yahoo query logs

data-analysis data-mining social-network

Last synced: 11 Jan 2025

https://github.com/felipeheide/technicalcryptobot

Python script to analyze and trade Bitcoin (BTC) based on technical indicators like RSI, MACD, MMS, and support/resistance levels. Fetches prices from CryptoCompare and CoinGecko APIs. Allows investing a specified amount and displays potential profit/loss. Runs continuously, updating every minute.

api bitcoin coingecko cryptocompare cryptocurrency data-analysis python trading-bot

Last synced: 10 Jan 2025

https://github.com/bts-cm/airdrop_tool

Fetch & analyse blockchain tickets. View leaderboards and user tickets. Calculate and perform provably fair airdrops.

airdrop bitshares bitsharesjs blockchain crypto data-analysis data-science electron etl javascript nodejs react ticket tusc

Last synced: 15 Nov 2024

https://github.com/valeriopagliarino/tcf-2021-unito-public

Exam project of the course "Computing Tecniques for Physics" - Università degli Studi di Torino - Physics department - 2021

cern-root data-analysis geant4-simulation monte-carlo-simulation object-oriented-programming physics

Last synced: 01 Feb 2025

https://github.com/quantumudit/analyzing-cleanaway-services

This project focuses on scraping all the service locations across Australia and their associated attributes from "Cleanaway" website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.

data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping

Last synced: 06 Nov 2024

https://github.com/martinboller/cc-build

Builds latest version of CyberChef and install it with NGINX on another system. CyberChef is a simple, intuitive web app for analyzing and decoding data without having to deal with complex tools or programming languages.

analysis blueteam compression cyberchef data-analysis data-manipulation decode encode encryption hashing parsing virtual-machine

Last synced: 16 Nov 2024

https://github.com/gxjansen/user-analysis-with-r-google-analytics

Analyzing user behavior of an E-commerce website with R and (mainly) Google Analytics Data

analytics analytics-api conversion-rate-optimization data-analysis ecommerce google google-analytics r

Last synced: 30 Oct 2024

https://github.com/AnonCatalyst/WebHound

WebHound is your Python-powered command-line assistant for sharp and efficient web searches! It sniffs out data from major search engines and detects social platforms, helping you uncover valuable insights and stay ahead of the game. 🌐🔍✨

awareness data-analysis data-visualization information-retrieval osint osint-python osint-reconnaissance osint-resources osint-tool osint-tools osinttools web-osint webscraping

Last synced: 04 Dec 2024

https://github.com/roaldarbol/anibehavr

🪲 An R package for Analysis of Animal Behaviour

animal-behavior behavioural-states data-analysis r

Last synced: 28 Oct 2024

https://github.com/iondv/metrics

IONDV. Framework application: Metrics is to collect and show the metrics data.

collecting data data-analysis iondv iondv-app metrics

Last synced: 08 Jan 2025

https://github.com/johnsell620/sentiment-analysis-goodreads-reviews

Document-level sentiment analysis of book reviews scraped from the Goodreads website. Technologies used include TensorFlow, Spark, HDFS, Sqoop, Scrapy, and D3.js.

data-analysis data-visualization recurrent-neural-networks web-scraping

Last synced: 29 Nov 2024

https://github.com/nragland37/event-optimization-tool

R-based Shiny application that maps availability and identifies optimal engagement times to enhance participation within an organization

data-analysis data-cleaning data-preparation heatmap r shiny shiny-app tidyverse

Last synced: 16 Nov 2024

https://github.com/pmsipilot/intercom2dw

Intercom2DW is an attempt at loading all the data available in an Intercom application.

data-analysis docker intercom

Last synced: 14 Jan 2025

https://github.com/gcappon/py_agata

Official Python porting of AGATA (Automated Glucose dATa Analysis) a toolbox to analyse glucose data.

continuous-glucose-monitoring data-analysis hacktoberfest python toolbox

Last synced: 12 Nov 2024

https://github.com/pottekkat/matplotlib-basics

A jupyter notebook that walks you through the basics of matplotlib for data science applications

data-analysis data-visualization machine-learning matplotlib pyplot python

Last synced: 07 Jan 2025

https://github.com/EmirhanServeren/NFT-CollaBot

NFT CollaBot is a data-oriented project designed by the requirements of NFT ecosystem and aims to strengthen community.

data data-analysis data-analytics nft streamlit streamlit-webapp tezos tezos-api tezos-blockchain tezoswallet

Last synced: 08 Nov 2024

https://github.com/wtbates99/stock-indicators

A comprehensive self-hosted stock analysis platform combining FastAPI and React. Features interactive stock visualizations, real-time data aggregation, and customizable technical indicators with a modern, grid-based interface.

backend data-analysis data-mining data-science fastapi financial-analysis frontend investment python reactjs sqlite3 statstistics stock-market stock-price-prediction time-series

Last synced: 07 Jan 2025

https://github.com/lucs1590/strava-analysis

🏃📊 Using strava to do personal analyses and to practice data scientist skills.

data-analysis data-science github jupyter-notebook python3 strava strava-api

Last synced: 10 Feb 2025

https://github.com/omarelgabry/insights.py

A Python package for reading, storing, & analyzing data from Public Data APIs

data-analysis

Last synced: 22 Nov 2024

https://github.com/alhankeser/citibike-analysis

Extracting and Transforming Citi Bike Data for Analysis

citibike data-analysis data-science data-visualization etl sql

Last synced: 17 Oct 2024

https://github.com/brews/riverpca

Companion repository to Malevich S.B., and C.A. Woodhouse (2017), Pacific SSTs, mid-latitude atmospheric circulation, and widespread interannual anomalies in Western US streamflow, Geophys. Res. Lett., 44, doi:10.1002/2017GL073536.

analysis data-analysis paper pca python river streamflow visualization

Last synced: 03 Feb 2025

https://github.com/grand-27-master/data-science-course

One-stop repo for learning data science along with roadmap!

data-analysis data-science machine-learning python statistics

Last synced: 06 Jan 2025

https://github.com/bishopce16/stroke_prediction_analysis

The purpose of this project is to derive insight on characteristics and statistics regarding the dataset to see which factors influence whether or not a patient has had a stroke.

data-analysis data-visualization machine-learning nump pandas plpgsql python seaborn tableau

Last synced: 11 Nov 2024

https://github.com/dcs-training/summerschool2024-stream2

Welcome to the repository for all the materials for Stream 2 of the CDCS 2024 summer school. Go to the readme file

data-analysis data-visualisation data-wrangling r sentiment-analysis statistics text-analysis web-scraping

Last synced: 10 Nov 2024

https://github.com/msyriac/alhazen

Tools for gravitational lensing: now deprecated by pixell+orphics+symlens. Leaving it here in case anyone needs to reproduce old stuff.

analysis cmb data-analysis gravitational-lensing lensing

Last synced: 27 Oct 2024

https://github.com/petulla/readroper

Read single and multi-card ASCII polling datasets in R

ascii data-analysis polling-data r

Last synced: 06 Feb 2025