Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Data analysis

Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.

https://github.com/super-lou/exstat

🌾 R package to provide an efficient and simple solution to aggregate and analyze the stationarity of time series

climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics time-series

Last synced: 23 Dec 2024

https://github.com/jonzeolla/lab-securitydataanalysis

An introductory lab to Security Data Analysis (using Apache Metron (incubating)).

apache-metron data-analysis lab metron security

Last synced: 18 Nov 2024

https://github.com/dcs-training/datavisualisationwithr

Data Visualisation with R Workshop (delivered by the Centre in December 2020). This workshop is focusing on visualising your data. Go to the readme file

data-analysis data-visualisation data-wrangling r

Last synced: 10 Nov 2024

https://github.com/palewire/baseball-notebooks

Python notebooks exploring Major League Baseball data

baseball baseball-statistics data-analysis jupyter-notebook pandas python

Last synced: 18 Oct 2024

https://github.com/sowinskibraeden/schedulegeneratorapp

The Desktop Application for my schedule-generator algorithm, allowing users to easily interact with the algorithm and its variables to generate schedules as documents for students individually as well as the master timetable

algorithm csv data-analysis dataclasses python-docx python-typing python311 xlsxwriter

Last synced: 20 Nov 2024

https://github.com/quantumudit/movie-ratings-analysis

This project focuses on analyzing and finding correlations between the audience and critic ratings for some of the popular movies released between 2009-2011 using Python & Power BI

data-analysis data-visualization jupyter-notebook power-bi python

Last synced: 26 Dec 2024

https://github.com/scottgriv/river-charts

🌊 📉 A Python, Django, Plotly, and Pandas web application that visualizes river data in real time, pulled using an API from the United States Geological Survey (USGS).

api charts data data-analysis data-visualization dataset django pandas plotly python usgs usgs-api visualization webapp

Last synced: 14 Dec 2024

https://github.com/jimbrig/lossrunAnalyzer

R Package and Shiny App to Analyze Insurance Lossruns

actuarial data-analysis data-mining data-science insurance r record-linkage risk-management shiny

Last synced: 04 Dec 2024

https://github.com/quantumudit/regional-sales-analysis

This project focuses on analyzing and visualizing the United States regional sales for a fictitious company in between 2018-2020 using Python & Power BI.

data-analysis data-visualization databases jupyter-notebook power-bi python sqlite

Last synced: 26 Dec 2024

https://github.com/quantumudit/consumer-goods-sales-analysis

This project focuses on analyzing and visualizing the consumer goods sales in the United States between 2015-2016 using Python & Power BI.

data-analysis data-visualization database jupyter-notebook python sqlite

Last synced: 26 Dec 2024

https://github.com/quantumudit/basketball-players-analysis

The project focuses on analyzing salaries and various other in-game metrics of top NBA basketball players from 2005-14 by performing exploratory data analysis with Python and Jupyter Notebook and by visualizing the data in an insightful dashboard made with Power BI

data-analysis jupyter-notebook power-bi python

Last synced: 26 Dec 2024

https://github.com/theengineeringworld/numpy-data-science

NumPy Data Science Essential Traing COurse. Part of Youtube Course Offered by TheEngineeringWorld.

data-analysis data-science numpy numpy-exercises numpy-library numpy-tutorial python python-3-6 python3 scipy2018

Last synced: 08 Nov 2024

https://github.com/alyssonmach/9-data-science-apps-with-python

[freeCodeCamp Course]: build interactive and data-driven web apps in Python using the Streamlit library.

data-analysis data-science data-visualization machine-learning streamlit-webapp

Last synced: 06 Nov 2024

https://github.com/maksimekin/umd_data_challange_2020

Ocean Clean up data analysis project for the UMD Data Challenge 2020. Data Exploration for a Sustainable Planet.

cleanup competition data-analysis data-science folium geolocation machine-learning ocean planet pollution sklearn sustainability time-series trash umd

Last synced: 15 Dec 2024

https://github.com/yusufcinarci/web-scraping-projects

In these project files, I will host the web scraping examples that I will make day by day.

data-analysis data-science jupyter-notebook python web-scraping

Last synced: 26 Dec 2024

https://github.com/thecoderpinar/earthquake_prediction_analysis_project

🌍 Welcome to the Earthquake Prediction Analysis Project! 🚀 This project aims to predict earthquake magnitudes using LSTM neural networks and analyze seismic data. Explore, analyze, and forecast earthquakes with ease! 📈🔮

analysis data-analysis data-science earthquake-prediction geocoding geology lstm lstm-neural-networks machine-learning matlab matlab-deep-learning open-source time-series visualization

Last synced: 16 Dec 2024

https://github.com/richiejp/jdp

Automatically collect and normalise data, then run algorithms on it.

automation-framework data-analysis suse-qa

Last synced: 19 Nov 2024

https://github.com/franpog859/top-of-the-world

🌍🔝 Proof that your country is the top of the world using GeoTIFF images and a little bit of geometry. Data mining project

data data-analysis data-mining elevation geometry geotiff image-processing matplotlib nvector rasterio

Last synced: 12 Nov 2024

https://github.com/ptyadana/dv-data-visualization-with-python

Data analysis and Data Visualization of Countries's GDP, Life Expectancy comparison across continents, GDP per Capita Relative Growth, Population Reative Growth comparison etc using Pandas, Matplotlib.

csdojo data-analysis data-visualization datavisualization matplotlib matplotlib-pyplot numpy pandas pluralsight python python3

Last synced: 15 Nov 2024

https://github.com/negativenagesh/spam-ham_email_detection_machine_learning

This project focuses on classifying spam/ham emails, using machine learning algorithms like LGR, NB, RF, DT etc.. and based on the accuracy score and precision score I chose logistic regression for the classification. And I have used streamlit for frontend.

app data-analysis data-cleaning data-engineering data-science data-visualization data-visualizations jupyter-notebook logistic-regression machine-learning modeling naive-bayes-classifier nlp python

Last synced: 23 Dec 2024

https://github.com/ethan-wickstrom/rrrs

Welcome to RRRS, a rapid, hyper-optimized CSV random sampling tool designed with performance and efficiency at its core. Crafted meticulously in Rust, RRRS offers an unparalleled solution for extracting random data samples from CSV files swiftly and effortlessly.

analytics cli command-line command-line-tool data data-analysis data-science dataset rust rust-lang sample samples

Last synced: 19 Nov 2024

https://github.com/i10mm/gpt-arxiv-fetcher

Revolutionize your research with our GitHub repository, where GPT meets arXiv API for seamless access and analysis of the latest academic papers!

artificial data-analysis intelligence llm machine-learning

Last synced: 22 Nov 2024

https://github.com/sondosaabed/introduction-to-data-analysis-with-pandas-and-numpy

Learning the data analysis process of questioning, wrangling, exploring, analyzing, and communicating data. Working with data in Python using libraries like NumPy and pandas.

data-analysis data-analyst-nanodegree data-wrangling numpy pandas python

Last synced: 06 Nov 2024

https://github.com/anaagg/data-installations-windows

Are you thinking about using the Linux subsystem on Windows? This is your repo, along this repository you will find the necessary steps to install on your computer what you need to work with Python, SQL, miniconda on Windows using the Linux subsystem.

conda conda-environment data-analysis hyper jupyter jupyter-notebook jupyter-notebooks linux miniconda pycharm pycharm-ide python sql vscode windows workbench

Last synced: 09 Oct 2024

https://github.com/rikard-helgegren/leverage_analysis_tool

Analyst tool for portfolio construction. How can levereged certificates be used to increase returns in a portfolio while keeping the risk as low as possible. Use the tool and find out.

cpp data-analysis investment kivy-framework python3

Last synced: 14 Oct 2024

https://github.com/tzerk/gammaspec

A collection of functions to analyse gamma spectra

data-analysis gamma-ray-spectrometry r

Last synced: 08 Nov 2024

https://github.com/accurat/react-dataviz

⚛📊🚀 React components to build powerful interactive data visualizations

d3 data-analysis data-visualization react react-components

Last synced: 06 Nov 2024

https://github.com/davidzajac1/reptoro

A Data Visualization and Analytics Platform for the Reptile Industry

analytics data-analysis data-visualization plotly-dash python

Last synced: 11 Oct 2024

https://github.com/orkunaktas/sofascore-webscraping

⚽️I scraped the shot data of the Fenerbahçe - Adana Demirspor match from Sofascore⚽️

beautifulsoup data-analysis football-analytics football-data selenium webscraping

Last synced: 11 Oct 2024

https://github.com/negativenagesh/whatsapp_chat_analyzer

This is an end to end project of whatsapp chat analysis, here I have used my hostel whatsapp group's chat data.

data-analysis data-science frontend modeling python streamlit

Last synced: 23 Dec 2024

https://github.com/irfanchahyadi/ml-notes

Complete personal notes for performing Data Analysis, Preprocessing, and Training ML model.

data-analysis machine-learning plotting python

Last synced: 07 Nov 2024

https://github.com/globeandmail/startr-cli

A command-line scaffolder for the startr R project template

data-analysis data-journalism data-visualization journalism r

Last synced: 26 Dec 2024

https://github.com/waveform80/structa

A small utility for analyzing data structures (e.g. JSON files)

csv data-analysis data-visualization datajournalism datawrangling json yaml

Last synced: 11 Oct 2024

https://github.com/moataz-elmesmary/data-analysis-using-excel-and-power-bi-sessions

Excel & Power BI Sessions from basics to advanced for Data Analysis

data-analysis dax excel powerbi powerpivot powerquery

Last synced: 19 Nov 2024

https://github.com/dermatologist/goscar-export

:fire: CSV to FHIR (For OSCAR EMR EForm Export)

data-analysis data-warehouse fhir fhir-r4 fhir-server hacktoberfest oscar-emr

Last synced: 27 Oct 2024

https://github.com/femtotrader/dukascopyticksreader.jl

A Julia library to download tick data from Dukascopy https://www.dukascopy.com/swiss/english/marketwatch/historical/

data data-analysis dataset dukascopy html julia stock-data

Last synced: 12 Oct 2024

https://github.com/srinivasrm/mutual-funds-analysis-and-prediction

In this project I have performed analysis and prediction on 1,3,and 5 year returns on 1064 mutual funds in India. I have scraped data from a website which is the most visited website for mutual fund investments.I have tested regression models linear model,SGD Regressor , Random Forest Regressor,Decision Tree Regressor,Ridge,MLP Regressor and linear model (Lasso).After which I have selected the best perorming model and performed Hyper parameter tuning and then deployed an interactive application which can generate the visualization and send an email with the visualization to the users email address.

beautifulsoup data-analysis data-base data-cleaning data-science deployment etl finanace frontend funds machine-learning mutual mutual-funds pgsql python scikit-learn sql streamlit web webapplication

Last synced: 11 Oct 2024

https://github.com/virajbhutada/spotify-track-analysis-and-recommendation

Experience a comprehensive exploration of Spotify's musical landscape seamlessly transitioned from Tableau visualizations to SQL analysis. Dive into track inventory, streaming metrics, and sonic trends via interactive dashboards, while leveraging SQL queries for deeper insights into KPIs and cross-platform rankings.

audio-analysis data-analysis data-analytics data-science data-visualization eda machine-learning-library ml-models mysql recommendation-system spotify spotify-data spotify-dataset sql-database sql-server streaming-metrics tableau tableau-public trends-analysis

Last synced: 11 Nov 2024

https://github.com/mjunaidca/sql-auditor-pro-gpt

Get quick insights from your Data Sources in tables, charts and graphs with with AI-driven audits and optimize your SQL databases.

ai ai-agent auditing-data cloudflared compound-ai-systems custom-gpt data-analysis fastapi-template gpt sql sqlmodel

Last synced: 03 Nov 2024

https://github.com/ogoodness/vbreaker-js

CSC 483 Project - Ciphers: Caeser, Multiplicitive, Affine, Vigenere, Hill, Columnar Transposition

affine-cipher caesar-cipher columnar-transposition-cipher cryptography data-analysis decoder decryption encoder encryption hill-cipher parsing vigenere-cipher

Last synced: 14 Nov 2024

https://github.com/msyriac/alhazen

Tools for gravitational lensing: now deprecated by pixell+orphics+symlens. Leaving it here in case anyone needs to reproduce old stuff.

analysis cmb data-analysis gravitational-lensing lensing

Last synced: 27 Oct 2024

https://github.com/pmsipilot/intercom2dw

Intercom2DW is an attempt at loading all the data available in an Intercom application.

data-analysis docker intercom

Last synced: 14 Nov 2024

https://github.com/lucs1590/strava-analysis

🏃📊 Using strava to do personal analyses and to practice data scientist skills.

data-analysis data-science github jupyter-notebook python3 strava strava-api

Last synced: 17 Dec 2024

https://github.com/mszell/fyp2021

INSTRUCTOR's VERSION: Lectures of First Year Project 2021, Project 1

data-analysis data-science education python teaching-materials

Last synced: 12 Nov 2024

https://github.com/forhadulislam/sna-project

A project for Social Network Analysis. Analyzed yahoo query logs

data-analysis data-mining social-network

Last synced: 12 Nov 2024

https://github.com/juicedata/juicefs-deeplearning-tutorials

Deep Learning and Data Analytics Techniques with the help of JuiceFS.

data-analysis deep-learning filesystem juicefs machine-learning

Last synced: 14 Nov 2024

https://github.com/agungbudiwirawan/e-commerce_analysis_using_sql

The objective of this project is to provide an analysis of Olist Store (Brazilian E-commerce) sales.

data-analysis data-science e-commerce e-commerce-project microsoft-sql-server sql sql-server

Last synced: 02 Dec 2024

https://github.com/ahammadmejbah/ibm-project-03-analyzing-spreadsheet-data-with-python

Spreadsheets are computer programs that allow users to enter, view, and change information in a gridlike format. One of the most useful programs on a PC is a spreadsheet, which allows users to organize data in tabular form. A spreadsheet's primary purpose is to store numerical data and sometimes a few words of text.

data-analysis data-analysis-python data-analyst data-science excel python spreadsheet

Last synced: 11 Nov 2024

https://github.com/alanmenchaca/ml-books-compilation

Repository of books that I have been collecting about machine learning, deep learning, data science, and more ...

data-analysis data-science deep-learning machine-learning statistics

Last synced: 17 Dec 2024

https://github.com/pnnl/archive_walker

Archive Walker Software to read and examine PMU data to detect events and conditions for further analysis.

data-analysis pmu synchrophasor

Last synced: 25 Nov 2024

https://github.com/jesussantana/ibm-data-analysis-with-python-da0101en

This course will take you from the basics of Python to exploring many different types of data.

anova correlation data-analysis model-evaluation numpy pandas prepare-data python regression-models statistics

Last synced: 12 Nov 2024

https://github.com/lebrancconvas/personal-saved-datasets

Saved Datasets for doing stuff in the area of Statistics & Data Science.

data-analysis data-science database datasets excel kaggle kaggle-dataset microsoft-excel sample-dataset

Last synced: 11 Nov 2024

https://github.com/elkronos/anovatoolbox

This GitHub repository contains a collection of functions for performing various statistical analyses and generating visualizations. The functions are designed to work with different types of data and provide comprehensive outputs for data analysis.

anova anova-model data-analysis r statistics

Last synced: 23 Nov 2024

https://github.com/arnavk-09/world-population

🌏 Taipy demo to explore world population data...

data-analysis python taipy world-population

Last synced: 13 Dec 2024

https://github.com/ktmud/github-life

A data explorer for GitHub projects' life cycles

data-analysis github scraper time-series

Last synced: 22 Nov 2024

https://github.com/easonlai/chat_with_csv_streamlit_with_chart

In this repository, you will find an example code for creating an interactive chat experience that allows you to ask questions about your CSV data with chart visualization capabilities.

azure-openai azure-openai-api azure-openai-service chart-visualization data-analysis data-visualization gpt gpt-3 langchain langchain-python matplotlib openai python streamlit streamlit-webapp

Last synced: 10 Nov 2024

https://github.com/petulla/readroper

Read single and multi-card ASCII polling datasets in R

ascii data-analysis polling-data r

Last synced: 13 Dec 2024

https://github.com/agungbudiwirawan/socioeconomic_analysis

The objective of this project is to analyze the socio-economic in Chicago.

chicago-crime crime-data data-analysis data-science microsoft-sql-server project sql sql-project sql-server

Last synced: 02 Dec 2024

https://github.com/cosmoduende/r-ufo-sightings

Are we alone in the universe? - Data Analysis and Data Visualization of UFO sightings with R. How to analyze and visualize data of UFO sightings of the last century in the USA and the rest of the world with R language.

data-analysis data-analytics data-science data-visualisation data-visualization data-visualizations dataviz ovni ovni-dataset r-code r-language r-programming r-stats ufo ufo-analysis ufo-dataset ufo-sighting ufo-sightings

Last synced: 07 Nov 2024

https://github.com/darsan-in/rumour-monger-spotter

Rumour Monger Spotter is a prototype developed during a national-level cyber hackathon to identify false information on Twitter. Using the Google Fact Check API and a Multinomial Naive Bayes classifier, the tool analyzes tweet content to assess the likelihood of misinformation. Despite a development window of less than 24 hours, the project won a t

ai data-analysis fact-checking hackathon india naive-bayes national-competition natural-language-processing prototype real-time-analysis social-media text-classification tweet-content twitter

Last synced: 12 Dec 2024

https://github.com/danielpuentee/outdpik

The fundamental toolkit for outliers search and visualization. It aims to be the fundamental high-level package for this purpose.

data-analysis matplotlib numpy python

Last synced: 17 Dec 2024

https://github.com/fatihilhan42/data-science-projects

In this repo, there are (beginner-upper) level projects in the field of data science. I will host these projects that I have done in this field every day in this repo. With the hope that it will be useful to those who are interested in the field of data science like me and will just start...

data-analysis data-engineering data-mining data-science data-structures data-visualization database datascience fatihilhan fortytwo fortytwofficial jupyter-notebook python

Last synced: 01 Dec 2024

https://github.com/dcs-training/summerschool2024-stream2

Welcome to the repository for all the materials for Stream 2 of the CDCS 2024 summer school. Go to the readme file

data-analysis data-visualisation data-wrangling r sentiment-analysis statistics text-analysis web-scraping

Last synced: 10 Nov 2024

https://github.com/mykhode/python-sic-mini-project

SAMSUNG SIC Finish Project Course - Python

data-analysis python-analysis samsung-sic

Last synced: 12 Nov 2024

https://github.com/andreantonacci/eu2019

Social Network Analysis of Twitter Topic-Network Structures during the 2019 European Elections

data-analysis election-analysis european-elections gephi latex master-thesis social-network-analysis

Last synced: 24 Nov 2024

https://github.com/dcs-training/digital-method-of-the-month

In this repository you are going to find the documents we produced to support the discussion in our Digital Methods of the Month. These documents will help you orienting yourself if you want to pickup the method in your research. Go to the readme file

3d-data data-analysis data-visualisation data-wrangling geographical-data gis good-practices-digital-research machine-learning network-analysis open-research preregistration statistics text-analysis

Last synced: 10 Nov 2024

https://github.com/super-lou/aeag_toolbox

🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)

climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics

Last synced: 23 Dec 2024

https://github.com/trainingbypackt/splunk-7-essentials-elearning

Build an elaborate Splunk enterprise environment that will extract powerful insights from your machine-generated big data

data-analysis eventgen indexing machine-learning splunk sub-search visualization

Last synced: 14 Nov 2024

https://github.com/ihabbendidi/diamond-analysis

Exploratory statistical analysis of a Diamond dataset

data-analysis data-visualization exploratory-data-analysis machine-learning r

Last synced: 16 Dec 2024

https://github.com/ouhscbbmc/data-science-practices-1

Collection of publicly available practices of data science and analysis

bbmc-investigation data-analysis data-science python r sql

Last synced: 18 Nov 2024

https://github.com/johnsell620/sentiment-analysis-goodreads-reviews

Document-level sentiment analysis of book reviews scraped from the Goodreads website. Technologies used include TensorFlow, Spark, HDFS, Sqoop, Scrapy, and D3.js.

data-analysis data-visualization recurrent-neural-networks web-scraping

Last synced: 29 Nov 2024

https://github.com/nghorbani/neuraldataanalysis

Download Data from https://bit.ly/3g8RUmi

data-analysis matlab spike-detection spike-rate

Last synced: 09 Nov 2024