Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2025-02-10 00:07:12 UTC
- JSON Representation
https://github.com/slgobinath/wisdom
An adaptive and self-boosting stream processor
cep complex-event-processing data-analysis distributed self-tuning stream-processing wisdom
Last synced: 15 Oct 2024
https://github.com/lisa-ho/three-investigators
Respository for scraping and analysing fan data on a German audio drama called 'Die Drei Fragezeichen' (the three investigators).
data-analysis data-viz datawrapper python webscraping
Last synced: 10 Feb 2025
https://github.com/ynikitenko/lena
Lena is an architectural framework for data analysis
analysis-framework analysis-pipeline data-analysis data-science
Last synced: 11 Nov 2024
https://github.com/ttiagojm/ground-truth-vs-prediction
Machine and Deep Learning notebooks
data-analysis data-science deep-learning kaggle machine-learning opencv tensorflow
Last synced: 10 Nov 2024
https://github.com/hevalhazalkurt/exploring_the_data_of_lego_history
A data exploration project on LEGO history in Python with pandas, matplotlib etc. (WIP)
data data-analysis data-science data-visualization datascience datasets lego lego-history matplotlib pandas python python3
Last synced: 21 Jan 2025
https://github.com/palewire/baseball-notebooks
Python notebooks exploring Major League Baseball data
baseball baseball-statistics data-analysis jupyter-notebook pandas python
Last synced: 18 Oct 2024
https://github.com/aby-ss/automate
Automate: AI-powered automation software revolutionizing businesses. Streamline operations, boost productivity, and drive growth with our intuitive, user-friendly solutions.
automation b2b b2b-connect business-growth business-intelligence business-service business-solutions data-analysis python python-app saas services services-as-a-software task-automation
Last synced: 13 Jan 2025
https://github.com/tirendazacademy/pandasai-tutorials
Tutorials for PandasAI
ai data-analysis data-science data-visualization llms openai pandas pandasai python
Last synced: 08 Nov 2024
https://github.com/scottgriv/river-charts
🌊 📉 A Python, Django, Plotly, and Pandas web application that visualizes river data in real time, pulled using an API from the United States Geological Survey (USGS).
api charts data data-analysis data-visualization dataset django pandas plotly python usgs usgs-api visualization webapp
Last synced: 14 Dec 2024
https://github.com/woctezuma/regression
Gaussian Process Regression vs. Relevance Vector Machine.
data-analysis data-science gaussian-process-regression machine-learning python regression relevance-vector-machine statistics
Last synced: 06 Dec 2024
https://github.com/femtotrader/dukascopyticksreader.jl
A Julia library to download tick data from Dukascopy https://www.dukascopy.com/swiss/english/marketwatch/historical/
data data-analysis dataset dukascopy html julia stock-data
Last synced: 12 Oct 2024
https://github.com/alyssonmach/9-data-science-apps-with-python
[freeCodeCamp Course]: build interactive and data-driven web apps in Python using the Streamlit library.
data-analysis data-science data-visualization machine-learning streamlit-webapp
Last synced: 06 Nov 2024
https://github.com/misaghmomenib/data-analysis-projects
A Repository Featuring a Collection of Data Analysis Projects, Showcasing Various Techniques and Tools for Extracting Insights From Data. Explore, Learn, and Utilize These Projects to Enhance Your Data Analysis Skills and Workflows.
data-analysis data-analysis-python data-visualization jupyter-notebook python
Last synced: 21 Jan 2025
https://github.com/ojasphansekar/Data-Warehouse-and-Business-Intelligence
SSIS, Talend, Tableau, Power BI, PostgreSQL, MySQL, Oracle, SQL Server, Toad Data Modeler
azure-sql-database business-intelligence data-analysis data-conversion data-ingestion data-integration data-mapping data-visualization excel mysql-database oracle-18c postgresql-database power-bi sql-server-database sql-server-management-studio ssis-packages tableau-desktop talend-dataintegration toad-modeler
Last synced: 27 Nov 2024
https://github.com/negativenagesh/whatsapp_chat_analyzer
This is an end to end project of whatsapp chat analysis, here I have used my hostel whatsapp group's chat data.
data-analysis data-science frontend modeling python streamlit
Last synced: 23 Dec 2024
https://github.com/shervinnd/bazar_app_store_eda
Bazar App Data analysis code to find the most downloaded category and most popular installed apps
data data-analysis data-science dataanalysis eda python
Last synced: 18 Jan 2025
https://github.com/psyteachr/quant-fun-v2
Fundamentals of Quantitative Analysis
data-analysis data-visualization datawrangling r statistics
Last synced: 29 Jan 2025
https://github.com/yusufcinarci/web-scraping-projects
In these project files, I will host the web scraping examples that I will make day by day.
data-analysis data-science jupyter-notebook python web-scraping
Last synced: 26 Dec 2024
https://github.com/ahammadmejbah/different-types-of-data-splitting-methods
In order to prevent overfitting and guarantee that our model can generalize to new data, data splitting is essential in machine learning.
data data-analysis data-engineering data-mining data-science data-visualization
Last synced: 09 Jan 2025
https://github.com/quantumudit/movie-ratings-analysis
This project focuses on analyzing and finding correlations between the audience and critic ratings for some of the popular movies released between 2009-2011 using Python & Power BI
data-analysis data-visualization jupyter-notebook power-bi python
Last synced: 26 Dec 2024
https://github.com/astrodynamic/retailanalitycs-in-postgresql
Develop a SQL script to create a database with tables, views, roles, and functions. Form personalized offers to increase average check, frequency of visits, and cross-selling.
bd csv data-analysis data-export data-input data-manipulation data-validation database-management functions git margin offers postgresql retail role-permission-management selling sql transaction tsv views
Last synced: 12 Jan 2025
https://github.com/quantumudit/basketball-players-analysis
The project focuses on analyzing salaries and various other in-game metrics of top NBA basketball players from 2005-14 by performing exploratory data analysis with Python and Jupyter Notebook and by visualizing the data in an insightful dashboard made with Power BI
data-analysis jupyter-notebook power-bi python
Last synced: 26 Dec 2024
https://github.com/pythondeveloper6/udemy-courses-full-eda
simple EDA for Udemy courses
data-analysis eda matplotlib numpy pandas python seaborn
Last synced: 19 Jan 2025
https://github.com/sonigarima/donation-management-system
A donation management system for NGOs and Donors. The project is designed for Cognizance IITR 2021 - Salesforce Codathon.
data-analysis donation-management reactjs
Last synced: 02 Jan 2025
https://github.com/quantumudit/regional-sales-analysis
This project focuses on analyzing and visualizing the United States regional sales for a fictitious company in between 2018-2020 using Python & Power BI.
data-analysis data-visualization databases jupyter-notebook power-bi python sqlite
Last synced: 26 Dec 2024
https://github.com/quantumudit/consumer-goods-sales-analysis
This project focuses on analyzing and visualizing the consumer goods sales in the United States between 2015-2016 using Python & Power BI.
data-analysis data-visualization database jupyter-notebook python sqlite
Last synced: 26 Dec 2024
https://github.com/super-lou/exstat
🌾 R package to provide an efficient and simple solution to aggregate and analyze the stationarity of time series
climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics time-series
Last synced: 23 Dec 2024
https://github.com/maksimekin/umd_data_challange_2020
Ocean Clean up data analysis project for the UMD Data Challenge 2020. Data Exploration for a Sustainable Planet.
cleanup competition data-analysis data-science folium geolocation machine-learning ocean planet pollution sklearn sustainability time-series trash umd
Last synced: 15 Dec 2024
https://github.com/jimmymugendi/email-sms-spam_classifier
Email SMS Spam Classifier is a cutting-edge machine learning solution designed to combat spam messages in email and SMS communications. Leveraging advanced Natural Language Processing (NLP) techniques, the system preprocesses text data by tokenizing, removing stopwords, and stemming, ensuring the most accurate classification results.
data-analysis data-visualization machine-learning-algorithms pandas seaborn-plots sklearn-library
Last synced: 16 Jan 2025
https://github.com/pythondeveloper6/store-sales-eda
simple EDA with some insights on Store Sales
data-analysis eda matplotlib numpy pandas seaborn
Last synced: 19 Jan 2025
https://github.com/asepscareer/visualization-using-plotly-and-covid-19-data
Covid-19 Analysis using plotly : Choropleth Maps
choropleth-map covid-19 data-analysis data-visualization plotly python
Last synced: 25 Nov 2024
https://github.com/jonzeolla/lab-securitydataanalysis
An introductory lab to Security Data Analysis (using Apache Metron (incubating)).
apache-metron data-analysis lab metron security
Last synced: 18 Nov 2024
https://github.com/tirendazacademy/hands-on-data-science-with-gcp
Google BigQuery Tutorial
big-data big-data-analytics bigdata bigquery bigquery-ml bigqueryml cloud-computing data-analysis data-analytics data-engineering data-science dataanalysis dataengineering google-bigquery google-cloud-platform machienlearning machine-learning
Last synced: 01 Jan 2025
https://github.com/pythondeveloper6/supermarket-eda-seaborn-for-beginners
learn Seaborn basics using a simple EDA
data-analysis eda numpy pandas seaborn visualization
Last synced: 19 Jan 2025
https://github.com/waveform80/structa
A small utility for analyzing data structures (e.g. JSON files)
csv data-analysis data-visualization datajournalism datawrangling json yaml
Last synced: 01 Jan 2025
https://github.com/ogoodness/vbreaker-js
CSC 483 Project - Ciphers: Caeser, Multiplicitive, Affine, Vigenere, Hill, Columnar Transposition
affine-cipher caesar-cipher columnar-transposition-cipher cryptography data-analysis decoder decryption encoder encryption hill-cipher parsing vigenere-cipher
Last synced: 14 Nov 2024
https://github.com/depressioncenter/mden
Mobile technologies code from the University of Michigan's Mobile Data Experts Network (MDEN), featuring data cleaning automations, REDCap project templates, and links to useful external modules. [DOI: 10.6084/m9.figshare.25438714]
automation data-analysis data-cleaning fitness-tracker heart-rate-data mobile-data mobile-development mquery powerautomate powerbi powerquery python r sleep-data smartwatch-data tableau
Last synced: 25 Nov 2024
https://github.com/sayakpaul/analysis-of-college-database-of-2017-passouts
Contains my analysis of a database containing information about the students of an engineering college.
data-analysis data-visualization matplotlib python-3
Last synced: 08 Feb 2025
https://github.com/ethan-wickstrom/rrrs
Welcome to RRRS, a rapid, hyper-optimized CSV random sampling tool designed with performance and efficiency at its core. Crafted meticulously in Rust, RRRS offers an unparalleled solution for extracting random data samples from CSV files swiftly and effortlessly.
analytics cli command-line command-line-tool data data-analysis data-science dataset rust rust-lang sample samples
Last synced: 19 Nov 2024
https://github.com/sowinskibraeden/schedulegeneratorapp
The Desktop Application for my schedule-generator algorithm, allowing users to easily interact with the algorithm and its variables to generate schedules as documents for students individually as well as the master timetable
algorithm csv data-analysis dataclasses python-docx python-typing python311 xlsxwriter
Last synced: 20 Nov 2024
https://github.com/thecoderpinar/earthquake_prediction_analysis_project
🌍 Welcome to the Earthquake Prediction Analysis Project! 🚀 This project aims to predict earthquake magnitudes using LSTM neural networks and analyze seismic data. Explore, analyze, and forecast earthquakes with ease! 📈🔮
analysis data-analysis data-science earthquake-prediction geocoding geology lstm lstm-neural-networks machine-learning matlab matlab-deep-learning open-source time-series visualization
Last synced: 16 Dec 2024
https://github.com/globeandmail/startr-cli
A command-line scaffolder for the startr R project template
data-analysis data-journalism data-visualization journalism r
Last synced: 26 Dec 2024
https://github.com/1uc1f3r616/dark-net-websites-dataset
Dataset of Onion Websites
crawler darknet data-analysis dataset onion search-engine website
Last synced: 10 Jan 2025
https://github.com/fatihilhan42/data-science-projects
In this repo, there are (beginner-upper) level projects in the field of data science. I will host these projects that I have done in this field every day in this repo. With the hope that it will be useful to those who are interested in the field of data science like me and will just start...
data-analysis data-engineering data-mining data-science data-structures data-visualization database datascience fatihilhan fortytwo fortytwofficial jupyter-notebook python
Last synced: 29 Jan 2025
https://github.com/data-engineering-community/data-engineering-meetup-in-a-box
A collection of guides, resources, and support for DE meetup organizers.
data data-analysis data-engineering data-mining data-structures database meetups
Last synced: 23 Dec 2024
https://github.com/vikas-ukani/data-analysis-with-python---zero-to-pandas
Attend Complete Data Analysis Course from freecodecamp.com
data-analysis data-science data-visualization machine-learning numpy numpy-arrays pandas
Last synced: 23 Jan 2025
https://github.com/ptyadana/dv-data-visualization-with-python
Data analysis and Data Visualization of Countries's GDP, Life Expectancy comparison across continents, GDP per Capita Relative Growth, Population Reative Growth comparison etc using Pandas, Matplotlib.
csdojo data-analysis data-visualization datavisualization matplotlib matplotlib-pyplot numpy pandas pluralsight python python3
Last synced: 15 Nov 2024
https://github.com/qin-yu/r-global-financial-crisis
2018 [R] Data analysis: What happened during the 2007-09 financial crisis?
data-analysis data-visualization finance financial-analysis industry r risk-management statistical-analysis statistical-methods
Last synced: 21 Nov 2024
https://github.com/nafisalawalidris/911-call-analysis
The 911 Call Analysis project explores and visualises emergency call data to uncover patterns and trends. It includes data preparation, exploratory analysis, visualizing call volume and reasons and generating heatmaps. Users can customize the code for their dataset. The project relies on libraries like Pandas, NumPy, Matplotlib, Seaborn, and SciPy
cluster-analysis data-analysis data-visualization decision-making emergency-calls emergency-services exploratory-data-analysis heatmaps matplotlib numpy pandas patterns-and-trends resource-allocation scipy seaborn
Last synced: 23 Jan 2025
https://github.com/afsalashyana/whatsapp-chat-analyzer
Analyze WhatsApp chats with beautiful graphs. Written in JavaFX
data-analysis data-visualization javafx javafx-14 javafx-application whatsapp
Last synced: 31 Dec 2024
https://github.com/avinashkranjan/basic-data-analysis-and-visualization-in-python
📊 Some of the most important python tools in data science for Data Analysis and Data Visualization.
data-analysis data-science matplotlib matplotlib-pyplot numpy pandas plotly seabourne
Last synced: 13 Dec 2024
https://github.com/stimulsoft/stimulsoft.dashboards.php
Dashboards.PHP is a complete software package for designing and viewing dashboards. Includes the JS data analysis engine, dashboard designer and viewer. Support PHP 5, PHP 7, and PHP 8 versions.
charts dashboard-builder dashboards data-analysis data-grid data-visualization datatable dynamic-dashboard interactive-dashboards live-data mysql-data php php-bi-tools php-dashboard php-kpi php7 php8 pivot-tables sql-datasources statistics
Last synced: 30 Jan 2025
https://github.com/nafisalawalidris/predicting-credit-card-approvals
Explore credit card approval prediction through data analysis and machine learning. Preprocess data, train logistic regression models, and optimize hyperparameters. Learn data preprocessing, feature engineering, model training, and evaluation. Dive into the world of machine learning with Python and popular libraries.
approval-prediction credit-card data-analysis data-preprocessing feature-engineering hyperparameter-optimization libraries logistic-regression machine-learning model-evaluation model-training python python3
Last synced: 23 Jan 2025
https://github.com/heiderjeffer/misalignment-between-ownership-and-contribution-affects-system-reliability
Research Proposals RP
archtecture data-analysis data-collection nvivo-software python qualitative-analysis quantative-analysis reliability-engineering software-engineering
Last synced: 08 Feb 2025
https://github.com/kwokhing/exploratory-data-analysis-on-smrt-tweets
Demo on performing exploratory data analysis (EDA) on train service disruptions based on scrapped (user generated contents) tweets from the train operator's (SMRT) twitter account
data-analysis data-cleaning data-collection data-preparation exploratory-data-analysis exploratory-data-visualizations folium geospatial-data leaflet-map python python3 regex scraping selenium selenium-python social-media text-processing user-generated-content web-scraping webscraping
Last synced: 02 Dec 2024
https://github.com/mohammadkarbalaee/python-for-data-analysis-book
All the practice and code that I am doing while I read the book called, Python for data analysis
data-analysis data-science python
Last synced: 01 Feb 2025
https://github.com/jcm-ai/standard-bank-data-science-virtual-experience-programme
This repository has all of the assignments I had to do for the Standard Bank Data Science Virtual Experience Program. 📉👨💻📊📈
automl business-analysis business-solutions client-communication data-analysis data-mining data-science data-visualization machine-learning machine-learning-algorithms matplotlib-pyplot model-evaluation model-interpretation power-point presentation-slides programming-language python3 seaborn sql statical-analysis
Last synced: 09 Jan 2025
https://github.com/agungbudiwirawan/socioeconomic_analysis
The objective of this project is to analyze the socio-economic in Chicago.
chicago-crime crime-data data-analysis data-science microsoft-sql-server project sql sql-project sql-server
Last synced: 02 Dec 2024
https://github.com/ktmud/github-life
A data explorer for GitHub projects' life cycles
data-analysis github scraper time-series
Last synced: 23 Jan 2025
https://github.com/agungbudiwirawan/e-commerce_analysis_using_sql
The objective of this project is to provide an analysis of Olist Store (Brazilian E-commerce) sales.
data-analysis data-science e-commerce e-commerce-project microsoft-sql-server sql sql-server
Last synced: 30 Jan 2025
https://github.com/vidhi1290/robust-yield-prediction-
"Predicting a Greener Future 🌾📊 Delve into the world of agriculture and data science with our Yield Prediction project. We harness machine learning and weather data to forecast crop yields accurately. Join us in cultivating smarter farming practices for a sustainable tomorrow."
artificial-intelligence data-analysis data-cleaning-and-preprocessing data-science data-visualization dataexploration devops docker machine-learning machine-learning-algorithms matplotlib matplotlib-pyplot pandas python scikit-learn scikitlearn-machine-learning streamlit yield-prediction-for-food-processing
Last synced: 02 Feb 2025
https://github.com/lussierc/foodborneillnessdataanalysis
A data analysis of foodborne illnesses using R Scripting methods.
data-analysis database foodborne-disease-outbreaks foodborne-illnesses rstudio
Last synced: 01 Feb 2025
https://github.com/mdh266/wikimedia_challenge
Analyzing click-through rates from Wikimedia
data-analysis data-challenge exploratory-data-analysis matplotlib pandas python
Last synced: 31 Jan 2025
https://github.com/pnnl/archive_walker
Archive Walker Software to read and examine PMU data to detect events and conditions for further analysis.
data-analysis pmu synchrophasor
Last synced: 25 Jan 2025
https://github.com/splch/qbs
An effective and flexible Quantile-Based Balanced Sampling algorithm for addressing class imbalance in datasets while preserving the underlying data distribution, improving model performance across various machine learning applications.
classification data-analysis imbalanced-classification imbalanced-data machine-learning resampling
Last synced: 07 Feb 2025
https://github.com/saka7/data-science-sandbox
Data Science, Data Analysis, Web Scraping Sandbox
ai artificial-intelligence data-analysis data-science jupyter-notebook machine-learning neural-network sandbox web-crawling web-scraping
Last synced: 24 Jan 2025
https://github.com/dataform-co/dataform-example-project
Example project on Dataform
data-analysis data-pipeline data-transformation elt sql sqlx
Last synced: 13 Nov 2024
https://github.com/benjamindpb/wikidata-preprocessing
Wikidata dump preprocessing & analysis of georreferencial entities
data-analysis preprocessing wikidata wikidata-dump
Last synced: 19 Jan 2025
https://github.com/neelshah18/neelshah18.github.io
Neel Shah's Website
article blog data-analysis data-science data-visualization deep-learning machine-learning personal-website
Last synced: 28 Jan 2025
https://github.com/michaelnabil230/laravel-analytics
A Laravel package to retrieve pageviews and other data from Database
data-analysis data-structures database laravel php
Last synced: 15 Nov 2024
https://github.com/ndleah/self-quantified
😊 EDA, self-quantified data analysis
data-analysis personal-data-analysis r self-quantify
Last synced: 12 Jan 2025
https://github.com/ndleah/school-donation
💰 Top school donors analysis
cufflinks data-analysis data-science data-visualization dataset exploratory-analysis python python-library python3
Last synced: 12 Jan 2025
https://github.com/lit26/trump_tweet_analysis
Analysis of Trump's original tweets.
data-analysis lda-model topic-modeling
Last synced: 30 Jan 2025
https://github.com/surajv311/data_analysis-food_recipes_ds
Data preprocessing, cleaning, <Analysis> & plotting 📊 of Food Recipies Dataset (from Kaggle). 🐍 Libraries used: Pandas, Matplotlib, Seaborn, Plotly.📈
data-analysis kaggle-dataset matplotlib numpy pandas plotly seaborn
Last synced: 21 Jan 2025
https://github.com/mrigankpawagi/exop
Quest for a Habitable Exoplanet
css data-analysis font-awesome googlefonts html5 javascript jquery materializecss planets pwa space webapp
Last synced: 18 Jan 2025
https://github.com/neutrinoceros/gpgi
A Generic Interface for Grid + Particle data
data-analysis grid particles performance
Last synced: 16 Nov 2024
https://github.com/amrrs/introduction-to-eda-with-python
Introduction to EDA with Python Session Files
Last synced: 15 Nov 2024
https://github.com/nirmalnishant645/python-programming
Basic Python Programs
algorithms algorithms-and-data-structures algorithms-datastructures big-data data-analysis data-cleaning data-mining data-mining-algorithms data-science data-structure data-structures datastructures-algorithms geeksforgeeks geeksforgeeks-python geeksforgeeks-solutions hackerearth hackerearth-python hackerearth-solutions python python3
Last synced: 24 Jan 2025
https://github.com/reiniiriarios/squirrel-table
Desktop application to run MySQL queries over SSH and generate CSV and XLSX files. Useful for QA where queries need to be run repeatedly and files handed off.
csv csv-files data-analysis desktop-application electron excel mariadb mysql nodejs qa quality-assurance quality-control quality-control-assurance sql xlsx
Last synced: 18 Jan 2025
https://github.com/elkronos/anovatoolbox
This GitHub repository contains a collection of functions for performing various statistical analyses and generating visualizations. The functions are designed to work with different types of data and provide comprehensive outputs for data analysis.
anova anova-model data-analysis r statistics
Last synced: 24 Jan 2025
https://github.com/mrankitgupta/mrankitgupta
Myself Ankit Gupta, This contains a short & interesting introduction about me.
ai ankit ankit-gupta ankitgupta artificial-intelligence awesome-readme data-analysis data-science data-visualisation data-visualization github github-profile github-profile-readme machine-learning mrankitgupta profile python readme readme-profile social
Last synced: 17 Nov 2024
https://github.com/mykhode/python-sic-mini-project
SAMSUNG SIC Finish Project Course - Python
data-analysis python-analysis samsung-sic
Last synced: 11 Jan 2025
https://github.com/DataHerb/dataherb-python
Python Package for DataHerb: create, search, and load datasets.
data data-analysis data-mining database dataset python
Last synced: 15 Nov 2024
https://github.com/asifdotexe/sentimentscoringmodel
This project focuses on performing sentiment analysis on Amazon reviews using natural language processing (NLP) techniques. It includes various steps, from data exploration and preprocessing to building and evaluating sentiment models.
data-analysis data-visualization natural-language-processing sentiment-analysis
Last synced: 15 Nov 2024
https://github.com/emmanuel10701/matplotlib
Matplotlib
data-analysis data-science data-visualization matplotlib python
Last synced: 31 Jan 2025
https://github.com/storopoli/r_scripts
Couple of handy R Scripts that I use in a daily basis for Scientific Research
data-analysis data-science data-visualization r scientific
Last synced: 20 Nov 2024
https://github.com/quantumudit/analyzing-whiskyexchange-whisky
This project focuses on scraping data related to Japanese Whiskey from the Whiskey Exchange website; performing necessary transformations on the scraped data and then analyzing & visualizing it using Jupyter Notebook and Power BI.
data-analysis data-science data-transformation data-visualization etl jupyter-notebook power-bi python webscraping
Last synced: 26 Dec 2024
https://github.com/fx2y/datanarrate
[WIP] LLM-powered agent for adaptive data analysis across multiple sources. Uses natural language for complex queries, visualizations, and insights. Features autonomous planning, SQL/Elasticsearch generation, and AI storytelling. Built with LangChain, GPT-4, FastAPI, and React.
ai data-analysis data-visualization elasticsearch fastapi gpt-4 langchain machine-learning nlp react sql
Last synced: 16 Jan 2025
https://github.com/martinboller/cc-build
Builds latest version of CyberChef and install it with NGINX on another system. CyberChef is a simple, intuitive web app for analyzing and decoding data without having to deal with complex tools or programming languages.
analysis blueteam compression cyberchef data-analysis data-manipulation decode encode encryption hashing parsing virtual-machine
Last synced: 16 Nov 2024
https://github.com/romac/adaproject
🔬 Project proposal for the Applied Data Analysis course at EPFL
Last synced: 24 Dec 2024
https://github.com/danielpuentee/outdpik
The fundamental toolkit for outliers search and visualization. It aims to be the fundamental high-level package for this purpose.
data-analysis matplotlib numpy python
Last synced: 17 Dec 2024
https://github.com/darsan-in/rumour-monger-spotter
Rumour Monger Spotter is a prototype developed during a national-level cyber hackathon to identify false information on Twitter. Using the Google Fact Check API and a Multinomial Naive Bayes classifier, the tool analyzes tweet content to assess the likelihood of misinformation. Despite a development window of less than 24 hours, the project won a t
ai data-analysis fact-checking hackathon india naive-bayes national-competition natural-language-processing prototype real-time-analysis social-media text-classification tweet-content twitter
Last synced: 12 Dec 2024
https://github.com/ihabbendidi/diamond-analysis
Exploratory statistical analysis of a Diamond dataset
data-analysis data-visualization exploratory-data-analysis machine-learning r
Last synced: 09 Feb 2025
https://github.com/super-lou/aeag_toolbox
🛠️ R toolbox to provide a simple way of interacting with all the code necessary to carry out hydrological stationnarity analysis for the Agence de l'Eau Adour-Garonne (AEAG)
climate climate-change climate-data data-analysis data-visualization environment hydrology inrae low-water mann-kendall mann-kendall-tests r stationarity-test statistics
Last synced: 23 Dec 2024
https://github.com/cafferychen777/microbiomestat-turtorial-professional-version
MicrobiomeStat Tutorial Repository: This is a comprehensive resource for learning how to use the MicrobiomeStat package. It provides a step-by-step guide to effectively analyze complex microbiome data.
16s 16s-rrna 16s-seq analysis data-analysis metagenomes metagenomic-analysis metagenomic-pipeline metagenomics microbiome microbiome-analysis microbiome-analysis-pipelines microbiome-data microbiome-workflow omics omics-data omics-data-analysis omics-data-integration visualization
Last synced: 23 Jan 2025
https://github.com/dylan-profiler/tangled-up-in-unicode
Access to the Unicode Character Database (UCD)
data-analysis data-quality exploration linguistic-analysis linguistics python unicode
Last synced: 16 Nov 2024
https://github.com/thecoderpinar/credit-card-fraud-detection-project
This project focuses on the detection of credit card fraud using various data science and machine learning techniques. The dataset includes a record of credit card transactions over a specific period, with the goal of accurately identifying fraudulent activities. 🚀✨
anamoly-detection classification-algorithms credit-card-transactions data-analysis data-preprocessing data-science data-visualization fraud-detection machine-learning python
Last synced: 09 Feb 2025
https://github.com/c0deta1ker/arpescape
ARPEScape is a MATLAB-based app that contains a set of tools and functions for analysing the electronic structure of materials using photoelectron spectroscopy (PES) techniques, such as X-ray photoelectron spectroscopy (XPS) and angle-resolved photoelectron spectroscopy (ARPES).
analysis analysis-package angle-resolved-photoemission angle-resolved-spectroscopy arpes condensed-matter-physics data-analysis lcn matlab photoelectron-spectra photoelectron-spectroscopy photoemission psi sls ucl xps
Last synced: 30 Nov 2024
https://github.com/juangesino/behaviouraleconomics
All the files and data for the experiment performed during the course Behavioural Economics @ University of Amsterdam
behavioral-economics behavioural-economics data-analysis economics game-theory statistics
Last synced: 23 Jan 2025