Data analysis
Data analysis is a process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.
- GitHub: https://github.com/topics/data-analysis
- Wikipedia: https://en.wikipedia.org/wiki/Data_analysis
- Last updated: 2026-06-20 00:07:30 UTC
- JSON Representation
https://github.com/implicitlayer/alphaanalysis
Full library for analyzing financial data using ML and classical approaches
data-analysis data-science deep-learning finance machine-learning risk-analysis time-series-analysis trading transformers
Last synced: 12 Mar 2026
https://github.com/linuxto5re/gateiodatafilteration
Gate.io API analysis: spot, margin, futures; filters high volume, identifies common cryptos.
arbitrage-bot centralized-exchanges cryptocurrency data-analysis gate gateio-api python rest trading-strategies
Last synced: 23 Oct 2025
https://github.com/splch/qbs
An effective and flexible Quantile-Based Balanced Sampling algorithm for addressing class imbalance in datasets while preserving the underlying data distribution, improving model performance across various machine learning applications.
classification data-analysis imbalanced-classification imbalanced-data machine-learning resampling
Last synced: 01 Apr 2025
https://github.com/nhsdigital/sde_example_analysis
Example of what you can do in Databricks in the Secure Data Environment (SDE) using Python, SQL, and R.
data-analysis data-science databricks-notebooks machine-learning mlflow
Last synced: 25 Oct 2025
https://github.com/llnl/nddav
N-Dimensional Data Analysis and Visualization
data-analysis data-viz high-dimensional-data topological-data-analysis visual-analytics visualization
Last synced: 29 Apr 2025
https://github.com/nmicht/food-language-variety
A map to display distribution of food names using their synonyms in different regions.
data-analysis language-distribution language-variety map
Last synced: 31 Jan 2026
https://github.com/Lightning-Chart/lcjs-analysis
A data analysis library.
data-analysis javascript-library lcjs lightningchart-js typescript-library
Last synced: 12 May 2025
https://github.com/dylan-profiler/tangled-up-in-unicode
Access to the Unicode Character Database (UCD)
data-analysis data-quality exploration linguistic-analysis linguistics python unicode
Last synced: 15 Apr 2025
https://github.com/khuyentran1401/suicide-rates
data-analysis data-science kaggle machine-learning python
Last synced: 13 Apr 2026
https://github.com/narius2030/sakila-datawarehouse-ssis
Implement a simple data warehouse to store Saklia data - Create data pipelines for extract, transform and load data from source to warehouse - Retrieve data in warehouse to explore and do several analysis
data-analysis data-integration data-modeling data-visualization excel microsoft-sql-server power-bi ssas ssis
Last synced: 10 Oct 2025
https://github.com/sayakpaul/datacamp-blogs
Jupyter notebooks of my DataCamp blogs
data-analysis data-science jupyter-notebooks machine-learning python sql
Last synced: 16 May 2026
https://github.com/bts-cm/airdrop_tool
Fetch & analyse blockchain tickets. View leaderboards and user tickets. Calculate and perform provably fair airdrops.
airdrop bitshares bitsharesjs blockchain crypto data-analysis data-science electron etl javascript nodejs react ticket tusc
Last synced: 17 Jan 2026
https://github.com/aadya940/numpyai
A Natural Language Interface to the Numpy Library using LLMs.
ai data-analysis data-science library llm machine-learning numpy python
Last synced: 12 Apr 2025
https://github.com/carlosvinimsouza/dataanalysiswithpython
Learn Data Analysis using Libs Python (Numpy, Pandas, Matplotlib and Seaborn)
data-analysis data-science free-code-camp matplotlib numpy pandas python python3 seaborn
Last synced: 11 Apr 2026
https://github.com/thecoderpinar/earthquake-explorer
🌍🔍 Explore and analyze earthquake data worldwide using this interactive data science project. Visualize earthquake occurrences, patterns, and geographical distribution. Dive deep into seismic data, perform exploratory data analysis, and gain insights into earthquake trends.
data-analysis data-science data-visualization earthquake-analysis exploratory-data-analysis geospatial-analysis interactive-maps jupyter-notebook machine-learning python seismic-data
Last synced: 23 Aug 2025
https://github.com/pmsipilot/intercom2dw
Intercom2DW is an attempt at loading all the data available in an Intercom application.
Last synced: 11 Apr 2026
https://github.com/nghorbani/neuraldataanalysis
Download Data from https://bit.ly/3g8RUmi
data-analysis matlab spike-detection spike-rate
Last synced: 22 Apr 2025
https://github.com/silveirinhajuan/rotinapy
RotinaPy: Simplify your daily life and maximize productivity with an integrated app for task management, study tracking, flashcards, and more. Built with Streamlit and Python.
data-analysis flashcards llm-integration llm-ui machine-learning ollama productivity python streamlit study study-project study-tracker task-management task-manager
Last synced: 13 Feb 2026
https://github.com/yasminezaatour/heart-disease-predictions
Heart attack predictions using Python
data-analysis data-modeling data-visualization machine-learning
Last synced: 28 Nov 2025
https://github.com/datalayer/desktop
Ξ 🖥️ Datalayer Destkop.
ai data data-analysis data-science datalayer desktop electron
Last synced: 25 Oct 2025
https://github.com/tushar2704/instagram-user-analytics
This project revolves around the exploration and analysis of user engagement patterns on the popular social media platform, Instagram. By delving into user data and interaction metrics, this project aims to provide valuable insights into user behavior, content performance, and trends.
artificial-intelligence data-analysis data-science instagram project tushar2704
Last synced: 23 Jan 2026
https://github.com/singhkunall/-india-census-2011-population-demographics-dashboard
Interactive Excel dashboard visualizing India's 2011 Census Population Demographics using charts, pivot tables, and slicers.
data-analysis data-visualization excel-dashboard india-census population-data
Last synced: 04 Feb 2026
https://github.com/enzon3/tws-dataset_gen
Program that slowly generates a dataset of news titles and the sentiment of the titles.
data-analysis data-science dataset
Last synced: 14 Apr 2026
https://github.com/dra1ex/mind-net.js
Fast and simple to use neural network implementation in pure TypeScript with GPU support!
artificial-intelligence classification data-analysis deep-learning gan generative-adversarial-network gpu machine-learning ml neural-network neural-network-engine neural-networks regression sequential-network supervised-learning unsupervised-learning vae variational-autoencoder
Last synced: 09 Apr 2025
https://github.com/ihabbendidi/diamond-analysis
Exploratory statistical analysis of a Diamond dataset
data-analysis data-visualization exploratory-data-analysis machine-learning r
Last synced: 17 Oct 2025
https://github.com/roaldarbol/anibehavr
🪲 An R package for Analysis of Animal Behaviour
animal-behavior behavioural-states data-analysis r
Last synced: 11 Oct 2025
https://github.com/garrettj403/albertaenergysources
Get grid data from Alberta Electric System Operator (AESO)
alberta canada data-analysis energy-data
Last synced: 10 Oct 2025
https://github.com/agungbudiwirawan/socioeconomic_analysis
The objective of this project is to analyze the socio-economic in Chicago.
chicago-crime crime-data data-analysis data-science microsoft-sql-server project sql sql-project sql-server
Last synced: 04 Jul 2025
https://github.com/heiderjeffer/misalignment-between-ownership-and-contribution-affects-system-reliability
Research Proposals RP
archtecture data-analysis data-collection nvivo-software python qualitative-analysis quantative-analysis reliability-engineering software-engineering
Last synced: 23 Feb 2026
https://github.com/julianolaurentino/sql_sample
Estudos voltados para criação de consultas em SQL. Este repositório está sendo alimentando semanalmente.
data-analysis sql sql-query sql-server
Last synced: 05 Mar 2025
https://github.com/alyssonmach/9-data-science-apps-with-python
[freeCodeCamp Course]: build interactive and data-driven web apps in Python using the Streamlit library.
data-analysis data-science data-visualization machine-learning streamlit-webapp
Last synced: 12 Feb 2026
https://github.com/ehtisham-sadiq/final-year-project-material-
This repository contains all the material and data related to final year project.
data-analysis data-science data-visualization deep-learning implementation-of-algorithms machine-learning machine-learning-algorithms natural-language-processing python3
Last synced: 02 Jan 2026
https://github.com/wtbates99/stock-indicators
A comprehensive self-hosted stock analysis platform combining FastAPI and React. Features interactive stock visualizations, real-time data aggregation, and customizable technical indicators with a modern, grid-based interface.
backend data-analysis data-mining data-science fastapi financial-analysis frontend investment python reactjs sqlite3 statstistics stock-market stock-price-prediction time-series
Last synced: 15 Oct 2025
https://github.com/tillbiskup/labinform
Python components of the laboratory information system LabInform
data-analysis data-storage data-storage-infrastructure electronic-lab-notebook good-practices reproducible-research reproducible-science unique-identifier
Last synced: 06 Sep 2025
https://github.com/mszell/fyp2021
INSTRUCTOR's VERSION: Lectures of First Year Project 2021, Project 1
data-analysis data-science education python teaching-materials
Last synced: 02 May 2025
https://github.com/mykhode/python-sic-mini-project
SAMSUNG SIC Finish Project Course - Python
data-analysis python-analysis samsung-sic
Last synced: 10 Oct 2025
https://github.com/ekosaputro09/Data-Science-References
Some useful resources to learn about Data Science
cheatsheet data-analysis data-science data-visualization machine-learning statistical-learning
Last synced: 22 Nov 2025
https://github.com/ziaeemehr/itng_nest
Nest Simulator quick guides and examples, adding new model using NESTML
computational-neuroscience data-analysis nest-simulator neuroscience
Last synced: 30 Aug 2025
https://github.com/arzan101/ola-data-analytics
Ola - Identified the reason and trends for ride cancellation. Process - Cleaned and Processed Data from multiple sources, applied Sql queries and visualized data using PoweBi . Motive - To reduce the cancellation rate
dashboard data-analysis data-mining data-visualization dataanalytics excel powerbi sql
Last synced: 06 Jan 2026
https://github.com/martial2023/bank-performance-analysis
Analyse de données bancaires du Berka Dataset (1993-1998) pour calculer et visualiser des KPI clés
dashboard data-analysis data-visualization nextjs pandas plotly-express pymongo python recharts-js sqlalchemy
Last synced: 26 Aug 2025
https://github.com/DCS-training/intromachinelearning
This course is aimed at providing an introduction to machine learning for those with some beginner level python/Rstudio skills. Go to the readme file
data-analysis data-wrangling machine-learning python statistics
Last synced: 25 Apr 2025
https://github.com/haloapping/malas-ngetik-clf
Saya malas ngetik, makanya saya buat aja template proyek kompetisi Kaggle 😜. Template ini khusus untuk kasus klasifikasi.
data-analysis exploratory-data-analysis feature-engineering kaggle kaggle-competition machine-learning python3 scikit-learn
Last synced: 12 Apr 2026
https://github.com/fenghaojiang/ethereum-etl
ETL(Extract, Transform, Load) data from Ethereum like EVM Block chain
Last synced: 14 Jan 2026
https://github.com/leechristophermurray/parquetframe
Unlocking the power of Parquets
data data-analysis dataframe entity-framework etl graph interactive python rust workflow worklow zanzibar
Last synced: 28 May 2026
https://github.com/adriens/endoflife-date-snapshots
Daily consolidated and enriched snapshots of endoflife.date
apache-parquet csv csv-export data-analysis data-science database datavisualization dataviz duckdb duckdb-database end-of-life endoflife eol jupyter-notebook kaggle kaggle-notebook olap python release-policy release-schedule
Last synced: 11 Apr 2026
https://github.com/adirthaborgohain/community-data-analysis
Data and Visual Analysis on several different communities generated using Louvain Algorithm in Neo4j on the dblp dataset.
Last synced: 10 May 2026
https://github.com/mainakverse/ml-algorithms-starter
List of machine learning algorithms that are needed to start with ML projects and lay a foundation into data science
data-analysis data-science jupyter-notebooks machine-learning-algorithms practice
Last synced: 19 Apr 2025
https://github.com/ernestaroozoo/memestocks.net
MemeStocks.net is a Python web app that tracks the historical popularity of specific stocks by monitoring Reddit mentions. Users can search for a stock symbol and view information such as the stock's name, price, and historical popularity data. The data is gathered using the Pushshift API and stored in a PostgreSQL database.
dashboard data-analysis financial-data meme-stock python reddit-scraper scraping sql stock streamlit
Last synced: 06 Mar 2026
https://github.com/iamfoysal/data-analysis
This repository contains various examples and exercises to help learn data science using Python.
data-analysis data-science database jupyter-notebook python3
Last synced: 10 Feb 2026
https://github.com/mrjxtr/tokyo_airbnb_analysis_project
Full project case study and analysis to show potential opportunities to start an AirBnb business in Tokyo, Japan.
data-analysis data-cleaning data-science data-visualization pandas python3
Last synced: 24 Feb 2026
https://github.com/jpcadena/onemetric-plus
OneMetric+ project for analytical tool on demand forecast and outlier detection
black-formatter data-analysis data-analytics data-science data-visualization demand-forecasting isort machine-learning matplotlib mypy numpy outlier-detection pandas pre-commit-hook pydantic python ruff scikit-learn seaborn solid-principles
Last synced: 10 Apr 2026
https://github.com/camille-maslin/simulfcimage
🔍 SimulFCImage: A professional multispectral image processing application developed for ImViA Laboratory.
academic-project computer-vision data-analysis gui-application image-processing image-viewer multispectral-images pyqt5 python scientific-visualization spectral-analysis
Last synced: 05 Feb 2026
https://github.com/goggle/dataisbeautiful
Some data analysis Jupyter notebooks, mainly indented for submissions on the subreddit /r/dataisbeautiful.
data-analysis data-visualisation jupyter-notebook notebook reddit
Last synced: 16 May 2025
https://github.com/quantumudit/python-projects
Consists of various projects that are primarily powered by Python
data-analysis data-science data-visualization jupyter-notebook projects python pythonapplication pythonprojects
Last synced: 09 Apr 2025
https://github.com/kevinschoon/qviz
QViz Interactive Plotting
data-analysis data-visualization go gonum qframe yaegi
Last synced: 01 Jun 2026
https://github.com/walidbosso/r_data_mining
Extract knowledge from a data using different techniques, including Association Rules Hierarchical Agglomerative Clustering (HAC) K-means Clustering Decision Trees
association-rule-mining association-rules clustering data-analysis data-mining data-science data-visualization decision-tree-classifier decision-trees exportation extract-data hac hierarchical-clustering k-means k-means-clustering k-means-r r-programming r-studio
Last synced: 23 Mar 2025
https://github.com/shlokashah/student-depression-and-suicide-rate-prediction
https://shlokashah.github.io/Student-Depression-And-Suicide-Rate-Prediction/
data-analysis data-visualization machine-learning student suicide-rate-prediction
Last synced: 19 Nov 2025
https://github.com/grburgess/gbm_kitty
Database, reduce, and analyze GBM data without having to know anything. Curiosity killed the catalog.
3ml catalogue data-analysis fermi-science grbs pipelines
Last synced: 29 Jun 2025
https://github.com/llnl/hdtopology
High-dimensional topological data analysis library for NDDAV
analysis cpp data-analysis data-viz high-dimensional-data topological-data-analysis visualization
Last synced: 29 Apr 2025
https://github.com/avs/avs-go
Common web components for AVS data visualization products
avs big-data charting-library csharp csharp-library data-analysis data-science data-visualization datamanagement high-performance high-performance-computing java java-library javascript multi-dimensional multi-platform multi-threading rendering-3d-graphics rendering-engine webcomponents
Last synced: 10 Apr 2026
https://github.com/aravind-selvam/covid_dashboard
With Covid death and vaccine data. I have created a dashboard.
covid-19 data-analysis data-science data-visualization tableau tableau-public visualization
Last synced: 08 Mar 2026
https://github.com/rmnldwg/liver-smart
Data and analysis pipeline for a study on the potential advantages of daily adaptive liver SBRT performed at the University Hospital Zurich.
data-analysis fractionation jupyter-notebook liver-cancer metastasis radiation-oncology stereotactic
Last synced: 27 May 2026
https://github.com/kylekirkby/cardatasnatch
CarDataSnatch allows you to quickly find information about a car in the uk using a valid number plate. Grab an image of the car in question along with a multitude of other data. Compare two cars' data for fast and easy analysis.
beautifulsoup cars command-line-tool data data-analysis data-mining ethical-hacking python python3 requests scraper social-engineering
Last synced: 15 Apr 2025
https://github.com/luminati-io/Amazon-dataset-samples
A sample dataset of over 1,000 Amazon product listings, extracted using the Bright Data API, perfect for competitive analysis, market trends, and eCommerce insights.
amazon api data-analysis data-science dataset ecommerce products web-scraping
Last synced: 09 Apr 2025
https://github.com/lacerbi/vbmc
Variational Bayesian Monte Carlo (VBMC) algorithm for posterior and model inference in MATLAB (old location)
bayesian-inference data-analysis gaussian-processes machine-learning matlab variational-inference
Last synced: 11 Oct 2025
https://github.com/agungbudiwirawan/supermarket_sales_dashboard-sales-analysis-using-pivot-table-and-chart
The objective of this project is to analyze supermarket sales using pivot table and chart in Microsoft Excel. After doing the analysis, I created an interactive dashboard that aims to make it easier for the audience to explore data.
dashboard data-analysis data-science data-visualization excel sales-dashboard spreadsheet
Last synced: 08 Jan 2026
https://github.com/erictleung/erictleung.github.io
:memo: Source code for my website, portfolio of projects, and more
bioinformatics blog data data-analysis data-science github-jekyll github-page jekyll lanyon open-science open-source software-engineering
Last synced: 21 Jan 2026
https://github.com/fernandezfran/exma
A Python library with C extensions to analyze and manipulate molecular dynamics trajectories and electrochemical data
computational-physics data-analysis molecular-dynamics oop python science
Last synced: 16 Jan 2026
https://github.com/cosmoduende/r-arduino
Interoperability Data-IoT: How to send and receive data and take control of your Arduino, from R. How to establish interoperability between R and Arduino (Data and IoT) using a data flow between the two
arduino arduino-data arduino-dataflow arduino-serial arduino-serial-data arduino-serial-led arduino-uno data-analysis data-arduino data-cleaning data-iot data-visualization interoperability iot-rstudio r-analytics r-data-visualization r-iot rstudio-arduino serial-read serialport
Last synced: 24 Apr 2026
https://github.com/wiseaidev/corona-virus-data-analysis-modeling-and-visualization
Data analysis of covid-19 and SEIRD model implementation.
coronavirus coronavirus-tracking covid-19 data-analysis data-analysis-python data-visualization folium-maps modeling-dynamic-systems numpy ploty population python3 science science-research seird-model seird-simulator simulation
Last synced: 14 Apr 2025
https://github.com/timzatko/fifa-19-dataset-machine-learning
Player's value prediction and game position classification on FIFA 19 dataset.
data-analysis fifa19 machine-learning scikit-learn
Last synced: 04 May 2026
https://github.com/ahmednasef3/titanic-full-eda
Simple EDA for Titanic Dataset.
data-analysis data-visualization eda exploratory-data-analysis matplotlib pandas seaborn titanic titanic-data-analytics
Last synced: 27 Jan 2026
https://github.com/its-kanii/predictive-maintenance-for-healthcare-equipment
Predictive Maintenance for Healthcare Equipment utilizes machine learning to analyze operational metrics and predict equipment failures. This project leverages a dataset of usage hours, temperature, and maintenance history to enhance equipment reliability and reduce downtime.
data-analysis data-science failure-prediction feature-engineering healthcare-equipment jupyter-notebook machine-learning predictive-maintenance python time-series-analysis
Last synced: 09 May 2026
https://github.com/lastancientone/amd-vs-nvda
Analyzing 2 technology stocks using Master Analyst Program (MAP).
data data-analysis data-structures data-visualization excel forecasting time-series-analysis
Last synced: 15 May 2025
https://github.com/ronylpatil/whatsapplib
WhatsApp Group Chat Analysis Python Package.
data-analysis open-source pypi-package python-library python-package
Last synced: 02 Jan 2026
https://github.com/dcs-training/r-qgisintegratingspatialanalysis
This was an intermediate course of three sessions with a focus on developing skills in data visualisation, analysis and integration using both R studio and QGIS. Go to the readme file
data-analysis data-visualisation data-wrangling gis qgis r spatial-analysis
Last synced: 28 Jan 2026
https://github.com/luizassimoes/fitness-report
Create a personalized Fitness Wrapped report using your Apple Health data with this Streamlit application. Generate comprehensive, detailed summaries of your annual fitness activities, providing valuable insights into your year-long progress and achievements.
data-analysis data-visualization python streamlit
Last synced: 12 Feb 2026
https://github.com/a-r-j/npview
CLI tools for quickly inspecting CSV/TSV & NumPy (.npy) array files
cli csv data-analysis inspector npy numpy python tsv
Last synced: 18 Jan 2026
https://github.com/quantumudit/alteryx-weekly-challenges
This repository contains Alteryx solutions to the weekly challenges published in Alteryx Community
alteryx alteryx-workflow data-analysis data-science data-transformation data-visualization etl
Last synced: 27 Jan 2026
https://github.com/rvalla/chessevolution
Some code to analyze my chess games using the Lichess API.
chess data-analysis lichess lichess-api python
Last synced: 23 Oct 2025
https://github.com/dcs-training/bayesian-statistics
Materials for the CDCS Introduction to Bayesian Statistics course. Go to the readme file
bayesian-statistics data-analysis r statistics
Last synced: 05 Feb 2026
https://github.com/jshinm/web-scrapper
Web Scrapper used to extract NeuroData github repo stats
Last synced: 04 Apr 2025
https://github.com/henrylin03/video-games
Using Python and SQL to clean, analyse and visualise video games' data from Metacritic. Includes scraping using BeautifulSoup.
analysis beautifulsoup beautifulsoup4 data data-analysis data-science eda jupyter-notebook pandas python sql sqlite3 video-game video-games
Last synced: 14 Apr 2026
https://github.com/aditiiprasad/whatsstat
A fun and insightful WhatsApp chat analyzer that turns your conversations into beautiful stats, juicy graphs, and quirky insights.
chat-analyzer data-analysis data-visualization nlp streamlit text-processing whatsapp
Last synced: 02 Sep 2025
https://github.com/briatte/asr
Applied Stats with R and RStudio (first-year social-science tutorials)
course data-analysis data-science data-visualization r statistics
Last synced: 14 Apr 2026
https://github.com/code-jl/nfl-point-kicker-data-scraper
A Python-based web scraping toolkit that extracts and processes NFL kicking statistics from Pro-Football-Reference. This project automates the collection of comprehensive game data, with a particular focus on field goal attempts and environmental conditions.
automation beautifulsoup csv data-analysis data-collection field-goals football-statistics kicking-stats nfl python selenium sports-analysis statistics weather-data web-scraping
Last synced: 06 Sep 2025
https://github.com/virajbhutada/cliquebait-digital-marketing-analysis-using-sql
This GitHub repository contains the CliqueBait Digital Marketing Analysis project, utilizing SQL for comprehensive analysis of marketing campaigns, user engagement, product performance, and website interactions within the Clique Bait food app. The project offers actionable insights for optimizing marketing strategies in competitive landscape.
campaign-website data-analysis data-extraction data-science digital-marketing food-store microsoft-excel mysql product-performance sql sql-database sql-project user-engagement website-analytics
Last synced: 27 Feb 2025
https://github.com/devexpress-examples/web-forms-pivot-grid-bind-to-sql-data-source
This example demonstrates how to create an ASPxPivotGrid and bind it to data via code.
asp-net-web-forms data-analysis dotnet pivot-grid pivot-grid-for-web-forms
Last synced: 06 Jul 2025
https://github.com/gxelab/tutorials
Tutorials of frequently used software packages and libraries in the lab
bioinformatics data-analysis evolution genetics genomics julia python3 r-language statistics visualization
Last synced: 18 Jan 2026
https://github.com/naso7y/students-performance-analysis
A project analyzing students' academic performance to identify trends and factors affecting outcomes. Built with Python, using data visualization and statistical techniques to derive actionable insights.
data-analysis data-visualization machine-learning python
Last synced: 23 Feb 2026
https://github.com/giatraskon/clustering-countries-socioeconomic-health-analysis
Exploration and analysis of socio-economic and health data from 167 countries using MATLAB. Application of clustering algorithms to identify development patterns, visualize disparities, and understand global trends.
calinski-harabasz-index clustering country-data data-analysis data-visualization davies-bouldin-index elbow-method feature-selection health-indicators human-development-index k-means-clustering k-median-clustering k-medoids-clustering machine-learning matlab pca pearson-correlation silhouette-score socio-economic-indicators unsupervised-learning
Last synced: 29 Jan 2026
https://github.com/yeisonmontoya1815/machine-learning_prediction_can_inflation
we aim to predict trends in the Canadian market basket using sentiment analysis techniques. Sentiment analysis involves analyzing text data to determine the sentiment expressed, whether positive, negative, or neutral.
algorithms-and-data-structures data data-analysis data-science data-visualization feature-engineering machine-learning matplotlib-pyplot numerical-analysis numpy pandas pipelines python sklearn structured-data super unsupervised-learning
Last synced: 05 Feb 2026
https://github.com/narius2030/hive-datawarehouse-analysis
Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems
apache-hadoop apache-hive data-analysis etl-pipeline hiveql machine-learning statistics
Last synced: 01 Apr 2025
https://github.com/ivangrigorov/neutrino-search-engine
Creating Java search engine both for HTML or document type of files
data data-analysis data-knowledge information-extraction information-retrieval java-language search-engine
Last synced: 31 Mar 2025
https://github.com/cbg-ethz/scdna-pipe
Python data analysis pipeline for single cell copy number event history reconstruction
bioinformatics bioinformatics-pipeline data-analysis genomics python snakemake snakemake-workflows workflow
Last synced: 05 Jan 2026
https://github.com/fabienarcellier/qjoin
qjoin is a data manipulation library that provides simple and efficient joining and collection processing functionality
composable data-analysis developer-tools functools python
Last synced: 01 Mar 2026
https://github.com/tillbiskup/cwepr
A Python package based on the ASpecD framework for handling cwEPR data.
continuous-wave data-analysis data-processing electron-paramagnetic-resonance reproducible-research reproducible-science
Last synced: 06 Sep 2025
https://github.com/trybnetic/tu7-acceleration-sleep-wake-classification
Supporting material for the paper ''Discrimination of sleep and wake periods from a hip-worn raw acceleration sensor using recurrent neural networks''
accelerometer accelerometry actigraphy data-analysis sensors sleep
Last synced: 01 Jun 2026