data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-29 00:07:49 UTC
- JSON Representation
https://github.com/madhuresh2011/genai-powered-data-analytics-by-tata
I recently participated in Tata iQ's job simulation on the Forage platform, and it was incredibly useful to understand what it might be like to be on a data analytics team in an AI transformation consulting role.
chatgpt data dataanalytics eda excel gemini generative-ai internships powerpoint presentation
Last synced: 14 Feb 2026
https://github.com/cognitixe/metamask-wallet-recovery-funds-phrase-data-seed-token
This repository provides tools and guidelines for securely recovering MetaMask Wallet funds using recovery phrases, seed data, and tokens. It ensures safe and reliable methods for recovering access to your wallet and managing your cryptocurrency assets.
bitcoin blockchain cryptocurrencies cryptocurrency data ethereum funds metamask metamask-bot metamask-desktop metamask-extension metamask-plugin metamask-snap metamask-wallet phrase recovery seed token wallet wallet-security
Last synced: 13 May 2026
https://github.com/dawidolko/datafusion-app-python
Project as part of the Data Warehousing subject.
academic-project data dataprocessing extraction gui loading project pysimplegui python transformation
Last synced: 15 Feb 2026
https://github.com/charon25/weatherdata
17 000 weather measurements collected by a weather station created for a college project.
csv data dataset datasets json measurements strasbourg weather weather-data
Last synced: 16 Jan 2026
https://github.com/basemax/okala-product-ids
A PHP script to fetch and save product IDs from Okala's online store API across multiple categories and store branches.
crawler crawler-okala crawler-php crawlers data database ids ir iran json okala okala-crawler php php-crawler product
Last synced: 09 May 2026
https://github.com/abhroroy365/market_analysis
This project explores customer segmentation and market analysis in the context of online retail using an online retail dataset. By applying advanced analytics, we aim to uncover insights that can drive strategic decisions and enhance business performance.
clustering data data-analysis data-visualization kmeans-clustering machine-learning market-analysis python silhouette-analysis
Last synced: 09 May 2026
https://github.com/naitiknayak196/tech-layoffs-cleaning-sql-vs-python
This project cleans and analyzes a tech layoffs dataset using MySQL and Python (Pandas) to compare their efficiency in data processing. It provides business insights into workforce trends, industry stability, and economic impacts to support data-driven decision-making.
data datacleaning dataset jyputer-notebook layoffdata layoffs mysql python sql
Last synced: 09 May 2026
https://github.com/keziatbnn/supervised-regression-salaryprediction
Make salary predictions based on years of experience using supervised regression.
data data-analysis-python data-prediction data-science python
Last synced: 11 Aug 2025
https://github.com/dms-codes/scrape-kesaintblanc-id
Kesaintblanc Data Scraper This Python script is designed to scrape product data from the Kesaintblanc website. It collects information about products, including product name, URL, price, image URLs, status, stock, and more. The scraped data is saved to a CSV file for further analysis.
data kesaintblanc python webscraper
Last synced: 27 May 2026
https://github.com/yorkearwaker/data
Data things; representation, transformation, pipelines, governance,
actuality data epistemology information knowledge ontology
Last synced: 07 Apr 2025
https://github.com/reshmaaiman/liver-patient-prediction
Liver Disease Prediction
data data-science data-visualization dataanalysis jupyter-notebook numpy pandas python seaborn
Last synced: 16 Apr 2026
https://github.com/satyam4229/iit-and-nit-college-dataset
The dataset for IITs and NITs typically includes information related to these premier engineering institutions in India, such as their names, locations, rankings, academic programs offered, faculty details, student information, admission process, infrastructure and facilities, placements.
college-data csv data excel iit nit
Last synced: 04 Jan 2026
https://github.com/hupili/djworkshop-cuc2018
data data-journalism data-visualization
Last synced: 27 Mar 2026
https://github.com/oroszgy/hunlp-resources
Scripts and resources for making spaCy understand Hungarian.
corpus-linguistics data hungarian hungarian-language hunlp magyarlanc model natural-language-processing nlp resources script spacy wikipedia
Last synced: 18 May 2026
https://github.com/wyattowalsh/proxywhirl
rotating proxy system
data data-extraction dataextraction proxy proxy-checker proxy-list proxy-scraper proxy-server proxypool python python3 rotating-proxy sqlite sqlite3 web-data-extraction
Last synced: 03 Mar 2026
https://github.com/andrii04/andreamonforte-bi-assignment
Automated Data Pipeline that ingests daily GA4-formatted CSV files from a private Google Cloud Storage bucket, validates and loads them into BigQuery, and prepares analysis-ready views. The solution is built for deployment as a Cloud Function triggered by Cloud Scheduler and uses Python with the Google Cloud Storage and BigQuery client libraries.
automation bigquery cloud cloudfunctions data data-analysis data-engineering etl etlpipeline gcp google googlecloudplatform pipeline python sql
Last synced: 09 Nov 2025
https://github.com/yashkp1234/movie-recommendation-engine
My project on analyzing the movie data set, and creating a recommendation engine using that analysis.
analysis data notebook python recommendation-engine
Last synced: 04 May 2025
https://github.com/apostolissiampanis/weather-app-api
WeatherApp is a Java-based console application that retrieves and processes weather data using the wttr.in web service.
api data hibernate java json lombok objected-orientated-programing oop spring-boot spring-data-jpa sqlite webflux
Last synced: 05 May 2026
https://github.com/diegoperea20/datos-secuenciales-con-ia
Realizacion de procesamiento de señales unidimensionales con modelos auto regresivos, convolución 1d, convolución 2d usando el espectrograma y redes recurrentes
ai artificial-intelligence convolutional-neural-networks data ia secuential-data spectrogram uao
Last synced: 06 Feb 2026
https://github.com/0xhericles/ufcg-geojson
GeoJSON file containing the blocks and buildings of the Federal University of Campina Grande.
data data-visualization geojson map open-source ufcg university
Last synced: 09 Feb 2026
https://github.com/thomasjewson/cci-data-science-textbook
This is a short, interactive textbook aimed at introducing data science to non-IT university undergraduates. Funded by Erasmus+.
data data-science learning python textbook
Last synced: 16 Apr 2026
https://github.com/jigyasag18/power-bi-dashboard-project
The Ecommerce Sales Analysis Dashboard project utilizes Power BI to provide detailed insights into ecommerce sales data, enabling stakeholders to track key performance metrics and uncover trends. This interactive dashboard allows users to explore the data in real-time, offering features such as drill-down capabilities, customizable filters.
dashboard data data-visualization datacleaning datanalysis datanalytics datapreprocessing powerbi visulaization
Last synced: 04 Mar 2026
https://github.com/sehgal-vishal/world-population-
World Population Sql Analysis
data dataanalysis population sql
Last synced: 05 Mar 2026
https://github.com/martinius96/meteostanica-odosielacie-scripty
Meteostanica - Arduino, ESP8266, ESP32 - odosielanie sketche pre reprezentáciu dát vo webovom rozhraní.
arduino bme280 bmp280 data dht22 ds18b20 esp32 esp8266 espressif html meteo meteostanica mysel nodemcu php stanica teplota tlak vlhkost webstranka
Last synced: 11 Apr 2026
https://github.com/fuzzt/location-analyzer
The Location Data Analyzer is a Spring Boot application that offers insights on location data, such as counting locations by type, calculating average ratings, and identifying the most reviewed and incomplete entries. It features a simple frontend (HTML, CSS, JavaScript) and is deployed on Render.
analysis api average css data deployment docker fetch-api frontend html javascript location maven ratings render restful-api reviews spring-boot techstack
Last synced: 11 Apr 2026
https://github.com/taeefnajib/ibm-applied-data-science-capstone
This repository is for my IBM Applied Data Science Capstone Project. All the notebooks and other files are uploaded. If you are benefited by this repository by any means, please feel free to "Star" it and follow me. Thanks.
advance capstone capstone-project data data-science ibm ibm-watson jupyter jupyter-notebook notebook notebook-jupyter project science spacex spacex-api
Last synced: 14 Mar 2025
https://github.com/climate-resource/input4mips_validation
Validation of input4MIPs data
cmip data forcing input4mips validation
Last synced: 20 Jan 2026
https://github.com/squareslab/frameworkstudytranscripts
archived data human-study zackc
Last synced: 06 Mar 2026
https://github.com/ashfaqalizardariofficial/databasehelper
A C# database helper library to connect with the database server and perform actions insert, update, delete, select data and select multiple data from the database.
ashfaq-ali-zardari ashfaq-ali-zardari-official data database delete helper insert ms-sql-server multiple select-data server sql-server update
Last synced: 02 Apr 2026
https://github.com/mnazlukhanyan/da-projects
Портфолио с работами по аналитике данных, показывающие мои навыки, умения и опыт
data data-vizualisation hypothesis-tests matplotlib pandas plotly postgresql product-metrics python scipy seaborn sql visualization
Last synced: 11 Apr 2026
https://github.com/scjoaoantonio/trab_datascience
Este projeto tem como objetivo analisar os posts da rede social Bluesky. A aplicação interativa foi desenvolvida utilizando Streamlit e permite a coleta e visualização de dados, além de oferecer análises avançadas como previsão de engajamento, modelagem de tópicos e análise de sentimentos.
bluesky data data-science streamlit
Last synced: 09 May 2026
https://github.com/thicclatka/tetration
New file format for tensors
cli data fileformat mmap tensors
Last synced: 26 May 2026
https://github.com/foreteternelle/pokemonstudiodataapi
The GitHub repository of the Pokémon Studio Data Api
Last synced: 02 Apr 2026
https://github.com/snacks02/wobbling-statistics
Audio equipment statistics using Squiglink data
audio data data-visualization headphones iems speakers squiglink statistics
Last synced: 17 Apr 2026
https://github.com/ashita-ai/ashita-ai.github.io
Ashita AI - The island of misfit data tools
Last synced: 19 Feb 2026
https://github.com/cloud-shuttle/drover-sqlforge
The Data Automation Engine. A blazing-fast, pure Go alternative to dbt for data transformations.
ast data drover sql transformation
Last synced: 03 Jun 2026
https://github.com/awhipp/forex-api-export
API Service that pulls forex data and returns CSV file based on the parameters
data forex forex-trading oanda oanda-api-v20 trading
Last synced: 04 Jun 2026
https://github.com/stdlib-js/dstructs
Data structures.
containers data data-structures javascript namespace node node-js nodejs ns stdlib structs structures
Last synced: 18 Apr 2026
https://github.com/nicholas-owen/rdm-calendar
A small utility to manage conference and event information
calendar conference data event research
Last synced: 26 May 2026
https://github.com/mipacd/holochatstats
A VTuber chat log (and general) analytics platform
data flask hololive postgresql python visualization vtuber youtube
Last synced: 05 Apr 2026
https://github.com/kenatsf/basic_data_analysis
Basic data science project: ETL, forecast and data visualization.
analysis data data-analysis data-science logistic-regression matplotlib matplotlib-pyplot numpy pandas powerbi python scikit-learn time-series time-series-analysis time-series-forecasting
Last synced: 05 Apr 2026
https://github.com/iamfrerot/userverse
creating api for data analysis
data data-analytics spring-boot users
Last synced: 23 Mar 2025
https://github.com/sourceduty/language_barriers
🔤 Language barriers between the world's 7,000 languages.
communication concept data idea info information language language-barrier language-barriers languages project research
Last synced: 11 Feb 2026
https://github.com/montanaz0r/suicide-rate-analysis
Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector
analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate
Last synced: 20 Apr 2026
https://github.com/anjaliwork20/moodify
Mood-based music recommendation system that considers a user's emotional state to recommend songs, genres, artists and playlists using Machine learning
artificial-intelligence cnn-keras cnn-model convolutional-neural-networks data data-analysis data-science data-structures data-visualization database deep-learning machine-learning machine-learning-algorithms python recommended song songs
Last synced: 20 Apr 2026
https://github.com/fatihemres/Fruits
Fruit Details app by SwiftUI. Using data, models, animation and practically onboarding usage.
animations data models onboarding swift swiftui
Last synced: 31 Aug 2025
https://github.com/sebastianbrzustowicz/flight-quality-overview-microservice
Go + Docker. Microservice with parallel computations to convert raw vehicle flight data into overview raport with visualisation.
container control csv data docker drone flight go goroutines http microservice parallel-computing pdf quadcopter raport rms sse vehicle
Last synced: 10 May 2026
https://github.com/aneeshmurali-n/nlp-emotion-classification-in-text
Develop machine learning models to classify emotions in text samples.
bag-of-words data emotion-classification feature-extraction machine-learning naive-bayes natural-language-processing nlp nltk preprocessing python scikit-learn svm text-classification tf-idf tokenizer vectorizer
Last synced: 10 May 2026
https://github.com/datasqlsantosh/global-energy-consumption-renewable-generation-python-data-analysis-portfolio
This project focuses on analyzing global energy consumption patterns and trends in renewable energy generation using Python data analysis libraries such as Seaborn and NumPy. The analysis aims to explore energy consumption data from various regions worldwide and examine the contribution of renewable energy sources over time
data data-analysis data-visualization pandas seaborn
Last synced: 10 May 2026
https://github.com/zhukovanan/stepik_
The completed tasks of different data or computer science related fields on stepik
data statistical-learning statistics stepik-course
Last synced: 21 Apr 2026
https://github.com/nmelgar/birthday_sports_dataviz
We will analyze how the Matthew Effect has influenced in professional sports players.
analysis csv data data-analysis data-science data-visualization datavisualization dataviz probability research tableau
Last synced: 08 Jan 2026
https://github.com/fastpix/android-data-kaltura
This SDK enables seamless integration with Kaltura Player, offering advanced video analytics via the FastPix Dashboard
analytics android-sdk data fastpix kaltura kaltura-player metrics sdk video video-metrics
Last synced: 21 Apr 2026
https://github.com/amethyst-php/alias
alias amethyst amethyst-libary amethyst-package api data laravel library package
Last synced: 21 Apr 2026
https://github.com/zawaung7791/streamlit-data-viewer
Data previewer using streamlit, plotly and python
Last synced: 21 Apr 2026
https://github.com/notthestallion/pca__3d-and-from-scratch__principal-component-analysis
In this project, I will be implementing Principal Component Analysis (PCA) from scratch on an ecological footprint consummation database for countries and a three-dimensional scale using a movie database. The goal of this project is to gain a deeper understanding of PCA and to demonstrate its capabilities in exploring complex datasets.
data data-science database pca pca-analysis principal-component-analysis principal-component-analysis-pca principle-component-analysis
Last synced: 10 May 2026
https://github.com/schijioke-uche/data-analysis-with-python-an-spss-model
With this Python notebook algorithm, you can use SPSS Model notebook to build machine learning pipelines that you can use to iterate rapidly during the model building process in data analysis. Whether you're trying to find the right algorithm or experimenting with different ways of preparing your data, you can create reproducible research that's easily understood by any member of your team with Hypothesis definition.
anova cp4a cp4d cp4i cp4s data ibm ibm-cloud jeffrey-chijioke-uche jeffrey-solomon-chijioke-uche openshift python python3 redhat t-test
Last synced: 22 Apr 2026
https://github.com/hemangsharma/assignment-2---classification-models
Assignment 2 - Classification Models repository contains project for 36106 Machine Learning Algorithms and Applications
data datascience-machinelearning machine-learning ml
Last synced: 10 Jun 2026
https://github.com/tks18/xl-pq-handler
A Pythonic Power Query (.pq) File Manager for Excel & Power BI Automation
analytics automation data excel power-query powerbi python xlwings
Last synced: 20 Jan 2026
https://github.com/aminnairi/node-decode
Check that your data meet your expectations
check data decode expectations schema
Last synced: 22 Apr 2026
https://github.com/elcarrillo/structpy
StructPy is a Python-based command-line tool designed for academics and scientists to manage data projects effectively. It simplifies workflows by creating structured project directories, generating timestamped filenames, validating datasets, and backing up projects seamlessly.
command-line-tool data database file-structure organization python science-tool
Last synced: 24 Apr 2026
https://github.com/hruth-vik/sales-analysis-report
SalesScope is a powerful sales analytics dashboard that extracts insights, reveals trends, and drives strategy from raw data.
analytics data powerbi-report powerbi-visuals python
Last synced: 24 Apr 2026
https://github.com/badranalyst/data-cleaning-and-exploratory-data-analysis-project
This project uses SQL to clean and analyze a layoffs dataset. Data cleaning tasks include removing duplicates, standardizing values, and handling missing data. Exploratory analysis is performed to identify trends in layoffs across companies, industries, and time periods.
cleaning-data data database dataset mysql mysql-database sql
Last synced: 07 Apr 2025
https://github.com/marielachirinosr/cyclistic-data-analytics-project
This project explores user behavior within a fictional bike-sharing system, modeled after Cyclistic, operating in Chicago.
data data-visualization pandas powerbi-report powerbi-visuals python
Last synced: 24 Apr 2026
https://github.com/infinitode/crsd
A synthetic customer review sentiment dataset for sentiment analysis generated using different AI models.
ai data dataset datasets huggingface-datasets mit-license ml nlp open-source python sentiment sentiment-analysis sentiment-classification text-data
Last synced: 10 Jun 2026
https://github.com/miniql/miniql-json
A MiniQL query resolver that loads data from JSON files.
data json query query-language
Last synced: 11 May 2026
https://github.com/thinkphp/my-react-tictactoeai-app
App React Tic Tac Toe Component based on Artificial Intelligence
ai algoirthms data datastructures games javascript react
Last synced: 25 Apr 2026
https://github.com/marielachirinosr/bellabeat-wellness-data-trends
Analyzing smart device data for insights on user activity patterns to optimize interventions for better health outcomes.
data data-analysis data-visualization pandas python python3 tableau tableau-public
Last synced: 25 Apr 2026
https://github.com/sungchun12/demotron
CLI to delight real people with live demos
Last synced: 26 Feb 2025
https://github.com/marielachirinosr/hotel-data-analysis
Pandas & Matplotlib Learning Analysis. Repository featuring data analysis projects using Pandas and Matplotlib libraries
data data-analysis matplotlib pandas python
Last synced: 25 Apr 2026
https://github.com/anuraganalog/blog
Data Science Blog
anuraganalog blog data science
Last synced: 26 Apr 2026
https://github.com/bilgehangecici/datatypeconverter
Converting integer and floating numbers to appropriate bit-level representation.
data datatypeconverter java machine-level variables
Last synced: 30 Mar 2025
https://github.com/jigyasag18/multiple-disease-detection-app
This repository contains the implementation of a Multiple Disease Detection System, which employs advanced machine learning techniques for early detection and prediction of prevalent diseases, including diabetes, heart disease, and Parkinson's disease. The system utilizes a variety of patient health metrics such as demographics and medical history.
data datapreprocessing machine-learning machine-learning-algorithms machinelearningmodel prediction python streamlit streamlit-webapp
Last synced: 07 Jun 2026
https://github.com/rylan12/apscores
A quick way to visualize how the AP score distributions have changed from year to year.
advanced-placement analysis ap-exam data scores
Last synced: 19 Jun 2026
https://github.com/schoolsquirrel/holiday-data
Automatically updated holiday data for SchoolSquirrel
data holidays schoolsquirrel scripts vacation
Last synced: 03 Oct 2025
https://github.com/fatihemres/africa
Africa app by SwiftUI. Using AVFoundation, MapKit, data, models, animations, stickers.
animations avfoundation data mapkit models swift swift-animations swiftui
Last synced: 27 Apr 2026
https://github.com/debjyotisaha/tableau-projects-phase-2
Published interactive dashboards on Tableau Public, highlighting expertise in data visualization and storytelling through analyses of transportation patterns, sales trends, and demographic studies. These projects showcase the ability to transform complex datasets into actionable, intuitive visuals for decision-making.
dashboards data data-analysis data-visualisation tableau
Last synced: 26 Aug 2025
https://github.com/schenkd/tweetminer
Data Miner for Twitter Streaming API
data dataminer datamining java twitter twitter-api twitter4j
Last synced: 07 Jun 2026
https://github.com/santiagoenriquega/custom_database
Python-based database library for database management, indexing, transactions, and constraints, showcasing foundational database concepts.
data data-engineering database database-design python
Last synced: 27 Apr 2026
https://github.com/chocoscoding/fakeapi
A fake API with nice functionalities for testing
api data express fetch fetch-api frontend javascript js json json-api json-server nodejs testing typescript
Last synced: 09 Apr 2026
https://github.com/drkane/area-profiles
Produce UK area profiles based on various data sources
dash-plotly data flask statistics uk
Last synced: 27 Apr 2026
https://github.com/n4en/python-for-data-engineers
Python for data engineers
data data-engineer data-engineering dataengineering python python-notebooks python3 tutorial
Last synced: 26 Aug 2025
https://github.com/davorg/towerbridge
When is Tower Bridge lifting?
data hacktoberfest london perl web-scraping
Last synced: 29 Jun 2026
https://github.com/leonardomusini/mbe-growth-nexus-converter
Python tool to convert laboratory text files into NeXus files for Molecular Beam Epitaxy (MBE) data.
data data-engineering nexus python
Last synced: 28 Apr 2026
https://github.com/hoijui/osh-dir-std
Open Source Hardware directory standard(s)
data fchh interfacer-project-eu interfacer-project-eu-wp4-3 oseg specification standard
Last synced: 28 Apr 2026
https://github.com/franckalbinet/maris-crawlers
Automated data harvesting of MARIS data sources
automation data marine-radioactivity
Last synced: 25 Aug 2025
https://github.com/sagarkhese40/python-assginment
python assignment
assignment data data-science data-visualization python seaborn-plots
Last synced: 28 Apr 2026
https://github.com/sebastian-diaz-berdecia/analisis-popularidad-de-series-y-generos-de-series
Consultas SQL para el análisis de la popularidad de series y géneros series de la base de datos NetflixDB.
business-analytics bussiness-intelligence data data-analysis database mysql mysql-database sql
Last synced: 12 May 2026
https://github.com/darrendavy12/earthquake-events-and-risks-project---azure-data-pipeline---api-connection-
Earthquake Events and Risks Project - Azure Data Pipeline - API Connection
azure blob-storage cloud cloudstorage data databricks databricks-notebooks databricks-workspace dataengineer dataengineering microsoft python
Last synced: 28 Apr 2026