data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/alex0x4b/akutils
High-level Python library for recurring data manipulation (Pandas, Python data structure, API, file manipulation, etc.).
Last synced: 08 Mar 2026
https://github.com/kenanbek/youtube-data
YouTube stats data over YouTube Data API v3 using Python.
data python youtube youtube-api
Last synced: 13 May 2026
https://github.com/shubhamsoni98/analysis-with-sql
This project focuses on creating and managing a database for a music record company to perform various analyses on bands, albums, and songs. Using SQL, the goal is to create a structured relational database with relevant tables, insert necessary data, and perform queries that provide insights into the relationships between bands, albums, and songs.
analys analysis data data-science database dbms mysql mysqlworkbench project query schema sql
Last synced: 03 Jan 2026
https://github.com/mumtaz4118/nlp-course
Programming Assignments and Lectures for Stanford's CS 224: Natural Language Processing with Deep Learning
course data data-analysis data-analytics data-science data-visualization deep-learning education machine-learning natural-language-processing neural-network transfer-learning
Last synced: 24 Nov 2025
https://github.com/shahsuvarli/election-voters-data-analysis-pandas
Educational project analyzing Azerbaijan voter demographics with pandas, focusing on data cleaning, grouping, and visualization.
cleaning data grouping matplotlib numpy pandas python visualization
Last synced: 12 Apr 2026
https://github.com/pew-pew-team/hydrator
Hydrator kernel component
data deserializer dto hydrator kernel mapper mapping serializer structure
Last synced: 24 Mar 2025
https://github.com/arthurdanjou/studies
💼 This is the repository containing all my projects done during my studies in Python and R.
ai data data-science data-visualization jupyter jupyter-notebook ml python r
Last synced: 08 Apr 2025
https://github.com/realbxnnie/accountservice
A Simple DataStoreService wrapper with session backuping and session locking.
Last synced: 29 Jul 2025
https://github.com/infinitode/pyautoplot
PyAutoPlot is an open-source Python library designed to make dataset analysis much easier by generating helpful detailed plots using matplotlib. It automatically generates appropriate plots based on the dataset you feed it.
analysis automatic csv data dataset dataset-analysis generation matplotlib pandas plots plotting-in-python plotting-library python
Last synced: 16 Mar 2025
https://github.com/ot-code/sql-sabor-y-tradicion
A SQL-driven project that integrates menu and order data to reveal insights on dish performance, customer preferences, and spending trends. It informs pricing strategies, menu adjustments, and targeted promotions, ultimately enhancing the overall customer experience and driving business growth.
analytical-queries data data-aggregation data-analysis database-design join-queries mysql order-analytics relational-databases restaurant-data sql sql-script
Last synced: 08 Apr 2025
https://github.com/anuraganalog/twitter-data-analysis
My internship work during the 2020 summer
analysis data eda exploratory-data-analysis jupyter-notebook nlp spotle textblob twitter wordcloud
Last synced: 20 May 2026
https://github.com/aniruddha-biswas/shield-insurance-business-insights
Shield Insurance Business Insights
data data-visualization dataanalysis excel mysql powerbi sql
Last synced: 01 Apr 2025
https://github.com/mobinx/easymeet-js
EasyMeetjs is a robust and versatile TypeScript library that provides a solid foundation for building WebRTC-based applications. It simplifies the complexities of WebRTC, enabling developers to easily incorporate real-time communication features into their projects.From simple audio video calling to real time peer to peer file transfer , everything
data meeting react realtime screensharing streaming-video webrtc zoom
Last synced: 03 Jan 2026
https://github.com/yashaswitir28/yashaswitir28.github.io
This is my Portfolio Website
data data-analysis-python data-analyst data-cleaning data-science data-visualization excel html-css ms office365 portfolio-website powerbi python sql
Last synced: 29 May 2026
https://github.com/darkogamerz/dhis2heat
A Comprehensive data management and Health Equity Assessment and Analysis platform that fetches data from DHIS2, optimize, calculate, clean and visualize inequality data.
analytics data data-science dhis2 equality equity health heat inequality r shiny shinydashboard visualization
Last synced: 01 Apr 2025
https://github.com/eudesgccunha/automated-management-panel
Automated management panel using Power BI
data data-analysis data-visualization database excel powerbi
Last synced: 04 Feb 2026
https://github.com/suchi25sathavara/data-wrangling-with-r
Analyzing Road Accidents in Victoria, Australia
data r reporting rstudio wrangling-data
Last synced: 01 Apr 2025
https://github.com/suchi25sathavara/r-projects
R projects in Real world Scenerios for Data Analysis
data data-analysis datavisualization r
Last synced: 01 Apr 2025
https://github.com/wraith13/systematic-metasyntactic-variables
This is a list for that you can express the existence of different serieses when using metasyntax variables.
Last synced: 14 Jun 2025
https://github.com/trollmii/bunnybase
An efficient data managing system
bunnybase data data-science data-structures database datascience python python3
Last synced: 22 Apr 2025
https://github.com/rickyarians/practical-statistic-car-emission
Practical Statistic Project- Car Emission in Canada - 2022
data data-science dataanalysis r rmarkdown rpubs statistics
Last synced: 22 May 2026
https://github.com/karensaraimoralesmontiel/8-week-sql-challenge
Case Studies Solutions for the 8-Week-SQL-Challenge.
Last synced: 02 Jan 2026
https://github.com/nodamu/apache-beam-studies
Personal Apache Beam studies repository
apachebeam batch-processing data dataeng dataengineering datapipeline stream-processing
Last synced: 04 Nov 2025
https://github.com/makcymal/silvera
My researches on ML and statistics, optimization methods, CS algoritms and numerical methods
algorithms data data-structures machine-learning numerical-methods statistics
Last synced: 01 Apr 2025
https://github.com/inist-cnrs/ws-data
Modèles et données pour les web services
Last synced: 03 Sep 2025
https://github.com/shubhamsoni98/project_using_knn
This project applies the K-Nearest Neighbors (KNN) algorithm to predict iPhone purchases based on customer data. Using features like age, salary, and previous purchase behavior, the KNN model classifies customers into buyers and non-buyers.
anaconda analytics data data-science eda knn knn-classification machine-learning-algorithms predict project python scikit-learn tableau
Last synced: 03 Jan 2026
https://github.com/lightdash/quickstart-github
Instant analytics for Github
analytics business-intelligence data dbt github
Last synced: 14 Sep 2025
https://github.com/alextanhongpin/node-github-api
:page_with_curl: sample github api queries with nodejs for scraping purposes
Last synced: 06 May 2026
https://github.com/rickstaa/ai-compute-visualizer
A StreamLit-based web application to visualize GPU inventory and AI capabilities on the Livepeer network.
Last synced: 28 Jun 2025
https://github.com/karajmiglani-datascientist/karajmiglanifake-news-detection
FAKE_NEWS_PREDICTION
algorithms data data-science flask machine-learning probability-statistics python statistics structure
Last synced: 22 May 2026
https://github.com/ompreetham/data-structures
binary-search-tree c data data-structures datastructures graph linked-list list stack structures tree
Last synced: 25 Mar 2025
https://github.com/cody-scott/arclint
A flexible tool to validate and improve your data in ArcGIS using regex and other methods
arcgis arcgispro data lint regex validation
Last synced: 14 May 2025
https://github.com/thingston/extractor
Collection of PHP classes to extract data from HTML pages.
Last synced: 14 Jan 2026
https://github.com/jacoblincool/moodle-export
A streamlined library for retrieving data from Moodle.
Last synced: 07 May 2025
https://github.com/rishabhmathur06/data_analysis-netflix
data data-analytics data-science matplotlib-pyplot numpy pandas python seaborn
Last synced: 12 Apr 2026
https://github.com/srvanderplas/statistical_atlas
Framed Charts and the Statistical Atlas of 1870
census data ggplot2 graphics r statistics visualization
Last synced: 29 May 2026
https://github.com/interzoid/typescript-examples
Provides TypeScript examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.
angular api cloud data database matching nodejs quality typescript
Last synced: 12 Jan 2026
https://github.com/powersyang/visualization
data visualization templates 数据可视化模板
Last synced: 24 Mar 2025
https://github.com/nadahamdy217/movies-data-etl-using-python-gcp
Developed a comprehensive ETL pipeline for movie data using Python, Docker, and a GCP Pub/Sub emulator. Successfully processed and published the data in a local Docker environment, showcasing advanced data engineering skills.
analytics data data-engineering data-ingestion data-preparation data-preprocessing data-processing data-project docker etl etl-pipeline gcp matplotlib matplotlib-pyplot numpy pandas pubsub python scipy seaborn
Last synced: 06 Jan 2026
https://github.com/igor-starostenko/sabre
Slice your files like a champ with **sabre**
Last synced: 28 Mar 2025
https://github.com/lut-ful/e-commerce-sales-report
This dashboard provides a visual analysis of e-commerce sales data
data data-analytics data-science data-visualization power-bi statics
Last synced: 28 Jun 2025
https://github.com/j-hagedorn/locals
:globe_with_meridians: A collection of tidied, neighborhood-level public datasets
address-dataset census-data census-tract data neighborhood social-sciences
Last synced: 03 Feb 2026
https://github.com/jneidel/nationalities
Dataset of 100 common nationalities
data dataset json nationalities nationality opendata
Last synced: 25 Mar 2025
https://github.com/cosmos-loops/cosmos-data
Cosmos.Data is a inline project of COSMOS LOOPS PROGRAMME to provide several SQL-Query, RMDB/ORM and No-SQL components' extensions.
connection-pool data mysql mysqlconnector oracle postgresql sqlite sqlkata sqlserver transaction uow
Last synced: 12 Apr 2026
https://github.com/eva-kaushik/data-clustering
Clustering Accelerators for hard and soft clustering, including implementations of K-means, K-medoids, hierarchical clustering, fuzzy C-means, and Gaussian mixture models. Demonstrates text clustering using both hard and soft clustering algorithms.
clustering clustering-algorithm data datascience machine-learning-algorithms
Last synced: 09 Apr 2025
https://github.com/denisecase/620-mod6-web-scraping
Notes on how to get started scraping content from the web
beautifulsoup4 data mining python
Last synced: 11 Apr 2025
https://github.com/cassandrajm/reddit-dashboard
INTERACTIVE DASHBOARD: Analyzing Political Discourse on Reddit: A Multi-Faceted NLP Approach to Toxicity, Bias, and Political Stance
capstone data data-analysis data-science politics python reddit
Last synced: 09 Apr 2025
https://github.com/plandes/datdesc
Describe and optimize data
data hyperparameter-optimization hyperparameter-tuning latex table
Last synced: 04 Sep 2025
https://github.com/yash-chauhan-dev/sf_analytics
Business teams often rely on data analysts to extract insights using SQL. This tool eliminates that dependency by bridging the gap between humans and data using AI.
aiml analytics data dbt langchain llm python snowflake streamlit
Last synced: 07 May 2026
https://github.com/rohitblaze10/netflix_analysis_using_tableau
The Netflix dashboard in Tableau provides a professional and visually captivating interface for users to explore a vast collection of TV shows and series. With seamless navigation and interactive filters, users can easily personalize their recommendations based on release year, genre, duration, and rating.
data data-analysis data-science data-visualization netflix tableau
Last synced: 04 Feb 2026
https://github.com/astridlyre/offhand
A Random Data Generator Library for JavaScript.
data generator javascript library random typescript
Last synced: 20 May 2026
https://github.com/badawy403/egy.list
A Node.js package providing access to official Egyptian data including universities, governorates, cities, and more. This package makes it easy for developers to integrate Egypt-specific information into their applications.
city data egypt javascript nodejs npm package
Last synced: 08 Mar 2026
https://github.com/parmsam/rweekly.data
R package containing data on Rweekly posts
Last synced: 21 May 2026
https://github.com/canadaluke888/terminaltablebuilder
Build and edit tabular data all from the terminal.
cli data data-manipulation excel json ods rich spreadsheets sqlite3 tables
Last synced: 20 Apr 2026
https://github.com/gagolews/clustering-data-v0
Datasets for Clustering [DEPRECATED – A NEW VERSION IS AVAILABLE]
clustering data dataset machine-learning
Last synced: 15 Sep 2025
https://github.com/muhammed-fazal/student-success-and-early-intervention-analytics-system
To consolidate scattered student performance records into a unified Data Warehouse in SQL Server. Engineer an Interactive Power BI dashboards that visualize academic trends, identifying student performance and implement predictive analytics.
analysis analytics dashboard data data-analysis data-engineering data-science data-visualization database etl etl-pipeline power-bi powerbi python sql sql-server
Last synced: 29 May 2026
https://github.com/renebentes/2808
Curso 2808 - Fundamentos do Entity Framework
Last synced: 27 Jun 2025
https://github.com/kahlery/my-jupyter-notebook-projects
🐊 collection of my data science analysis, actually I store most of my data science projects in my google drive because of google colab
Last synced: 12 Apr 2026
https://github.com/vojtech-dobes/php-conformance
constraint data input normalization php sanitization schema validation
Last synced: 23 Jul 2025
https://github.com/afnanenayet/ds-a
Some interview prep I've been doing. This repo is reimplementations of algorithms and data structures in Python3
algorithms data interview prep python structures
Last synced: 05 Apr 2025
https://github.com/amethyst-php/user
amethyst amethyst-package api data laravel user
Last synced: 12 Apr 2026
https://github.com/nsandoya/python_scrp_project
This is a tool specially made for Dipaso ecommerce website. You can extract data from there, analyze it and see keywords, brands, and categories frecuency, prices distribution and other market tendencies as well —all in a group of friendly stadistic tables and graphics (exported from a Jupyter notebook) :)
beautifulsoup4 data data-analysis jupyter-notebook pandas python3
Last synced: 28 Apr 2026
https://github.com/ournet/topics-data
Ournet topics data package
data ournet storage topic topics topics-data topics-storage
Last synced: 12 Jun 2025
https://github.com/jerboaburrow/uk-counties-and-unitary-authorities-may-2023-geojson
UK "Counties" Extracted from Office for National Statistics data
Last synced: 29 Mar 2025
https://github.com/rishikesh-jadhav/track_deep_learning
Data collected from the Udacity simulator comprising RGB images with steering and throttle annotations for each frame, specifically gathered for behavioral cloning purposes.
data datacollection udacity-self-driving-car
Last synced: 03 Jan 2026
https://github.com/nadahamdy217/harvest-gaurd-plant-disease-detection-web-application
web application that help people grow healthy plants
classification-confidential cnn cnn-classification css data data-science detection html javascript keras machine-learning model plant-disease-detection supervised-learning tensorflow web-application
Last synced: 13 Apr 2026
https://github.com/iliyasalve/cyclistic_case_study
Analysis of the Bike-Sharing System for the following question: "How do annual members and casual riders use Cyclistic bikes differently?"
bike-sharing data data-analysis data-visualisation r
Last synced: 06 Apr 2025
https://github.com/stefanpietrusky/factsv2
Repository for the article in the online magazine TDS.
ai arxiv-papers beautifulsoup data flask-application gensim llama matplotlib ollama plotly pyldavis python selenium webdriver
Last synced: 09 Apr 2025
https://github.com/kiing-dom/data-structures-algorithms
data structures and algorithms
algorithms-and-data-structures data data-structures java leetcode
Last synced: 09 Aug 2025
https://github.com/gabboraron/datacamp_projects
Here you can find my DataCamp Projects
data datacamp datacamp-projects
Last synced: 14 Jun 2026
https://github.com/etmendz/mendz.data.oracle
Provides a generic Mendz.Data-aware context for ADO.Net-compatible access to Oracle databases.
ado-net context data database datasettings mendz oracle
Last synced: 13 Apr 2026
https://github.com/unknownsoup/budget_tracker
A personal budget tracker to build my knowledge of working with databases and data analysis. In this case using SQL and python for the analysis.
data data-science databases python sql
Last synced: 26 Jan 2026
https://github.com/shadmanshaikh/data-analysis-and-ml-work
All of my work in Data Analysis and Machine learning
analytics artificial-intelligence data machine-learning
Last synced: 05 Jul 2025
https://github.com/rorylshanks/devdb-client
This is the repository for the official command line client for DevDB (https://devdb.cloud)
cloud data database-management development
Last synced: 29 May 2026
https://github.com/jun-labs/algorithm
📝 자료구조, 알고리즘 학습 저장소.
algorithm data data-structures leetcode problem-solving programmers ps structure
Last synced: 14 Mar 2025
https://github.com/mustafaozvardar/selenium-eksisozluk
This project is a simple web scraper built with Python using Selenium. It extracts and prints the content of popular entries from a specific EksiSozluk page.
data python selenium selenium-python
Last synced: 29 Apr 2026
https://github.com/raghavendranhp/youtube_data_harvesting
The "YouTube Data Analyzer" is a versatile tool for businesses and content creators, enabling them to gather, analyze, and harness valuable insights from multiple YouTube channels. With streamlined data collection, storage in MongoDB, migration to SQL, and a user-friendly Streamlit interface, it empowers users to make data-driven decisions
apiintegration data datacollection eda googleapi googleapiclient matplotlib mongodb mysql mysqlconnector numpy oops pandas pymongo python pythonoops sql sqlalchemy streamlit youtube-api
Last synced: 13 Apr 2026
https://github.com/amethyst-php/token
amethyst amethyst-package api data laravel token
Last synced: 21 May 2026
https://github.com/amethyst-php/product
An item that is made to be sold or bought
amethyst amethyst-package api data laravel product
Last synced: 21 May 2026
https://github.com/soenkekluth/micromitter
minimal and performant event emitter / dispatcher
data dispatch dispatcher emit emitter event eventdriven handler on send trigger
Last synced: 02 Nov 2025
https://github.com/sakshamarora07/blinkit-sales-report-power-bi
This dashboard provides Blinkit with insights to optimize its grocery delivery operations and understand customer preferences. It evaluates sales trends, outlet performance, and item categories to identify key areas for improvement. The interactive visuals allow detailed exploration of sales distribution, customer ratings, and product popularity.
data data-science dataanalytics datavisualization excel powerbi sql
Last synced: 08 Jan 2026
https://github.com/living-with-machines/zoonyper
Code to make it easy to import and process Zooniverse annotations and their metadata in Python/Jupyter Notebooks
crowdsourcing data data-processing data-science python zooniverse
Last synced: 04 Jul 2025
https://github.com/kashirin-alex/thither.direct-onamove
an android skeleton-example application for using data from Thither.Direct platform on mobile applications
android-application data data-analysis data-structures data-visualization mobile-development mobility query research-data-management
Last synced: 27 Apr 2026
https://github.com/danpoynor/data-pagination-and-filtering-project
Data pagination exercise using 'vanilla' JavaScript. This script consumes a JSON array containing any number of objects and adds buttons to a page that users can click to navigate to different pages of data.
data javascript json navigation pagination vanilla-javascript
Last synced: 20 Apr 2026
https://github.com/prishabhanot/facial_recognition_pca
A face recognition system using Principal Component Analysis (PCA) for dimensionality reduction and a Support Vector Machine (SVM) classifier for classification. PCA extracts essential features (eigenfaces) from facial images, significantly reducing computational complexity while retaining critical information for accurate recognition.
data eigenfaces facial-recognition pca python reducing-computational-complexity reducing-data-dimensions svm-classifier
Last synced: 01 Mar 2025
https://github.com/stdlib-js/ndarray-vector-uint32
Create an unsigned 32-bit integer vector (i.e., a one-dimensional ndarray).
constructor ctor data javascript ndarray node node-js nodejs stdlib structure types uint32 vec vector
Last synced: 25 Apr 2026