data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/ahmad-mtr/prjkt_exam_schedule_test
I hate scrolling in a list of 300+ courses of my Uni exam schedule, so I'm creating this. this's a test btw :)
Last synced: 11 Apr 2025
https://github.com/octoenergy/tentaclio-databricks
Module to give tentaclio support to databricks
Last synced: 24 Jun 2025
https://github.com/ezeparziale/analisis-uso-bicicletas-caba
:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.
data data-science data-visualization
Last synced: 14 Mar 2025
https://github.com/ezeparziale/analisis-data-delitos
:gun: Analsis de delitos de CABA
Last synced: 14 Mar 2025
https://github.com/official-imvoiid/multifetch
A high-performance web scraper for bulk image and GIF extraction from reliable sources — built for AI/ML data pipelines and large-scale media collection
aiml data dataset gifscraper imagescraper python pythontool tools webscraper windows
Last synced: 19 May 2026
https://github.com/1sumer/mass-mail-automation
Mass Emailer is a Python-based application designed to send bulk emails efficiently using an SMTP server. Leveraging the power of the Tkinter library for the graphical user interface (GUI), this tool provides a user-friendly platform for managing and dispatching large volumes of emails with ease.
data oops-in-python python smtp-server tkinter
Last synced: 20 Aug 2025
https://github.com/salman-khan-mohammed/youtube-data-analysis-sentiment-analysis
Analyzing the YouTube Data
data data-visualization plotly-express scrapping-data sentiment-analysis
Last synced: 26 Mar 2025
https://github.com/yassin522/health-insurance-cross-sell-prediction
Prediction of Vehicles Health Insurance
data data-analysis data-science machine-learning plotly python
Last synced: 15 May 2026
https://github.com/ahabdel/amazon-web-scraper
Amazon Web Scraper to scrape pricing adjustments and provide updates on a day to day basis
Last synced: 29 Oct 2025
https://github.com/kingabzpro/5-airflow-alternatives-for-data-orchestration-tutorial
Code examples of Luigi, Prefect, Kedro, Dagster, and MageAI
dagster data data-orchestration kedro luigi mageai prefect
Last synced: 18 Apr 2026
https://github.com/jorgermduarte/mongo-replication
cluster data mongo mongodb mongoose replica replica-set replication
Last synced: 03 Mar 2025
https://github.com/randomgamingdev/randomgamingdev.github.io.data
The data for RandomGamingDev.github.io (feel free to build your own website off of mine :D)
blog custom data projects projects-list
Last synced: 02 Jan 2026
https://github.com/maulanakavaldo/tri-hita-karana
Project Tri Hita Karana - Future Knowledge G20 Bali. DTS Kominfo x Binar Academy.
bali data data-science g20 science
Last synced: 02 Mar 2025
https://github.com/aliasgarsogiawala/dashboards
Power BI dashboards , each folder contains a pbix file and a pdf file with explanation of the dashboard
analysis dashboards data data-visualization powerbi
Last synced: 12 Feb 2026
https://github.com/colonelbundy/martenmigrator
Data migrator for Marten
data database documentdb marten migration postgres
Last synced: 04 May 2026
https://github.com/gui-sitton/prepaid
In this project I work as an analyst for the telecommunications company Megaline. The company offers its customers prepaid plans, Surf and Ultimate. The sales department wants to know which plans bring in the most revenue in order to adjust the advertising budget
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 22 May 2026
https://github.com/stdlib-js/dstructs-stack
Stack.
collection data data-structure data-structures first-out javascript last-in lifo node node-js nodejs stack stdlib structure
Last synced: 14 May 2026
https://github.com/octoenergy/tentaclio-gdrive
A python project containing all the dependencies for the gdrive tentaclio schema
Last synced: 24 Jun 2025
https://github.com/amethyst-php/office
amethyst amethyst-package api data laravel office
Last synced: 17 May 2026
https://github.com/Greatwoman23/Sentiment-Analysis-on-Amazon-Products-Review
Sentiment_Analysis_On_Amazon_Product_Review
analysis dashboard-application data data-science datascientistproject machine-learning publication python remotejob
Last synced: 04 May 2025
https://github.com/domarps/grad-project-reports
Write-ups of a few key semester-long projects I have worked during my Masters
circuit data deeplearning graph-algorithms matlab question-answering
Last synced: 26 Mar 2025
https://github.com/huspacy/huspacy-resources
Resources for building and evaluating huspacy
Last synced: 21 Mar 2025
https://github.com/cemoktra/data_series
time series handling
data lazy-evaluation time-series
Last synced: 29 Oct 2025
https://github.com/hivesolutions/crossline
Simple event pipping and storing infra-structure
Last synced: 15 May 2026
https://github.com/GAMELEIRA/studies-database
Esse repositório têm como objetivo alocar todo e qualquer script para aprender e praticar gerenciamento de banco de dados SQL e NoSQL. Nesse projeto, serão consolidados os principais fundamentos e princípios, além da prática de exercícios e desenvolvimento de projetos.
data database mongodb mssql mysql nosql sql
Last synced: 03 May 2025
https://github.com/greatwoman23/sentiment-analysis-on-amazon-products-review
Sentiment_Analysis_On_Amazon_Product_Review
analysis dashboard-application data data-science datascientistproject machine-learning publication python remotejob
Last synced: 17 May 2026
https://github.com/md-emranhossen/leetcode-practice
This repository stores my solutions to LeetCode problems, organized by problem number and title.
cpp data datastructures-algorithms leetcode-solutions
Last synced: 26 Jun 2025
https://github.com/topunix/hackerrank
:green_book: HackerRank Solutions
algorithm-challenges algorithms algorithms-and-data-structures data data-structures hackerrank hackerrank-algorithms-solutions hackerrank-challenges hackerrank-python hackerrank-solutions python
Last synced: 17 May 2026
https://github.com/nonsignificantp/enfermedades-inmunoprevenibles
Analisis sobre el efecto de las vacunas y la incidencia de casos de enfermedades inmunoprevenibles en la Ciudad de Buenos Aires entre los años 1995 y 2016
a analysis argentina buenosaires data hepatitis science vaccination
Last synced: 18 Jun 2026
https://github.com/engineeringmadness/gaming-ai-analytics
Using Databricks to analyze game reviews from Steam web store
data databricks llama pyspark semantic-layer
Last synced: 15 May 2026
https://github.com/eslamdyab21/apara-data-gui
Custom application for Apara's data wrangling scripts, Technologies used are Qt-designer, PyQt5 for the GUI and Pandas, Numpy for the data work.
csv data data-analysis data-wrangling gui pandas pyqt5-desktop-application qt5-gui
Last synced: 17 May 2026
https://github.com/amethyst-php/value
amethyst amethyst-package api data laravel value
Last synced: 17 May 2026
https://github.com/prernarohra/todo-webapp
Simple Todo App for practice.
axios css data fastapi html json python typescript
Last synced: 06 Apr 2026
https://github.com/peter7775/mysql-graph-visualizer
SQL database conversion and visualisation as graph / in development
analytics analyzer conversion converter data database go golang graph graphql mysql neo4j neo4j-graph refactoring sql visualization
Last synced: 14 Mar 2025
https://github.com/jor-/measurements
Python functions to handle, statistically analyze and plot measurement data.
Last synced: 17 Mar 2025
https://github.com/ayushman0511/data-warehouse-project1
A comprehensive guide to building a data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
data data-ana data-anal data-cleaning data-enginee data-lakehou datalake datasci dataware datawarehouse datawarehousi etl etl-job etl-pipeline medallion sql sql-quer sql-query sql-server sqlserver
Last synced: 26 Jun 2025
https://github.com/majorcluster/clj-data-adapter
A Clojure library designed to convert data
Last synced: 12 Jul 2025
https://github.com/woctezuma/humble-choice-leak
Retrieve leaks for Humble Choice.
data datamining humble-bundle humble-bundle-games humble-bundle-leak humble-choice humble-choice-leak humblebundle humblebundle-leak leak leaks steam steam-games
Last synced: 27 Mar 2025
https://github.com/dsietz/daas-workshop
Workshop for building a Data as a Service platform using the DaaS SDK.
archconf daas daas-pattern data dataprivacy nfjs rust rust-lang
Last synced: 20 May 2026
https://github.com/vatshayan/ip-address-data-analysis-
Extraction of 100's of IP Address and using Machine Learning algorithm for detecting threats
data data-analysis data-science data-visualization dataset ip ipconfig ipv4-address jupyter-notebook machine-learning machine-learning-algorithms supervised-machine-learning unsupervised-learning
Last synced: 15 Jul 2025
https://github.com/bcodmo/workshop_bios_oceanographic_data
Repository holding lesson on Data Management Basics. See webpage for rendered view: https://bcodmo.github.io/workshop_bios_oceanographic_data/
bco-dmo data datamanagement fair workshop
Last synced: 08 Apr 2026
https://github.com/nabilaagha/chest-x-ray-medical-diagnosis-using-deep-learning
This project uses deep learning to classify chest X-ray images for disease detection. It involves data preprocessing, pre-trained CNN models, and the ChestX-ray8 dataset to enhance medical diagnostics with AI.
computer-vision data data-processing deep-learning juypter-notebook medical-image-processing x-ray-images
Last synced: 15 Dec 2025
https://github.com/jigyasag18/orders-sales-analysis-report-using-power-bi
This repository analyzes and visualizes office supply sales data to improve profitability. It examines sales performance by various factors, using charts to provide insights and actionable recommendations for sales optimization, market research, and product mix.
data dataanalysis dataanalytics dataset powerbi powerbi-dashboards powerbi-report powerbi-reports powerbi-visuals powerbidashboard
Last synced: 18 Feb 2026
https://github.com/stdlib-js/array-base-assert-any-has-property
Test whether at least one element in a provided array has a specified property, either own or inherited.
any array assert data generic has javascript node node-js nodejs prop property stdlib structure test types validate
Last synced: 07 May 2025
https://github.com/codehub001/ai-driven-automation-for-data-quality-monitoring-in-cloud-data-warehouses
This project focuses on leveraging AI to automate data quality monitoring in cloud data warehouses. Traditional data validation methods often require manual intervention and fail to scale with increasing data complexity. By integrating machine learning models, this approach enables real-time anomaly detection, automated data cleansing.
csv-export csv-import dashboard data datacleaning lib modeltraining python testing-library visualization
Last synced: 13 May 2025
https://github.com/theanujsinha01/data-analytics-portal-
Data Analytics Portal Built a web-based data analytics tool using Streamlit, Pandas, and Plotly. Supported CSV and Excel uploads (up to 200MB) for data exploration. Features included statistical summaries, group-by aggregation, and frequency counts. Integrated interactive charts (bar, pie, line, scatter) for visual insights. This tool is live now.
Last synced: 28 Apr 2026
https://github.com/ubeydgur/car-price-prediction
Predicting the price of a used car
ai artificial-intelligence data data-science data-visualization machine-learning machine-learning-algorithms
Last synced: 08 Jun 2026
https://github.com/wolfchamane/amjs-data-types
Data types for your OOP javascript project
cjs data javascript modules nodejs oop types
Last synced: 20 May 2026
https://github.com/thesfinox/fit-the-data
Data analysis using Wolfram Mathematica
analysis data data-analysis lab mathematica wolfram wolfram-mathematica
Last synced: 24 Jan 2026
https://github.com/circlexo/circlexo
Open-source project to seamlessly integrate and manage your business workflow, connecting Jira, GitHub, Discord, Stripe, RevenueCat, and OpenAI all in one intuitive platform.
bussiness-intelligence data discord-bot forge github google jira kpis ploi revenuecat stripe vapor
Last synced: 20 May 2026
https://github.com/shimul-zahan/all-practices-tukitaki
This is repository for all the practice tasks or learning new things. Cause environment are setup and no need to setup a new project or environments.
data data-science datapreprocessing deep-learning machine-learning neural-network practice python visualization
Last synced: 12 Jan 2026
https://github.com/furkankarakuz/turkey_earthquake
This project focuses on analyzing and visualizing earthquake data specific to Turkey. It aims to provide insightful visualizations on topics such as earthquake frequency, location, and magnitude using data obtained from Boğaziçi University Kandilli Observatory and Earthquake Research Institute.
api data data-visualization earthquake python python3 request streamlit turkey turkey-earthquake
Last synced: 20 May 2026
https://github.com/patrikcze/meshtatic_data
Meshtastic Data Transfer - Trying some stupid thing, like transferring files over LORA network.
data meshtastic meshtastic-python
Last synced: 03 Feb 2026
https://github.com/heshamalsaqqaf2/python-projects
Beginner Level Python Projects
Last synced: 22 Jul 2025
https://github.com/jorgeatgu/dataset-elecciones-28a
Datasets generados a partir del dataset de elecciones generales de El País
28a data elecciones2019 elections spain
Last synced: 16 May 2026
https://github.com/clagiordano/marketplaces-data-export
LIbrary that share the same interface and provide adapters for online marketplaces services
adapter amazon api clagiordano data ebay ebay-api export marketplaces mws mws-api rest soap
Last synced: 22 Mar 2025
https://github.com/cmutel/jester
Import data from the olca-schema JSON-LD format into the HESTIA JSON-LD schema
agriculture data json-ld life-cycle-assessment ontology
Last synced: 26 Jul 2025
https://github.com/krescruz/pegaso-data
Utilerías para el analisis de datos del Proveedor de Certificación de Factura Pegaso
Last synced: 29 Apr 2026
https://github.com/amethyst-php/shipment
amethyst amethyst-package api data laravel shipment
Last synced: 20 May 2026
https://github.com/jodus-melodus/queue
Simple Queue
data datastructures linear queue queues
Last synced: 10 Sep 2025
https://github.com/chocolateboy/corrigenda
Corrections, addenda, and deltas for data that's wrong on the Internet
addenda api corrections corrigenda data json json-data
Last synced: 27 Mar 2025
https://github.com/andygeiss/pipeline-example
This is a basic example of using a pipeline in data science.
data data-pipeline data-science example go golang iris-dataset pipeline protobuf
Last synced: 17 Jul 2025
https://github.com/johndelatto/-universities-to-pursue-a-master-s-degree-in-machine-learning
Best Master’s Programs in Machine Learning (ML) for 2021 These are the best universities to pursue a master’s degree in machine learning, with research rankings in AI and machine learning
ai api data education project school
Last synced: 17 Jun 2025
https://github.com/amethyst-php/setting
Give the user the ability to configure his own settings
amethyst amethyst-package api data laravel setting
Last synced: 19 May 2026
https://github.com/pyrustic/litedao
Intuitive interaction with SQLite database
auto-init dao data database database-access library lightweight pyrustic python sql sqlite
Last synced: 09 May 2026
https://github.com/jigyasag18/fake-news-prediction-project
The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Porter stemming algorithm, achieving around 97% classification accuracy with logistic regression model.
data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming vectorization
Last synced: 08 Jun 2026
https://github.com/campiohe/geomask
A very simple lib for creating geometric masks from spatial data using regular grids.
Last synced: 30 Dec 2025
https://gitlab.com/sean-c/pdf_rules
Turn PDFs into CSVs by defining rules
Data Cleaning automation data data parsing
Last synced: 14 Apr 2025
https://github.com/rameshaditya/dynamic-hybrid-data-grid
Facilitates faster read-and-write of large ordered collections of data.
algorithms data data-structures storage
Last synced: 30 Jun 2026
https://github.com/vijaykumar1303/sales-data-analysis-and-dashboard-development
To analyze sales data to uncover insights into sales performance, trends, and patterns, and to develop an interactive dashboard that provides a comprehensive view of sales metrics and KPIs.
data dataanalysis datacleaning datavisualisation dax-query powerbi powerquery sql sqldataanalysis
Last synced: 11 Feb 2026
https://github.com/pyfig/s21_data-science-bootcamp
School21 Bootcamp Data Science
data data-science numpy pandas python school21
Last synced: 26 Jun 2025
https://github.com/amethyst-php/price
Define prices and attach them to any model
amethyst amethyst-package api data laravel price
Last synced: 17 May 2026
https://github.com/eloyhere/semantic-java
Semantic-Java is a modern, maven Java stream processing framework with zero dependencies. It elegantly blends the fluency of Java Streams, the laziness of JavaScript generators, and intelligent index-based control inspired by database indexing — perfect for time-series, event streams, and high-performance data pipelines as a maven pendency.
data functional functional-programming java pipeline stream
Last synced: 07 Apr 2026
https://github.com/ahmad-ali-rafique/random-forest-classifier-modeling
Detailed exploration of random forest classifiers, including data cleaning, model building, and performance evaluation on various datasets.
classification classification-models data dataanalytics datamodel dataset model-checking models random-forest random-forest-classifier
Last synced: 01 Jun 2026
https://github.com/shailu2004/azure_big_data_project
This project demonstrates a comprehensive Azure Data Engineering workflow using multiple Azure resources to process and analyze an e-commerce dataset. The dataset consists of 8 files containing details about customers, payments, orders, and other key information
ai azure cloud data data-engineering
Last synced: 08 Jul 2025
https://github.com/ahmad-ali-rafique/random-forest-regressor-modeling
Detailed exploration of random forest regressors, including data cleaning, model building, and performance evaluation on various datasets.
data dataanalytics datacleaning evaluation-metrics modeling random-forest random-forest-regression regression regression-analysis
Last synced: 05 Mar 2025
https://github.com/vaxdata22/foresight-pharmaceutical
This is a Data Analysis case study done on the Foresight Pharmaceutical Company dataset.
actionable-insights business-analytics business-intelligence data data-analytics data-cleaning data-mining data-visualization data-wrangling exploratory-data-analysis spreadsheets sql sql-server sql-server-management-studio statistical-analysis t-sql transact-sql
Last synced: 05 Mar 2025
https://github.com/danielrosehill/ghg-ebitda-correlations
Streamlit data visualisation examining correlation between emissions & profitability
data sustainability sustainability-data
Last synced: 14 Mar 2025
https://github.com/theduardomaciel/cc-pe
Conteúdos, scripts em R e datasets utilizados durante a matéria de Probabilidade e Estatística.
Last synced: 27 Mar 2025
https://github.com/ahmad-ali-rafique/electricity-consumption-analysis-household-dataset
This repository contains analysis and predictive modeling of household electricity consumption using Python. It includes data cleaning, exploratory data analysis (EDA), time series forecasting (ARIMA, SARIMA, LSTM), and model evaluation to optimize energy usage.
arima-forecasting artificial-intelligence artificial-neural-networks data data-science dataanalytics datacleaning evaluation-metrics exploratory-data-analysis long-short-term-memory lstmmodel modeling time-series timeseries-forecasting
Last synced: 23 Jun 2025
https://github.com/truongnhatbui/automatidata
Automatidata
data data-analysis data-science data-visualization python tableau
Last synced: 08 Jul 2025
https://github.com/ethenkem/pygraphsurvey
A python base web app that provide graphical analysis on data collected from surveys and the system has its on built in form fiiling where admin can set question and sent a link for the forms to be filled and then the system provide anylysis on the collected data. Form feature include selection options, range values file inputs etc
Last synced: 12 Jan 2026
https://github.com/amethyst-php/data-view
amethyst amethyst-package api data data-view laravel
Last synced: 19 May 2026
https://github.com/stdlib-js/dstructs-circular-buffer
Circular buffer.
buffer circular collection cyclic data data-structure data-structures fifo first-in-first-out javascript node node-js nodejs queue ring stdlib structure
Last synced: 20 May 2026
https://github.com/ournet/embed-providers-data
Embed provides data
data embed embed-providers json providers
Last synced: 03 May 2026
https://github.com/gui-sitton/carsells
In this project I am an analyst on the Crankshaft List. Hundreds of free vehicle advertisements are published on the site every day. I need to study the data collected over the last few years and determine which factors influence the price of a vehicle.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 20 May 2026
https://github.com/amethyst-php/contact
amethyst amethyst-package api contact data laravel
Last synced: 20 May 2026
https://github.com/avijeetpandey/quizzez
Implementation of quizzez application using kotlin
Last synced: 20 May 2026
https://github.com/amethyst-php/shipment-zone
amethyst amethyst-package api data laravel shipment-zone
Last synced: 20 May 2026
https://github.com/prcharan592/olympic-insights-historical-data-analytics-in-r
This project analyzes 120 years of Olympic history (1896–2016), uncovering trends and insights from the data
data data-analytics data-science data-visualization kaggle r-programming
Last synced: 03 Apr 2025