data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/potlock/data
data research for other funding mechanisms and PotLock related data.
data flipsidecrypto near-protocol potlock
Last synced: 07 Mar 2026
https://github.com/rllyhz/mini-data-center
This repo is to fulfill my internship assignment at the Office of Communication and Information (Kominfo), Balai Kota, Semarang, Indonesia
chartjs country-information data information-visualization laravel laravel-application
Last synced: 06 Nov 2025
https://github.com/nimomach/amazon-sales-data
This is a small dataset containing Amazon sales data analysis for few regions.
dashboards data data-analysis data-visualization
Last synced: 08 Mar 2026
https://github.com/umstek/sampler
Generate elaborate random data instantly.
data faker javascript json sample
Last synced: 20 Jul 2025
https://github.com/mawiegand/automatic-point-label-placement-data
Test instances for the automatic point label placement problem.
data datastructures generator javascript labeling problem ruby
Last synced: 16 May 2026
https://github.com/jeugregg/deeplearningpicturedogs
Classify dogs pictures by Deep Learning CNN neural networks
classez-des-images cnn-keras data data-science ipynb neural-network vision
Last synced: 24 Jul 2025
https://github.com/agusk/ilmudata-book-excel-analytics
Hallo Microsoft Excel: Mastering Data Analytics
analytics data data-analytics excel power-query-editor
Last synced: 06 Jan 2026
https://github.com/topunix/hackerrank
:green_book: HackerRank Solutions
algorithm-challenges algorithms algorithms-and-data-structures data data-structures hackerrank hackerrank-algorithms-solutions hackerrank-challenges hackerrank-python hackerrank-solutions python
Last synced: 17 May 2026
https://github.com/rosa-lpz/data-analysis-handbook
Data Analysis base knowledge and practical applications
data data-analysis data-visualization database dax documentation power-bi python r sql tableau tableau-public
Last synced: 06 Apr 2026
https://github.com/soenneker/soenneker.timezones.data
Provides TimeZone geometry
csharp data dotnet geometry lookup polygons timezone timezones timezonesdata
Last synced: 30 May 2026
https://github.com/amethyst-php/value
amethyst amethyst-package api data laravel value
Last synced: 17 May 2026
https://github.com/jodus-melodus/queue
Simple Queue
data datastructures linear queue queues
Last synced: 10 Sep 2025
https://github.com/nel-zi/zipco_foods
Developed an automated ETL pipeline using Python and Apache Airflow to consolidate fragmented CSV sales data into a normalized Azure SQL database for Zipco Foods.
airflow apache-spark data dataengineering etl pyspark wsl
Last synced: 03 May 2026
https://github.com/citizenlabsgr/data.world
Work with data sets prior to uploading to data.world
Last synced: 26 Mar 2025
https://github.com/andygeiss/pipeline-example
This is a basic example of using a pipeline in data science.
data data-pipeline data-science example go golang iris-dataset pipeline protobuf
Last synced: 17 Jul 2025
https://github.com/dms-codes/scrape_tripsantai
Trip Santai Tour Data Scraper This Python script is a web scraper designed to extract and collect information about tours from the Trip Santai website. It utilizes the requests library to fetch web pages, BeautifulSoup for parsing HTML, and writes the collected data to a CSV file.
beautifulsoup4 data python requests scraper webscraper
Last synced: 21 May 2026
https://github.com/kwame-mintah/ml-data-copy-to-aws-s3
Automatically copy new data to an AWS S3 bucket for Machine Learning.
Last synced: 14 May 2026
https://github.com/madhuresh2011/kulturehire-internship
☺️Hi folk, During my internship at KultureHire, I completed a real-world Data Analyst project. I created an interactive dashboard using pivot tables, conducted a thorough analysis, and provided actionable recommendations. I'm excited to share my work and the insights I discovered.
data data-analytics data-cleaning data-standardization data-visualization excel excel-pivot-charts excel-pivot-tables genz-aspirations my-sql
Last synced: 17 Feb 2026
https://github.com/bfontaine/datatools
:triangular_ruler: Some scripts I use to work with data
Last synced: 23 Jul 2025
https://github.com/sibeux/redesigned-broccoli
Repositori untuk menyimpan data file musik
data data-center nasrulwahabi sibeux
Last synced: 24 Jan 2026
https://github.com/qubitpi/wiktionary-data
Wiktionary data in simple parsable formats hosted on 🤗 Datasets
ancient-greek data german huggingface huggingface-datasets language latin natural-language-processing nlp old-persian python wiktionary wiktionary-data
Last synced: 17 Jul 2025
https://github.com/luminati-io/zoominfo-dataset-samples
A sample dataset of over 1000 ZoomInfo companies, extracted using the Bright Data API, ideal for market growth, lead generation, and market analysis.
b2b business companies data data-extraction database dataset datasets web-scraping zoominfo
Last synced: 17 Mar 2025
https://github.com/shoaib1522/database-systems
📚💾 Master the fundamentals of database systems with this all-in-one lab repository, featuring ERD design diagrams 🧠🗺️, Oracle SQL 🌐📝, relational schema practice, and complete PowerPoint lectures 🖥️📑. Perfect for revision, exams, or quick reference! 💡📘
data database database-management databases databases-course db dbms-project erd notes oracle oracle-database sql
Last synced: 21 Aug 2025
https://github.com/birjemin/wxgameod
wxgame 开放数据 weixin 微信小游戏 关系链数据
data interactive-data relation user-storage
Last synced: 16 Jul 2025
https://github.com/null-none/py-fear-and-greed
Fear & Greed Index
data fear-and-greed python trading
Last synced: 16 Jul 2025
https://github.com/germanpaul12/flights-data-sky-scraper-api
Sky Scraper - Python app for searching flight information using the Sky Scrapper API.
data flights flights-api scraping
Last synced: 15 Jul 2025
https://github.com/peternaydenov/data-pool
Data layer for node apps and single page applications
Last synced: 29 Apr 2025
https://github.com/omari-kd/environmental-impact-on-food-production
The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.
data data-analysis data-science data-visualization environmental-impact-analysis r
Last synced: 30 Mar 2025
https://github.com/omari-kd/recommendation-system-analysis-and-modelling
This project aims to develop a recommendation system that leverages historical user data to provide tailored recommendations across different domains, such as product recommendations, content suggestions and service optimisation.
data data-science data-science-in-r machine-learning-algorithms recommendation-system
Last synced: 08 Jan 2026
https://github.com/j-hagedorn/locals
:globe_with_meridians: A collection of tidied, neighborhood-level public datasets
address-dataset census-data census-tract data neighborhood social-sciences
Last synced: 03 Feb 2026
https://github.com/ishansurdi/data-visualisation-empowering-business-with-effective-insights
The following tasks are completed for Data Visualization: Empowering Business with Effective Insights on Forage in October 2024. It is important to note that this should not be interpreted as an endorsement.
chart communicating-insights-and-analysis dashboard data data-analysis forage powerbi powerbi-visuals tableau tata tata-group virtual-internship visual visualization
Last synced: 17 Feb 2026
https://github.com/priyanshubiswas-tech/farmlab-report-and-case-study-iot
This project was developed through live interviews and case studies with farmers in the year 2023 to address key agricultural challenges. The device provides real-time farm insights for better decision-making. Future plans include a digital portal, increased range, more sensors, and improved design. Open to collaboration!
arduino-ide c case case-study data data-analysis iot iot-device serialization
Last synced: 15 Jul 2025
https://github.com/rubidev68/citadelai-community
Community version of citadelai.app
ai ai-assistant chatbot chatbot-framework data knowledge-management silo-digital
Last synced: 03 Feb 2026
https://github.com/ayush1999/data-mining
data mining natural-language-processing
Last synced: 10 Sep 2025
https://github.com/athari22/statistics-from-stock-data
Statistics from Stock Data
cvs data data-science dataanalysis datacleaning dataframe jupyter pandas pandas-python python statistics stock table
Last synced: 16 Feb 2026
https://github.com/ims94/ballerina-tsv-querying
An example Ballerina project to query tsv data using Ballerina language integrated queries
ballerina ballerina-lang data olympics query sql
Last synced: 03 Feb 2026
https://github.com/lut-ful/e-commerce-sales-report
This dashboard provides a visual analysis of e-commerce sales data
data data-analytics data-science data-visualization power-bi statics
Last synced: 28 Jun 2025
https://github.com/pooja-manjunatha/nyc_parking_violations_dbt
This project uses dbt to transform NYC parking violations data through a layered architecture: Bronze: Raw ingested data Silver: Cleaned and enriched data Gold: Aggregated tables for analytics Using DuckDB as the warehouse backend, it ensures data quality with tests and documentation. The project enables reliable analysis of parking violations
data data-analysis data-engineering dbt duckdb python sql
Last synced: 14 May 2026
https://github.com/kobowood1/data-analysis-alpha
My first data analysis project
data data-analysis data-analytics data-science
Last synced: 06 May 2025
https://github.com/valyaevgeorgiy/r_basic
Работа с основами среды R и тем самым изучения нового языка программирования, связанного непосредственно с анализом данных и построением графиков и диаграмм.
coding data data-analysis r rstudio
Last synced: 12 Dec 2025
https://github.com/onemoredavid/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 04 Apr 2025
https://github.com/vedantwalia/google-data-analytics-capstone-case-study
This is a repository of my work on data analysis as a part of the Google Data Analytics Capstone
bigquery data data-viz datavisualization-project divvy-bikes google googledataanalytics sql tableau tableau-public
Last synced: 02 Jan 2026
https://github.com/cmdrvl/profile
profile manages column-scoping configurations for report tools — defining which columns to include, key alignment, and normalization rules for rvl, compare, and shape.
cli configuration csv data data-quality open-source ops rust tooling
Last synced: 07 Mar 2026
https://github.com/tadiusfrank2001/data_mining_projects_labs_cs145
A collection of data mining course assignments to implement advanced predictive statistical analysis models
algorithms data data-mining data-science deep-learning predictive-modeling python3 wide-learning
Last synced: 16 May 2026
https://github.com/skygenesisenterprise/aether-calendar
Aether Calendar is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem
applications calendar capacitorjs data javascript linux macos nextjs typescript windows
Last synced: 12 Apr 2026
https://github.com/ivanshero400/kutub-al-salaf-database
أضخم مكتبة مفتوحة المصدر للكتب الإسلامية التراثية | 7,878 كتابا | 40 تصنيفا | المصدر: مكتبة كيزانه (Kizanah) | تحميل مباشر من بايثون بسطر واحد
arabic books-database data hadith islamic-books islamic-heritage kizanah open-source python sqlite
Last synced: 02 Jul 2026
https://github.com/interzoid/typescript-examples
Provides TypeScript examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.
angular api cloud data database matching nodejs quality typescript
Last synced: 12 Jan 2026
https://github.com/interzoid/php-examples
Provides PHP examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.
api cloud data database php quality
Last synced: 12 Jan 2026
https://github.com/charlieroth/exoexplo
Exploring NASA Exoplanet Archive Data
Last synced: 03 Apr 2025
https://github.com/cody-scott/arclint
A flexible tool to validate and improve your data in ArcGIS using regex and other methods
arcgis arcgispro data lint regex validation
Last synced: 14 May 2025
https://github.com/advisors-excel-llc/angular-datafree
angularjs data data-visualization datafree-directive
Last synced: 30 Sep 2025
https://github.com/karajmiglani-datascientist/karajmiglanifake-news-detection
FAKE_NEWS_PREDICTION
algorithms data data-science flask machine-learning probability-statistics python statistics structure
Last synced: 22 May 2026
https://github.com/stdlib-js/array-base-banded-filled2d-by
Create a filled two-dimensional banded nested array according to a provided callback function.
alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types
Last synced: 19 May 2026
https://github.com/rickstaa/ai-compute-visualizer
A StreamLit-based web application to visualize GPU inventory and AI capabilities on the Livepeer network.
Last synced: 28 Jun 2025
https://github.com/matheussoranco/how-to-estimate-required-sample-size-for-model-training
Modeling the relationship between training set size and model accuracy.
artificial-intelligence data jupyter-notebook machine-learning python
Last synced: 22 May 2026
https://github.com/push-protocol/push-google-bigquery
The Power of Web3 Big Data: A Guide to Using Google BigQuery and Push Protocol for Data Communication and Analysis
bigquery data push push-notifications web3
Last synced: 26 Mar 2025
https://github.com/shrutakeerti/eye-gaze-detection
This repo contains everything that I have done at IIT Jodhpur Summer Internship May 15 - July 15
ai aiml data eda eeg eeg-signals eye jodhpur mlflow
Last synced: 17 Mar 2025
https://github.com/afeiship/data-pagination
Raw data(items) pagination.
data next page pagination previous total
Last synced: 18 May 2026
https://github.com/mksingh431/sql-complete-notes
SQL, or Structured Query Language, is a robust and specialized programming language designed for efficient management and manipulation of relational databases. With SQL, you can seamlessly interact with databases like MySQL, PostgreSQL, Microsoft SQL Server, Oracle,.
Last synced: 21 Apr 2026
https://github.com/redatargaoui/dataconverter
Data conversion functionality to integrate into the software used for autism detection research.
apache-poi data dataconversion excel java
Last synced: 06 Sep 2025
https://github.com/e-kotov/albofr
alboFr: Get French Data on Tiger Mosquito Colonisation
aedes-albopictus data france tiger-mosquito
Last synced: 11 Jun 2026
https://github.com/shubhamsoni98/classification-with-decision-tree
This project predicts iPhone purchases using demographic data (gender, age, salary). A Decision Tree Classifier was used, achieving 88.16% accuracy. Insights from the model can refine marketing strategies, optimize product offerings, and boost sales by targeting key customer segments.
algorithms anaconda classification data data-science descision-tree jupyter-notebook machine-learning prediction python
Last synced: 19 Jan 2026
https://github.com/clagiordano/weblibs-data-export
Library for generic data export to various formats
clagiordano data export weblibs xlsx
Last synced: 01 Jul 2026
https://github.com/skygenesisenterprise/api-service
The Official Sky Genesis Enterprise API Service Ecosystem
api-service client cryptography data dns docker javascript nextjs service stalwart typescript websocket
Last synced: 31 Dec 2025
https://github.com/rajesh9943/web-scraping-analysis-of-top-us-company-revenue-growth-in-2023
Explore the landscape of US business growth in 2023 with our dynamic project, 'Web Scraping for US 2023 Revenue Growth.' Utilizing advanced web scraping techniques, we unveil insights into the top companies driving economic expansion.
cleaning-data data data-analysis data-visualization manipulation numpy pandas pre-fill
Last synced: 16 Aug 2025
https://github.com/lu-sketch/chocolate-imports-dataset
Chocolate Imports for South Africa
Last synced: 18 May 2026
https://github.com/gui-sitton/y.music
In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 18 May 2026
https://github.com/RedInfinityPro/ScientificSharp
Rating: (5/10) The code is a Windows Forms application for a basic scientific calculator, allowing users to perform mathematical operations like addition, subtraction, multiplication, division, trigonometrics, and logarithms.
componentmodel cryptography data drawing forms generic linq system tasks text
Last synced: 30 Sep 2025
https://github.com/amethyst-php/account
account amethyst amethyst-package api data laravel
Last synced: 18 May 2026
https://github.com/yadavkaushal/datascience-e-commerce-shopping-details
This project analyzes customer purchase data including details such as location, company, credit card usage, browser info, job roles and purchase price. It explores patterns in payment methods, spending behavior and online transactions. Using Pandas, Matplotlib and Seaborn, we clean analyze and visualize key trends to derive actionable insights.
data datacleaning dataframe datapreprocessing dataset libraries matplotlib numpy pandas plots visulaization
Last synced: 06 May 2026
https://github.com/davecumin/ancir_next
analysis chronobiology circadian d3 data data-analysis data-visualization svelte timeseries
Last synced: 18 May 2026
https://github.com/muneeb1030/webscrapper_politifact
This initiative seeks to extract and analyze fact-checking data from Politifact.com, providing valuable insights into political statements, rulings, and the evolving information landscape.
data data-collection dataanalysis python3 scrapy scrapy-spider webscraping
Last synced: 09 Sep 2025
https://github.com/joshuadeguzman/xcraper
Python based stocks exchange data scraper
data pandas python stock-market
Last synced: 18 May 2026
https://github.com/nodamu/apache-beam-studies
Personal Apache Beam studies repository
apachebeam batch-processing data dataeng dataengineering datapipeline stream-processing
Last synced: 04 Nov 2025
https://github.com/dscamilo/gestion-clientes-springboot
Proyecto de gestión de clientes aplicando Java y Springboot, haciendo uso de Lombok, uso de interface, inyección de dependencias, uso de anotaciones Service, Data, RestController . Consumo de API haciendo uso de Postman.
data interface java lombok-maven restcontroller spring-boot
Last synced: 15 May 2026
https://github.com/fastpix/flutter-core-data-sdk
A comprehensive Flutter SDK for video player analytics and event tracking, designed to provide detailed insights into video playback behavior and user engagement metrics.
Last synced: 15 May 2026
https://github.com/analyticslover/salifort-motors-turnover-project
The Salifort Motors H.R. Project serves as the capstone for the Google Advanced Analytics Program on Coursera. This project presents a business scenario and a problem on the scnario context, employee turnover. In this project, essential techniques as EDA and Data Modeling are used to analyze and predict the employee turnover rates in the company.
data data-analysis datamodeling eda machine-learning pandas python sklearn
Last synced: 10 Apr 2026
https://github.com/luminati-io/google-search-api
Two methods to collect real Google SERP data—a free scraper for basic use and the enterprise-grade Bright Data API for high-volume demands.
data google-scraper html python serp-api web-scraping
Last synced: 25 Jun 2025
https://github.com/nitheshgoutham/singapore-resale-flat-prices-predicting
To Predict the Resale Price of a Flat
data data-visualization machine-learning python3 sql streamlit
Last synced: 09 May 2026
https://github.com/hadarsharon/grizzlys
User-friendly Python DataFrames 🔵🟡 powered by Julia 🔴🟢🟣
big-data data data-analysis data-engineering data-frame data-frames data-science dataframe dataframe-library dataframes dataframes-jl julia python
Last synced: 18 May 2026
https://github.com/jlee9503/excel-projects
Fitness tracker dashboard, displaying users workout type, calories burned, and steps taken with multiple filters (gender, age, and workout intensity). Implemented using MS Excel.
Last synced: 16 Jan 2026
https://github.com/the-universal-linux-society/sysreport
Bash script to give you a full system report. Just by running the script it offers insight into CPU data, disk space, temperature readings, network configuration, MAC addresses, firewall status, and system logs for error analysis.
analysis bash bash-script bash-scripting data report reporting system
Last synced: 15 May 2026
https://github.com/xuender/kstats
Golang statistics library package that supports v1.18+.
algorithms analytics data go golang kstats machine-learning math rounding statistics
Last synced: 20 Jul 2025
https://github.com/thibautre/dataipsum
Configurable data generator (with crumbles inside)
algorithm data random-generation
Last synced: 21 Jul 2025
https://github.com/vladandreitoma/igisol_jyvaskyla_xept_experimental_campaign
A simulation toolkit together with data analysis for the Xe&Pt Exotic Nuclei Generation experiment @ Jyvaskyla December 2022. Helping dr.Paul Constantin with simulation development. Simulation is done using Geant4 provided by CERN. Data anlysis is done using ROOT by Cern. Both C++ based. Job distributors to run the sim are coded in pearl
analysis architecture-design cplusplus data oop oop-principles pearl simulations
Last synced: 05 Sep 2025
https://github.com/shreedata/data-analysis-using-python-libraries-
The COVID-19 pandemic has significantly impacted India, necessitating a detailed analysis of the virus’s spread within the country. In this project, we explore an India-specific COVID-19 dataset, leveraging Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.
covid-19 data data-cleaning data-visualization datana kaggle-dataset matplotlib numpy pandas-python python3 pythonlibrarires scikit seaborn
Last synced: 28 Mar 2025
https://github.com/bakangmonei/is_final_assignment
My intelligent systems assignment
data data-science intelligent-systems python
Last synced: 02 May 2026
https://github.com/raufjatoi/electricity-consumption-prediction
arima-model customize data kinda-dynamic ml
Last synced: 25 Jul 2025
https://github.com/Axnjr/csv-parser-utils
Homework task for SWE position at Redhat.
csv data dataanalysis datatools pandas python
Last synced: 30 Oct 2025
https://github.com/Sikessem/Typed
Convert PHP values to objects of strict types.
cast converter data object-oriented-programming oop php poo programmation-orientee-objets strict-types value-object variable-object
Last synced: 11 May 2025