data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-27 00:07:33 UTC
- JSON Representation
https://github.com/nimomach/amazon-sales-data
This is a small dataset containing Amazon sales data analysis for few regions.
dashboards data data-analysis data-visualization
Last synced: 08 Mar 2026
https://github.com/kenjyco/mongo-helper
Helper funcs and tools for working with MongoDB
aggregation-pipeline data database kenjyco mongo mongodb python
Last synced: 28 Jan 2026
https://github.com/amethyst-php/data-view
amethyst amethyst-package api data data-view laravel
Last synced: 19 May 2026
https://github.com/aliaksandr-master/unipipeline
simple way to build the declarative and destributed data pipelines with python
Last synced: 11 Jul 2025
https://github.com/josecsotomorales/dataform
Repository for testing dataform
cli data data-engineering data-transformation
Last synced: 27 Mar 2025
https://github.com/soenneker/soenneker.timezones.data
Provides TimeZone geometry
csharp data dotnet geometry lookup polygons timezone timezones timezonesdata
Last synced: 30 May 2026
https://github.com/dakostu/grabbag.h
A data structure for non-deterministic element selection in C++11
cpluscplus cpp cpp-component cpp-library cpp11 data data-structure data-structures generics non-deterministic random randomization template
Last synced: 19 Oct 2025
https://github.com/jeugregg/deeplearningpicturedogs
Classify dogs pictures by Deep Learning CNN neural networks
classez-des-images cnn-keras data data-science ipynb neural-network vision
Last synced: 24 Jul 2025
https://github.com/styd/sd_struct
Searchable Deep Struct
activesupport data gem openstruct rails ruby structure
Last synced: 18 May 2026
https://github.com/aguven6/inmemory-data-processor
Convert tabular data to columnar data with index. Aim is to process huge data quicker especially in aggregation operation
columnar-storage data data-structures parallel-computing parallel-programming processing
Last synced: 17 May 2026
https://github.com/nikashj/pizza-sales-dashboard-analysis
Pizza sales analysis using Power Bi
data data-analysis data-visualization dax-expression excel powerbi
Last synced: 06 Apr 2026
https://github.com/fastbolt/entity-importer
Entity importing library for importing data from files (CSV and Excel currently) or API into doctrine.
data doctrine2 excel excel-import
Last synced: 17 Feb 2026
https://github.com/mekramy/ircity
Iran province, county and city data in json format.
Last synced: 05 Apr 2025
https://github.com/naithikjorapur/practive-tanstacktsx
Practice TanStack with React, Vite, and TypeScript to build fast, type-safe apps. Leverage tools like TanStack Query for data management and Vite for a streamlined development experience.
data exercise fetching html-css-javascript json learning-by-doing practice query router tsx
Last synced: 05 Apr 2025
https://github.com/1sumer/mass-mail-automation
Mass Emailer is a Python-based application designed to send bulk emails efficiently using an SMTP server. Leveraging the power of the Tkinter library for the graphical user interface (GUI), this tool provides a user-friendly platform for managing and dispatching large volumes of emails with ease.
data oops-in-python python smtp-server tkinter
Last synced: 20 Aug 2025
https://github.com/Sikessem/Typed
Convert PHP values to objects of strict types.
cast converter data object-oriented-programming oop php poo programmation-orientee-objets strict-types value-object variable-object
Last synced: 11 May 2025
https://github.com/nanis/unitedat
Unify data sets which consist of separate files with a common header repeated in each one.
Last synced: 12 Apr 2025
https://github.com/bakangmonei/is_final_assignment
My intelligent systems assignment
data data-science intelligent-systems python
Last synced: 02 May 2026
https://github.com/amethyst-php/source
The source of information. It can be used to save the origin of whatever information (news, books, etc.. )
amethyst amethyst-package api data laravel source
Last synced: 27 Apr 2026
https://github.com/dms-codes/scrape_tripsantai
Trip Santai Tour Data Scraper This Python script is a web scraper designed to extract and collect information about tours from the Trip Santai website. It utilizes the requests library to fetch web pages, BeautifulSoup for parsing HTML, and writes the collected data to a CSV file.
beautifulsoup4 data python requests scraper webscraper
Last synced: 21 May 2026
https://github.com/sibeux/redesigned-broccoli
Repositori untuk menyimpan data file musik
data data-center nasrulwahabi sibeux
Last synced: 24 Jan 2026
https://github.com/eryks1999/data-collection-project_python
This project allowed me to practice classes, populating json files as well as extracting data.
Last synced: 16 Apr 2026
https://github.com/ember-nexus/reference-dataset
Ember Nexus API backup containing different standardized scenarios
Last synced: 25 Jan 2026
https://github.com/nitheshgoutham/singapore-resale-flat-prices-predicting
To Predict the Resale Price of a Flat
data data-visualization machine-learning python3 sql streamlit
Last synced: 09 May 2026
https://github.com/luminati-io/google-search-api
Two methods to collect real Google SERP data—a free scraper for basic use and the enterprise-grade Bright Data API for high-volume demands.
data google-scraper html python serp-api web-scraping
Last synced: 25 Jun 2025
https://github.com/basemax/okala-database-crawler
A robust, UTF-8 compliant PHP-based crawler designed to extract structured product data from Okala. This tool efficiently scrapes and saves store information, category slugs, and detailed product listings into organized JSON files. Ideal for data analysis, backup, or integration into other systems.
crawler crawler-php curl data json okala okala-com okalacom php php-crawler scraper
Last synced: 01 May 2026
https://github.com/gsmithun4/expressjs-field-validator
Plugin for validating JSON request, middleware for expressjs
data express-js expressjs json-request middleware nodejs request rest-api validation
Last synced: 06 Mar 2026
https://github.com/e22m4u/ts-projection
Модуль для работы с проекцией данных для TypeScript
Last synced: 12 Apr 2025
https://github.com/mvuorre/osfdatasette
Harvest, wrangle, and serve preprint data from OSF API with Datasette
data datasette open-science preprints
Last synced: 11 Apr 2025
https://github.com/juniorreisx/movelo-logstica
Movelo is a lightweight logistics simulator built with TypeScript that provides mock order and delivery data for developing and testing UIs, dashboards, and backend features without external APIs.
data hooks lucide-react react tailwindcss typescript
Last synced: 12 Apr 2025
https://github.com/Vidya-Vijay/Vid2501
About me
analytics data data-science machinelearning python r spss sql statistics tableau visualization
Last synced: 19 Jul 2025
https://github.com/sap-samples/sap-bdc-explore-hyperscaler-data
The repository contains detailed steps to integrate external hyperscaler data sources to SAP Datasphere in the SAP Business Data Cloud per the Open data ecosystem integration principles .
aws azure business cloud data databricks datasphere gcp hyperscalers sap
Last synced: 16 May 2026
https://github.com/dimaa1608/azurecontent
AzureContent is a repository on GitHub containing documentation and resources related to Microsoft Azure services and features. It provides clear and concise information for users seeking guidance on Azure cloud computing solutions.
azure azurecontent cloud computing content data deployment integration management networking platform security service storage virtualization
Last synced: 10 Apr 2025
https://github.com/bfontaine/datatools
:triangular_ruler: Some scripts I use to work with data
Last synced: 23 Jul 2025
https://github.com/davecumin/ancir_next
analysis chronobiology circadian d3 data data-analysis data-visualization svelte timeseries
Last synced: 18 May 2026
https://github.com/ournet/news-data
Ournet news data package
data news news-data news-storage ournet storage
Last synced: 04 Apr 2025
https://github.com/ournet/quotes-data
Ournet quotes data package
data ournet ournet-quotes quotes
Last synced: 04 Apr 2025
https://github.com/yadavkaushal/datascience-e-commerce-shopping-details
This project analyzes customer purchase data including details such as location, company, credit card usage, browser info, job roles and purchase price. It explores patterns in payment methods, spending behavior and online transactions. Using Pandas, Matplotlib and Seaborn, we clean analyze and visualize key trends to derive actionable insights.
data datacleaning dataframe datapreprocessing dataset libraries matplotlib numpy pandas plots visulaization
Last synced: 06 May 2026
https://github.com/jigyasag18/ibm-power-bi-dashboard-project
IBM Power BI Dashboard Project is a data-driven analysis of employees using IBM's comprehensive dataset, providing insights into key factors contributing to employee turnover and enabling organizations to strategize effectively towards improved employee retention and satisfaction.
data data-visualization dataanalysis dataanalytics dataset datavisualisation datavisualization-project powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard
Last synced: 07 Mar 2026
https://github.com/amethyst-php/account
account amethyst amethyst-package api data laravel
Last synced: 18 May 2026
https://github.com/gui-sitton/y.music
In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 18 May 2026
https://github.com/webobite/fact-chatbot
A Fact chatbot is a project in which it read a txt file which consist all facts ahead of time and answer the user with some useful information regarding the same on the basis of facts provided in text file.
chatbot chatgpt chatgpt3 data data-visualization embedding-vectors generativeai nlp
Last synced: 04 May 2026
https://github.com/ngupta23/data_prep_helper
A helper package for preparing and combining data from a variety of sources
data data-science dataprep datapreparation dataprocessing helpers python
Last synced: 03 Apr 2025
https://github.com/afeiship/data-pagination
Raw data(items) pagination.
data next page pagination previous total
Last synced: 18 May 2026
https://github.com/byndyusoft/byndyusoft.data.relational.specifications
byndyusoft data relational specifications
Last synced: 12 Sep 2025
https://github.com/redatargaoui/dataconverter
Data conversion functionality to integrate into the software used for autism detection research.
apache-poi data dataconversion excel java
Last synced: 06 Sep 2025
https://github.com/e-kotov/albofr
alboFr: Get French Data on Tiger Mosquito Colonisation
aedes-albopictus data france tiger-mosquito
Last synced: 11 Jun 2026
https://github.com/shubhamsoni98/classification-with-decision-tree
This project predicts iPhone purchases using demographic data (gender, age, salary). A Decision Tree Classifier was used, achieving 88.16% accuracy. Insights from the model can refine marketing strategies, optimize product offerings, and boost sales by targeting key customer segments.
algorithms anaconda classification data data-science descision-tree jupyter-notebook machine-learning prediction python
Last synced: 19 Jan 2026
https://github.com/istinnew/cook-me-up
[In Progress] Welcome to Cook-Me-Up! This project aims to analyze and organize cooking recipes using data analysis (Python, BigQuery SQL, Looker Studio etc.) and machine learning techniques. The goal is to simplify meal preparation and offer users a comprehensive database of culinary delights.
bigquery clustering cookme culinary data data-science dataanalysis datavisualization looker-studio machine-learning python recipe-search recipes unsupervised-learning
Last synced: 16 May 2026
https://github.com/onemoredavid/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 04 Apr 2025
https://github.com/amethyst-php/value
amethyst amethyst-package api data laravel value
Last synced: 17 May 2026
https://github.com/kobowood1/data-analysis-alpha
My first data analysis project
data data-analysis data-analytics data-science
Last synced: 06 May 2025
https://github.com/ffatahillah7/snowflake-tastybytes-data-warehouses
Build Snowflake Tasty Bytes Warehouses
data data-warehouse mysql snowflake sql warehouse
Last synced: 26 Mar 2025
https://github.com/madhuresh2011/kulturehire-internship
☺️Hi folk, During my internship at KultureHire, I completed a real-world Data Analyst project. I created an interactive dashboard using pivot tables, conducted a thorough analysis, and provided actionable recommendations. I'm excited to share my work and the insights I discovered.
data data-analytics data-cleaning data-standardization data-visualization excel excel-pivot-charts excel-pivot-tables genz-aspirations my-sql
Last synced: 17 Feb 2026
https://github.com/ashishsingh789/data_visualization
Data visualization project using Python to analyze categorical and continuous variables. Includes bar charts, histograms, and scatter plots. Libraries used: pandas, matplotlib, and seaborn.
analysis barchart data data-science data-visualization histogram matplotlib pandas-dataframe scatter-plot seaborn
Last synced: 07 Sep 2025
https://github.com/nel-zi/zipco_foods
Developed an automated ETL pipeline using Python and Apache Airflow to consolidate fragmented CSV sales data into a normalized Azure SQL database for Zipco Foods.
airflow apache-spark data dataengineering etl pyspark wsl
Last synced: 03 May 2026
https://github.com/erkylima/algorithms
Python project to refresh knowledge on algorithms and data structures. Interactive examples of Bubble, Merge, Quick Sort, along with Lists, Stacks, Queues, and Trees. Challenges included. Recycle your expertise! 🚀 #Python #Algorithms #DataStructures
algorithms algorithms-and-data-structures data data-structures
Last synced: 19 Jan 2026
https://github.com/ericmaddox/nyc-crime-analytics
Analyzes and visualizes crime data from the NYC Police Department using interactive maps and heatmaps, leveraging the NYC Open Data API.
crime-analysis crimedata data datavisualization esri folium heatmap nycopendata python python3 rtcc
Last synced: 24 Jun 2025
https://github.com/thetacom/byteclasses
A Python package to manage and interact with binary data in a simple and structured manner.
binary-data bytes data dataclasses package python python3
Last synced: 11 Jul 2025
https://github.com/joshuadeguzman/xcraper
Python based stocks exchange data scraper
data pandas python stock-market
Last synced: 18 May 2026
https://github.com/omari-kd/environmental-impact-on-food-production
The goal of this project is to assess the environmental impact of food production at both macro and micro levels and propose data-driven insights to mitigate the negative effects of food production on the environment.
data data-analysis data-science data-visualization environmental-impact-analysis r
Last synced: 30 Mar 2025
https://github.com/omari-kd/recommendation-system-analysis-and-modelling
This project aims to develop a recommendation system that leverages historical user data to provide tailored recommendations across different domains, such as product recommendations, content suggestions and service optimisation.
data data-science data-science-in-r machine-learning-algorithms recommendation-system
Last synced: 08 Jan 2026
https://github.com/vaibhavmojidra/data-structures---hashtable-using-array-and-linked-list-in-java
Hash Table is a data structure which stores data in an associative manner. In a hash table, data is stored in an array format, where each data value has its own unique index value. Access of data becomes very fast if we know the index of the desired data. Thus, it becomes a data structure in which insertion and search operations are very fast irrespective of the size of the data. Hash Table uses an array as a storage medium and uses hash technique to generate an index where an element is to be inserted or is to be located from.
arrays data data-structures hashing java linked-list mojidra vaibhav vaibhav-mojidra vaibhavmojidra
Last synced: 12 Apr 2025
https://github.com/j-hagedorn/locals
:globe_with_meridians: A collection of tidied, neighborhood-level public datasets
address-dataset census-data census-tract data neighborhood social-sciences
Last synced: 03 Feb 2026
https://github.com/halyusa16/basic-sql-employee-analysis
This project focuses on analyzing employee data through querying, performing table joins to connect related information, aggregating salary statistics, and using subqueries to extract meaningful insights.
data data-analytics data-exploration database mysql self-project sql
Last synced: 16 May 2026
https://github.com/jensostertag-archive/charts.js
A JavaScript Plugin to draw Charts to visualize Data and Statistics on Websites
charts data javascript statistics webapplication
Last synced: 22 Jun 2025
https://github.com/sumansuhag/prediction_model
This repository features a collection of Jupyter notebooks designed to showcase the practical applications of machine learning, data preprocessing, feature engineering, and recommendation systems. These notebooks enable users to explore, analyze, and predict business events.
algotithms artificial-intelligence data logistic-regression machine-learning-algorithms science sckiit-learn
Last synced: 28 Mar 2025
https://github.com/hadarsharon/grizzlys
User-friendly Python DataFrames 🔵🟡 powered by Julia 🔴🟢🟣
big-data data data-analysis data-engineering data-frame data-frames data-science dataframe dataframe-library dataframes dataframes-jl julia python
Last synced: 18 May 2026
https://github.com/jlee9503/excel-projects
Fitness tracker dashboard, displaying users workout type, calories burned, and steps taken with multiple filters (gender, age, and workout intensity). Implemented using MS Excel.
Last synced: 16 Jan 2026
https://github.com/trevorhobenshield/psychopath
Path Utils for ML Data Prep.
audio data data-science deep-learning filesystem images machine-learning text videos
Last synced: 25 Jul 2025
https://github.com/xuender/kstats
Golang statistics library package that supports v1.18+.
algorithms analytics data go golang kstats machine-learning math rounding statistics
Last synced: 20 Jul 2025
https://github.com/thibautre/dataipsum
Configurable data generator (with crumbles inside)
algorithm data random-generation
Last synced: 21 Jul 2025
https://github.com/kulgan/justobjects
It's all just objects
data json-schema justobjects objects parsing python python3 validation
Last synced: 10 Jul 2025
https://github.com/rubidev68/citadelai-community
Community version of citadelai.app
ai ai-assistant chatbot chatbot-framework data knowledge-management silo-digital
Last synced: 03 Feb 2026
https://github.com/erictleung/tidytuesdays
:chart_with_upwards_trend: My attempts at #tidytuesday
data data-science data-visualization r rstats tables tidytuesday tidyverse
Last synced: 19 Sep 2025
https://github.com/denisecase/cintel-03-data
Getting started with interactive data analytics in Python
analytics data interactive python shiny
Last synced: 11 Apr 2025
https://github.com/fintech-lsi/fintech-credit-risk-prediction
This repository provides a machine learning model for predicting credit risk in the financial sector. The model uses borrower information, such as age, income, employment length, loan amount, and credit history, to assess the likelihood of loan repayment or default.
data fintech machine-learning model prediction risk
Last synced: 12 Oct 2025
https://github.com/Axnjr/csv-parser-utils
Homework task for SWE position at Redhat.
csv data dataanalysis datatools pandas python
Last synced: 30 Oct 2025
https://github.com/denisecase/buzzline-04-case
Adding live visualizations to streaming data applications
animation data kafka matplotlib python streaming
Last synced: 11 Apr 2025
https://github.com/praveendecode/data-analysis
Implemented data analysis projects with interactive Streamlit UI for user-friendly data exploration and insights presentation
data data-science dataanalysis exploratory-data-analysis insights python streamlit-dashboard tableau tableau-public
Last synced: 04 Apr 2025
https://github.com/giosil/export-as
A convenience library for exporting data in different formats.
data data-export export exporter java
Last synced: 26 Jul 2025
https://github.com/ims94/ballerina-tsv-querying
An example Ballerina project to query tsv data using Ballerina language integrated queries
ballerina ballerina-lang data olympics query sql
Last synced: 03 Feb 2026
https://github.com/stkisengese/numpy-data-fundamentals
A comprehensive collection of NumPy exercises covering array manipulation, slicing, broadcasting, random data generation, and real-world data analysis applications.
data data-analysis numpy pre-processing
Last synced: 16 May 2026
https://github.com/rd-uk/rduk-data-sqlite
SQLite Data Provider implementation for rduk-data
Last synced: 16 May 2026
https://github.com/naufalbasara/superstores-pipeline
Data Pipeline on Dummy E-commerce with Apache Airflow
airflow data data-engineering data-pipeline data-warehouse postgresql
Last synced: 16 May 2026
https://github.com/paulveillard/cybersecurity-analytics
An ongoing collection of awesome software, libraries, learning tutorials, documents and books, technical resources and cool stuff about Analytics Engineering in Cybersecurity.
analytics bigdata bigquery cybernetics cybersecurity data data-engineering data-science encryption encryption-decryption seo seo-friendly seo-optimization
Last synced: 28 Mar 2025
https://github.com/lut-ful/e-commerce-sales-report
This dashboard provides a visual analysis of e-commerce sales data
data data-analytics data-science data-visualization power-bi statics
Last synced: 28 Jun 2025
https://github.com/hyfi06/unam-careers
A utility package for retrieving career information from UNAM.
Last synced: 16 May 2026