data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-03 00:07:49 UTC
- JSON Representation
https://github.com/tadiusfrank2001/data_mining_projects_labs_cs145
A collection of data mining course assignments to implement advanced predictive statistical analysis models
algorithms data data-mining data-science deep-learning predictive-modeling python3 wide-learning
Last synced: 16 May 2026
https://github.com/skygenesisenterprise/api-service
The Official Sky Genesis Enterprise API Service Ecosystem
api-service client cryptography data dns docker javascript nextjs service stalwart typescript websocket
Last synced: 31 Dec 2025
https://github.com/muneeb1030/webscrapper_politifact
This initiative seeks to extract and analyze fact-checking data from Politifact.com, providing valuable insights into political statements, rulings, and the evolving information landscape.
data data-collection dataanalysis python3 scrapy scrapy-spider webscraping
Last synced: 09 Sep 2025
https://github.com/interzoid/typescript-examples
Provides TypeScript examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.
angular api cloud data database matching nodejs quality typescript
Last synced: 12 Jan 2026
https://github.com/interzoid/php-examples
Provides PHP examples for consuming several of the Cloud APIs available from Interzoid, including company name matching, individual name matching, weather, page performance, email validation, currency rates/FOREX, and global telephone information.
api cloud data database php quality
Last synced: 12 Jan 2026
https://github.com/push-protocol/push-google-bigquery
The Power of Web3 Big Data: A Guide to Using Google BigQuery and Push Protocol for Data Communication and Analysis
bigquery data push push-notifications web3
Last synced: 26 Mar 2025
https://github.com/cody-scott/arclint
A flexible tool to validate and improve your data in ArcGIS using regex and other methods
arcgis arcgispro data lint regex validation
Last synced: 14 May 2025
https://github.com/rajesh9943/web-scraping-analysis-of-top-us-company-revenue-growth-in-2023
Explore the landscape of US business growth in 2023 with our dynamic project, 'Web Scraping for US 2023 Revenue Growth.' Utilizing advanced web scraping techniques, we unveil insights into the top companies driving economic expansion.
cleaning-data data data-analysis data-visualization manipulation numpy pandas pre-fill
Last synced: 16 Aug 2025
https://github.com/karajmiglani-datascientist/karajmiglanifake-news-detection
FAKE_NEWS_PREDICTION
algorithms data data-science flask machine-learning probability-statistics python statistics structure
Last synced: 22 May 2026
https://github.com/RedInfinityPro/ScientificSharp
Rating: (5/10) The code is a Windows Forms application for a basic scientific calculator, allowing users to perform mathematical operations like addition, subtraction, multiplication, division, trigonometrics, and logarithms.
componentmodel cryptography data drawing forms generic linq system tasks text
Last synced: 30 Sep 2025
https://github.com/gui-sitton/y.music
In this project I compared the musical preferences of the citizens of Springfild and Shelbyville. I examined real Y.Music data to test hypotheses and compare the behavior of users in these two cities.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 18 May 2026
https://github.com/clagiordano/weblibs-data-export
Library for generic data export to various formats
clagiordano data export weblibs xlsx
Last synced: 01 Jul 2026
https://github.com/rickstaa/ai-compute-visualizer
A StreamLit-based web application to visualize GPU inventory and AI capabilities on the Livepeer network.
Last synced: 28 Jun 2025
https://github.com/matheussoranco/how-to-estimate-required-sample-size-for-model-training
Modeling the relationship between training set size and model accuracy.
artificial-intelligence data jupyter-notebook machine-learning python
Last synced: 22 May 2026
https://github.com/afeiship/data-pagination
Raw data(items) pagination.
data next page pagination previous total
Last synced: 18 May 2026
https://github.com/darshjasani/claims-analysis
This repository contains a comprehensive analysis of claims data, detailing the workflow from data preprocessing to model evaluation. The goal of this analysis is to build predictive models to improve claims prediction and management.
analysis data linear machine-learning python
Last synced: 16 May 2026
https://github.com/redatargaoui/dataconverter
Data conversion functionality to integrate into the software used for autism detection research.
apache-poi data dataconversion excel java
Last synced: 06 Sep 2025
https://github.com/e-kotov/albofr
alboFr: Get French Data on Tiger Mosquito Colonisation
aedes-albopictus data france tiger-mosquito
Last synced: 11 Jun 2026
https://github.com/shubhamsoni98/classification-with-decision-tree
This project predicts iPhone purchases using demographic data (gender, age, salary). A Decision Tree Classifier was used, achieving 88.16% accuracy. Insights from the model can refine marketing strategies, optimize product offerings, and boost sales by targeting key customer segments.
algorithms anaconda classification data data-science descision-tree jupyter-notebook machine-learning prediction python
Last synced: 19 Jan 2026
https://github.com/amethyst-php/account
account amethyst amethyst-package api data laravel
Last synced: 18 May 2026
https://github.com/mksingh431/sql-complete-notes
SQL, or Structured Query Language, is a robust and specialized programming language designed for efficient management and manipulation of relational databases. With SQL, you can seamlessly interact with databases like MySQL, PostgreSQL, Microsoft SQL Server, Oracle,.
Last synced: 21 Apr 2026
https://github.com/mvuorre/osfdatasette
Harvest, wrangle, and serve preprint data from OSF API with Datasette
data datasette open-science preprints
Last synced: 11 Apr 2025
https://github.com/kobowood1/data-analysis-alpha
My first data analysis project
data data-analysis data-analytics data-science
Last synced: 06 May 2025
https://github.com/vladandreitoma/igisol_jyvaskyla_xept_experimental_campaign
A simulation toolkit together with data analysis for the Xe&Pt Exotic Nuclei Generation experiment @ Jyvaskyla December 2022. Helping dr.Paul Constantin with simulation development. Simulation is done using Geant4 provided by CERN. Data anlysis is done using ROOT by Cern. Both C++ based. Job distributors to run the sim are coded in pearl
analysis architecture-design cplusplus data oop oop-principles pearl simulations
Last synced: 05 Sep 2025
https://github.com/yadavkaushal/datascience-e-commerce-shopping-details
This project analyzes customer purchase data including details such as location, company, credit card usage, browser info, job roles and purchase price. It explores patterns in payment methods, spending behavior and online transactions. Using Pandas, Matplotlib and Seaborn, we clean analyze and visualize key trends to derive actionable insights.
data datacleaning dataframe datapreprocessing dataset libraries matplotlib numpy pandas plots visulaization
Last synced: 06 May 2026
https://github.com/shrutakeerti/eye-gaze-detection
This repo contains everything that I have done at IIT Jodhpur Summer Internship May 15 - July 15
ai aiml data eda eeg eeg-signals eye jodhpur mlflow
Last synced: 17 Mar 2025
https://github.com/shreedata/data-analysis-using-python-libraries-
The COVID-19 pandemic has significantly impacted India, necessitating a detailed analysis of the virusβs spread within the country. In this project, we explore an India-specific COVID-19 dataset, leveraging Python libraries such as Pandas, NumPy, Matplotlib, and Seaborn.
covid-19 data data-cleaning data-visualization datana kaggle-dataset matplotlib numpy pandas-python python3 pythonlibrarires scikit seaborn
Last synced: 28 Mar 2025
https://github.com/joshuadeguzman/xcraper
Python based stocks exchange data scraper
data pandas python stock-market
Last synced: 18 May 2026
https://github.com/srindot/average_flightdata_collection_fwuav
This repository is designed for collecting average data for a flapping wing UAV. The script acg_coeff_data_collection.py runs the necessary data collection, and the resulting data is saved into a CSV file called AverageFlightData.csv.
Last synced: 18 Sep 2025
https://github.com/davecumin/ancir_next
analysis chronobiology circadian d3 data data-analysis data-visualization svelte timeseries
Last synced: 18 May 2026
https://github.com/luminati-io/google-search-api
Two methods to collect real Google SERP dataβa free scraper for basic use and the enterprise-grade Bright Data API for high-volume demands.
data google-scraper html python serp-api web-scraping
Last synced: 25 Jun 2025
https://github.com/nodamu/apache-beam-studies
Personal Apache Beam studies repository
apachebeam batch-processing data dataeng dataengineering datapipeline stream-processing
Last synced: 04 Nov 2025
https://github.com/nitheshgoutham/singapore-resale-flat-prices-predicting
To Predict the Resale Price of a Flat
data data-visualization machine-learning python3 sql streamlit
Last synced: 09 May 2026
https://github.com/dscamilo/gestion-clientes-springboot
Proyecto de gestiΓ³n de clientes aplicando Java y Springboot, haciendo uso de Lombok, uso de interface, inyecciΓ³n de dependencias, uso de anotaciones Service, Data, RestController . Consumo de API haciendo uso de Postman.
data interface java lombok-maven restcontroller spring-boot
Last synced: 15 May 2026
https://github.com/analyticslover/salifort-motors-turnover-project
The Salifort Motors H.R. Project serves as the capstone for the Google Advanced Analytics Program on Coursera. This project presents a business scenario and a problem on the scnario context, employee turnover. In this project, essential techniques as EDA and Data Modeling are used to analyze and predict the employee turnover rates in the company.
data data-analysis datamodeling eda machine-learning pandas python sklearn
Last synced: 10 Apr 2026
https://github.com/bakangmonei/is_final_assignment
My intelligent systems assignment
data data-science intelligent-systems python
Last synced: 02 May 2026
https://github.com/hadarsharon/grizzlys
User-friendly Python DataFrames π΅π‘ powered by Julia π΄π’π£
big-data data data-analysis data-engineering data-frame data-frames data-science dataframe dataframe-library dataframes dataframes-jl julia python
Last synced: 18 May 2026
https://github.com/jlee9503/excel-projects
Fitness tracker dashboard, displaying users workout type, calories burned, and steps taken with multiple filters (gender, age, and workout intensity). Implemented using MS Excel.
Last synced: 16 Jan 2026
https://github.com/Sikessem/Typed
Convert PHP values to objects of strict types.
cast converter data object-oriented-programming oop php poo programmation-orientee-objets strict-types value-object variable-object
Last synced: 11 May 2025
https://github.com/xuender/kstats
Golang statistics library package that supports v1.18+.
algorithms analytics data go golang kstats machine-learning math rounding statistics
Last synced: 20 Jul 2025
https://github.com/thibautre/dataipsum
Configurable data generator (with crumbles inside)
algorithm data random-generation
Last synced: 21 Jul 2025
https://github.com/naithikjorapur/practive-tanstacktsx
Practice TanStack with React, Vite, and TypeScript to build fast, type-safe apps. Leverage tools like TanStack Query for data management and Vite for a streamlined development experience.
data exercise fetching html-css-javascript json learning-by-doing practice query router tsx
Last synced: 05 Apr 2025
https://github.com/tkxwaweru/python_data_manipulation
Manipulating the MASSIVE dataset using python
data dataanalysis excel python
Last synced: 11 Jan 2026
https://github.com/tusharios/weatherappwithmoya
binding data moyaexampleswift mvvm-architecture swift5 weather-app
Last synced: 28 Mar 2025
https://github.com/Axnjr/csv-parser-utils
Homework task for SWE position at Redhat.
csv data dataanalysis datatools pandas python
Last synced: 30 Oct 2025
https://github.com/pcpp94/elexon_pipeline_gb_demand
Guidelines and code snippets for extracting and processing Elexon gross demand data on Databricks. Provides half-hourly GB demand at sectoral (Domestic, Non-domestic), GSP-area granularity, settlement demand, and embedded generation. Supports non-commodity cost calculations for CfD, RO, and FiT.
data electricity elexon gb octopusenergy power powerdata pypsa uk
Last synced: 12 Jul 2025
https://github.com/mekramy/ircity
Iran province, county and city data in json format.
Last synced: 05 Apr 2025
https://github.com/fastbolt/entity-importer
Entity importing library for importing data from files (CSV and Excel currently) or API into doctrine.
data doctrine2 excel excel-import
Last synced: 17 Feb 2026
https://github.com/phtrempe/l2a
This is a small project which aims to show an example of applied machine learning in Python 3 with the Keras library and its TensorFlow backend to train a neural network model for it to learn to add two integers.
applied data data-science deep-learning keras machine-learning neural-network tensorboard tensorflow
Last synced: 05 May 2026
https://github.com/styd/sd_struct
Searchable Deep Struct
activesupport data gem openstruct rails ruby structure
Last synced: 18 May 2026
https://github.com/fastpix/flutter-core-data-sdk
A comprehensive Flutter SDK for video player analytics and event tracking, designed to provide detailed insights into video playback behavior and user engagement metrics.
Last synced: 15 May 2026
https://github.com/echang1802/normandy
Normandy is a python framework for data pipelines, which main objective is standardizing your team code and provide a data treatment methodology flexible to your team needs.
analytics business-intelligence data dataengineering datascience etl pipeline
Last synced: 11 Mar 2026
https://github.com/ppmim/papi4k_old2
PAPI: the PANIC data reduction pipeline
data near-infrared pipeline processing
Last synced: 23 Jun 2025
https://github.com/the-universal-linux-society/sysreport
Bash script to give you a full system report. Just by running the script it offers insight into CPU data, disk space, temperature readings, network configuration, MAC addresses, firewall status, and system logs for error analysis.
analysis bash bash-script bash-scripting data report reporting system
Last synced: 15 May 2026
https://github.com/eryks1999/data-collection-project_python
This project allowed me to practice classes, populating json files as well as extracting data.
Last synced: 16 Apr 2026
https://github.com/allanotieno254/spss-nutrition-research
This repository contains the results of statistical analyses performed in IBM SPSS Statistics on a child nutrition dataset.
data data-preprocessing dataanalysis spss
Last synced: 17 Feb 2026
https://github.com/yvandana/pwc-power-bi-job-simulation
Projects pursued during my Job Simulation
dashboard data dataanalysis powerbi pwc-forage-switzerland
Last synced: 06 Mar 2026
https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql
This project involves cleaning a dataset containing information about layoffs from companies around the world.
data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql
Last synced: 08 Jun 2026
https://github.com/anthonysanalysis/bellabeat-analysis
Bellabeat Tech Case Study Capstone Project
analysis capstone case-study data data-analysis data-visualization md r rmd rstudio
Last synced: 20 Apr 2026
https://github.com/e22m4u/ts-projection
ΠΠΎΠ΄ΡΠ»Ρ Π΄Π»Ρ ΡΠ°Π±ΠΎΡΡ Ρ ΠΏΡΠΎΠ΅ΠΊΡΠΈΠ΅ΠΉ Π΄Π°Π½Π½ΡΡ Π΄Π»Ρ TypeScript
Last synced: 12 Apr 2025
https://github.com/ashishsingh789/quantium_data-analysis-_virtual-internship
Completed a job simulation focused on Data Analytics and Commercial Insights for the data science team. Developed expertise in data preparation and customer analytics, utilizing transaction datasets to extract valuable insights and deliver data-driven commercial recommendations
data datawrangling matplotlib pandas pandas-dataframe presentation programming python python-library
Last synced: 07 Apr 2026
https://github.com/yvandana/brain-tumor-detection-and-classification
Bachelor's Major Project- Presented at ICMISC 2022
2d-cnn brain-tumor-classification brain-tumor-detection cnn-model data data-augmentation keras-tensorflow sklearn-metrics
Last synced: 16 Jun 2025
https://github.com/himanshub16/lekhpal
Monitor and catalog Twitter feed matching your desired keywords
analytics data data-catalog data-filtering mongodb twitter twitter-streaming-api
Last synced: 14 May 2026
https://github.com/iota-pico/data
IOTA Pico Framework Data Structures and Helpers
data iota iota-pico-framework javascript typescript
Last synced: 18 May 2026
https://github.com/onemoredavid/python-like-a-boss
This is where I stash my Python study material.
data data-analysis data-engineering data-science data-visualization datascience ipynb ipynb-jupyter-notebook ipynb-notebook numpy pandas python python3
Last synced: 04 Apr 2025
https://github.com/jigyasag18/data-analysis-using-ms-excel
This project is on analyzing real-time data from Ambuvians Healthcare, a health products startup. It included data cleaning, such as removing duplicates and addressing missing values, followed by analyses to reveal insights into sales trends, customer demographics, and purchasing behaviors. Visualizations in MS-Excel including bar and pie charts.
analysis data data-visualization dataanalysis datacleaning datapreprocessing dataset msexcel visualization
Last synced: 07 Mar 2026
https://github.com/jigyasag18/amazon-power-bi-dashboard
The Amazon Power BI Dashboard Project repository provides an interactive analytics dashboard for visualizing and analyzing sales performance across various product categories within Amazon's ecosystem. Utilizing comprehensive sales data, it empowers stakeholders with actionable insights to enhance decision-making and improve business strategies.
data data-visualization dataanalysis dataanalytics dataset datasets datavisualization-project powerbi powerbi-report powerbi-visuals powerbidashboard
Last synced: 07 Mar 2026
https://github.com/yusuf4030/the-data-analyst-toolkit
π Explore essential data analysis tools organized by role and task, empowering users from students to professionals with quick access to valuable resources.
budget budget-management business-intelligence charts cookbook cureated-list data data-analysis-python data-visualization internet-of-everything internet-of-transport large-language-models nse open-source python selenium stock-market traffic-analysis
Last synced: 18 May 2026
https://github.com/juniorreisx/movelo-logstica
Movelo is a lightweight logistics simulator built with TypeScript that provides mock order and delivery data for developing and testing UIs, dashboards, and backend features without external APIs.
data hooks lucide-react react tailwindcss typescript
Last synced: 12 Apr 2025
https://github.com/jigyasag18/ibm-power-bi-dashboard-project
IBM Power BI Dashboard Project is a data-driven analysis of employees using IBM's comprehensive dataset, providing insights into key factors contributing to employee turnover and enabling organizations to strategize effectively towards improved employee retention and satisfaction.
data data-visualization dataanalysis dataanalytics dataset datavisualisation datavisualization-project powerbi powerbi-dashboards powerbi-report powerbi-visuals powerbidashboard
Last synced: 07 Mar 2026
https://github.com/byndyusoft/byndyusoft.data.relational.specifications
byndyusoft data relational specifications
Last synced: 12 Sep 2025
https://github.com/cannt39t/data-mining-spider-vk
ΠΠ°ΡΠΊ ΠΊΠΎΡΠΎΡΡΠΉ ΡΠΎΠ±ΠΈΡΠ°ΡΡ Π²ΡΡ ΠΈΠ½ΡΠΎΡΠΌΠ°ΡΠΈΡ ΠΎ ΡΠ΅ΠΊΠ»Π°ΠΌΠ½ΡΡ ΠΏΠΎΡΡΠ°Ρ Π² Π³ΡΡΠΏΠΏΠ΅ VK
data data-mining python3 vk vkontakte
Last synced: 05 Apr 2025
https://github.com/axafrance/azureml-to-openshift-talk
Scale your dev IA: From dev AzureML to prod OpenShift in one click
ai axa azureml data learn ml openshift raise-the-bar talk
Last synced: 16 Feb 2026
https://github.com/The-Tech-Idea/Beep.winform.Sample
Application for Managing your Different DataSources . Still in Alpha.please be patient
application data data-science database dataset integeration mysql nosql oracle postgres sqlite sqlserver workflow-engine workflows
Last synced: 04 Nov 2025
https://github.com/karensaraimoralesmontiel/8-week-sql-challenge
Case Studies Solutions for the 8-Week-SQL-Challenge.
Last synced: 02 Jan 2026
https://github.com/michael-sebero/data-recovery-tools
This tool suite recovers sensitive data.
algiz-linux archive corruption data data-recovery linux recover recovery rust tool tool-suite tools
Last synced: 18 May 2026
https://github.com/azaz9026/loan_approval_prediction
Welcome to the Loan Approval Prediction repository! This project aims to build a predictive model that can determine whether a loan application should be approved or denied based on various features. Purpose The goal of this repository is to develop a machine learning model that can accurately predict loan approval decisio
data data-analysis data-visualization eda machine-learning numpy pandas python statistics
Last synced: 06 Apr 2026
https://github.com/bastianolea/servel_elecciones
Resultados electorales desde Servel (2024)
chile comunas data elecciones genero
Last synced: 08 Jul 2025
https://github.com/melvinjwallace/melvinjw.github.io
A portfolio of a host of projects completed using python and sql.
data data-analysis data-cleaning data-loading data-mining data-preparation data-processing data-science data-transformation data-visualization dataset matplotlib microsoft-sql-server pandas-python seaborn
Last synced: 02 Apr 2026
https://github.com/rellyson/data-engineering-tools
This repository holds examples and documentation about the most used tools in the data engineering ecosystem.
apache-airflow apache-spark data data-engineering jupyter-notebook python tools
Last synced: 17 Jan 2026
https://github.com/gui-sitton/games
Identify patterns that determine whether a game is successful or not. This will allow you to identify potential big winners and plan advertising campaigns.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 18 May 2026
https://github.com/amethyst-php/issue
amethyst amethyst-package api data issue laravel task ticket
Last synced: 18 May 2026
https://github.com/ahadly/sql-data-analytics-project
This repository contains a collection of SQL scripts demonstrating various analytical techniques, such as changes over time, cumulative, performance, data segmentation, part-to-whole analysis.
analytics business-analytics business-intelligence data data-analysis data-analyst data-analytics data-engineering data-science data-scientist database datascience query reporting sql sql-queries sql-query sql-server window-functions window-functions-in-sql
Last synced: 18 May 2026
https://github.com/xjwllmsx/profitable-app-profiles
Analyzes Google Play & App Store data to recommend profitable profiles for free, ad-supported mobile apps
data data-analysis data-cleaning jupyter pandas python
Last synced: 18 May 2026
https://github.com/yugoff/ml-kaggle-regression-with-a-mohs-hardness-dataset
Your Goal: For this Episode of the Series, your task is to use regression to predict the Mohs hardness of a mineral, given its properties
data gradient-boosting kaggle kaggle-competition regression-models
Last synced: 18 May 2026
https://github.com/shamaz332/ecomrace-data-analysis-in-datascience
data data-science matplotlib pandas
Last synced: 15 May 2026
https://github.com/jmcph4/rpdb
rpdb
automation data database dataset db real-estate rpdata sql
Last synced: 12 Apr 2025
https://github.com/rid17pawar/friendscircle
Friends Circle is a console based application developed in cpp using Graph Data Structure.
cpp data graph graph-algorithms oop
Last synced: 08 Jun 2026
https://github.com/mkshah605/personal-brand-development
A data-driven approach to a personal brand development project.
branding data data-science growth music personal
Last synced: 12 Sep 2025
https://github.com/raghavendranhp/attrition-alchemy
This project uses machine learning to predict and analyze employee attrition in Company.By developing three predictive models,it identifies key factors influencing turnover,providing actionable insights to mitigate attrition challenges.The analysis focuses on enhancing job satisfaction,work-life balance and career growth opportunities.
data datawrangling decision-trees eda gradient-boosting logistic-regression macine-learning pandas preprocessing random-forest-classifier skicit-learn svm
Last synced: 18 May 2026
https://github.com/lisakey/lisakey
I am passionate about Python π and SQL ποΈ for data analysis π, and I actively develop projects in these languages.
analysis analyst data dataanalysis dataanalyst java python sql
Last synced: 02 May 2026