data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/pyrustic/litedao
Intuitive interaction with SQLite database
auto-init dao data database database-access library lightweight pyrustic python sql sqlite
Last synced: 09 May 2026
https://github.com/jigyasag18/fake-news-prediction-project
The Fake News Prediction App Repository offers a machine learning project that focuses on identifying the authenticity of news articles as fake or real. It uses a dataset of 20,000 articles and employs methods such as TF-IDF vectorization and the Porter stemming algorithm, achieving around 97% classification accuracy with logistic regression model.
data datapreprocessing logistic-regression machine-learning machine-learning-algorithms numpy pandas prediction stemming vectorization
Last synced: 08 Jun 2026
https://github.com/campiohe/geomask
A very simple lib for creating geometric masks from spatial data using regular grids.
Last synced: 30 Dec 2025
https://gitlab.com/sean-c/pdf_rules
Turn PDFs into CSVs by defining rules
Data Cleaning automation data data parsing
Last synced: 14 Apr 2025
https://github.com/rameshaditya/dynamic-hybrid-data-grid
Facilitates faster read-and-write of large ordered collections of data.
algorithms data data-structures storage
Last synced: 30 Jun 2026
https://github.com/vijaykumar1303/sales-data-analysis-and-dashboard-development
To analyze sales data to uncover insights into sales performance, trends, and patterns, and to develop an interactive dashboard that provides a comprehensive view of sales metrics and KPIs.
data dataanalysis datacleaning datavisualisation dax-query powerbi powerquery sql sqldataanalysis
Last synced: 11 Feb 2026
https://github.com/pyfig/s21_data-science-bootcamp
School21 Bootcamp Data Science
data data-science numpy pandas python school21
Last synced: 26 Jun 2025
https://github.com/amethyst-php/price
Define prices and attach them to any model
amethyst amethyst-package api data laravel price
Last synced: 17 May 2026
https://github.com/eloyhere/semantic-java
Semantic-Java is a modern, maven Java stream processing framework with zero dependencies. It elegantly blends the fluency of Java Streams, the laziness of JavaScript generators, and intelligent index-based control inspired by database indexing — perfect for time-series, event streams, and high-performance data pipelines as a maven pendency.
data functional functional-programming java pipeline stream
Last synced: 07 Apr 2026
https://github.com/ahmad-ali-rafique/random-forest-classifier-modeling
Detailed exploration of random forest classifiers, including data cleaning, model building, and performance evaluation on various datasets.
classification classification-models data dataanalytics datamodel dataset model-checking models random-forest random-forest-classifier
Last synced: 01 Jun 2026
https://github.com/shailu2004/azure_big_data_project
This project demonstrates a comprehensive Azure Data Engineering workflow using multiple Azure resources to process and analyze an e-commerce dataset. The dataset consists of 8 files containing details about customers, payments, orders, and other key information
ai azure cloud data data-engineering
Last synced: 08 Jul 2025
https://github.com/ahmad-ali-rafique/random-forest-regressor-modeling
Detailed exploration of random forest regressors, including data cleaning, model building, and performance evaluation on various datasets.
data dataanalytics datacleaning evaluation-metrics modeling random-forest random-forest-regression regression regression-analysis
Last synced: 05 Mar 2025
https://github.com/vaxdata22/foresight-pharmaceutical
This is a Data Analysis case study done on the Foresight Pharmaceutical Company dataset.
actionable-insights business-analytics business-intelligence data data-analytics data-cleaning data-mining data-visualization data-wrangling exploratory-data-analysis spreadsheets sql sql-server sql-server-management-studio statistical-analysis t-sql transact-sql
Last synced: 05 Mar 2025
https://github.com/danielrosehill/ghg-ebitda-correlations
Streamlit data visualisation examining correlation between emissions & profitability
data sustainability sustainability-data
Last synced: 14 Mar 2025
https://github.com/theduardomaciel/cc-pe
Conteúdos, scripts em R e datasets utilizados durante a matéria de Probabilidade e Estatística.
Last synced: 27 Mar 2025
https://github.com/ahmad-ali-rafique/electricity-consumption-analysis-household-dataset
This repository contains analysis and predictive modeling of household electricity consumption using Python. It includes data cleaning, exploratory data analysis (EDA), time series forecasting (ARIMA, SARIMA, LSTM), and model evaluation to optimize energy usage.
arima-forecasting artificial-intelligence artificial-neural-networks data data-science dataanalytics datacleaning evaluation-metrics exploratory-data-analysis long-short-term-memory lstmmodel modeling time-series timeseries-forecasting
Last synced: 23 Jun 2025
https://github.com/truongnhatbui/automatidata
Automatidata
data data-analysis data-science data-visualization python tableau
Last synced: 08 Jul 2025
https://github.com/ethenkem/pygraphsurvey
A python base web app that provide graphical analysis on data collected from surveys and the system has its on built in form fiiling where admin can set question and sent a link for the forms to be filled and then the system provide anylysis on the collected data. Form feature include selection options, range values file inputs etc
Last synced: 12 Jan 2026
https://github.com/amethyst-php/data-view
amethyst amethyst-package api data data-view laravel
Last synced: 19 May 2026
https://github.com/stdlib-js/dstructs-circular-buffer
Circular buffer.
buffer circular collection cyclic data data-structure data-structures fifo first-in-first-out javascript node node-js nodejs queue ring stdlib structure
Last synced: 20 May 2026
https://github.com/ournet/embed-providers-data
Embed provides data
data embed embed-providers json providers
Last synced: 03 May 2026
https://github.com/gui-sitton/carsells
In this project I am an analyst on the Crankshaft List. Hundreds of free vehicle advertisements are published on the site every day. I need to study the data collected over the last few years and determine which factors influence the price of a vehicle.
data data-analysis data-analysis-python data-science data-visualization python
Last synced: 20 May 2026
https://github.com/amethyst-php/contact
amethyst amethyst-package api contact data laravel
Last synced: 20 May 2026
https://github.com/avijeetpandey/quizzez
Implementation of quizzez application using kotlin
Last synced: 20 May 2026
https://github.com/amethyst-php/shipment-zone
amethyst amethyst-package api data laravel shipment-zone
Last synced: 20 May 2026
https://github.com/prcharan592/olympic-insights-historical-data-analytics-in-r
This project analyzes 120 years of Olympic history (1896–2016), uncovering trends and insights from the data
data data-analytics data-science data-visualization kaggle r-programming
Last synced: 03 Apr 2025
https://github.com/amethyst-php/source
The source of information. It can be used to save the origin of whatever information (news, books, etc.. )
amethyst amethyst-package api data laravel source
Last synced: 27 Apr 2026
https://github.com/ressuman/next-blog-1-project
Next.js with TypeScript: Fetching Data and Setting Up Routes. This project demonstrates my first experience with Next.js using TypeScript. It involves fetching posts from the JSON Placeholder dummy API, setting up pages, and linking routes.
api-rest data html-css-javascript jsx nextjs14 routing typescript
Last synced: 15 May 2026
https://github.com/basis-company/data-player.js
in memory data layer for fast access to plain normalized data
collection data model traversal
Last synced: 25 Feb 2025
https://github.com/szc126/metadata-nnd-vocalo-twitter
ボカロ系新着動画ツイートを収集 - "new VOCALOID/UTAU videos" tweet collection
data nico-nico-douga niconico vocaloid
Last synced: 20 May 2026
https://github.com/estherslabbert/final-capstone-unsupervised-ml
Exploration of USArrests data using unsupervised machine learning
arrests correction data data-analysis data-clustering data-visualization jupyter-notebook machine-learning pca-analysis standardised-data usa
Last synced: 26 Jun 2025
https://github.com/raruto/cockpit-sample-data
Sample data installer addon for Cockpit CMS
Last synced: 17 Mar 2025
https://github.com/ranjeetj06/insighthub
InsightHub is a data analytics project that helps automate the entire process of preparing, analyzing, and reporting on CSV data.
analysis begineer data springboot
Last synced: 17 May 2026
https://github.com/disruptek/bloom
bloom filters
bloom data filter hash membership nim probability set structure
Last synced: 04 Apr 2025
https://github.com/stdlib-js/array-base-any-has-property
Test whether at least one element in a provided array has a specified property, either own or inherited.
any array assert data generic has javascript node node-js nodejs prop property stdlib structure test types validate
Last synced: 20 May 2026
https://github.com/ellisvalentiner/legislation-embeddings
Embeddings for U.S. Congress legislation
data embeddings machine-learning nlp python
Last synced: 12 Aug 2025
https://github.com/tuscanicz/doctrine-data-applier
Symfony bundle for Doctrine Migrations of data using doctrine entities
data database doctrine entity migrations symfony symfony-bundle
Last synced: 02 Feb 2026
https://github.com/bala-1409/sales-forecasting-datascience-project
Develop a data science project using historical sales data to build a regression model that accurately predicts future sales. Preprocess the dataset, conduct exploratory analysis, select relevant features, and employ regression algorithms for model development. Evaluate model performance, optimize hyperparameters, and provide actionable insights.
data data-analysis data-science data-visualization datacleaning exploratory-data-analysis machine-learning-algorithms modelfitting prediction predictive-analytics predictive-modeling python3 regression-models salesforecast supervised-learning
Last synced: 26 Apr 2026
https://github.com/bala-1409/loan-classification-data-science-projects
This project uses machine learning algorithms to predict the classification of loan status. The dataset is loaded and some transformation is done using SQL for getting a proper dataset with some valid informations.
data data-analysis datacleaning datascience datavisualization exploratory-data-analysis loan machine-learning machine-learning-algorithms modelfitting sql supervised-learning visualization
Last synced: 22 Mar 2025
https://github.com/sharmadhiraj/plot-pi
Graphical Representation of PI
data data-visualization html javascript js mathematics plot
Last synced: 28 Mar 2025
https://github.com/itsmeyogesh22/solved-8-weeks-sql-challenge-correct-solutions
Included in Serious SQL Virtual apprenticeship program, this repository contains solutions for all eight different case studies crafted by Danny Ma. For more information please visit: https://8weeksqlchallenge.com/
8weeksqlchallenge data dataanalytics datawithdanny postgresql sql sqlserver-2022 t-sql
Last synced: 07 Apr 2025
https://github.com/kunalkumar2001/coffee_sales_project_using_excel_power-bi_and_sql
Coffee Shop Sales Dashboard built using Power BI for visualization and SQL for data extraction and transformation. The project dives deep into sales performance, providing actionable insights for data-driven decisions.
analytics data dataanalytics mssql powerbi sql
Last synced: 26 Jun 2025
https://github.com/amliyanage/data-structures
arrays binary-tree data data-structures graph hashtable linked-list stack
Last synced: 06 Apr 2025
https://github.com/jun-labs/json-handling
🔍 Json 데이터 핸들링 예제.
data gson jackson json json-object
Last synced: 15 May 2026
https://github.com/dolanmiu/mclaren-task
A front end assessment task for Mclaren
angular data observable observables rxjs
Last synced: 16 May 2026
https://github.com/badr-moufad/dashboard-agriedge-data
Prepare data for dashboard. This is part of my research internship.
acquisition dashboard data data-morocco data-science data-visualisation weather weather-dashboard weather-data
Last synced: 04 Apr 2025
https://github.com/luminati-io/linkedin-dataset-samples
Sample dataset of 1001 LinkedIn companies, extracted via Bright Data API, featuring essential data points for competitive analysis and market insights.
data database dataset linkedin linkedin-api linkedin-data linkedin-dataset linkedin-scraper sample web-scraping
Last synced: 17 Mar 2025
https://github.com/amethyst-php/taxonomy
amethyst amethyst-package api data laravel taxonomy
Last synced: 18 Jan 2026
https://github.com/preranarao03/madhav_e-commerce_dashboard
This repository features the Madhav_E-Commerce_Dashboard built with Power BI. It provides interactive visualizations for analyzing e-commerce sales performance, product categories, customer segments, and geographic data, aiding in data-driven business decisions.
Last synced: 30 Jan 2026
https://github.com/jszafran/personal-aws-data-lake
Personal, cloud based (AWS), data lake for experimenting with cloud services.
aws cloud data data-engineering dataengineering datalake etl terraform
Last synced: 20 May 2026
https://github.com/pulipulichen/pts-local-news-dataset
A dataset containing local news from Public Television Service.
Last synced: 27 Mar 2026
https://github.com/amethyst-php/geolocation
amethyst amethyst-package api data geolocation laravel
Last synced: 20 May 2026
https://github.com/qubitpi/wiktionary-data
Wiktionary data in simple parsable formats hosted on 🤗 Datasets
ancient-greek data german huggingface huggingface-datasets language latin natural-language-processing nlp old-persian python wiktionary wiktionary-data
Last synced: 17 Jul 2025
https://github.com/redinfinitypro/scientificsharp
Rating: (5/10) The code is a Windows Forms application for a basic scientific calculator, allowing users to perform mathematical operations like addition, subtraction, multiplication, division, trigonometrics, and logarithms.
componentmodel cryptography data drawing forms generic linq system tasks text
Last synced: 06 Apr 2025
https://github.com/xylambda/data-structures-algorithms
This repository provides implementations of popular algorithms and abstract data types using JAVA.
algorithm algorithms array arraylist avl-tree data data-structures graph heap iterative java linked list netbeans queue recursive set stack tree
Last synced: 30 Jun 2026
https://github.com/kashyap-prabhat/sigma
A Scala library for probability and statistics formulas, including rules for probability calculations.
data formulas library mathematics probability scala statistics
Last synced: 30 Jun 2026
https://github.com/ciscorn/japanmesh-rs
A Rust library for handling Japanese Grid Square Code (JIS X 0410:2002 地域メッシュコード)
census data geospatial japan rust
Last synced: 11 Jan 2026
https://github.com/yusufterzii/weather-data-analysis-with-pandas
Pandas Example
data dataanalysis pandas pandas-dataframe pandas-python
Last synced: 20 May 2026
https://github.com/anzerr/storage.ts
Util to store data used in a service
data nodejs storage typescript util
Last synced: 20 May 2026
https://github.com/deva-246/excel-power-query-data-cleaning-dashboard
dashboard data datacleaning excel pivottable powerquery slicer
Last synced: 22 Mar 2025
https://github.com/mx51/data-dictionary-action
GitHub Action for generating and checking freshness of data dictionaries
Last synced: 17 Jan 2026
https://github.com/chompfoods/stub-jaxrs-jersey
JAX-RS Jersey server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients jax-rs jersey nutrition raw recipe-api recipes server server-stub stub stub-server
Last synced: 02 May 2026
https://github.com/bho0920/crime-data-analysis-eu
Crime Data Analysis for Self-Defense Tool Market Entry in the EU.
data data-analysis sql sqlite tableau
Last synced: 21 Jun 2025
https://github.com/shubhamsoni98/classification-with-random-forest---2
Fraud detection is a critical task for financial institutions and businesses. This document outlines the end-to-end process of predicting fraudulent activities using a Random Forest model. The process includes data preparation, exploration, model training, and evaluation.
algorithms anaconda data data-science dataflow feature-engineering jupyter-notebook machine-learning model modeltraining prediction python random-forest sql visualization
Last synced: 20 Jan 2026
https://github.com/simranjeet97/kaggle_pokemon_datset_eda-dashboard
Full EDA and Dashboard of Kaggle Pokemon Dataset with Live Streaming Data and Images
cloud data data-science dataanalytics machine-learning machine-learning-algorithms pokemon pokemon-dataset pokemon-prediction python science
Last synced: 07 May 2026
https://github.com/muhammadadilnaeem/data-science-materials
This repository will contain basic source code and materials related to Data science.
artificial-intelligence artificial-neural-networks calculus data data-science deep-learning deep-neural-networks machine-learning machine-learning-algorithms mathematics nlp-machine-learning projects statistics
Last synced: 07 May 2025
https://github.com/harrisonwelch/pythondatascience
Repo of code from the linked-in lesson "Python: Data Analysis"
data data-science matplotlib notes numpy python tutorial
Last synced: 12 Apr 2026
https://github.com/moons-14/datapot
Incorporate and serve all information.
ai aiogram api data infomation news newspaper rss video
Last synced: 04 Jan 2026
https://github.com/umstek/sampler
Generate elaborate random data instantly.
data faker javascript json sample
Last synced: 20 Jul 2025
https://github.com/aguven6/inmemory-data-processor
Convert tabular data to columnar data with index. Aim is to process huge data quicker especially in aggregation operation
columnar-storage data data-structures parallel-computing parallel-programming processing
Last synced: 17 May 2026
https://github.com/jigyasag18/credit-card-fraud-detection-using-machine-learning
This repository presents a credit card fraud detection system utilizing a Logistic Regression model trained on a dataset of 284,807 transactions with significant class imbalance. After employing under-sampling for balance, the model achieves a test accuracy of around 93.40%, showcasing the effectiveness of ML in identifying fraudulent transactions.
credit-card-fraud creditcardfrauddetection data dataset logistic-regression logisticregression machine-learning machine-learning-algorithms mlproject mlprojects
Last synced: 02 Sep 2025
https://github.com/mbiushelix/soilresp
Geofag 1 feltarbeid fra Vg2
data data-visualization geology global-warming norwegian-language soil-quality-testing soil-respiration
Last synced: 23 Jul 2025
https://github.com/ntnn/dataparse
Parsing, transforming and unmarshalling data.
data data-parser data-parsing data-transformation golang golang-lib
Last synced: 30 Jun 2026
https://github.com/khansasafira19/sk-cool-storytelling
Source Code for Data Storytelling with HTML5
data html5 javascript storytelling
Last synced: 13 May 2026
https://github.com/arthurcfranklin/acervo-musical
Este projeto consiste na criação de um banco de dados relacional para auxiliar um DJ na organização e catalogação do seu acervo musical. O objetivo é fornecer um sistema eficiente para armazenar e gerenciar informações sobre cantores, bandas, músicas e suas versões remixadas.
data database mysql mysql-database sql
Last synced: 22 Mar 2025
https://github.com/kinshukjainn/dclue-v1
Dsainone is a highly optimized Data Structures and Algorithms (DSA) library designed to provide efficient implementations of graph algorithms, trees, hashing, and linked lists while maintaining exceptional memory efficiency. The library is designed to be as fast and optimized as possible
Last synced: 20 May 2026
https://github.com/sibeux/redesigned-broccoli
Repositori untuk menyimpan data file musik
data data-center nasrulwahabi sibeux
Last synced: 24 Jan 2026
https://github.com/ishansurdi/data-visualisation-empowering-business-with-effective-insights
The following tasks are completed for Data Visualization: Empowering Business with Effective Insights on Forage in October 2024. It is important to note that this should not be interpreted as an endorsement.
chart communicating-insights-and-analysis dashboard data data-analysis forage powerbi powerbi-visuals tableau tata tata-group virtual-internship visual visualization
Last synced: 17 Feb 2026
https://github.com/shoaib1522/database-systems
📚💾 Master the fundamentals of database systems with this all-in-one lab repository, featuring ERD design diagrams 🧠🗺️, Oracle SQL 🌐📝, relational schema practice, and complete PowerPoint lectures 🖥️📑. Perfect for revision, exams, or quick reference! 💡📘
data database database-management databases databases-course db dbms-project erd notes oracle oracle-database sql
Last synced: 21 Aug 2025
https://github.com/piyushkumar2025/analytical-sql-project-exploring-trends-segmentation-kpis
A complete SQL analytics project using a simulated data warehouse. It analyzes sales, customer, and product data with CTEs, joins, window functions, subqueries, and views to deliver insights on trends, segmentation, and KPIs, showing how SQL enables data-driven decisions without BI tools.
advanced-sql analytics business-intelligence data data-science-projects datascience joins kpi mysql query sql window-functions-in-sql
Last synced: 02 Jul 2025
https://github.com/ressuman/csv-writer-project
CSV Writer with TypeScript. This project demonstrates my implementation of a CSV writer using plain TypeScript and JavaScript, without relying on any frameworks.
Last synced: 15 May 2026
https://github.com/xmen3em/kaggle-competitions
This collection contains various projects and notebooks developed to tackle a range of Kaggle competitions, showcasing different machine learning techniques, data preprocessing methods, and model optimizations.
data data-science data-visualization deep-learning deployment ensemble-learning machine-learning-algorithms python streamlit
Last synced: 09 Apr 2026
https://github.com/birjemin/wxgameod
wxgame 开放数据 weixin 微信小游戏 关系链数据
data interactive-data relation user-storage
Last synced: 16 Jul 2025
https://github.com/dhi13man/rca_ace
RCA Ace is designed for organizations seeking to enhance their understanding and utilization of insights derived from Root Cause Analyses (RCAs).
analytics data enterprise open-source python python3 rca
Last synced: 10 Sep 2025
https://github.com/shivamsharma32/ipl-2022-analysis
The IPL 2022 Analysis project is a data-driven exploration of the Indian Premier League (IPL) 2022 cricket tournament. The analysis focuses on utilizing Python programming and various libraries to analyze and visualize the performance of teams, players, and key metrics in the IPL 2022 season.
data dataana dataanalytics datavi matplotlib python
Last synced: 17 May 2026
https://github.com/ayush1999/data-mining
data mining natural-language-processing
Last synced: 10 Sep 2025
https://github.com/weecology/updating-data
Hugo website for instructions on how to make a regularly updating data pipeline
continuous-analysis continuous-integration data gh-actions living-data netlify travis-ci
Last synced: 17 Feb 2026
https://github.com/tadiusfrank2001/data_mining_projects_labs_cs145
A collection of data mining course assignments to implement advanced predictive statistical analysis models
algorithms data data-mining data-science deep-learning predictive-modeling python3 wide-learning
Last synced: 16 May 2026