data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-02 00:07:45 UTC
- JSON Representation
https://github.com/cloud-shuttle/drover-sqlforge
The Data Automation Engine. A blazing-fast, pure Go alternative to dbt for data transformations.
ast data drover sql transformation
Last synced: 03 Jun 2026
https://github.com/shsiddhant/womens-wc
ML project to predict match outcomes for Women's Cricket World Cup 2025.
cricket-prediction data feature-engineering postgresql python
Last synced: 04 Apr 2026
https://github.com/holo-nim/flue
data streaming options
data nim reader-writer streams
Last synced: 04 Apr 2026
https://github.com/stdlib-js/dstructs
Data structures.
containers data data-structures javascript namespace node node-js nodejs ns stdlib structs structures
Last synced: 18 Apr 2026
https://github.com/opdev1004/crumbdbjs
JSON files based database Javascript
data data-storage data-store database database-management nodejs
Last synced: 18 Apr 2026
https://github.com/zurd46/zurdsynthdatagen
This Electron project uses the OpenAI ChatCompletion API to generate synthetic datasets in either German (DE) or English (EN).
data data-structures dataset electron json jsonl nodejs openai synthetic
Last synced: 04 Apr 2026
https://github.com/rd-uk/rduk-data-pg
PostgreSQL Data Provider implementation for rduk-data
Last synced: 18 Apr 2026
https://github.com/neelamraikwar9/bookdata
This is my 1st assignment git repository. I have worked with Book Data and by using Express Js created routes and API's for Post, Update, Delete, and Get.
api books data database deployment expressjs node nodejs postman postman-api
Last synced: 05 Apr 2026
https://github.com/mipacd/holochatstats
A VTuber chat log (and general) analytics platform
data flask hololive postgresql python visualization vtuber youtube
Last synced: 05 Apr 2026
https://github.com/codbex/codbex-number-generator-data
Number Generator for Documents Module - Data
Last synced: 05 Apr 2026
https://github.com/codbex/codbex-hestia-data-sample
Sample data for codbex-hestia
Last synced: 05 Apr 2026
https://github.com/kenatsf/basic_data_analysis
Basic data science project: ETL, forecast and data visualization.
analysis data data-analysis data-science logistic-regression matplotlib matplotlib-pyplot numpy pandas powerbi python scikit-learn time-series time-series-analysis time-series-forecasting
Last synced: 05 Apr 2026
https://github.com/josericodata/josericodata.github.io
Welcome to my portfolio website. This site showcases my skills, experience, education, and projects as a Data Analyst.
awesine-latex big-data career-development data data-analyst data-science database dublin ireland job-seeking jose-maria-rico-leal jose-rico jose-rico-data latex latex-cv portfolio portfolio-website python sql
Last synced: 18 Apr 2026
https://github.com/prakashjha1/loan-eligibility-prediction
This repository contains the codebase and resources for a machine learning-based project aimed at predicting loan eligibility for individuals. The project utilizes various algorithms and data preprocessing techniques to build predictive models that assess the likelihood of an applicant being eligible for a loan based on historical data.
data data-visualization exploratory-data-analysis loan-prediction-analysis machine-learning-algorithms naive-bayes-classification parameter-tuning python random-forest
Last synced: 19 Apr 2026
https://github.com/phelipe-sempreboni/certificates
Tutorial intended for information about my licenses and certificates acquired over time.
certificate certificates certification course data database datascience licences license-management marketing marketing-analytics python sql
Last synced: 16 May 2026
https://github.com/ahmad-ali-rafique/decision-tree-classifier-modeling
👏Comprehensive exploration of decision tree classifiers, including data cleaning, model building🏩, and performance evaluation on various datasets.
analytics classification classification-models data data-science dataanalytics datacleaning dataset decision-tree-classifier models
Last synced: 20 Apr 2026
https://github.com/montanaz0r/suicide-rate-analysis
Testing a significance of the correlation between a suicide rate and a number of psychiatrists and psychologists working in the mental health sector
analysis correlation data data-analysis data-science jupyter-notebook jupyter-notebooks matplotlib numpy pandas psychology python python-3 seaborn statistics suicide-rate
Last synced: 20 Apr 2026
https://github.com/omers/sre-devops-tools
Tools and useful sources for SRE and DevOps
awsome awsome-list data devops monitoring sre tools
Last synced: 20 Apr 2026
https://github.com/crypt596-rubykz/metaai-data-explorer-scraping-tool
MetaAI data explorer tool
api-research automation data explorer html-parsing metaai playwright python rate-limiting scraping
Last synced: 20 Apr 2026
https://github.com/stdlib-js/array-base-symmetric-banded-filled2d-by
Create a filled two-dimensional symmetric banded nested array according to a provided callback function.
alloc allocate array callback data fill filled foreach generic javascript map matrix multidimensional node node-js nodejs stdlib strided structure types
Last synced: 20 Apr 2026
https://github.com/hormcodes/data
Terraform configuration for public data storage hosted on data.horm.codes
aws cloudfront content-management data github-actions s3-bucket terraform
Last synced: 20 Apr 2026
https://github.com/sdspot2034/data-lemur-solutions
Solutions to SQL Problems on DataLemur
competitive-programming data data-analytics data-science database postgresql query sql
Last synced: 20 Apr 2026
https://github.com/nikoheikkila/maps
A TypeScript collection of specialized map implementations
data javascript maps typescript
Last synced: 20 Apr 2026
https://github.com/zhukovanan/stepik_
The completed tasks of different data or computer science related fields on stepik
data statistical-learning statistics stepik-course
Last synced: 21 Apr 2026
https://github.com/vidya-vijay/vidya-vijay
About me
analytics data data-science machinelearning python r spss sql statistics tableau visualization
Last synced: 21 Apr 2026
https://github.com/fastpix/android-data-kaltura
This SDK enables seamless integration with Kaltura Player, offering advanced video analytics via the FastPix Dashboard
analytics android-sdk data fastpix kaltura kaltura-player metrics sdk video video-metrics
Last synced: 21 Apr 2026
https://github.com/rahulpatel0615/sales-analysis-project
Sales Data Analysis Dashboard with Python, Pandas, and Matplotlib. Features 12+ visualizations and comprehensive insights.
data data-analysis data-visualization matplotlib pandas portfolio python
Last synced: 21 Apr 2026
https://github.com/vishwas-chakilam/movies-review-scraping-analysis
A project for collecting, cleaning, and analyzing movie data. Includes scripts for web scraping (deprecated) and using the OMDb API to fetch movie details. Analyze and visualize data with Python and Power BI to uncover insights and trends in movie ratings and genres.
data dataanalysis datacleaning datavisualization matplotlib-python numpy-library pandas python webscraping
Last synced: 21 Apr 2026
https://github.com/amethyst-php/alias
alias amethyst amethyst-libary amethyst-package api data laravel library package
Last synced: 21 Apr 2026
https://github.com/stefen-taime/llm-rag-mtl-public-hospital
Ce projet développe un modèle de type Retrieve-Augment-Generate (RAG) pour répondre aux questions en utilisant les données publiques des avis laissés sur Google pour des hôpitaux à Montréal
data google-reviews hopital hospital hub ia llm montreal open-source quebec rag
Last synced: 21 Apr 2026
https://github.com/jdenn0514/surveycore
Core Survey Analysis Infrastructure
Last synced: 21 Apr 2026
https://github.com/vck9521/traffic-accidents
In this project, we analyze the effects of various factors that correlate to traffic fatalities in the United States. Logistic regression is used, with the y variable being Fatality Rate (coded 0 for Survived, 1 for Fatality).
analysis data fatalities r regression rstudio traffic visualization
Last synced: 05 Jun 2026
https://github.com/schijioke-uche/data-analysis-with-python-an-spss-model
With this Python notebook algorithm, you can use SPSS Model notebook to build machine learning pipelines that you can use to iterate rapidly during the model building process in data analysis. Whether you're trying to find the right algorithm or experimenting with different ways of preparing your data, you can create reproducible research that's easily understood by any member of your team with Hypothesis definition.
anova cp4a cp4d cp4i cp4s data ibm ibm-cloud jeffrey-chijioke-uche jeffrey-solomon-chijioke-uche openshift python python3 redhat t-test
Last synced: 22 Apr 2026
https://github.com/rbcavi/factorio-mod-data
The modpacke data for factorio-viewer
data factorio factorio-data factorio-mod-data
Last synced: 23 Apr 2026
https://github.com/syed-nihaal/car-price-prediction-and-performance-analysis
A data science notebook project focused on analyzing car features and building a model for car price prediction.
data data-analysis data-visualization jupyter-notebook python
Last synced: 23 Apr 2026
https://github.com/ppatrzyk/heatmap
Display CSV as a heatmap in terminal
csv data data-visualization terminal
Last synced: 24 Apr 2026
https://github.com/elcarrillo/structpy
StructPy is a Python-based command-line tool designed for academics and scientists to manage data projects effectively. It simplifies workflows by creating structured project directories, generating timestamped filenames, validating datasets, and backing up projects seamlessly.
command-line-tool data database file-structure organization python science-tool
Last synced: 24 Apr 2026
https://github.com/coryson/osm-mla-finder
Python script to locate institutions employing Medical Laboratory Assistants in Germany, developed for BTZ – Berufliche Bildung Köln GmbH. It uses OpenStreetMap, SerpAPI, and web scraping to find and verify relevant labs, clinics, and diagnostic centers.
beautifulsoup data openstreetmap osm python scraping serpapi webscraping
Last synced: 24 Apr 2026
https://github.com/yuvrajsaraogi/-iris-flower-classification
Iris flower has three species; setosa, versicolor, and virginica, which differs according to their measurements. Now assume that you have the measurements of the iris flowers according to their species, and the task is to train a machine learning model that can learn from the measurements of the iris species and classify them.
classification data data-analysis data-science data-visualization flower flower-classification iris iris-classification iris-flower iris-flower-classification knn knn-classification machine-learning machine-learning-algorithms ml natural-language-processing nlp python
Last synced: 24 Apr 2026
https://github.com/cyberoctane29/python-for-data-analysis
A repository dedicated to learning Python for data analysis, data science, and data analytics. This collection of Jupyter notebooks covers practical exercises and concepts from the Google Advanced Data Analytics Professional Certificate program.
data data-analysis data-analytics data-science python
Last synced: 24 Apr 2026
https://github.com/hruth-vik/sales-analysis-report
SalesScope is a powerful sales analytics dashboard that extracts insights, reveals trends, and drives strategy from raw data.
analytics data powerbi-report powerbi-visuals python
Last synced: 24 Apr 2026
https://github.com/sanogotech/open-source-data-stack
modern open source data stack
airbyte airflow data data-science dbt docker postgresql python
Last synced: 11 Apr 2026
https://github.com/indhra/cats-ijcnn-data-2004
CATS IJCNN Data 2004 Competition of Artificial Time Series
2004 artificial cats data ijcnn time-series
Last synced: 22 Mar 2025
https://github.com/vidushibhadana/eda-on-nyc-taxi-data
About Conducting an Exploratory Data Analysis (EDA) on New York City taxi data and visualizing it through countplots, distribution plots (displot), and histograms using Python and it's libraries.
data data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 11 Apr 2026
https://github.com/oliver021/helppad-net
Versatile .NET Toolkit: A Comprehensive Set of Miscellaneous Helpers, Classes, and Utilities
assert async checks cryptographic-algorithms data date dotnet fluent functional functional-programming hash helpers parallel pipe pipeline pointers review supports tasks
Last synced: 15 Jun 2026
https://github.com/parmsam/rweekly.data
R package containing data on Rweekly posts
Last synced: 21 May 2026
https://github.com/canadaluke888/terminaltablebuilder
Build and edit tabular data all from the terminal.
cli data data-manipulation excel json ods rich spreadsheets sqlite3 tables
Last synced: 20 Apr 2026
https://github.com/gagolews/clustering-data-v0
Datasets for Clustering [DEPRECATED – A NEW VERSION IS AVAILABLE]
clustering data dataset machine-learning
Last synced: 15 Sep 2025
https://github.com/mai-space/design-concept-sharing-recipes
🖼️ Concept for a framework based on state of the art technology and libaries for secure data sharing and online collaboration, as well as focus on the ux and ui of said framework
concept content-map data datasharing framework hci mci mock-up navigation-map peer-to-peer screendesign userstories
Last synced: 14 May 2025
https://github.com/lakshyakumar266/jee-dpp-manager-app
DPP manager app for JEE preparing Students
data expo javascript management react-native
Last synced: 07 May 2026
https://github.com/purarue/blizzard_gdpr_parser
Parses date-related information from my blizzard GDPR export.
blizzard data gdpr webscraping
Last synced: 06 Apr 2025
https://github.com/emna-chebbi/student-performance
Predictive model for student exam scores based on student performance factors
ai computer-vision data kaggle machine-learning ml mse regression regression-models
Last synced: 15 May 2026
https://github.com/colour-science/colour-streamlit-tm-30-18
Generates the "ANSI/IES TM-30-18 Colour Rendition Report" using Colour and Streamlit
color color-science color-space color-spaces colorspace colorspaces colour colour-science colour-space colour-spaces colourspace colourspaces data dataset datasets streamlit
Last synced: 29 Jun 2026
https://github.com/dug22/jjournal
A Jupyter like notebook software for Java
data data-analysis data-science java jshell jshell-repl notebook swing swing-application
Last synced: 11 Apr 2026
https://github.com/fridex/real-estate
My machine learning in real estate
data machine-learning real-estate
Last synced: 27 Jun 2025
https://github.com/neha-adnani/sql_music-store-analysis
SQL-based data analysis of a digital music store's sales and customer data.
business-analysis data data-analysis database follow-along-projects pgadmin4 portfolio-project postgres queries sql
Last synced: 18 Jun 2025
https://github.com/moscatellimarco/webscrap-imdb
🎬 Python scraper for IMDB: Extract movie/TV details for 📊 analysis & 🗃️ storage. Easy setup, 🔧 customizable, with 🖥️ CLI.
css data datascience html movies python scrapy scrapy-crawler scrapy-spider web web-scraping webdata webscraping
Last synced: 15 May 2026
https://github.com/muhamedlabs/muhamed_onedrive
Muhamed_OneDrive - це надійне і зручне хмарне сховище для файлів, розроблене для безпечного зберігання і легкого обміну даними.
data html5 onedrive programming style
Last synced: 04 Jan 2026
https://github.com/devbigboy/iti-database
This course will cover the following Topics: joins, Normalization, Aggregate function, Group By, Order By, Select, Ranking Functions, Built-In Functions
analytics data data-analytics mssql-database sql sql-server
Last synced: 03 Nov 2025
https://github.com/jun-labs/algorithm
📝 자료구조, 알고리즘 학습 저장소.
algorithm data data-structures leetcode problem-solving programmers ps structure
Last synced: 14 Mar 2025
https://github.com/amethyst-php/token
amethyst amethyst-package api data laravel token
Last synced: 21 May 2026
https://github.com/zevio/acl
ACL Anthology corpus sample
data dataset scholarly-articles
Last synced: 01 Mar 2026
https://github.com/kashirin-alex/thither.direct-onamove
an android skeleton-example application for using data from Thither.Direct platform on mobile applications
android-application data data-analysis data-structures data-visualization mobile-development mobility query research-data-management
Last synced: 27 Apr 2026
https://github.com/errea/vet_clinic_database
For this project you need special preparation. As the goal of this project is to solve some performance issue, first we need to introduce those issues. In order to do that, you will populate your database with a significant number of data.
data data-analysis data-structures data-visualization database
Last synced: 21 May 2026
https://github.com/tearth/test-data-generator
The generator of test data for the school project.
Last synced: 05 Jul 2025
https://github.com/vdoninav/real_estate_analysis
real estate analysis
data data-analysis data-analysis-python data-science pandas pandas-dataframe pandas-python plotly plotly-express scipy seaborn streamlit streamlit-application streamlit-dashboard streamlit-webapp
Last synced: 12 Apr 2026
https://github.com/xenoverseup/data-structures
Data structures in every language I know.
cpp data data-science data-structures data-structures-and-algorithms doubly-linked-list linked-list
Last synced: 14 May 2026
https://github.com/shysolocup/fndt
JavaScript package allowing you to see function data like body and arguments from outside of the function
aepl data fndt functions javascript javascript-tools js js-function js-functions lightweight nodejs nodejs-modules package stews
Last synced: 30 Apr 2026
https://github.com/jigyasag18/employee-salary-prediction-jigyasa
PayNexus is a machine learning-powered web app that predicts employee salaries based on role, education, and experience. Built using Python, Streamlit, and scikit-learn, it supports both single and batch predictions. The app includes advanced features like resume parsing via NLP and interactive visual analytics. Ideal for job seekers, HR profession
data dataset decision-tree-regressor gradient-boosting-classifier knearest-neighbor-classifier labelencoder lasso-regression linear-regression machine-learning machine-learning-algorithms machinelearning onehot-encoder pipeline random-forest random-forest-classifier ridge-regression standardscaler svr-regression-prediction xgboost xgboost-classifier
Last synced: 15 May 2026
https://github.com/eshan-sud/secureit
A Blockchain-based Data Sovereignty Platform
blockchain data decentralised-application platform sovereignty
Last synced: 21 Jan 2026
https://github.com/vaxdata22/foresight-institution
This is a Data Analysis case study done on the Foresight Institution dataset.
actionable-insights business-analytics business-intelligence data data-analytics data-cleaning data-mining data-processing data-visualization data-wrangling exploratory-data-analysis spreadsheets sql sql-server sql-server-management-studio statistical-analysis t-sql transact-sql
Last synced: 28 May 2026
https://github.com/rsc-labs/see-open-data
Show www.dane.gov.pl in user friendly format. Generate flourish data or other data visualizations.
data data-visualization flourish government poland
Last synced: 04 Apr 2025
https://github.com/kinshukjainn/dclue-v1
Dsainone is a highly optimized Data Structures and Algorithms (DSA) library designed to provide efficient implementations of graph algorithms, trees, hashing, and linked lists while maintaining exceptional memory efficiency. The library is designed to be as fast and optimized as possible
Last synced: 20 May 2026
https://github.com/anzerr/storage.ts
Util to store data used in a service
data nodejs storage typescript util
Last synced: 20 May 2026
https://github.com/amethyst-php/geolocation
amethyst amethyst-package api data geolocation laravel
Last synced: 20 May 2026
https://github.com/ashu3291/blinkit-app-store-
conducted a comprehensive analysis of Blinkit's sales performance, customer satisfaction and inventory distribution to improve the sales performance.
cleaning-data data dataanalysis-projects powerbi-visuals powerbidashboard sql
Last synced: 05 Jan 2026
https://github.com/amliyanage/data-structures
arrays binary-tree data data-structures graph hashtable linked-list stack
Last synced: 06 Apr 2025
https://github.com/stdlib-js/array-base-any-has-property
Test whether at least one element in a provided array has a specified property, either own or inherited.
any array assert data generic has javascript node node-js nodejs prop property stdlib structure test types validate
Last synced: 20 May 2026
https://github.com/ffatahillah7/snowflake-tastybytes-data-warehouses
Build Snowflake Tasty Bytes Warehouses
data data-warehouse mysql snowflake sql warehouse
Last synced: 26 Mar 2025
https://github.com/fiddlydigital/anonimizer
A lib to replace and rehydrate sensitive data in text
anonimize anonymize data data-security prompt sanitize string string-manipulation text
Last synced: 15 Mar 2025
https://github.com/dms-codes/scrape-kesaintblanc-id
Kesaintblanc Data Scraper This Python script is designed to scrape product data from the Kesaintblanc website. It collects information about products, including product name, URL, price, image URLs, status, stock, and more. The scraped data is saved to a CSV file for further analysis.
data kesaintblanc python webscraper
Last synced: 27 May 2026
https://github.com/estherslabbert/final-capstone-unsupervised-ml
Exploration of USArrests data using unsupervised machine learning
arrests correction data data-analysis data-clustering data-visualization jupyter-notebook machine-learning pca-analysis standardised-data usa
Last synced: 26 Jun 2025
https://github.com/basis-company/data-player.js
in memory data layer for fast access to plain normalized data
collection data model traversal
Last synced: 25 Feb 2025