data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-30 00:07:50 UTC
- JSON Representation
https://github.com/chompfoods/stub-inflector
Inflector server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery inflector ingredients nutrition raw recipe-api recipes server stub stub-inflector stub-server
Last synced: 27 Apr 2026
https://github.com/shreeparab1890/indian-elections-2019-analysis-eda
This ipython notebook is the Exploratory data analysis (EDA) of the Indian Lok Sabha Elections 2019.
data data-analysis data-science data-visualization eda exploratory-data-analysis matplotlib numpy pandas plotly python python3 visualization
Last synced: 28 Apr 2026
https://github.com/epomatti/az-e2e-data-eng-proj
Data engineering with Azure services
azure data data-engineering databricks datafactory datalake lake synapse terraform
Last synced: 28 Apr 2026
https://github.com/codegeekr/test_datasciencestarter
test Data Science Starter
analytics data data-science data-visualization machine-learning python science starter-kit statistics test
Last synced: 28 Apr 2026
https://github.com/n-ce/localstorage-data-interchange-manager
Implementation of local storage data interchange using map data structure.
data export import javascript js-maps json localstorage
Last synced: 28 Apr 2026
https://github.com/mumtaz4118/scraping-medium-and-data-analytics
The file DataExtraction.py extracts information from the json files scrapped by the scrapper medium_scrapper_post.py. To extract information from json files scrapped by medium_scrapper_tag_archive.py (scrapping from tags archive) then use Data_Extraction_Archive_Tags.py
data data-analysis data-analytics data-extraction data-preprocessing data-science data-scraping deep-learning machine-learning python
Last synced: 29 Apr 2026
https://github.com/stdlib-js/array-struct-factory
Return a constructor for creating arrays having a fixed-width composite data type.
array composite data factory javascript node node-js nodejs stdlib struct structure typed typed-array types
Last synced: 29 Apr 2026
https://github.com/apsalverda/ebird-hotspot-menu-bar-python
🪶 Retrieve recent hotspot observations using eBird API
data ebird ebird-api hotspot instructions live livedata macos menubar observations platypus
Last synced: 29 Apr 2026
https://github.com/chandansoren/financial-budget-analysis
Financial budget for 2021
Last synced: 29 Apr 2026
https://github.com/istinnew/eniac_ab_insight
Dive into a comprehensive analysis aimed at boosting iPhone 13 sales by optimizing the Click-Through Rate (CTR) of the “SHOP NOW” button, compare different button designs and determine the most effective strategy for increasing engagement.
ab-testing data data-analysis data-engineering data-science data-visualization google googlecolab libraries python testing testing-tools visual-studio-code
Last synced: 29 Apr 2026
https://github.com/wireservice/workbench-lookup
A port of `agate-lookup` to Workbench
data journalism lookup workbench
Last synced: 08 Jun 2026
https://github.com/syed-bakhtawar-fahim/datavisualization
Data Visualization with Python
big-data-analytics data data-analysis data-analysis-python data-science data-visualization pandas pyspark
Last synced: 30 Apr 2026
https://github.com/omarsaad21/it-salary-eda
A python EDA project implemented on IT department salaries data we made data exploration and made data visulization for some questions on dataset
data explotary-data-analysis juypter-notebook numpy pandas python visualization
Last synced: 30 Apr 2026
https://github.com/priyam-hub/covid-19-data-analysis
Explore COVID19 case numbers and deaths related to Coronavirus outbreak 2019/2020 in Pandas and in Jupyter notebook
analysis data data-visualization jupyter-notebook machine-learning python
Last synced: 08 Jun 2026
https://github.com/chompfoods/sdk-jaxrs-cxf
JAXRS-CXF SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
apache-cxf api branded chomp cxf data database food grocery ingredients java jax-rs nutrition raw recipe-api recipes sdk
Last synced: 30 Apr 2026
https://github.com/lugolbis/data-immo
End-to-end ETL pipeline
data data-engineering dbt dremio duckdb etl-pipeline lakehouse rust
Last synced: 08 Jun 2026
https://github.com/dhimmel/hgnc
Extracting human gene families from HGNC
data gene-families genes hgnc hugo human
Last synced: 01 May 2026
https://github.com/syedzaheerabbas/jamboree-education-linear-regression
Using data from Jamboree, this project explores the relationship between applicant profiles (GRE, TOEFL, GPA, etc.) and their chances of admission to Ivy League graduate programs. Linear regression, Ridge, and Lasso regression are employed to build predictive models and identify key factors.
data eda linear-regression python visualization
Last synced: 01 May 2026
https://github.com/ahmed-naserelden/astro-success-analytics
This project analyzes key factors influencing success in the Space Race using data science techniques. It includes data collection, machine learning modeling, and insightful visualizations to predict mission outcomes.
Last synced: 01 May 2026
https://github.com/shauryauppal/mydatatoolkit
A toolkit for data scientists to get work done faster, easier, and in a smarter way.
analytics awesome-list data data-science hacktoberfest
Last synced: 08 Jun 2026
https://github.com/gabrielf7/relogiohd
:watch: Relógio com Horário e Data
clock css data horario html javascript relogio relogio-hd relogio-javascript watch
Last synced: 01 May 2026
https://github.com/vbshuliar/ktor-http-request-response
This project is part of my Android Development Specialization provided by Meta on Coursera. In this project I practised HTTP requests and responses using Ktor.
android compose data http https json kotlin ktor request response
Last synced: 01 May 2026
https://github.com/sandygcabanes/etl-earthquake-data-from-usgs-google-cloud-composer-airflow
Airflow, Google Cloud Composer, GCS, BigQuery, Python. This automated pipeline pulls daily earthquake data from a trusted public source, stores it securely in the cloud, and organizes it into clean, searchable tables for analysis.
cloud composer dag data engineering etl etl-pipeline google json python
Last synced: 01 May 2026
https://github.com/anchanung/til
Computer Science
computer-science data database docker infra k8s kafka operating-system
Last synced: 01 May 2026
https://github.com/muhammadadilnaeem/bcg-data-science-job-simulation-on-forage-august-2024
This repository contains all the tasks, code, and documentation completed during the BCG Data Science job simulation on The Forage platform. The simulation focused on analyzing customer churn, building predictive models, and presenting insights for a major utility company.
bcg customer-churn-prediction-with-machine-learning data data-science forage numpy pandas
Last synced: 01 May 2026
https://github.com/gcoronelc/cepsuni-disbd-64505
Taller de Modelamiento de de Base de Datos con Gustavo Coronel
data database databases db2 db2-database modeling oracle oracle-database relational-database relational-database-design relational-databases relationships sql sql-server
Last synced: 02 May 2026
https://github.com/0xhericles/spamdetector
:email: A Simple Python Spam Detector with Scikit-Learn
data ham machine-learning python sklearn spam
Last synced: 02 May 2026
https://github.com/hafs96/prediction_consommation-de-carburant
Dans ce projet, l'objectif est de développer un modèle permettant de prédire si une voiture a une consommation de carburant élevée ou faible en fonction de ses caractéristiques techniques.
analysis data data-visualization machine-learning testing training
Last synced: 09 Jun 2026
https://github.com/sonwaneshivani/data-science-learning
Basics of python
css data data-science deep-learning flask gen-ai html ml nlp
Last synced: 02 May 2026
https://github.com/radekbednarik/covid-czech-data-api
Library to make it easy to work with REST API of official Czech Covid data.
api covid-19 data deno library typescript
Last synced: 02 May 2026
https://github.com/jesuscc1993/data-cleaner-extension
Clears browser data in a single click.
application-data chrome chrome-extension data
Last synced: 02 May 2026
https://github.com/viniddev/active_finance
Nesse projeto busquei solucionar um problema corriqueiro que é a dificuldade de se manter atualizado sobre as variações do mercado de ações e fundos imobiliários. Usei selenium webdriver para buscar informações e uma API do Telegram para enviar relatórios para o usuário
automation data data-analisis rpa selenium-webdriver telegram-bot
Last synced: 03 May 2026
https://github.com/amethyst-php/price-rule
amethyst amethyst-package api data laravel price price-rule rule
Last synced: 03 May 2026
https://github.com/tn3w/moviedb-json
A JSON library with 981,530 films.
data database db json movie movie-database movies
Last synced: 03 May 2026
https://github.com/yugsumeet17/churn-analysis-project--power-bi-sql-machine-learning
Dataset Explained, Project Goals & Metrics Required, SQL Server ETL & Data Cleaning, Power BI Data Load, Transformation, Blueprint & Measures, Power BI Visualization - Summary Page, Building Machine Learning Model - Random Forest, Power BI Visualization - Churn Prediction Page
data data-visualization dataanalytics excel postgresql powerbi python3
Last synced: 03 May 2026
https://github.com/soham7998/data-analysis-projects
My Data Analysis Projects which are completed by me and gain a hands on Experience from each project. the project showcase different Concepts , Visualization and many things.
data data-analysis data-science machine-learning nlp python soham visualization
Last synced: 04 May 2026
https://github.com/jdanielgoh/cobertura-campanias
En una democracia ¿caben todas las voces? Proyecto para visualizar el monitoreo de radio y TV que realiza el INE de las candidaturas presidenciales 2024
d3js data datavisualization vue
Last synced: 09 Jun 2026
https://github.com/farhad2415/job_scraper
Job Site Based Job Scraping with python
automation bash-script data data-scraping data-structures python selenium selenium-python
Last synced: 05 May 2026
https://github.com/a-poor/datatransform.jl
A package for defining (and performing) tabular-data transformations with JSON.
data data-science data-transformation etl feature-engineering json julia julia-package tabular-data
Last synced: 05 May 2026
https://github.com/muthupillai1204/diwali_sales_analysis
The Diwali sales analysis reviews past data to identify trends, peak buying times, popular products, and customer demographics. It assesses sales volume, revenue growth, and promotional effectiveness, helping businesses optimize marketing and inventory for future seasons.
data datacleaning eda excel jupyter-notebook matlplotlib numpy pandas python seaborn visualization
Last synced: 05 May 2026
https://github.com/julienmalka/shiftgenerator
ShiftGenerator WeSki 2018
data data-science latex python
Last synced: 06 May 2026
https://github.com/ksm26/ml-ai-data-science-jobs-in-canada
Explore the latest machine learning, artificial intelligence, and data science job opportunities in Canada. Stay informed about Canadian tech job market trends and find your next career move.
ai-canada ai-careers canada canadian-tech-companies canadian-tech-job-market data data-analysis data-engineering data-science data-science-careers machine-learning prompt-engineering robotics
Last synced: 06 May 2026
https://github.com/parthds02/analyzing-student-success-with-data
Discover key factors influencing student performance through data analysis and visualization. Explore gender, parental education, sports, and ethnicity impacts.
data datascience jupyter-notebook kaggle python pythonlibraries
Last synced: 06 May 2026
https://github.com/ralzz/dibimbing_datascience
This project contains an Exploratory Data Analysis (EDA) of the Estonia Passenger List dataset. I handled missing values, removed duplicate data, and created basic visualizations to find insights.
data data-science eda google-colab kaggle pandas python
Last synced: 06 May 2026
https://github.com/iv4n-ga6l/functional-dataprocessing-pipeline
A functional data processing pipeline that accepts an input file, allows specifying both input and output formats, applies specified transformations, and produces a resulting output file.
csv data datapreprocessing excel json pandas parquet pipeline python
Last synced: 06 May 2026
https://github.com/ashleydavis/brisjs-web-scraping-talk
Code to accompany my talk on web scraping for the Brisbane JavaScript meeting in September 2018
cheerio data data-acquisition data-acquisiton electron headless-browsers javascript nightmare nightmarejs nodejs web-scraping
Last synced: 06 May 2026
https://github.com/whis99/data_analysis_journey
A repositories of my data analysis projects.
data data-analysis data-analysis-python data-visualization dataset jupyter-notebook matplotlib python visualization
Last synced: 07 May 2026
https://github.com/chardos/get-git-data
Access git repository data in node.
Last synced: 07 May 2026
https://github.com/safwan2003/randomforest_heart_disease_prediction
A machine learning project using Random Forest Classifier to predict heart disease. Includes data preprocessing (with binning), feature selection, and model evaluation.
binning data data-science datapipeline datapreprocessing datavisaulization deep-learning machine-learning python random-forest-classifier streamlit
Last synced: 07 May 2026
https://github.com/jigyasag18/iit-guhawati
Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.
aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp
Last synced: 08 May 2026
https://github.com/liolb/sql2csv
Export SQL Server Table data to CSV
automation csv data database export extraction powershell scripting sql sql-server sql-table
Last synced: 08 May 2026
https://github.com/zsvoboda/olympics
Self service analytics of 120 years of Olympics data
analytics dashboards data datavisualization dataviz olympics open-data open-datasets opendata reports
Last synced: 08 May 2026
https://github.com/juanpablo70/pgad-assignment01
Breast Cancer Coimbra data set analysis
data data-science dataframe dataset jupyter-notebook matplotlib numpy pandas python
Last synced: 08 May 2026
https://github.com/taquece/goals-per-match
basic script to calculate average football goals per match from .CSV
beginner csv data football nodejs python sports-analytics
Last synced: 09 May 2026
https://github.com/basemax/okala-product-ids
A PHP script to fetch and save product IDs from Okala's online store API across multiple categories and store branches.
crawler crawler-okala crawler-php crawlers data database ids ir iran json okala okala-crawler php php-crawler product
Last synced: 09 May 2026
https://github.com/flexthink/matricize
A convenience library to convert between pure Python objects and their vectorized representations
data machine-learning numpy python
Last synced: 09 May 2026
https://github.com/mohamedbilal1800/olympic_history_data_analysis
This project delves into the 120 Years of Olympic History: Athletes and Results dataset, analyzing athlete demographics, medal achievements, and country performances across the Summer and Winter Olympics from 1896 to 2016.
analysis data eda matplotlib-pyplot pandas python seaborn visulaization
Last synced: 09 May 2026
https://github.com/yashkp1234/movie-recommendation-engine
My project on analyzing the movie data set, and creating a recommendation engine using that analysis.
analysis data notebook python recommendation-engine
Last synced: 04 May 2025
https://github.com/sebastianbrzustowicz/flight-quality-overview-microservice
Go + Docker. Microservice with parallel computations to convert raw vehicle flight data into overview raport with visualisation.
container control csv data docker drone flight go goroutines http microservice parallel-computing pdf quadcopter raport rms sse vehicle
Last synced: 10 May 2026
https://github.com/notthestallion/pca__3d-and-from-scratch__principal-component-analysis
In this project, I will be implementing Principal Component Analysis (PCA) from scratch on an ecological footprint consummation database for countries and a three-dimensional scale using a movie database. The goal of this project is to gain a deeper understanding of PCA and to demonstrate its capabilities in exploring complex datasets.
data data-science database pca pca-analysis principal-component-analysis principal-component-analysis-pca principle-component-analysis
Last synced: 10 May 2026
https://github.com/gurpreet0022/airbnb-eda
EDA on Airbnb booking data to uncover valuable insights, trends, and patterns
data data-science dataanalytics insights jupyter-notebook matplotlib numy pandas projects python3 seaborn visualization
Last synced: 11 May 2026
https://github.com/sehaj003/boston-bruins-roster-planning-mysql-nosql
Repository for Data Management project, Boston Bruins Roster Planning using MySQL and NoSQL along with data analysis using Python
data data-management mongodb mysql project-repository python
Last synced: 11 May 2026
https://github.com/vbhatsaccnt/retail-strategy-and-analytics-optimization-of-control-stores-for-sales-enhancement
In this project, we aim to optimize the performance of retail chain stores by establishing control stores based on their performance compared to selected trial stores. By leveraging data analytics and strategic insights, we seek to enhance sales revenue and drive growth within the retail chain.
customer-segmentation data data-science risk-analysis
Last synced: 13 May 2026
https://github.com/gregorybchris/pca
PCA assignment for Park Tudor
analysis component data display embedding pca principal projection teach
Last synced: 13 May 2026
https://github.com/prakhargpt/sql-data-warehouse-project
Building Data Warehouse project using SQL Server, including ETL processes, data modelling and analytics.
analytics data data-analysis data-cleaning data-engineering data-engineering-pipeline data-lakehouse data-science data-warehouse etl etl-job etl-pipeline medallion-architecture sql sql-server
Last synced: 12 Jun 2026
https://github.com/iannil/one-data-studio
one-data-studio integrates a data governance and development platform, a cloud-native MLOps platform, and a large model application development platform. It connects the entire value chain from raw data governance to model training and deployment, and further to the construction of generative AI applications.
Last synced: 12 Jun 2026
https://github.com/word2vect/beijing-new-house-data-visualization
Beijing New House Data Visualization for Python Programming 2024 Fall Data Visualization Lab
Last synced: 13 Jun 2026
https://github.com/bastianolea/plebiscitos_chile
Datos de resultados electorales de los plebiscitos constitucionales de 2022 y 2023
chile comunas data elecciones politica social
Last synced: 15 Jun 2026
https://github.com/spiraldb/spiraldb-nemo-curator
SpiralDB connectors for NVIDIA NeMo Curator
computer-vision data data-curation data-prep data-preparation data-processing data-quality datacuration datarecipes deduplication fast-data-processing multimodal multimodal-ai nvidia-nemo physical-ai python spiral vortex
Last synced: 15 Jun 2026
https://github.com/arch-fan/pokedata
Pokemon Data in CSV format for whatever you need!
Last synced: 17 Jun 2026
https://github.com/mbagalman/lattice-doe
Python code to create experimental designs optimized to meet statistical power targets
abtesting data datascience designofexperiments experimentaldesign statistics
Last synced: 19 Jun 2026
https://github.com/svetlanam/kbl-to-csv-s3
Keboola extractor, that converts excel to CSV based on input mapping criteria and upload to S3 bucket
data data-cleaning data-transformation etl keboola s3-bucket
Last synced: 20 Jun 2026
https://github.com/svetlanam/etl-transformation
ETL data cleaning and transformation for specific use case in own Keboola project
cleaning data etl keboola python rest-api transformation
Last synced: 20 Jun 2026
https://github.com/petzi53/repairdata
Open Repair Alliance Datasets 2021
data open-data open-datasets r repair repair-cafe repairs
Last synced: 22 Jun 2026
https://github.com/hlan22/2025-03-18-data-validation
(no longer useful) DSCI 310 Lecture about Data validation and code testing! Made in tandem with:
Last synced: 23 Jun 2026
https://github.com/fgazzelloni/20240930-dwpwr
Data Wrangling Practice with R - 30 September Tutorial for R-Ladies Rome
data data-science data-structures data-wrangling
Last synced: 28 Jun 2026