data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-29 00:07:49 UTC
- JSON Representation
https://github.com/nika2811/new-york-city-taxi-fare-prediction
About In this project using New York dataset we will predict the fare price of next trip. The dataset can be downloaded from https://www.kaggle.com/kentonnlp/2014-new-york-city-taxi-trips The dataset contains 8 features along with GPS coordinates of pickup and dropoff
data data-preprocessing data-visualization decision-trees feature-engineering kaggle kaggle-competition linear-regression machine-learning neural-network nyc polynomial-regression ridge-regression scikit-learn taxi taxi-data tensorflow xgboost
Last synced: 06 Apr 2025
https://github.com/hidayathamir/get-telegram-group-data
With these project you can get data in csv file from your telegram group.
bahasa-indonesia data python3 scrape telegram telethon
Last synced: 13 Sep 2025
https://github.com/debjyotisaha/hands-on-sql
My Learning Path towards SQL
cte data data-analysis insert joins select sql subqueries update
Last synced: 04 Apr 2025
https://github.com/opengeoshub/vdownload
A Powerful Geospatial Data Downloader
Last synced: 19 May 2026
https://github.com/prasad-chavan1/bank_data_analysis_r
Bank data analysis in R language
data data-analysis data-science r
Last synced: 24 Feb 2025
https://github.com/furkantosun1607/cse201-data-structure
This repository contains implementations of various data structures completed as part of the CSE201 (Data Structures) course. Each week, a different data structure was implemented during lab sessions.
array arraylist bfs-search binarytree data dfs-search java linkedlist queue stack structure tree-structure
Last synced: 26 Jun 2025
https://github.com/djdhairya/black-friday-sale
csv data data-analytics data-science data-visualization visualization
Last synced: 30 Oct 2025
https://github.com/gunn/covid-19-scripts
Scripts for processing COVID-19 data - e.g. converting from absolute to per capita numbers, adding fine-grained data from more countries
covid-19 data geography typescript
Last synced: 17 May 2026
https://github.com/tomwhite/misp-2017
MISP camp 2017 materials and code
bioinformatics data data-visualization hackathon
Last synced: 18 Apr 2026
https://github.com/tkxwaweru/python_data_manipulation
Manipulating the MASSIVE dataset using python
data dataanalysis excel python
Last synced: 11 Jan 2026
https://github.com/sweta-kaundilya/911-calls-capstone-project
For this capstone project we will be analyzing some 911 call data from Kaggle.
data data-analysis data-visualization jupyter-notebook matplotlib numpy pandas python seaborn
Last synced: 28 Apr 2026
https://github.com/pcpp94/elexon_pipeline_gb_demand
Guidelines and code snippets for extracting and processing Elexon gross demand data on Databricks. Provides half-hourly GB demand at sectoral (Domestic, Non-domestic), GSP-area granularity, settlement demand, and embedded generation. Supports non-commodity cost calculations for CfD, RO, and FiT.
data electricity elexon gb octopusenergy power powerdata pypsa uk
Last synced: 12 Jul 2025
https://github.com/casbin/node-casbin-data-permission
Data Permissions Example for Casbin
abac acl auth authorization casbin data data-permission example go node-casbin nodejs permission policy rbac
Last synced: 24 Feb 2025
https://github.com/germanpaul12/automating-hacker-news-and-weather-mails
Project for my Raspberry Pi to send me mails when it rains and to inform with hot tech news
beautifulsoup beautifulsoup4 data hacker-news openweather-api raspberry-pi requests
Last synced: 05 May 2026
https://github.com/phtrempe/l2a
This is a small project which aims to show an example of applied machine learning in Python 3 with the Keras library and its TensorFlow backend to train a neural network model for it to learn to add two integers.
applied data data-science deep-learning keras machine-learning neural-network tensorboard tensorflow
Last synced: 05 May 2026
https://github.com/echang1802/normandy
Normandy is a python framework for data pipelines, which main objective is standardizing your team code and provide a data treatment methodology flexible to your team needs.
analytics business-intelligence data dataengineering datascience etl pipeline
Last synced: 11 Mar 2026
https://github.com/ahmedkhaled404/data-cleaning-and-eda-layoffs-mysql
This project involves cleaning a dataset containing information about layoffs from companies around the world.
data data-analysis data-cleaning data-preprocessing datacleaning eda exploratory-data-analysis mysql sql
Last synced: 08 Jun 2026
https://github.com/anthonysanalysis/bellabeat-analysis
Bellabeat Tech Case Study Capstone Project
analysis capstone case-study data data-analysis data-visualization md r rmd rstudio
Last synced: 20 Apr 2026
https://github.com/yvandana/brain-tumor-detection-and-classification
Bachelor's Major Project- Presented at ICMISC 2022
2d-cnn brain-tumor-classification brain-tumor-detection cnn-model data data-augmentation keras-tensorflow sklearn-metrics
Last synced: 16 Jun 2025
https://github.com/amethyst-php/consume-rule
amethyst amethyst-package api consume-rule data laravel
Last synced: 19 May 2026
https://github.com/himanshub16/lekhpal
Monitor and catalog Twitter feed matching your desired keywords
analytics data data-catalog data-filtering mongodb twitter twitter-streaming-api
Last synced: 14 May 2026
https://github.com/yourdataarchitect/french-realestate-data-pipeline
This repository contains a fully automated data pipeline built with Apache Airflow to extract, clean, analyze, and report real estate listings from Seloger. It pushes data to MongoDB, Elasticsearch, and Google Sheets, with real-time Slack alerts for monitoring.
airlfow data datanalysis datapipeline market-intelligence real-estate
Last synced: 31 Dec 2025
https://github.com/coderooz/hr-dashboard
The goal of this project is to create a power bi dashboard to showcase the attrition data within the company.
Last synced: 07 Jan 2026
https://github.com/pyrustic/jayson
Intuitive interaction with JSON files [DEPRECATED, check the project Shared]
Last synced: 17 May 2026
https://github.com/fliplet/fliplet-widget-data-source-query
Data Source Query Provider
Last synced: 11 Apr 2025
https://github.com/axafrance/azureml-to-openshift-talk
Scale your dev IA: From dev AzureML to prod OpenShift in one click
ai axa azureml data learn ml openshift raise-the-bar talk
Last synced: 16 Feb 2026
https://github.com/encoreshao/data-science
Data analyze examples, using Jupyter notebook and Python!!!
data dataanalysis encore jupyter-notebook
Last synced: 29 Mar 2025
https://github.com/pulgamecanica/d3examples
https://www.oreilly.com/library/view/d3-for-the/9781492046783/
d3 d3-visualization d3js d3v4 data javascript
Last synced: 19 May 2026
https://github.com/kameronbrooks/datalys2-reporting
Datalys2 Reports allows you to create rich, interactive reports by simply defining a JSON configuration embedded in your HTML. It handles the layout, data visualization, and interactivity, so you don't need to write custom React code for every report.
data data-visualization html react
Last synced: 08 Apr 2026
https://github.com/mumtaz4118/employee-satisfaction-and-attrition
Analysis of attrition based on environmental satisfaction from a Kaggle dataset.
data data-analysis data-science data-visualization ipynb jupyter-notebook machine-learning python statistical-analysis statistical-models
Last synced: 19 May 2026
https://github.com/azaz9026/loan_approval_prediction
Welcome to the Loan Approval Prediction repository! This project aims to build a predictive model that can determine whether a loan application should be approved or denied based on various features. Purpose The goal of this repository is to develop a machine learning model that can accurately predict loan approval decisio
data data-analysis data-visualization eda machine-learning numpy pandas python statistics
Last synced: 06 Apr 2026
https://github.com/shahules786/titanic-analysis
different analysis of titanic accident (data from kaggle)
Last synced: 26 Jun 2025
https://github.com/jigyasag18/financial-risk-analysis-project
The Credit Card Financial Risk Analysis Dashboard is a real-time Power BI tool designed to provide insights into credit card transactions and customer demographics. It features interactive visualizations, efficient data processing, and actionable insights to support decision-making. Utilizing data from SQL database, the dashboard tracks key metrics
data dataanalysis database datacleaning datapreprocessing dataprocessing datavisualization financial-analysis financialriskanalysis mysql powerbi sql statistical-analysis
Last synced: 06 Mar 2026
https://github.com/melvinjwallace/melvinjw.github.io
A portfolio of a host of projects completed using python and sql.
data data-analysis data-cleaning data-loading data-mining data-preparation data-processing data-science data-transformation data-visualization dataset matplotlib microsoft-sql-server pandas-python seaborn
Last synced: 02 Apr 2026
https://github.com/shamaz332/ecomrace-data-analysis-in-datascience
data data-science matplotlib pandas
Last synced: 15 May 2026
https://github.com/amarlearning/exploring-the-evolution-of-linux
Data Analysis about the development of the Linux operating system by exploring its Git repository history.
cleaning-data data data-analysis data-wrangling datacamp first-commit git-history linux
Last synced: 12 May 2026
https://github.com/jmcph4/rpdb
rpdb
automation data database dataset db real-estate rpdata sql
Last synced: 12 Apr 2025
https://github.com/lisakey/lisakey
I am passionate about Python 🐍 and SQL 🗃️ for data analysis 📊, and I actively develop projects in these languages.
analysis analyst data dataanalysis dataanalyst java python sql
Last synced: 02 May 2026
https://github.com/eyluldursun/data-science-project
This project involves a data science analysis conducted on the Obesity Data Set. The study explores factors influencing obesity, includes data visualization, and develops predictive models. The goal of the project is to gain insights to help prevent obesity.
data data-science obesity r rmarkdown
Last synced: 26 Jun 2025
https://github.com/nxank4/an-augment
A Python library for advanced and novel data augmentation, combining traditional techniques like cropping and blurring with state-of-the-art generative AI methods such as style transfer, image inpainting, and latent space interpolation. It boosts data diversity for robust machine learning applications.
computer-vision data data-augmentation data-augmentation-strategies data-augmentation-techniques generative-ai image image-processing synthetic-data
Last synced: 10 Mar 2026
https://github.com/stuffbymax/game-dependencies-db
data database game games-list json mit-license
Last synced: 15 May 2026
https://github.com/jhwa426/database
SQL, MSSQL, MongoDB Database
data data-warehouse data-wrangling database datamodeling entity-relationship-diagram normalization sql sqlite3 ssms
Last synced: 06 Apr 2025
https://github.com/amethyst-php/opening-hour
amethyst amethyst-package api data laravel opening-hour
Last synced: 19 May 2026
https://github.com/amethyst-php/warehouse
amethyst amethyst-package api data laravel warehosue
Last synced: 19 May 2026
https://github.com/akashlogics/street-data-tracking
Detect, Track and Count number of persons walking across the path(s) making use of YOLO. This Python project tracks people moving across predefined street zones
analysis data excel newdataset object-detection opencv python python3 yolo
Last synced: 19 May 2026
https://github.com/amethyst-php/recipe
amethyst amethyst-package api data laravel recipe
Last synced: 19 May 2026
https://github.com/nouraalgohary/data-scientist-with-python
This repo comprises of my solutions for the tasks assigned in the course.
data data-science data-visualization datacamp datacamp-course datacamp-data-science datacamp-exercises datacamp-solutions-python datascience python
Last synced: 15 Jun 2025
https://github.com/buildinamsterdam/contentful-graphql
Contentful GraphQL connection
Last synced: 05 Jan 2026
https://github.com/ezeparziale/analisis-uso-bicicletas-caba
:biking_man: Análisis de como afecto la pandemia el uso de las bicicletas en CABA.
data data-science data-visualization
Last synced: 14 Mar 2025
https://github.com/ezeparziale/analisis-data-delitos
:gun: Analsis de delitos de CABA
Last synced: 14 Mar 2025
https://github.com/official-imvoiid/multifetch
A high-performance web scraper for bulk image and GIF extraction from reliable sources — built for AI/ML data pipelines and large-scale media collection
aiml data dataset gifscraper imagescraper python pythontool tools webscraper windows
Last synced: 19 May 2026
https://github.com/zeh237/superstore-data-analytics
This is a Flask based data analytics project based on the superstore dataset using flask, pandas, sql and python
analytics data data-analysis data-science data-visualization flask python superstore
Last synced: 04 May 2025
https://github.com/mapi-developer/dapo
Simple, zero-dependency tabular data manipulation and analysis for Python.
Last synced: 06 Mar 2026
https://github.com/kingabzpro/5-airflow-alternatives-for-data-orchestration-tutorial
Code examples of Luigi, Prefect, Kedro, Dagster, and MageAI
dagster data data-orchestration kedro luigi mageai prefect
Last synced: 18 Apr 2026
https://github.com/jorgermduarte/mongo-replication
cluster data mongo mongodb mongoose replica replica-set replication
Last synced: 03 Mar 2025
https://github.com/randomgamingdev/randomgamingdev.github.io.data
The data for RandomGamingDev.github.io (feel free to build your own website off of mine :D)
blog custom data projects projects-list
Last synced: 02 Jan 2026
https://github.com/colonelbundy/martenmigrator
Data migrator for Marten
data database documentdb marten migration postgres
Last synced: 04 May 2026
https://github.com/stdlib-js/dstructs-stack
Stack.
collection data data-structure data-structures first-out javascript last-in lifo node node-js nodejs stack stdlib structure
Last synced: 14 May 2026
https://github.com/piazzai/chess-variants
Analysis of Lichess variant games
analysis chess chess-variant chess-variants data data-mining data-science data-visualization lichess lichess-database logistic-regression logit-model pgn r r-code r-scripts regression regression-analysis shell shell-scripting
Last synced: 15 May 2026
https://github.com/ssiarhei115/cv-dbase-analysis
HeadHunter CVs data base analysis
analysis cv data data-science resume
Last synced: 09 Apr 2025
https://github.com/rrwen/poster-gisci-osmol
Conference poster and short paper titled "Outlier Detection in OpenStreetMap Data using the RandomForest Algorithm and Variable Contributions" for the GIScience Conference in 2016
2016 algorithm conference contribution data detection forest gis giscience learn machine open openstreetmap osm outlier paper poster random short variable
Last synced: 03 Apr 2025
https://github.com/rrwen/geohoods-to
Geospatial dataset of 1000+ aggregated variables for neighbourhoods in Toronto, ON, CA
csv data dataset geo geojson gis neighborhood neighborhoods neighbourhood neighbourhoods open open-data toronto toronto-open-data
Last synced: 25 Jun 2025
https://github.com/codehard8/web-scrapping
In this repository we have provide a web scrapping project through beautifulSoup and related files
beutifulsoup data houses-for-sale python3 requests-library-python webscraping
Last synced: 01 Jul 2025
https://github.com/md-emranhossen/leetcode-practice
This repository stores my solutions to LeetCode problems, organized by problem number and title.
cpp data datastructures-algorithms leetcode-solutions
Last synced: 26 Jun 2025
https://github.com/nonsignificantp/enfermedades-inmunoprevenibles
Analisis sobre el efecto de las vacunas y la incidencia de casos de enfermedades inmunoprevenibles en la Ciudad de Buenos Aires entre los años 1995 y 2016
a analysis argentina buenosaires data hepatitis science vaccination
Last synced: 18 Jun 2026
https://github.com/jonprice99/regional-election-analysis
An analysis of election results in Allegheny County using Pandas and other Python libraries to better understand the voting habits, practices, and preferences of regional voters.
data data-visualization election-analysis election-data pandas python
Last synced: 05 May 2026
https://github.com/abshek7/big-data
A repository for documenting the learning related to theory and practical notes of big data computing.
big-data data data-engineering mapreduce pyspark
Last synced: 15 Jun 2025
https://github.com/peter7775/mysql-graph-visualizer
SQL database conversion and visualisation as graph / in development
analytics analyzer conversion converter data database go golang graph graphql mysql neo4j neo4j-graph refactoring sql visualization
Last synced: 14 Mar 2025
https://github.com/ahmad-mtr/prjkt_exam_schedule_test
I hate scrolling in a list of 300+ courses of my Uni exam schedule, so I'm creating this. this's a test btw :)
Last synced: 11 Apr 2025
https://github.com/ayushman0511/data-warehouse-project1
A comprehensive guide to building a data warehouse with SQL Server, including ETL processes, data modeling, and analytics.
data data-ana data-anal data-cleaning data-enginee data-lakehou datalake datasci dataware datawarehouse datawarehousi etl etl-job etl-pipeline medallion sql sql-quer sql-query sql-server sqlserver
Last synced: 26 Jun 2025
https://github.com/majorcluster/clj-data-adapter
A Clojure library designed to convert data
Last synced: 12 Jul 2025
https://github.com/yassin522/health-insurance-cross-sell-prediction
Prediction of Vehicles Health Insurance
data data-analysis data-science machine-learning plotly python
Last synced: 15 May 2026
https://github.com/dsietz/daas-workshop
Workshop for building a Data as a Service platform using the DaaS SDK.
archconf daas daas-pattern data dataprivacy nfjs rust rust-lang
Last synced: 20 May 2026
https://github.com/vatshayan/ip-address-data-analysis-
Extraction of 100's of IP Address and using Machine Learning algorithm for detecting threats
data data-analysis data-science data-visualization dataset ip ipconfig ipv4-address jupyter-notebook machine-learning machine-learning-algorithms supervised-machine-learning unsupervised-learning
Last synced: 15 Jul 2025
https://github.com/bcodmo/workshop_bios_oceanographic_data
Repository holding lesson on Data Management Basics. See webpage for rendered view: https://bcodmo.github.io/workshop_bios_oceanographic_data/
bco-dmo data datamanagement fair workshop
Last synced: 08 Apr 2026
https://github.com/jigyasag18/orders-sales-analysis-report-using-power-bi
This repository analyzes and visualizes office supply sales data to improve profitability. It examines sales performance by various factors, using charts to provide insights and actionable recommendations for sales optimization, market research, and product mix.
data dataanalysis dataanalytics dataset powerbi powerbi-dashboards powerbi-report powerbi-reports powerbi-visuals powerbidashboard
Last synced: 18 Feb 2026
https://github.com/codehub001/ai-driven-automation-for-data-quality-monitoring-in-cloud-data-warehouses
This project focuses on leveraging AI to automate data quality monitoring in cloud data warehouses. Traditional data validation methods often require manual intervention and fail to scale with increasing data complexity. By integrating machine learning models, this approach enables real-time anomaly detection, automated data cleansing.
csv-export csv-import dashboard data datacleaning lib modeltraining python testing-library visualization
Last synced: 13 May 2025
https://github.com/Greatwoman23/Sentiment-Analysis-on-Amazon-Products-Review
Sentiment_Analysis_On_Amazon_Product_Review
analysis dashboard-application data data-science datascientistproject machine-learning publication python remotejob
Last synced: 04 May 2025
https://github.com/ubeydgur/car-price-prediction
Predicting the price of a used car
ai artificial-intelligence data data-science data-visualization machine-learning machine-learning-algorithms
Last synced: 08 Jun 2026
https://github.com/wolfchamane/amjs-data-types
Data types for your OOP javascript project
cjs data javascript modules nodejs oop types
Last synced: 20 May 2026
https://github.com/circlexo/circlexo
Open-source project to seamlessly integrate and manage your business workflow, connecting Jira, GitHub, Discord, Stripe, RevenueCat, and OpenAI all in one intuitive platform.
bussiness-intelligence data discord-bot forge github google jira kpis ploi revenuecat stripe vapor
Last synced: 20 May 2026
https://github.com/shimul-zahan/all-practices-tukitaki
This is repository for all the practice tasks or learning new things. Cause environment are setup and no need to setup a new project or environments.
data data-science datapreprocessing deep-learning machine-learning neural-network practice python visualization
Last synced: 12 Jan 2026
https://github.com/furkankarakuz/turkey_earthquake
This project focuses on analyzing and visualizing earthquake data specific to Turkey. It aims to provide insightful visualizations on topics such as earthquake frequency, location, and magnitude using data obtained from Boğaziçi University Kandilli Observatory and Earthquake Research Institute.
api data data-visualization earthquake python python3 request streamlit turkey turkey-earthquake
Last synced: 20 May 2026
https://github.com/heshamalsaqqaf2/python-projects
Beginner Level Python Projects
Last synced: 22 Jul 2025
https://github.com/hivesolutions/crossline
Simple event pipping and storing infra-structure
Last synced: 15 May 2026
https://github.com/clagiordano/marketplaces-data-export
LIbrary that share the same interface and provide adapters for online marketplaces services
adapter amazon api clagiordano data ebay ebay-api export marketplaces mws mws-api rest soap
Last synced: 22 Mar 2025
https://github.com/amethyst-php/shipment
amethyst amethyst-package api data laravel shipment
Last synced: 20 May 2026
https://github.com/GAMELEIRA/studies-database
Esse repositório têm como objetivo alocar todo e qualquer script para aprender e praticar gerenciamento de banco de dados SQL e NoSQL. Nesse projeto, serão consolidados os principais fundamentos e princípios, além da prática de exercícios e desenvolvimento de projetos.
data database mongodb mssql mysql nosql sql
Last synced: 03 May 2025