data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-06-29 00:07:49 UTC
- JSON Representation
https://github.com/chandansoren/financial-budget-analysis
Financial budget for 2021
Last synced: 29 Apr 2026
https://github.com/mirzayasirabdullahbaig07/advanced-sql-in-python
This repository covers advanced SQL concepts implemented using Python. It demonstrates how to interact with databases, run complex queries, perform joins, aggregations, window functions, and more using libraries like sqlite3, SQLAlchemy, or pandas. Ideal for data analysts and developers looking to integrate SQL power into Python workflows.
data databases dbms mysql nosql programing-language python sql
Last synced: 29 Apr 2026
https://github.com/tazeenrashid/orders-analysis-using-python-sql-server-and-tableau
I sourced some Orders data through Kaggle; did EDA using Python and then fetched some insights out of cleaned data using SQL Server (SSMS). Then, I built a Tableau Dashboard for some visual insights. Have a look and share your feedback!
analytics data eda jupyter-notebook python sql tableau
Last synced: 29 Apr 2026
https://github.com/danielrosehill/global-value-factors-explorer-dataset
Derivative database of IFVI Global Value Factors for data analysis and visualization use cases.
data environmental-data sustainability-data
Last synced: 23 Feb 2026
https://github.com/ipstack/wizard
Wizard for create ipstack databases
composer data geo geoip id-database info ip ipstack ipstack-wizard php wizard
Last synced: 29 Apr 2026
https://github.com/fs23yayan/membuatfungsidatapemrosesan
Membuat Fungsi Data Pemrosesan for Data Science in Marketing : Customer Segmentation with Python - Part 2
Last synced: 29 Apr 2026
https://github.com/thanh-wutan/chess-opening-comparator
Interactive web app using R to visualize and compare chess opening performance and popularity.
chess-openings data databases datavisualisation r
Last synced: 09 May 2026
https://github.com/khushi-sabarad/data_analysis
linkedin learning capstone project
data data-engineering matplotlib pandas python
Last synced: 10 May 2026
https://github.com/abhinav330/instagram-influencers-analysis
This Jupyter Notebook focuses on preprocessing and visualizing data from an Instagram profiles dataset. It includes data loading, inspection, visualization, and some data preprocessing steps.
data data-science data-visualization exploratory-data-analysis exploratory-data-visualizations influncer-products instagram scikit-learn sklearn
Last synced: 08 Jun 2026
https://github.com/lamouchi-bayrem/data-matrix-scanner
A dual-interface tool that leverages AI to **detect and decode QR codes and Data Matrix codes** from images using computer vision
data datamatrix-scanner decoder flask qrcode scanner tkinter-gui webapp
Last synced: 30 Apr 2026
https://github.com/beeracs/llama
Run Llama models in your web browser using JavaScript and WebAssembly. Explore light and dark modes easily. 🌐🐱👤
ai data fine-tuning framework gpt langchain large-language-models llama3 llamaindex llm lora machine-learning nlp peft qlora qwen rlhf vllm
Last synced: 10 May 2026
https://github.com/datasqlsantosh/global-energy-consumption-renewable-generation-python-data-analysis-portfolio
This project focuses on analyzing global energy consumption patterns and trends in renewable energy generation using Python data analysis libraries such as Seaborn and NumPy. The analysis aims to explore energy consumption data from various regions worldwide and examine the contribution of renewable energy sources over time
data data-analysis data-visualization pandas seaborn
Last synced: 10 May 2026
https://github.com/fatihilhan42/olympics-data-analysis-with-python
I will examine the Data Analysis of the Olympics between 1896-2016, which we have done on Python.
data data-science dataanalysis datavisualization jupyter-notebook olympics python
Last synced: 30 Apr 2026
https://github.com/dnut/json-match-finder
Python application used to match listings against openings via authenticated JSON API access.
data data-structures data-wrangling database json-api python-application python-modules
Last synced: 01 May 2026
https://github.com/dineshdhamodharan24/data-analysis
probability Analysis to customers and bascis analysis
analysis data powerbi probability python visualization
Last synced: 23 Jun 2026
https://github.com/ahmed-naserelden/astro-success-analytics
This project analyzes key factors influencing success in the Space Race using data science techniques. It includes data collection, machine learning modeling, and insightful visualizations to predict mission outcomes.
Last synced: 01 May 2026
https://github.com/skygenesisenterprise/aether-meet
Aether Meet is a lightweight, open-source client built for privacy, speed, and seamless integration within the Aether Office ecosystem
applications data docker javascript meeting nextjs notes typescript voip
Last synced: 01 May 2026
https://github.com/miniql/miniql-json
A MiniQL query resolver that loads data from JSON files.
data json query query-language
Last synced: 11 May 2026
https://github.com/fatihemres/fruits
Fruit Details app by SwiftUI. Using data, models, animation and practically onboarding usage.
animations data models onboarding swift swiftui
Last synced: 01 May 2026
https://github.com/anandvai/ai_rag_chatbot_multi_pdf_support
RAG (Retrieval-Augmented Generation) Chatbot built with Streamlit and LangChain, powered by Groq's blazing-fast LLaMA3-8B. It allows you to upload multiple PDFs, ask questions, and get precise, context-aware answers in a conversational format.
ai data data-science data-visualization data-visualizations dataengineering fastapi langchain langgraph python sql streamlit
Last synced: 01 May 2026
https://github.com/cemc-oper/nmc-typhoon-db-client
A CLI client for NMC Typhoon Database.
Last synced: 01 Jun 2026
https://github.com/dilkushsingh/webscraping-with-selenium-and-beautifulsoup
Web Scrapped a popular tech gadgets website using Selenium and BeautifulSoup, also performed Data Analysis on scrapped data.
beautifulsoup data datacleaning datagathering eda exploratory-data-analysis python selenium webscraping
Last synced: 24 Feb 2026
https://github.com/octoenergy/tentaclio-snowflake
A python project containing all the dependencies for snowflake tentaclio schema.
Last synced: 20 Oct 2025
https://github.com/amethyst-php/catalogue
amethyst amethyst-package api catalogue data laravel
Last synced: 20 Oct 2025
https://github.com/erencelik/binance-public-data-node
Nodejs downloader and unzipper script for Binance Public Data
binance data downloader nodejs public script
Last synced: 15 May 2026
https://github.com/musamairshad/dsa-python
This repository contains all the material related to Data Structures and Algorithms implemented in Python.
algorithms data datastructures efficiency python searching-algorithms sorting-algorithms
Last synced: 25 Mar 2025
https://github.com/parvezk/d3-fundamentals
D3 library API fundamentals
charts d3 data graphs visualization
Last synced: 19 Oct 2025
https://github.com/psgebeline/harvard-data-science
My work for the nine courses in Harvard's data science program, each with notes/assignments. Work in progress.
data linear-regression machine-learning modeling probability-theory r visualization wrangling
Last synced: 19 Oct 2025
https://github.com/meokullu/colorizenumber
ColorizeNumber - Bodrum Papatya, visualizes numeric data into colors which creates an image.
color colorize colors data data-visualization visualization vizualize-data
Last synced: 01 Jun 2026
https://github.com/analyst-amitbisht/pizza-sales-report-
Its a guided project to practice tools like SSMS + Power BI & also skills like data cleaning, data exploration, data analysis, data visualization, etc.
analytics data data-visualization powerbi sql-server
Last synced: 18 Oct 2025
https://github.com/alimghmi/bdlc
Bloomberg API integration, handling data requests, processing, and SQL database insertion.
api-client bloomberg data data-processing financial-data oauth2 python sql-database transformation
Last synced: 10 Jun 2026
https://github.com/mohibmirza-py/email-verifier-script
Streamlit app to verify emails in bulk
Last synced: 29 Apr 2026
https://github.com/enoch208/eventmaster
A user-friendly application that helps you easily record and play back your keyboard and mouse actions. With its modern design using `tkinter` and `ttkthemes`, it provides a smooth and easy-to-use interface. The app combines reliable technical features to give you a great experience.
automation data key keylogging-python replay spy tools
Last synced: 01 Jun 2026
https://github.com/ronknight/user-data-dashboard
📈 A data visualization tool for analyzing user data using an Excel-based data source.
dashboard data excel ga4 screenshot
Last synced: 17 Oct 2025
https://github.com/mat06mat/matbot
My discord bot code
data discord-bot discord-py py-cord
Last synced: 17 Oct 2025
https://github.com/dpbm/depencies-sets
append multiple depencies to your python project quickly
data dependencies dependencies-list dependencies-manager dependencies-set frameworks libraries lists pip py python python3 web
Last synced: 17 Oct 2025
https://github.com/saboye/sales-performance-analysis
A dashboard that presents monthly sales performance by product segment and product category to help clients identifying the segments and categories that have met or exceeded their sales targets, as well as those that have not met their sales targets.
dashboard data data-science eda tableau visualization
Last synced: 27 Jan 2026
https://github.com/otoneko1102/roulette-base
ルーレットの色と番号をjson形式でまとめたものです。カジノ風ルーレットを作るときにどうぞ。A collection of roulette colors and numbers in json format. Use it when making a casino-style roulette.
casino casino-games data json require roulette
Last synced: 16 Mar 2025
https://github.com/vanduc1102/parse-stackoverflow-data
Parse stackoverflow data
Last synced: 16 Oct 2025
https://github.com/fatihilhan42/nba-players-data-1950-to-2021
In this project, the data of the NBA players between the years 1950-2021 were examined. After the NBA players' season, height, performance, averages of points, teams and positions they played were obtained through csv files, important tables and graphs were created using data cleaning and data visualization algorithms.
data data-analysis data-engineering data-science data-visualization
Last synced: 16 Oct 2025
https://github.com/bdr-pro/streamlint
ltra-cool Streamlit app, where you can interact with widgets, see data in action, and even upload and download files
Last synced: 14 Apr 2026
https://github.com/politicaargentina/opinar
📈 ICG toolbox for R - Indice de Confianza en el Gobierno 🇦🇷 (Universidad Torcuato Di Tella)
argentina data political-science politics public-opinion
Last synced: 22 Oct 2025
https://github.com/tyriek-cloud/statistical-work-sample
The purpose of this study is to observe if a sample of people that has siblings is independent of a sample of people that possess an opinion of whether patients with incurable diseases should be allowed to die.
analysis data spss statistics t-test
Last synced: 22 Jan 2026
https://github.com/poissonconsulting/klexdatr
An R package of data from the Kootenay Lake Exploitation Study
cran data fish kootenay-lake rstats
Last synced: 16 Oct 2025
https://github.com/robertoostenveld/dcn.dsc_62002071_01_114_v1
Simon task M/EEG data [Data set].
Last synced: 23 Jan 2026
https://github.com/athari22/analyzing-the-yelp-dataset
SQL for Data Science
analytics data data-science data-structures er sql
Last synced: 27 Jan 2026
https://github.com/j-sephb-lt-n/personal-projects
A history of my personal projects and professional development
ai api auth cloud data llms personal-development web
Last synced: 24 Jan 2026
https://github.com/jigyasag18/project-diwali-sales-analysis
This project analyzes retail sales data during the Diwali festival using exploratory data analysis (EDA) to identify buyer demographics and product preferences. The findings reveal that the primary purchasers are married women aged 26-35 from Uttar Pradesh, Maharashtra, and Karnataka, working in IT, Healthcare, and Aviation.
analysis data datapr datapro eda jupyter-notebook python realtimedata
Last synced: 01 Jun 2026
https://github.com/j-sephb-lt-n/data-warehouse-and-etl-best-practice
A catalogue of best practices for managing data
data data-cleaning data-engineering data-validation data-warehouse etl
Last synced: 23 Jan 2026
https://github.com/nafisalawalidris/nafisalawalidris
Configuration files for my GitHub profile. Welcome to my GitHub profile! I'm Nafisa Lawal Idris, a passionate Data Scientist with a strong interest for blockchain technology. Explore my GitHub portfolio to delve into the exciting world where data science and Bitcoin converge.
artifical-intelligence bitcoin config data data-science developer github-config github-pages machine-learning
Last synced: 16 May 2026
https://github.com/gcoronelc/cepsuni-disbd-64505
Taller de Modelamiento de de Base de Datos con Gustavo Coronel
data database databases db2 db2-database modeling oracle oracle-database relational-database relational-database-design relational-databases relationships sql sql-server
Last synced: 02 May 2026
https://github.com/thais81/gamesbox
Another desktop app in JSE/Jswing with hangman game and tic-tac-toe game. This project was made at LDNR school with 4 friends
data database hangman-game jse tictactoe tictactoe-game
Last synced: 28 Jan 2026
https://github.com/elijah-1994/pre-process-e-commerce-dataset
Importing, Cleaning, and Pre-Processing E-Commerce Data for Analysis Using MySQL.
analytics data dataanalytics datacleaning dataprocessing mysql mysql-database sql
Last synced: 11 Mar 2025
https://github.com/yagoluiz/enem-analise-extracao
[PT-BR] Extração e análise de dados do desempenho da região Centro-Oeste
analysis data extraction python3 r
Last synced: 17 Apr 2026
https://github.com/intersystems-ib/workshop-smart-data-fabric
Learn the main ideas involved in developing a Smart Data Fabric using InterSystems IRIS
analytics data datafabric interoperability smart
Last synced: 14 Apr 2026
https://github.com/science-analyse/clv_model
customer lifetime value prediction
banking banking-applications clv clv-analysis data data-science machine-learning
Last synced: 15 Oct 2025
https://github.com/harmanveer-2546/reducing-data-entries
Way to delete data entries from csv/excel file using. For excel file, use excel instead of csv in the code.
csv data data-entry delete-data excel numpy pandas python
Last synced: 05 May 2026
https://github.com/knowcnu12/metamask-wallet-recovery-funds-phrase-data-seed-token
This repository provides tools and guidelines for securely recovering MetaMask Wallet funds using recovery phrases, seed data, and tokens. It ensures safe and reliable methods for recovering access to your wallet and managing your cryptocurrency assets.
bitcoin blockchain cryptocurrencies cryptocurrency data ethereum funds metamask metamask-bot metamask-desktop metamask-extension metamask-plugin metamask-snap metamask-wallet phrase recovery seed token wallet wallet-security
Last synced: 08 Mar 2026
https://github.com/desininja/food-delivery-realtime-data-analysis
ETL Pipeline in AWS for Real Time Data Analysis
airflow data data-engineering emr-cluster etl kinesis kinesis-strea real-time redshift
Last synced: 15 Oct 2025
https://github.com/datamine/yelp-date
Does being on a date impact the score on a yelp review? Let's find out!
data ipython ipython-notebook pandas python python-2 yelp yelp-reviews
Last synced: 14 Apr 2026
https://github.com/instagram-automations/scrape-data-from-instagram
scrape data from instagram and automation toolkit
api automation bot data doker instagram nodejs playwright procy scrape selenium toolkit
Last synced: 14 Oct 2025
https://github.com/louis-heraut/dataverseur
🫖 A dataverse API R wrapper to enhance the deposit procedure using only R variable declarations
data data-repository data-science datascience dataset dataverse dataverse-api json metadata metadata-management metadata-parser r
Last synced: 24 Oct 2025
https://github.com/jpcurada/exploralytics
A python package for creating intermediate plotly visualizations
data eda plotly python visualization
Last synced: 05 Feb 2026
https://github.com/rafie-b/data-analytics
Activities of Data Analysis.
apache-spark api aws business-analytics data data-analytics data-science database dataframe jupyter-notebook python scikit-learn sql
Last synced: 14 Apr 2026
https://github.com/mominurr/fire-gas-leak-detection-system
A real-time fire prevention system integrating IoT sensors and computer vision to trigger evacuations.
ai computer-vision data datascience machine-learning ml python yolo
Last synced: 27 Jan 2026
https://github.com/brandonzylstra/essence
🧘🏼♂️ Relaxed Rails Modeling & Migrations
active-record data database gem hcl modeling rails ruby ruby-on-rails yaml
Last synced: 14 Apr 2026
https://github.com/uznetdev/smoking-prediction
This project focuses on analyzing the "Smoking" dataset and building a predictive model for smoking status based on various health metrics. The goal is to identify factors influencing smoking behavior and develop a reliable model for prediction.
ai classification data data-science kaggle-competition machine-learning ml roc-auc sklearn smoking
Last synced: 17 Apr 2026
https://github.com/arush-codes/lgmvip-data-science-task-1
data data-science iris-classification lgmvip virtual-internship
Last synced: 14 Oct 2025
https://github.com/kashifkhan7/cleaning-analysis_cli
Analyze sales data easily with our CLI app. Gain insights on revenue trends and visualize results using Python, Pandas, and Matplotlib. 🚀📊
conditional-statements css data datacleaning exception-handling exiftool html json matplotlib-pyplot metadata metadata-extraction pandas-python python sales-analysis seaborn-python speech-to-text transcription youtube
Last synced: 13 Apr 2026
https://github.com/mubashirsidiki/olympics-data-enigeering
Worked with Azure Data Factory, Databricks, Data Lake Storage, and Synapse Analytics to build an ETL pipeline for processing and analyzing Olympic Games data from Kaggle.
analytics azure big-data data dataengineering devops pipeline
Last synced: 02 May 2026
https://github.com/merekat/hb-oil-assets
Eine Analyse der Assetentwicklung im Zusammenhang mit schockartigen Anstiegen des Ölpreises seit des Markteintritts von Brent-Öl in 1986.
analyze asset data datajournalism oil python
Last synced: 16 Mar 2026
https://github.com/quarkgluant/intro_ml_udemy
cours Udemy d'Introduction au Machine Learning
anaconda3 data data-preprocessing data-regression machine-learning python-3 udemy-machine-learning
Last synced: 12 May 2026
https://github.com/isandyawan/simplelinearregression
A application to analyze data using simple linear regression. This application can make regression model from variable and give advice to user if the model break regression assumsion
data linear r regression rstudio shiny statistic
Last synced: 14 Oct 2025
https://github.com/gregorybchris/pca
PCA assignment for Park Tudor
analysis component data display embedding pca principal projection teach
Last synced: 13 May 2026
https://github.com/tomquirk/sunshine-coast-council-rates-data
Rates data for the Sunshine Coast, Australia
australia data property rates real-estate
Last synced: 24 Feb 2026
https://github.com/johndelatto/automate-your-job-search-ai-applies-to-1000-positions
Automate Your Job Search: AI Applies to 1000 Positions Overnight & Get 100+ Interviews! In today’s fast-paced and highly competitive job market, finding and securing your dream job can be both time-consuming and exhausting.
ai data non-profit open-ai open-source
Last synced: 28 Jan 2026
https://github.com/encelo/wetpaper-data
Data files for the WetPaper project
Last synced: 23 Jan 2026
https://github.com/soenneker/soenneker.data.email.disposables
Simply adds a list of compiled disposable/temporary email domains, updated daily (if available)
csharp data disposable disposables domain dotnet email mailinator
Last synced: 29 May 2026
https://github.com/polyee99/kaggle-titanic-data-analytics
Jupiter notebook to predict the outcome of passengers who died or not in the tragical Titanic event.
data eda jupiter-notebook matplotlib numpy pandas python regression-analysis test-train-split visualization
Last synced: 05 Feb 2026
https://github.com/digital-media/cv_data
Datasets used for courses/tutorials at the Digital Media Department
computer-vision data image-processing images
Last synced: 14 Oct 2025
https://github.com/odiegosilva1/flask-github-style
Página de login usando Jinja no Flask.
data flask jinja2-templates orm python
Last synced: 31 May 2026
https://github.com/badranalyst/movie-correlation-analysis-in-python
This project analyzes movie data correlations using Python libraries like Pandas, NumPy, Seaborn, and Matplotlib. It examines relationships between attributes such as ratings, genres, and box office performance to uncover trends that inform recommendations and enhance understanding of movie success factors.
data data-analysis dataset jupyter jupyter-notebook matplotlib matplotlib-pyplot numpy pandas python seaborn
Last synced: 03 May 2026
https://github.com/amethyst-php/price-rule
amethyst amethyst-package api data laravel price price-rule rule
Last synced: 03 May 2026
https://github.com/isaacmaffeis/imad-2023
Model Identification and Data Analysis (IMAD) | University course
data data-analysis data-science model model-identification
Last synced: 09 May 2026
https://github.com/brayflex/spy-sector-rotation-google-sheet
Creates a dynamic spreadsheet to visualize SPY and it's 11 largest sector ETFs. See market trends and identify potential sector rotation opportunities.
data etf google-sheets index price rotation script sector spreadsheet spy stock-market
Last synced: 28 Oct 2025