data
Individual facts, statistics, or items of information, often numeric. In a technical sense, data are a set of values of qualitative or quantitative variables about one or more persons or objects. (https://en.wikipedia.org/w/index.php?title=Data&oldid=1093674723, released under CC BY-SA 3.0)
- GitHub: https://github.com/topics/data
- Wikipedia: https://en.wikipedia.org/wiki/Data
- Related Topics: datum,
- Last updated: 2026-07-01 00:07:35 UTC
- JSON Representation
https://github.com/edjoukou/pizza-sales-report
A data analysis project using SQL with MySQL database
analysis data mysql powerbi visualization
Last synced: 05 May 2026
https://github.com/chompfoods/stub-nodejs-server
Node.js server stub for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
api branded chomp data database food grocery ingredients node node-js node-server nodejs nutrtion raw recipe-api recipes server server-stub stub stub-server
Last synced: 05 May 2026
https://github.com/mito-ds/mitosheet_helper_config
The mitosheet_helper_config package used by enterprises to configure the mitosheet package.
data data-analytics data-science data-visualization jupyter pandas python
Last synced: 05 May 2026
https://github.com/donmaruko/python-eda-toolkit
CLI-runned EDA with 30 commands utilizing text-related functions, statistical calculations, data visualization, and data manipulation.
data data-analysis data-science data-visualization matplotlib pandas scipy seaborn statistical-analysis statistics wordcloud
Last synced: 06 May 2026
https://github.com/shibbbbs/fastapi_project
A FastAPI application that reads financial data from an Excel file (capbudg.xls) and provides API endpoints to list available tables (sheet names), fetch row names from a selected table, and calculate the sum of numerical values from a specified row. The API is accessible via a web-based interactive documentation at /docs
data dataanalysis fastapi pandas python
Last synced: 06 May 2026
https://github.com/shreshthvashisht/instgram-user-analytics
SQL Fundamentals
data data-analysis data-science mysql social-network-analysis
Last synced: 09 Jun 2026
https://github.com/ksm26/ml-ai-data-science-jobs-in-canada
Explore the latest machine learning, artificial intelligence, and data science job opportunities in Canada. Stay informed about Canadian tech job market trends and find your next career move.
ai-canada ai-careers canada canadian-tech-companies canadian-tech-job-market data data-analysis data-engineering data-science data-science-careers machine-learning prompt-engineering robotics
Last synced: 06 May 2026
https://github.com/parthds02/analyzing-student-success-with-data
Discover key factors influencing student performance through data analysis and visualization. Explore gender, parental education, sports, and ethnicity impacts.
data datascience jupyter-notebook kaggle python pythonlibraries
Last synced: 06 May 2026
https://github.com/amazenmb/web-scraping
Web Scraping Methods using Python
analytics beautifulsoup data lxml pyautogui-automation python scheduling schedulingscraping selenium webdriver webscraping xpath
Last synced: 06 May 2026
https://github.com/ekoepplin/dbt-bigquery-core
How to get data to BigQuery (or duckDB) and setup dbt tests for SODA cloud monitoring
bigquery data data-quality dbt dlt duckdb gcp soda
Last synced: 06 May 2026
https://github.com/xljones/bugsnag-exporter
Export Bugsnag project, error, and event data easily from a command line call which automatically handles pagination, and API backoffs
bash bugsnag cmd csv data error error-capture error-handling error-reporting event export go golang json project zsh
Last synced: 06 May 2026
https://github.com/iv4n-ga6l/functional-dataprocessing-pipeline
A functional data processing pipeline that accepts an input file, allows specifying both input and output formats, applies specified transformations, and produces a resulting output file.
csv data datapreprocessing excel json pandas parquet pipeline python
Last synced: 06 May 2026
https://github.com/fabsdevx/file-format-converter-handout
Data Engineering project for learning purposes. Credits to itversity
csv csv-import data data-engineering database pandas python
Last synced: 06 May 2026
https://github.com/ashleydavis/brisjs-web-scraping-talk
Code to accompany my talk on web scraping for the Brisbane JavaScript meeting in September 2018
cheerio data data-acquisition data-acquisiton electron headless-browsers javascript nightmare nightmarejs nodejs web-scraping
Last synced: 06 May 2026
https://github.com/juanpablodiaz/beertv
A Next.js Full Stack app to displays funny Beer TV Ads
api-routes data next tailwindcss
Last synced: 07 May 2026
https://github.com/shantanujpk/bigdatacloud
Exploration of PySpark for data processing and interview prep — demonstrates handling corrupted records, applying transformations/actions, and building efficient data pipelines with practical examples.
big-data data jupyter-notebook pipeline pyspark python spark sparksql
Last synced: 07 May 2026
https://github.com/lab5e/loadabledata
Simple framework-agnostic wrapper around loadable data to help encapsulate and use state changes in a UI.
async data loadable state typescript ui
Last synced: 07 May 2026
https://github.com/pocketfullofdata/electric-vehicles-market-size-analysis
This project analyzes the growth, adoption trends, and future projections of the electric vehicle (EV) market. Using data analysis and visualization techniques, it examines key factors like sales trends, and consumer adoption to understand the evolving landscape of the EV industry.
analysis data jupyter-notebook matplotlib numpy python seaborn vscode
Last synced: 07 May 2026
https://github.com/tjas/postgrad-ai-ddv-plotly
Jupyter Notebook to analyze the salaries of Federal District government public servants, using Python, Pandas and Plotly Express, to solve the proposed exercise in "Data Discovery and Visualization" discipline.
analysis analytics data data-analytics data-discovery data-science data-visualization graph graphs jupyter-notebook jupyter-notebooks pandas plotly plotly-express python
Last synced: 07 May 2026
https://github.com/safwan2003/randomforest_heart_disease_prediction
A machine learning project using Random Forest Classifier to predict heart disease. Includes data preprocessing (with binning), feature selection, and model evaluation.
binning data data-science datapipeline datapreprocessing datavisaulization deep-learning machine-learning python random-forest-classifier streamlit
Last synced: 07 May 2026
https://github.com/murtaza-arif/all-you-need-to-know-for-data-engineer
This repository is designed to showcase various aspects of data engineering, including tools, frameworks, and end-to-end projects. It covers everything from data ingestion and transformation to data warehousing and cloud-based solutions.
cassandra data data-engineering data-science kafka kafka-consumer kafka-streams pyarrow spark
Last synced: 07 May 2026
https://github.com/bhenk/msdata-d
MySql DAO
dao data data-layer database mysql mysql-database mysqli
Last synced: 07 May 2026
https://github.com/jigyasag18/iit-guhawati
Empower Sakhi is a data-driven platform that uses machine learning to identify women at risk of domestic violence in India. It offers confidential self-assessments, survivor stories, and emergency resources through a trauma-informed, privacy-focused web app. The project also provides NGOs with actionable insights via Power BI dashboard for support.
aiml data dataset datavisualization domestic-violence eda jupyter-notebook label-encoding machine-learning machine-learning-algorithms machine-learning-models machinelearning machinelearningprojects powerbi python python-app random-forest random-forest-classifier streamlit streamlit-webapp
Last synced: 08 May 2026
https://github.com/aidan-zamfir/the-iliad
Data analysis & relationship network for the characters of Homers Iliad
data data-analysis dataframes networks networkx python selenium spacy webscraping
Last synced: 08 May 2026
https://github.com/abhash-rai/regression-car-price-prediction
This repository contains my first complete data science project from web scrapping for data to data preprocessing, cleaning, exploratory data analysis, model training and deployment.
data data-science data-visualization eda exploratory-data-analysis machine-learning neural-network prediction prediction-model regression
Last synced: 08 May 2026
https://github.com/vanshuchaudhary/flightpriceanalysis-
The uploaded file is a Jupyter Notebook titled "Flight Analysis". It likely involves analyzing flight-related data, potentially exploring trends, patterns, or insights using data science techniques. The analysis might include data visualization, statistical analysis, or predictive modeling.
business-analytics data data-analysis data-visualization datainsights datascience matplotlib-pyplot python seaborn seaborn-plots seaborn-python sns statistical-analysis
Last synced: 08 May 2026
https://github.com/vaxdata22/cyclistic-ride-sharing-company
This is my Google Data Analytics Certificate case study for the Cyclistic ride-sharing company
actionable-insights business-analytics business-intelligence data data-analytics data-cleaning data-mining data-visualization data-wrangling exploratory-data-analysis google-data-analytics spreadsheets sql sql-server sql-server-management-studio statistical-analysis t-sql tableau transact-sql
Last synced: 10 Jun 2026
https://github.com/writetome51/public-data-container-interface
Just a TypeScript interface with 1 property: 'data'
container data interface typescript
Last synced: 15 May 2026
https://github.com/miniql/miniql-csv
A MiniQL query resolver that loads data from CSV files.
comma-separated-values csv data query query-language
Last synced: 08 May 2026
https://github.com/natarizkie2/neurochain-airdrop-bot
🍋 — A smart bot designed to complete data tasks like true/false selections automatically, with multi-account support for extra convenience.
airdrop automated bot data multi-account natarizkie neurochain nodejs web3
Last synced: 10 Jun 2026
https://github.com/chompfoods/sdk-typescript-angular
Angular TypeScript SDK for the Chomp Food & Recipe Database API. Use our API to get high-quality data on recipes and 875,000+ branded/grocery foods plus raw ingredients.
angular api branded chomp data database food grocery ingredients nutrition raw recipe-api recipes sdk typescript
Last synced: 09 May 2026
https://github.com/basemax/okala-product-ids
A PHP script to fetch and save product IDs from Okala's online store API across multiple categories and store branches.
crawler crawler-okala crawler-php crawlers data database ids ir iran json okala okala-crawler php php-crawler product
Last synced: 09 May 2026
https://github.com/flexthink/matricize
A convenience library to convert between pure Python objects and their vectorized representations
data machine-learning numpy python
Last synced: 09 May 2026
https://github.com/master-helix/ibm-data-analyst-certification-stock-analysis-project
This is a mini project repository of my IBM Certification involving stock analysis and plotting of Tesla and GameStop
analytics data data-analysis data-visualization ibm matplotlib pandas python web-scraping
Last synced: 09 May 2026
https://github.com/mohamedbilal1800/olympic_history_data_analysis
This project delves into the 120 Years of Olympic History: Athletes and Results dataset, analyzing athlete demographics, medal achievements, and country performances across the Summer and Winter Olympics from 1896 to 2016.
analysis data eda matplotlib-pyplot pandas python seaborn visulaization
Last synced: 09 May 2026
https://github.com/yashkp1234/movie-recommendation-engine
My project on analyzing the movie data set, and creating a recommendation engine using that analysis.
analysis data notebook python recommendation-engine
Last synced: 04 May 2025
https://github.com/baranasoftware/curricular-api
The design and implementation of a REST API for student and course data for a Higher Ed institution.
aws data data-pipeline go golang lambda rest rest-api sqlite3 system-design terraform
Last synced: 09 May 2026
https://github.com/scjoaoantonio/trab_datascience
Este projeto tem como objetivo analisar os posts da rede social Bluesky. A aplicação interativa foi desenvolvida utilizando Streamlit e permite a coleta e visualização de dados, além de oferecer análises avançadas como previsão de engajamento, modelagem de tópicos e análise de sentimentos.
bluesky data data-science streamlit
Last synced: 09 May 2026
https://github.com/sebastianbrzustowicz/flight-quality-overview-microservice
Go + Docker. Microservice with parallel computations to convert raw vehicle flight data into overview raport with visualisation.
container control csv data docker drone flight go goroutines http microservice parallel-computing pdf quadcopter raport rms sse vehicle
Last synced: 10 May 2026
https://github.com/beeracs/llama
Run Llama models in your web browser using JavaScript and WebAssembly. Explore light and dark modes easily. 🌐🐱👤
ai data fine-tuning framework gpt langchain large-language-models llama3 llamaindex llm lora machine-learning nlp peft qlora qwen rlhf vllm
Last synced: 10 May 2026
https://github.com/aneeshmurali-n/nlp-emotion-classification-in-text
Develop machine learning models to classify emotions in text samples.
bag-of-words data emotion-classification feature-extraction machine-learning naive-bayes natural-language-processing nlp nltk preprocessing python scikit-learn svm text-classification tf-idf tokenizer vectorizer
Last synced: 10 May 2026
https://github.com/notthestallion/pca__3d-and-from-scratch__principal-component-analysis
In this project, I will be implementing Principal Component Analysis (PCA) from scratch on an ecological footprint consummation database for countries and a three-dimensional scale using a movie database. The goal of this project is to gain a deeper understanding of PCA and to demonstrate its capabilities in exploring complex datasets.
data data-science database pca pca-analysis principal-component-analysis principal-component-analysis-pca principle-component-analysis
Last synced: 10 May 2026
https://github.com/repirate/asset-recovery-tool
A simple tool for recovering undrained tokens and NFTs from a compromised wallet on the Ethereum network.
bitcoin blockchain cryptocurrencies cryptocurrency data ethereum funds metamask-desktop metamask-plugin phrase recovery seed token wallet
Last synced: 10 May 2026
https://github.com/infinitode/crsd
A synthetic customer review sentiment dataset for sentiment analysis generated using different AI models.
ai data dataset datasets huggingface-datasets mit-license ml nlp open-source python sentiment sentiment-analysis sentiment-classification text-data
Last synced: 10 Jun 2026
https://github.com/afeiship/data-arary
Data array with some new methods.
array data data-structure js list
Last synced: 11 May 2026
https://github.com/alimghmi/bdlc
Bloomberg API integration, handling data requests, processing, and SQL database insertion.
api-client bloomberg data data-processing financial-data oauth2 python sql-database transformation
Last synced: 10 Jun 2026
https://github.com/quarkgluant/intro_ml_udemy
cours Udemy d'Introduction au Machine Learning
anaconda3 data data-preprocessing data-regression machine-learning python-3 udemy-machine-learning
Last synced: 12 May 2026
https://github.com/mateogiuffra/estrd2024s1
trabajos prácticos realizados en la materia Estructura de Datos de la Universidad Nacional de Quilmes (UNQ)
c cpp data data-structures-and-algorithms eficiency functional-programming haskell unq
Last synced: 12 May 2026
https://github.com/miniql/notebook-example
An example of MiniQL in a JavaScript Notebook
comma-separated-values csv data data-analysis data-science graphql javascript notebook query query-language
Last synced: 13 May 2026
https://github.com/cvinicius987/projetos-bigdata
Estudos de caso envolvendo projetos de BigData e Engenharia de Dados.
bigdata data data-engineering spark
Last synced: 13 May 2026
https://github.com/triboot/ultimate-playerprefs
This repository is only created for **Issue-Tacking** and the **Wiki-Documentation** (Wiki Docs are cooming soon) of Ultimate PlayerPrefs. The plugin source code is fully open and available after the purchase on the Unity Asset Store.
anticheat assetstore data datamanagement easy manager persistent-storage playerprefs playerprefsmanager savegame storage tools unity unity3d unity3d-editor
Last synced: 11 Jun 2026
https://github.com/andygol/andygol.github.io
Andrii Holovin – Product & Project Manager Geospatial Expert / OpenStreetMap Consultant / DevOps practitioner
consultant data data-structures devops experience floss gis mapping navigation openstreetmap personal-site personal-website
Last synced: 13 May 2026
https://github.com/m0nica/datalogues-refresh
:bar_chart: Programming blog focused on data with an emphasis on exploration in Python.
data jekyll python technical-writing
Last synced: 14 May 2026
https://github.com/lulloooo/article-fromfitto55tofittoeveryone
Analysis leading to an article published in the EcoSprinter 2024 Annual edition about an Analysis of EU "Fit for 55" packages under a different perspective 🔎
analysis data environment european-union
Last synced: 12 Jun 2026
https://github.com/iannil/one-data-studio
one-data-studio integrates a data governance and development platform, a cloud-native MLOps platform, and a large model application development platform. It connects the entire value chain from raw data governance to model training and deployment, and further to the construction of generative AI applications.
Last synced: 12 Jun 2026
https://github.com/jurooravec/knwldg
Datasets, scrapers, pipelines
companies crawler data dataset non-profit-organizations scraper scrapy
Last synced: 13 Jun 2026
https://github.com/cannt39t/wylsacom-analysis-reflinks-datamining
data data-analysis data-mining python3 sql
Last synced: 13 Jun 2026
https://github.com/word2vect/beijing-pm2.5-data-process
Beijing PM2.5 Data Process for Python Programming 2024 Fall Data Visualization Lab 2
Last synced: 15 Jun 2026
https://github.com/marielachirinosr/nyc-taxi-trip-exploration-2019-2020
Explores passenger behavior & impact of COVID-19 on NYC taxi industry (Q1 2019-2020).
bigquery data data-analysis data-visualization python sql tableau
Last synced: 15 Jun 2026
https://github.com/lafkpages/minecraft-crafting-info
Scrapes https://www.minecraftcrafting.info for crafting recipes.
Last synced: 17 Jun 2026
https://github.com/dineshram0212/youtube-analysis
This YouTube Analysis Package provides tools for analyzing YouTube video data, including metrics on views, likes, comments, and engagement trends. Ideal for gaining insights into video performance and audience interaction patterns.
data data-visualization pandas python webscraping youtube-api-v3
Last synced: 19 Jun 2026
https://github.com/artcc/coredatademo
Demo for CoreDataGenericModule implementation
core coredata coredata-model data encrypted encrypted-data encryption persist
Last synced: 19 Jun 2026
https://github.com/rileynwong/forecasting-coffee-prices
Predict coffee prices in Kenya
data data-analysis data-scraping data-visualization forecasting forecasting-models forecasting-prices jupyter-notebook prophet prophet-model
Last synced: 20 Jun 2026
https://github.com/svetlanam/etl-transformation
ETL data cleaning and transformation for specific use case in own Keboola project
cleaning data etl keboola python rest-api transformation
Last synced: 20 Jun 2026
https://github.com/supunlakmal/coronavirus-covid-19-status
Covid 19 cases and death count for each country in a json file.
coronavirus count country covid-19 covid-data covid19 data data-science data-visualization geographical geographical-information-system json
Last synced: 21 Jun 2026
https://github.com/peterhellberg/bugsnag-data
Dump Bugsnag data using the Data access API
Last synced: 22 Jun 2026
https://github.com/hlan22/2025-03-18-data-validation
(no longer useful) DSCI 310 Lecture about Data validation and code testing! Made in tandem with:
Last synced: 23 Jun 2026
https://github.com/dineshdhamodharan24/data-analysis
probability Analysis to customers and bascis analysis
analysis data powerbi probability python visualization
Last synced: 23 Jun 2026
https://github.com/chrisabruce/scrapling-rs
Adaptive web scraping, built in Rust. A high-performance port of Python Scrapling.
ai ai-scraping automation crawler crawling crawling-rust data data-extraction mcp mcp-server playwright rust-lang scraping selectors stealth web-scraper web-scraping web-scraping-rust webscraping xpath
Last synced: 26 Jun 2026
https://github.com/matthewgferrari/covid-contextualizer
A Coronavirus Contextualizer for the USA
Last synced: 26 Jun 2026
https://github.com/rodrigojunqueiradev/curso-python-3-do-basico-ao-avancado
Curso de Python 3 do básico ao avançado - com projetos reais
data data-analysis data-science python python-3 python-library python-script python3
Last synced: 27 Jun 2026
https://github.com/sakan811/honkai-star-rail-characters-damage-simulation
Honkai Star Rail Characters' Damage Simulation
data data-science data-visualization honkai honkai-star-rail honkai-starrail powerbi powerbi-visuals python sqlite
Last synced: 29 Jun 2026
https://github.com/nitheshgoutham/sentinel-2-data-processing-for-pichavaram-mangrove-forest-using-cnn
Image Processing using CNN
cnn cnn-classification cnn-keras data deep-learning matplotlib ploty python seaborn-python visualization
Last synced: 29 Jun 2026
https://github.com/vedantwalia/mymusicvisualisationproject
data datavisualisation json jupyter-notebook pandas python xml xml-parser
Last synced: 09 Apr 2026