Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Projects in Awesome Lists tagged with data-discovery
A curated list of projects in awesome lists tagged with data-discovery .
https://github.com/eugeneyan/applied-ml
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
applied-data-science applied-machine-learning computer-vision data-discovery data-engineering data-quality data-science deep-learning machine-learning natural-language-processing production recsys reinforcement-learning search
Last synced: 23 Nov 2024
https://github.com/datahub-project/datahub
The Metadata Platform for your Data Stack
data-catalog data-discovery datahub hacktoberfest linkedin metadata
Last synced: 17 Dec 2024
https://github.com/open-metadata/openmetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake
Last synced: 16 Dec 2024
https://github.com/amundsen-io/amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
amundsen data-catalog data-discovery linuxfoundation metadata
Last synced: 16 Dec 2024
https://github.com/open-metadata/OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datacatalog datadiscovery dataengineering dataquality dbt metadata metadata-management snowflake
Last synced: 27 Oct 2024
https://github.com/marquezproject/marquez
Collect, aggregate, and visualize a data ecosystem's metadata
data-dictionary data-discovery data-ecosystem-metadata data-governance data-lineage data-ops data-provenance marquez metadata metadata-service
Last synced: 17 Dec 2024
https://github.com/MarquezProject/marquez
Collect, aggregate, and visualize a data ecosystem's metadata
data-dictionary data-discovery data-ecosystem-metadata data-governance data-lineage data-ops data-provenance marquez metadata metadata-service
Last synced: 30 Oct 2024
https://marquezproject.github.io/marquez/
Collect, aggregate, and visualize a data ecosystem's metadata
data-dictionary data-discovery data-ecosystem-metadata data-governance data-lineage data-ops data-provenance marquez metadata metadata-service
Last synced: 13 Nov 2024
https://github.com/reata/sqllineage
SQL Lineage Analysis Tool powered by Python
data-discovery data-governance data-lineage lineage metadata sql
Last synced: 17 Dec 2024
https://github.com/opendatadiscovery/odd-platform
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
alerting bigdata data-catalog data-discovery data-engineering data-exploration data-governance data-lineage data-observability data-pipelines data-platform data-profiling data-quality data-science datacatalog lineage metadata metadata-management observability oss
Last synced: 19 Dec 2024
https://github.com/nasa/earthdata-search
Earthdata Search is a web application developed by NASA EOSDIS to enable data discovery, search, comparison, visualization, and access across EOSDIS' Earth Science data holdings.
data-discovery earthdata-search eosdis hacktoberfest
Last synced: 25 Oct 2024
https://github.com/rsyi/whale
🐳 The stupidly simple CLI workspace for your data warehouse.
data-catalog data-discovery data-documentation
Last synced: 22 Nov 2024
https://github.com/hyperqueryhq/whale
🐳 The stupidly simple CLI workspace for your data warehouse.
data-catalog data-discovery data-documentation
Last synced: 07 Oct 2024
https://github.com/recap-build/recap
Work with your web service, database, and streaming schemas in a single format.
data-catalog data-discovery data-engineering data-integration data-pipelines etl metadata recap
Last synced: 13 Dec 2024
https://github.com/swhl/ai-competition-collections
AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验帖)
competition cv data-discovery graph-neural-networks knowledge-graph nlp recommender-system speech
Last synced: 16 Dec 2024
https://github.com/gabledata/recap
Work with your web service, database, and streaming schemas in a single format.
data-catalog data-discovery data-engineering data-integration data-pipelines etl metadata recap
Last synced: 11 Nov 2024
https://github.com/mpostol/opc-ua-ooi
Object Oriented Internet - C# deliverables supporting a new Machine To Machine (M2M) communication architecture
c-sharp communication data-discovery data-oriented-architecture ebook iiot internet iot m2m metadata networking ooi opc-ua opcua publish-subscribe semantic-data
Last synced: 28 Oct 2024
https://github.com/opendatadiscovery/opendatadiscovery-specification
ODD Specification is a universal open standard for collecting metadata.
api big-data big-data-platform data-discovery data-engineering data-governance data-mesh data-platform metadata metadata-management metadata-parser open-source opensource spec specification
Last synced: 14 Nov 2024
https://github.com/tosh2230/stairlight
A data lineage tool detects table dependencies from rendered SQL statements.
bigquery data-catalog data-discovery data-engineering data-governance data-lineage data-management data-ops dbt gcs lineage redash s3 sql
Last synced: 19 Nov 2024
https://github.com/carte-data/carte
A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable front end that's just HTML.
carte data-catalog data-discovery data-documentation lightweight-data-catalogs python-library
Last synced: 04 Dec 2024
https://github.com/related-sciences/articat
articat: data artifact catalog
data-catalog data-discovery data-management data-platform
Last synced: 15 Nov 2024
https://github.com/unytics/catalog_builder
Data Catalogs Made Easy
bigquery data-catalog data-discovery databricks dbt redshift snowflake
Last synced: 07 Nov 2024
https://github.com/worldbank/wb-nlp-apps
This repository contains the NLP modeling components and web application implementations of a project for knowledge and data discovery funded by the Knowledge for Change Program (KCP) and the Joint Data Center on Forced Displacement (JDC).
data-discovery lda machine-learning nlp python topic-modeling word2vec
Last synced: 10 Nov 2024
https://github.com/slaclab/datacat
A system for managing files and file replicas across many diverse sites
data-catalog data-discovery datacat dataset dataset-catalog metadata metadata-store
Last synced: 11 Dec 2024
https://github.com/michalporeba/odis
Search in decentralised systems. Search federation, result moderation, aggregation and feedback with hypermedia in ReSTful API to round it all of.
data data-discovery discoverability federated information-discovery mesh-networks search
Last synced: 16 Nov 2024
https://github.com/tjas/postgrad-ai-ddv-plotly
Jupyter Notebook to analyze the salaries of Federal District government public servants, using Python, Pandas and Plotly Express, to solve the proposed exercise in "Data Discovery and Visualization" discipline.
analysis analytics data data-analytics data-discovery data-science data-visualization graph graphs jupyter-notebook jupyter-notebooks pandas plotly plotly-express python
Last synced: 07 Nov 2024