Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

Projects in Awesome Lists tagged with data-discovery

A curated list of projects in awesome lists tagged with data-discovery .

https://github.com/datahub-project/datahub

The Metadata Platform for your Data Stack

data-catalog data-discovery datahub hacktoberfest linkedin metadata

Last synced: 17 Dec 2024

https://github.com/open-metadata/openmetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.

data-catalog data-collaboration data-contracts data-discovery data-governance data-lineage data-observability data-profiling data-quality data-quality-checks data-science data-validation datadiscovery dataengineering dataquality dbt hacktoberfest metadata metadata-management snowflake

Last synced: 16 Dec 2024

https://github.com/amundsen-io/amundsen

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

amundsen data-catalog data-discovery linuxfoundation metadata

Last synced: 16 Dec 2024

https://github.com/reata/sqllineage

SQL Lineage Analysis Tool powered by Python

data-discovery data-governance data-lineage lineage metadata sql

Last synced: 17 Dec 2024

https://github.com/nasa/earthdata-search

Earthdata Search is a web application developed by NASA EOSDIS to enable data discovery, search, comparison, visualization, and access across EOSDIS' Earth Science data holdings.

data-discovery earthdata-search eosdis hacktoberfest

Last synced: 25 Oct 2024

https://github.com/rsyi/whale

🐳 The stupidly simple CLI workspace for your data warehouse.

data-catalog data-discovery data-documentation

Last synced: 22 Nov 2024

https://github.com/hyperqueryhq/whale

🐳 The stupidly simple CLI workspace for your data warehouse.

data-catalog data-discovery data-documentation

Last synced: 07 Oct 2024

https://github.com/recap-build/recap

Work with your web service, database, and streaming schemas in a single format.

data-catalog data-discovery data-engineering data-integration data-pipelines etl metadata recap

Last synced: 13 Dec 2024

https://github.com/swhl/ai-competition-collections

AI比赛经验帖子 & 训练和测试技巧帖子 集锦(收集整理各种人工智能比赛经验帖)

competition cv data-discovery graph-neural-networks knowledge-graph nlp recommender-system speech

Last synced: 16 Dec 2024

https://github.com/gabledata/recap

Work with your web service, database, and streaming schemas in a single format.

data-catalog data-discovery data-engineering data-integration data-pipelines etl metadata recap

Last synced: 11 Nov 2024

https://github.com/mpostol/opc-ua-ooi

Object Oriented Internet - C# deliverables supporting a new Machine To Machine (M2M) communication architecture

c-sharp communication data-discovery data-oriented-architecture ebook iiot internet iot m2m metadata networking ooi opc-ua opcua publish-subscribe semantic-data

Last synced: 28 Oct 2024

https://github.com/tosh2230/stairlight

A data lineage tool detects table dependencies from rendered SQL statements.

bigquery data-catalog data-discovery data-engineering data-governance data-lineage data-management data-ops dbt gcs lineage redash s3 sql

Last synced: 19 Nov 2024

https://github.com/carte-data/carte

A Python library to generate static data catalog sites. Carte scrapes metadata from your data assets and generates a fully searchable front end that's just HTML.

carte data-catalog data-discovery data-documentation lightweight-data-catalogs python-library

Last synced: 04 Dec 2024

https://github.com/worldbank/wb-nlp-apps

This repository contains the NLP modeling components and web application implementations of a project for knowledge and data discovery funded by the Knowledge for Change Program (KCP) and the Joint Data Center on Forced Displacement (JDC).

data-discovery lda machine-learning nlp python topic-modeling word2vec

Last synced: 10 Nov 2024

https://github.com/slaclab/datacat

A system for managing files and file replicas across many diverse sites

data-catalog data-discovery datacat dataset dataset-catalog metadata metadata-store

Last synced: 11 Dec 2024

https://github.com/michalporeba/odis

Search in decentralised systems. Search federation, result moderation, aggregation and feedback with hypermedia in ReSTful API to round it all of.

data data-discovery discoverability federated information-discovery mesh-networks search

Last synced: 16 Nov 2024

https://github.com/tjas/postgrad-ai-ddv-plotly

Jupyter Notebook to analyze the salaries of Federal District government public servants, using Python, Pandas and Plotly Express, to solve the proposed exercise in "Data Discovery and Visualization" discipline.

analysis analytics data data-analytics data-discovery data-science data-visualization graph graphs jupyter-notebook jupyter-notebooks pandas plotly plotly-express python

Last synced: 07 Nov 2024