Projects in Awesome Lists tagged with trino
A curated list of projects in awesome lists tagged with trino .
https://github.com/trinodb/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
analytics big-data data-science database databases datalake delta-lake distributed-database distributed-systems hadoop hive iceberg java jdbc presto prestodb query-engine sql trino
Last synced: 12 May 2025
https://github.com/ibis-project/ibis
the portable Python dataframe library
bigquery clickhouse database datafusion duckdb impala mssql mysql pandas polars postgresql pyarrow pyspark python snowflake sql sqlite trino
Last synced: 13 May 2025
https://github.com/datafold/data-diff
Compare tables within or across databases
data data-diffing data-engineering data-quality data-quality-monitoring data-science database databricks-sql dataengineering dataquality dbt mysql oracle-database postgres postgresql python rdbms snowflake sql trino
Last synced: 24 Mar 2025
https://github.com/evgskv/logica
Logica is a logic programming language that compiles to SQL. It runs on DuckDB, Google BigQuery, PostgreSQL and SQLite.
bigquery datalog language logic-programming logica postgresql presto prolog prolog-implementation sql sqlite trino
Last synced: 13 May 2025
https://github.com/EvgSkv/logica
Logica is a logic programming language that compiles to SQL. It runs on Google BigQuery, PostgreSQL and SQLite.
bigquery datalog language logic-programming logica postgresql presto prolog prolog-implementation sql sqlite trino
Last synced: 20 Mar 2025
https://github.com/devlive-community/datacap
DataCap is integrated software for data transformation, integration, and visualization. Support a variety of data sources, file types, big data related database, relational database, NoSQL database, etc. Through the software can realize the management of multiple data sources, the data under the source of various operations conversion ...
clickhouse database db2 dremio druid elasticsearch h2 hive ignite kylin kyuubi monetdb mongodb mysql phoenix postgresql presto redis sqlserver trino
Last synced: 14 May 2025
https://github.com/yanagishima/yanagishima
Web UI for Trino, Hive and SparkSQL
elasticsearch hive spark trino
Last synced: 27 Mar 2025
https://github.com/ploomber/jupysql
Better SQL in Jupyter. 📊
bigquery clickhouse data-engineering data-science duckdb hive jupyter mysql polars postgres presto python redshift snowflake spark-sql sql sqlite trino tsql
Last synced: 23 Jan 2025
https://github.com/vmware/versatile-data-kit
One framework to develop, deploy and operate data workflows with Python and SQL.
analytics data data-engineer data-engineering data-engineering-pipeline data-lineage data-pipelines data-science data-structures data-warehouse database dataops elt etl pipeline python snowflake sql trino warehouse
Last synced: 15 May 2025
https://github.com/devlive-community/dbm
Full platform database management tool, supports ClickHouse, Presto, Trino, MySQL, PostgreSQL, Apache Druid, ElasticSearch...
clickhouse database electron mysql postgresql presto sql trino
Last synced: 05 Apr 2025
https://github.com/dtstack/dt-sql-parser
SQL Parsers for BigData, built with antlr4.
antlr4 autocompletion bigdata flink hive impala mysql parser postgresql spark sql sql-validation trino
Last synced: 14 May 2025
https://github.com/DTStack/dt-sql-parser
SQL Parsers for BigData, built with antlr4.
antlr4 autocompletion bigdata flink hive impala mysql parser postgresql spark sql sql-validation trino
Last synced: 01 Apr 2025
https://github.com/wix-incubator/quix
Quix Notebook Manager
athena bigquery notebook-manager presto trino
Last synced: 16 May 2025
https://wix.github.io/quix
Quix Notebook Manager
athena bigquery notebook-manager presto trino
Last synced: 29 Mar 2025
https://github.com/bitsondatadev/trino-getting-started
docker docker-compose getting-started presto prestosql trino tutorial
Last synced: 16 May 2025
https://github.com/trannhatnguyen2/nyc_taxi_data_pipeline
Nyc_Taxi_Data_Pipeline - DE Project
airflow dbt debezium docker great-expectations kafka minio postgresql spark trino
Last synced: 06 Apr 2025
https://github.com/CybercentreCanada/jupyterlab-sql-editor
A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino
auto-completion dataframe datagrid extension formatter ipython-magic json jupyterlab lsp nested-structures notebook schema sparksql sql syntax-highlighting trino vscode-extension
Last synced: 06 Mar 2025
https://github.com/cybercentrecanada/jupyterlab-sql-editor
A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino
auto-completion dataframe datagrid extension formatter ipython-magic json jupyterlab lsp nested-structures notebook schema sparksql sql syntax-highlighting trino vscode-extension
Last synced: 04 Apr 2025
https://github.com/josephmachado/analytical_dp_with_sql
Code for my "Efficient Data Processing in SQL" book.
Last synced: 15 Apr 2025
https://github.com/aakashnand/trino-ranger-demo
Tutorial on how to setup Trino and Apache Ranger using docker
dataengineering datagovernance docker hacktoberfest ranger trino tutorial
Last synced: 04 Apr 2025
https://github.com/abeltavares/real-time-data-pipeline
📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.
apache-flink apache-iceberg apache-kafka apache-superset aws big-data data-engineering data-pipeline data-visualization docker etl lakehouse minio open-source real-time-data s3 sql-analytics streaming-analytics trino
Last synced: 22 Apr 2025
https://github.com/IBM/trino-db2
Db2 JDBC connector for Trino
db2 presto prestosql trino trino-plugin
Last synced: 10 May 2025
https://github.com/dbsystel/datalake-graphql-wrapper
The DataLake GraphQL Wrapper provides a GraphQL API for presto/trino.
boilerplate cli datalake generator graphql pothos presto prestodb prestosql template trino trinodb typescript wrapper-api yoga-graphql
Last synced: 20 Nov 2024
https://github.com/kestra-io/plugin-jdbc
clickhouse jdbc kestra mysql oracle pinot plugin postgres postgresql redshift rockset snowflake sql sqlserver trino vectorwise vertica
Last synced: 15 Apr 2025
https://github.com/alberttwong/databasecomparison
Comparison of batch and real time OLAP databases
apache-druid clickhouse database olap starrocks trino
Last synced: 05 Apr 2025
https://github.com/wgzhao/presto-udfs
some useful User Defined Functions(UDF) for both PrestoSQL and TrinoDB
exchange presto securities trino udf
Last synced: 15 Feb 2025
https://github.com/criccomini/hive-metastore-standalone
Apache Hive Metastore in Standalone Mode With Docker
docker github-workflow github-workflows hadoop hcatalog hive hive-metastore presto prestodb trino trinodb
Last synced: 12 May 2025
https://github.com/regadas/sqltools-trino-driver
SQLTools driver for Trino
sqltools sqltools-driver trino trinodb vscode vscode-extension
Last synced: 14 Apr 2025
https://github.com/duhanmin/bigdata-sql-parser
数据血缘,支持spark sql,hive sql,pg sql,presto sql,mysql sql,tidb sql, flink sql, datax血缘,spark/flink jar 运行命令的血缘解析;支持with语法
datax flink hive mysql postgresql presto spark tidb trino
Last synced: 04 Apr 2025
https://github.com/wgzhao/trino-event-logger
Trino 查询日志保存插件,用于保存所有Trino的查询语句以及相关信息
event-listener trino trino-plugin
Last synced: 07 May 2025
https://github.com/os-climate/osc-ingest-tools
python tools to assist with standardized data ingestion workflows
pandas python sql sqlalchemy trino trinodb
Last synced: 12 Apr 2025
https://github.com/aerospike/trino-aerospike.docker
Trino with the Aeropsike connector Docker Image
aerospike docker trino trino-plugin
Last synced: 19 Apr 2025
https://github.com/zncdatadev/trino-operator
Operator for Trino, the distributed SQL query engine for big data
Last synced: 14 May 2025
https://github.com/ren294/smarttraffic_lakehouse_for_hcmc
A Smart Traffic Management System for Ho Chi Minh City, Vietnam leveraging batch and real-time data processing, intuitive dashboards, and monitoring tools to optimize traffic flow, enhance safety, and support sustainable urban mobility through advanced analytics and user-friendly applications.
apache-airflow apache-flink apache-hive apache-hudi apache-kafka apache-nifi apache-spark apache-superset apache-zookeeper big-data debezium grafana lakefs metabase minio promotheus redis seatunnel streamlit trino
Last synced: 11 Apr 2025
https://github.com/os-climate/osc-trino-acl-dsl
A declarative format for configuring Trino access control
dsl json jsonschema python trino trino-acl-rules trinodb yaml
Last synced: 24 Mar 2025
https://github.com/apache/pulsar-sql
Pulsar SQL extracted from apache/pulsar
apache-pulsar pulsar pulsar-sql sql trino
Last synced: 09 Apr 2025
https://github.com/regadas/trino-pubsub-event-listener
Trino Google Pub/Sub event listener
google-cloud-platform trino trinodb
Last synced: 13 Apr 2025
https://github.com/grihabor/trino-query-parser
Provides a parser for trino queries
presto presto-parser sql trino trino-parser
Last synced: 04 Mar 2025
https://github.com/sudohainguyen/mini-lakehouse
Data lakehouse at home with docker compose
data-lakehouse hive iceberg trino
Last synced: 21 Mar 2025
https://github.com/pirate-emperor/bigdata-pipeline
BigData Pipeline is a local testing environment for experimenting with various storage solutions (RDB, HDFS), query engines (Trino), schedulers (Airflow), and ETL/ELT tools (DBT). It supports MySQL, Hadoop, Hive, Kudu, and more.
airflow airflow-dags airflow-docker big-data data-lake data-lakestore data-warehouse dbt dbt-core distributed-computing docker docker-compose hadoop hive hiveql kudu mysql mysql-server trino trino-cli
Last synced: 31 Jan 2025
https://github.com/jaceklaskowski/trino-meetups
Learning Trino in a series of Warsaw Data Engineering meetups online!
Last synced: 21 Feb 2025
https://github.com/alex1on/information-systems-ntua
Distributed execution of SQL queries over Trino
cassandra distributed-queries postgresql redis sql-engine tpc-ds-benchmark trino
Last synced: 23 Mar 2025
https://github.com/sonicexedvp/real-time-data-pipeline
📡 Real-time data pipeline with Kafka, Flink, Iceberg, Trino, MinIO, and Superset. Ideal for learning data systems.
apache-flink apache-iceberg apache-kafka apache-superset big-data data-engineering data-pipeline data-visualization docker etl lakehouse minio open-source real-time-data sql-analytics streaming-analytics trino
Last synced: 12 Mar 2025
https://github.com/dain55788/end-to-end-streaming-big-data
End-To-End Streaming Big Data Project makes processing big data easy.
airflow dbt docker kafka minio postgresql spark trino
Last synced: 26 Mar 2025
https://github.com/tcd93/invoice-data-pipeline
A sample data pipeline for transforming invoice images and CSV files into beautiful numbers
airflow data-pipeline kubernetes python trino
Last synced: 01 Apr 2025
https://github.com/okdp/charts
Collection of OKDP helm charts
helm-charts hive-metastore kubernetes okdp spark-history-server spark-kubernetes superset trino
Last synced: 02 Mar 2025
https://github.com/tomkat-cr/data_lakehouse_local_stack
Data Lakehouse local stack with PySpark, Trino, and Minio. Includes an example to process Raygun error data and the IP address occurrence.
docker-compose hive hive-metastore minio minio-storage python spark spark-sql trino
Last synced: 27 Feb 2025
https://github.com/dilermando-lima/trino-pg-mysql-s3-parquet
trino cluster collecting data from mysql and postgress process them and save into s3 as parquet
bigdata datalake docker mysql postgresql query-engine s3 trino trinodb
Last synced: 07 May 2025
https://github.com/mridang/athena-mongodb
MongoDB connector for AWS Athena Federation
apache-arrow athena aws lambda mongodb trino
Last synced: 29 Mar 2025
https://github.com/vubacktracking/stream-data-processing
Streaming data processing pipeline using Spark, PostgreSQL, Debezium, Kafka, Minio, Delta Lake, Trino and DBeaver
dbeaver debezium delta-lake kafka spark spark-streaming stream-processing trino
Last synced: 06 Mar 2025
https://github.com/yuhexiong/doris-to-trino-data-pipeline-spark-python
apache-doris apache-spark data-pipeline trino
Last synced: 18 Mar 2025
https://github.com/apostolis1/trino-study
A study of the Trino distributed execution engine using the TPCDS benchmark
benchmark databases distributed-systems tpcds trino
Last synced: 26 Feb 2025
https://github.com/ac-gomes/spark-iceberg-hive
apache-iceberg apache-spark hive-metastore iceberg minio spark trino
Last synced: 14 Dec 2024
https://github.com/ebyhr/trino-php-client
Fork from https://github.com/ytake/php-presto-client
Last synced: 21 Mar 2025