An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with impala

A curated list of projects in awesome lists tagged with impala .

https://github.com/apache/linkis

Apache Linkis builds a computation middleware layer to facilitate connection, governance and orchestration between the upper applications and the underlying data engines.

application-manager context-service engine hive hive-table impala jdbc jobserver linkis livy presto pyspark resource-manager rest-api scriptis spark sql storage thrift-server udf

Last synced: 20 Jan 2026

https://github.com/x-ream/sqli

orm sql query builder, API: QB, QB.X, QrB

clickhouse impala jdbc mysql orm presto query-builder tdengine xream

Last synced: 15 May 2025

https://github.com/wgzhao/addax

A fast and versatile ETL tool that can transfer data between RDBMS and NoSQL seamlessly

clickhouse database etl excel hadoop hdfs hive impala influxdb kudu mysql oracle postgresql sqlserver trino

Last synced: 03 Apr 2026

https://github.com/wgzhao/Addax

A fast and versatile ETL tool that can transfer data between RDBMS and NoSQL seamlessly

clickhouse database etl excel hadoop hdfs hive impala influxdb kudu mysql oracle postgresql sqlserver trino

Last synced: 14 Apr 2025

https://github.com/apache/impala

Apache Impala

impala

Last synced: 14 May 2025

https://github.com/google-research/seed_rl

SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.

atari deepmind-lab gcp google-research-football impala r2d2 rl tf2

Last synced: 18 Mar 2025

https://github.com/huawei-noah/xingtian

xingtian is a componentized library for the development and verification of reinforcement learning algorithms

dqn impala muzero ppo qmix reinforcement-learning-algorithms

Last synced: 05 Apr 2025

https://github.com/automattic/php-thrift-sql

A PHP library for connecting to Hive or Impala over Thrift

database hive impala php sql thrift

Last synced: 06 Apr 2025

https://github.com/ianmcook/implyr

SQL backend to dplyr for Impala

apache dplyr dplyr-sql-backends hadoop impala jdbc odbc r sql tidyverse

Last synced: 07 Apr 2025

https://github.com/turboway/pybigdata

使用 python 操作大数据的各种组件

elasticsearch hadoop hbase hive impala kafka mapreduce spark

Last synced: 23 Jul 2025

https://github.com/ascrus/getl

A tool for developing and testing ETL and ELT processes for automating the capture, delivery and processing of information in data warehouses on the MicroFocus Vertica platform.

csv dsl elt etl excel hdfs hive impala json kafka sql unit-testing vertica xml

Last synced: 14 Jun 2025

https://github.com/tomwhite/docker-impala

Run Impala in a Docker container.

docker hadoop impala

Last synced: 29 Oct 2025

https://github.com/archf/ansible-sql-runner

Run a sequence of sql queries or scripts on a target db

ansible database impala phoenix postgresql role sql

Last synced: 11 Apr 2025

https://github.com/rosacarla/dio-cloud-data-engineer

Reúne atividades e projetos realizados durante o bootcamp Cognizant Cloud Data Engineer #2, promovido pela DIO Inc.

etl hadoop hive impala linux mongodb python sql

Last synced: 12 Apr 2025

https://github.com/kwartile/spark-benchmark

Spark Benchmark suite to evaluate cluster configuration and compare the performance with other big data frameworks.

apache-spark benchmark benchmarking-suite cdh cloudera-hadoop hadoop hive impala performance scala spark

Last synced: 26 Apr 2026

https://github.com/starhe/balm

基于Spring Boot全家桶打造,大数据PAAS组件适配器,一键适配DolphinScheduler、Hadoop、Spark、Hive、Impala、HBase、Kafka、StarRocks、ClickHouse、Neo4j,通过标准REST接口操作,简单易用,方便二次开发和集成

clickhouse dolphinscheduler hadoop hbase hive impala kafka neo4j spark spring starrocks

Last synced: 13 Feb 2026

https://github.com/tranch/impalacli

CLI for Impala Database. With syntax highlighting.

cli dbcli highlight impala

Last synced: 14 Jan 2026

https://github.com/jianzhnie/scalerl

ScaleRL is a simple and scalable distributed reinforcement learning framework based on Python and PyTorch

a3c distributed-systems dppo impala parallel-computing seed-rl

Last synced: 02 Mar 2025

https://github.com/rosacarla/hive-bootcamp

[Forked from vmb / hive - GitLab] para realizar práticas do bootcamp Cognizant Cloud Data Engineer #2.

big-data database hive impala shellscript

Last synced: 07 May 2026

https://github.com/davidkhala/hadoop

hadoop ecosystem index

atlas hbase impala

Last synced: 22 Mar 2025

https://github.com/exasol/impala-virtual-schema

Virtual Schema for connecting Impala as a data source to Exasol

apache-impala exasol exasol-integration impala virtual-schema

Last synced: 08 Aug 2025

https://github.com/awhipp/impala-to-elastic-import

Simple Python Script that imports an Impala Query into an Elastic Search Index

elasticsearch impala python

Last synced: 04 Jun 2026

https://github.com/leisurelyleon/mastercard-lead-data-engineer

A tailored list of exemplified files corresponding to required skills for an appliable career position at Mastercard Inc.

apache apache-spark big-data hive impala java kafka nifi nosql nosql-database nosql-databases object-oriented object-oriented-programming oozie postgresql python scala spark sqoop

Last synced: 08 Apr 2026