An open API service indexing awesome lists of open source software.

Projects in Awesome Lists tagged with apache-hive

A curated list of projects in awesome lists tagged with apache-hive .

https://github.com/hadifadl/hive-cmd

All cmd files needed to run Hive on windows (taken from https://svn.apache.org/repos/asf/hive/trunk/bin/)

apache-hive hive windows

Last synced: 13 Jul 2025

https://github.com/mvrpl/hiveapi

Hive Rest API written in Java

apache-hive json json-api rest-api

Last synced: 04 May 2026

https://github.com/ren294/smarttraffic_lakehouse_for_hcmc

A Smart Traffic Management System for Ho Chi Minh City, Vietnam leveraging batch and real-time data processing, intuitive dashboards, and monitoring tools to optimize traffic flow, enhance safety, and support sustainable urban mobility through advanced analytics and user-friendly applications.

apache-airflow apache-flink apache-hive apache-hudi apache-kafka apache-nifi apache-spark apache-superset apache-zookeeper big-data debezium grafana lakefs metabase minio promotheus redis seatunnel streamlit trino

Last synced: 11 Apr 2025

https://github.com/nuttymoon/nifi-hive3streaming-fixed

A NiFi bundle containing a stable implementation of the PutHive3Streaming processor

apache-hive apache-nifi hive-streaming nifi-processor

Last synced: 31 Mar 2025

https://github.com/narius2030/hive-datawarehouse-analysis

Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems

apache-hadoop apache-hive data-analysis etl-pipeline hiveql machine-learning statistics

Last synced: 01 Apr 2025

https://github.com/tspannhw/table-ddl

DDL for Kudu, Impala, Phoenix, HBase, Hive, MySQL, PostgreSQL, Calcite, ... Tables. SQL.

apache-hbase apache-hive apache-kudu apache-phoenix apache-sparksql cloudera ddl hue nifi sql

Last synced: 19 Mar 2026

https://github.com/yingzhuo/hive-my-functions

My apache-hive User Defined Functions.

apache-hive udf udtf utaf

Last synced: 05 Aug 2025

https://github.com/mikeacosta/san-francisco-crime

SF crime data analysis with Apache Spark

apache-hive apache-spark hadoop hdfs hortonworks

Last synced: 17 May 2026

https://github.com/Narius2030/Hive-DataWarehouse-Analysis

Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems

apache-hadoop apache-hive data-analysis etl-pipeline hiveql machine-learning statistics

Last synced: 12 Aug 2025

https://github.com/cevheri/hive-java-example

Hive Query Language example with Apache Hive, Apache Hadoop, Java

apache-hive hive hive-partition hive-table hiveql java

Last synced: 19 Jun 2026

https://github.com/garystafford/presto-aws-federated-queries

Demonstration of Presto Federated Queries using Ahana PrestoDB Sandbox on AWS

ahana apache-hive aws data-analytics federated-query presto prestodb

Last synced: 27 Mar 2025

https://github.com/bayunova28/spotify_lyrics

This repository contains my personal project to generate mapreduce using apache hadoop

apache-derby apache-hadoop apache-hive hadoop-mapreduce mapreduce-python spotify

Last synced: 05 Apr 2025

https://github.com/exasol/hive-virtual-schema

Virtual Schema for connecting Hive as a data source to Exasol

apache-hive exasol exasol-integration hive virtual-schema

Last synced: 04 Jun 2026

https://github.com/mesmacosta/hive-table-metadata-generator

This script generates random metadata for the Hive metastore.

apache-hive bigdata datawarehouse metadata

Last synced: 04 Oct 2025

https://github.com/tashi-2004/apache-hadoop-spark-hive-cyberanalytics

This project utilizes Apache Hadoop, Hive, and PySpark to process and analyze the UNSW-NB15 dataset, enabling advanced query analysis, machine learning modeling, and visualization. The project demonstrates efficient data ingestion, processing, and predictive analytics for network security insights.

ai apache-hadoop apache-hive big-data-analytics big-data-processing data-analysis data-engineering data-science data-security data-visualization hdfs machine-learning network-analysis network-security pyspark python3 threat-detection unsw-nb15-dataset

Last synced: 02 May 2026

https://github.com/lefteris-souflas/apache-drill-and-impala

Explore data virtualization and query performance optimization with Apache Drill, Hive, and Impala. Tasks include comparing virtualization precision, proposing solutions for a bookstore's diverse data formats, creating Impala databases, and addressing query performance issues. The report offers practical insights and commands for implementation

apache-drill apache-hive apache-hue apache-impala

Last synced: 04 Feb 2026

https://github.com/adbc-drivers/hiveserver2

Common infrastructure for ADBC drivers for the HiveServer2 protocol (Apache Hive, Apache Spark, and friends)

adbc apache-hive apache-spark csharp databases hive

Last synced: 30 Jan 2026

https://github.com/kshru9/hive-data-consumption-app

This application can be used to query hive data warehouse in a simplified manner

angularjs apache-hive docker-compose hiveql hiveserver2 html-css-javascript scheduler spring-boot typescript

Last synced: 15 Apr 2026

https://github.com/xtutran/hive-example

Apache Hive practices

apache-hive

Last synced: 11 Feb 2026

https://github.com/vaxdata22/nosql-and-big-data-demonstration

This is a fun assignment task I undertook to explore the world of NoSQL and Big Data. technologies.

apache-hive cassandra-cql cypher-query-language data-warehouse hadoop-hdfs json mongodb neo4j nosql-databases redis

Last synced: 13 Feb 2026

https://github.com/lifeislearningforever/wikipedia-crawler-hive

Production-ready Wikipedia crawler with PySpark and Apache Hive integration. Extracts article data and stores it in Hive with Parquet format and date partitioning.

apache-hive data-engineering data-pipeline parquet pyspark python web-scraping wikipedia

Last synced: 06 Jun 2026