Projects in Awesome Lists tagged with apache-hive
A curated list of projects in awesome lists tagged with apache-hive .
https://github.com/nielsbasjes/yauaa
Yet Another UserAgent Analyzer
analyzer apache-beam apache-flink apache-hive client-hints flink hive java nifi-processor nifi-processors parse snowflake snowplow snowplowanalytics trino-plugin user-agent user-agent-analysis user-agent-parser useragent-parser useragentparser
Last synced: 16 Jan 2026
https://github.com/hadifadl/hive-cmd
All cmd files needed to run Hive on windows (taken from https://svn.apache.org/repos/asf/hive/trunk/bin/)
Last synced: 13 Jul 2025
https://github.com/ahmetfurkandemir/minio-hive-example
Kubernetes Hive Minio connection example
apache-hive hadoop hive hive-metastore hive-server k8s kubernetes kubernetes-cluster kubernetes-deployment minio postgresql s3 s3-bucket
Last synced: 12 Jun 2025
https://github.com/mvrpl/hiveapi
Hive Rest API written in Java
apache-hive json json-api rest-api
Last synced: 04 May 2026
https://github.com/tspannhw/operations-dashboard
Apache NiFi operations-dashboard
apache-hive apache-nifi big-data hortonworks microservices spring-boot tables
Last synced: 11 Jun 2025
https://github.com/ren294/smarttraffic_lakehouse_for_hcmc
A Smart Traffic Management System for Ho Chi Minh City, Vietnam leveraging batch and real-time data processing, intuitive dashboards, and monitoring tools to optimize traffic flow, enhance safety, and support sustainable urban mobility through advanced analytics and user-friendly applications.
apache-airflow apache-flink apache-hive apache-hudi apache-kafka apache-nifi apache-spark apache-superset apache-zookeeper big-data debezium grafana lakefs metabase minio promotheus redis seatunnel streamlit trino
Last synced: 11 Apr 2025
https://github.com/data-integrations/hive-plugins
Hive Export/Import plugins
apache-hive cask-marketplace cdap cdap-plugin hive hive-export
Last synced: 16 Mar 2026
https://github.com/nuttymoon/nifi-hive3streaming-fixed
A NiFi bundle containing a stable implementation of the PutHive3Streaming processor
apache-hive apache-nifi hive-streaming nifi-processor
Last synced: 31 Mar 2025
https://github.com/narius2030/hive-datawarehouse-analysis
Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems
apache-hadoop apache-hive data-analysis etl-pipeline hiveql machine-learning statistics
Last synced: 01 Apr 2025
https://github.com/yingzhuo/logback-flume-appender
logback appender for apache-flume
apache-flume apache-hadoop apache-hive flume logback logback-appender logback-flume-appender slf4j
Last synced: 16 Mar 2025
https://github.com/tspannhw/table-ddl
DDL for Kudu, Impala, Phoenix, HBase, Hive, MySQL, PostgreSQL, Calcite, ... Tables. SQL.
apache-hbase apache-hive apache-kudu apache-phoenix apache-sparksql cloudera ddl hue nifi sql
Last synced: 19 Mar 2026
https://github.com/yingzhuo/hive-my-functions
My apache-hive User Defined Functions.
Last synced: 05 Aug 2025
https://github.com/mikeacosta/san-francisco-crime
SF crime data analysis with Apache Spark
apache-hive apache-spark hadoop hdfs hortonworks
Last synced: 17 May 2026
https://github.com/Narius2030/Hive-DataWarehouse-Analysis
Implement a Hive data warehouse to store meaningful data, apply Machine Learning like Clustering or Regression for dealing with business problems
apache-hadoop apache-hive data-analysis etl-pipeline hiveql machine-learning statistics
Last synced: 12 Aug 2025
https://github.com/durlachert/delta-lake-optimization
BA 2
apache-hive apache-spark big-data delta-lake hdfs jupyter-notebook
Last synced: 25 Jan 2026
https://github.com/cevheri/hive-java-example
Hive Query Language example with Apache Hive, Apache Hadoop, Java
apache-hive hive hive-partition hive-table hiveql java
Last synced: 19 Jun 2026
https://github.com/garystafford/presto-aws-federated-queries
Demonstration of Presto Federated Queries using Ahana PrestoDB Sandbox on AWS
ahana apache-hive aws data-analytics federated-query presto prestodb
Last synced: 27 Mar 2025
https://github.com/bayunova28/spotify_lyrics
This repository contains my personal project to generate mapreduce using apache hadoop
apache-derby apache-hadoop apache-hive hadoop-mapreduce mapreduce-python spotify
Last synced: 05 Apr 2025
https://github.com/exasol/hive-virtual-schema
Virtual Schema for connecting Hive as a data source to Exasol
apache-hive exasol exasol-integration hive virtual-schema
Last synced: 04 Jun 2026
https://github.com/mesmacosta/hive-table-metadata-generator
This script generates random metadata for the Hive metastore.
apache-hive bigdata datawarehouse metadata
Last synced: 04 Oct 2025
https://github.com/tashi-2004/apache-hadoop-spark-hive-cyberanalytics
This project utilizes Apache Hadoop, Hive, and PySpark to process and analyze the UNSW-NB15 dataset, enabling advanced query analysis, machine learning modeling, and visualization. The project demonstrates efficient data ingestion, processing, and predictive analytics for network security insights.
ai apache-hadoop apache-hive big-data-analytics big-data-processing data-analysis data-engineering data-science data-security data-visualization hdfs machine-learning network-analysis network-security pyspark python3 threat-detection unsw-nb15-dataset
Last synced: 02 May 2026
https://github.com/lefteris-souflas/apache-drill-and-impala
Explore data virtualization and query performance optimization with Apache Drill, Hive, and Impala. Tasks include comparing virtualization precision, proposing solutions for a bookstore's diverse data formats, creating Impala databases, and addressing query performance issues. The report offers practical insights and commands for implementation
apache-drill apache-hive apache-hue apache-impala
Last synced: 04 Feb 2026
https://github.com/adbc-drivers/hiveserver2
Common infrastructure for ADBC drivers for the HiveServer2 protocol (Apache Hive, Apache Spark, and friends)
adbc apache-hive apache-spark csharp databases hive
Last synced: 30 Jan 2026
https://github.com/kshru9/hive-data-consumption-app
This application can be used to query hive data warehouse in a simplified manner
angularjs apache-hive docker-compose hiveql hiveserver2 html-css-javascript scheduler spring-boot typescript
Last synced: 15 Apr 2026
https://github.com/vaxdata22/nosql-and-big-data-demonstration
This is a fun assignment task I undertook to explore the world of NoSQL and Big Data. technologies.
apache-hive cassandra-cql cypher-query-language data-warehouse hadoop-hdfs json mongodb neo4j nosql-databases redis
Last synced: 13 Feb 2026
https://github.com/cevheri/hive-query-language-example
Apache Hive Query Language example
apache-hive hive hive-partition hive-table hiveql
Last synced: 05 Mar 2026
https://github.com/lifeislearningforever/wikipedia-crawler-hive
Production-ready Wikipedia crawler with PySpark and Apache Hive integration. Extracts article data and stores it in Hive with Parquet format and date partitioning.
apache-hive data-engineering data-pipeline parquet pyspark python web-scraping wikipedia
Last synced: 06 Jun 2026