hive-metastore

Star

Here are 11 public repositories matching this topic...

naushadh / hive-metastore

Star

Apache Hive Metastore as a Standalone server in Docker

docker spark presto trino hive-metastore localstack

Updated Aug 22, 2024
Python

thanhENC / e2e-data-platform

Star

End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)

airflow spark docker-compose end-to-end data-platform dbt data-pipeline trino hive-metastore adventureworks delta-lake lightdash

Updated Oct 14, 2024
Python

harrydevforlife / building-lakehouse

Star

Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize and recommend app.

python airflow spark s3 metabase minio dbt flask-api hive-metastore delta-lake lakehouse

Updated Apr 20, 2024
Python

GoogleCloudPlatform / datacatalog-connectors-hive

Star

Sample code with integration between Data Catalog and Hive data source.

python hive analytics gcp data-warehouse metadata-management hive-metastore apache-atlas datacatalog

Updated Jan 29, 2025
Python

criccomini / pymetastore

Star

A Python Client for Hive Metastore

python hive thrift data-engineering hcatalog hive-metastore

Updated Dec 19, 2023
Python

Narius2030 / Sakila-Lakehouse

Star

Developed a Lakehouse-based data pipeline using Sakila dataset to analyze movie sale and rental trends. The project was designed according to Delta architecture

spark-streaming apache-kafka real-time-analytics hive-metastore delta-lake lakehouse trino-dbt

Updated Mar 18, 2025
Python

aaliashraf / airflow-spark-hive-azure-docker-workflow

Star

Foundation Workspace for Airflow, Spark, Hive, and Azure Data Lake Gen2 via Docker

python docker airflow spark apache-spark hive pyspark azure-storage apache-airflow hive-metastore bitnami-image azuredatalakegen2

Updated Mar 31, 2024
Python

pratikSethi / perf-ops

Star

Performance Optimizations and Benchmarks for Distributed SQL Engines

spark presto s3 glue sparksql hive-metastore

Updated Feb 12, 2020
Python

sergio11 / genomic_data_storage_architecture

Star

🧬 Genomic Data Storage Architecture: A proof of concept for securely managing and auditing massive genomic datasets by combining distributed storage 📂, event-driven microservices ⚡, and blockchain ⛓️ (or equivalent notarization) for tamper-proof, traceable, and scalable genomic data workflows.

Updated Sep 22, 2025
Python

tomkat-cr / data_lakehouse_local_stack

Star

Data Lakehouse local stack with PySpark, Trino, and Minio. Includes an example to process Raygun error data and the IP address occurrence.

python spark hive docker-compose minio spark-sql trino hive-metastore minio-storage

Updated Jun 19, 2025
Python

hasancatalgol / iceflow-pipeline

Star

Batch processing pipeline for BI and AI purposes

airflow spark iceberg trino hive-metastore minio-storage

Updated May 7, 2025
Python

Improve this page

Add a description, image, and links to the hive-metastore topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the hive-metastore topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hive-metastore

Here are 11 public repositories matching this topic...

naushadh / hive-metastore

thanhENC / e2e-data-platform

harrydevforlife / building-lakehouse

GoogleCloudPlatform / datacatalog-connectors-hive

criccomini / pymetastore

Narius2030 / Sakila-Lakehouse

aaliashraf / airflow-spark-hive-azure-docker-workflow

pratikSethi / perf-ops

sergio11 / genomic_data_storage_architecture

tomkat-cr / data_lakehouse_local_stack

hasancatalgol / iceflow-pipeline

Improve this page

Add this topic to your repo