Lakehouse Glossary

A unified source of essential lakehouse-related knowledge.

Open source software projects: Apache Hudi / Hudi; Apache Kafka / Kafka; Apache Parquet / Parquet; Avro

Data architecture terms: data warehouse, data lake, data lakehouse; medallion architecture; ingest; query processing; metadata

Hudi capabilities / data integration: ACID transactions; copy-on-write; merge-on-read; change data capture; streaming data; incremental update; ETL, ELT

AI/ML and data: AI; machine learning; generative AI; retrieval-augmented generation (RAG); vector embeddings; generative AI; vector database; embedding models; Euclidean distance; similarity search

Filters

Clear All
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
no-search-result

No result found.