Skip to content

anjijava16/Modern_Data_lakes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 

Repository files navigation

Modern_Data_lakes

  1. Dremio.
  2. Starburst (Starburst is the commercial version of the open source Trino (formerly PrestoSQL). Trino is a SQL-based query engine that gives users the ability to run analytics on data lakes. Starburst is fundamentally designed to handle non-interactive use cases such as ad-hoc workloads)
  3. denodo
  4. athena

Table Format

  1. Apache Iceberg

  2. Delta Lake (Databricks)

  3. Apache Hudi

  4. Iceberg : https://iceberg.apache.org/docs/latest/spark-queries/

  5. Dremio

image

image

image

Before and After

https://medium.com/airbnb-engineering/upgrading-data-warehouse-infrastructure-at-airbnb-a4e18f09b6d5

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors