GithubHelp home page GithubHelp logo

Feature Enhancement: Enable Presto Server to Transmit Catalog Name for Enhanced Functionality in the Metastore Layer about presto HOT 2 OPEN

AnuragKDwivedi avatar AnuragKDwivedi commented on June 27, 2024
Feature Enhancement: Enable Presto Server to Transmit Catalog Name for Enhanced Functionality in the Metastore Layer

from presto.

Comments (2)

tdcmeehan avatar tdcmeehan commented on June 27, 2024

Can you more concretely describe how the metastore would actually use the Presto catalog name? Is this for some sort of fork of the Hive metastore?

from presto.

AnuragKDwivedi avatar AnuragKDwivedi commented on June 27, 2024

Lakehouses have 3 part names for table objects.
<catalog-name>.<schema-name>.<table-name>.
Extending this concept to other objects <catalog-name>.<schema-name>.<object-name>.
Currently metastores like HMS are limited to store only a two part name <schema-name>.<table-name> based on the assumption that it will store metadata about only one catalog. There are limitations to this design:
This limitation forces SQL engines like Presto & Spark to store the 3rd part of the name <catalog-name> external to the metastore which is not ideal place for metadata.
This limitation of HMS also introduces another problem that each SQL engine that accesses an object like an iceberg table can name the catalog differently, which will makes it difficult to identify a table object uniquely across different engines.
This also makes HMS single tenant (one catalog per metastore).
Drawing an analogy to traditional databases world, a single instance can hold multiple databases with schemas and table objects dbname.schemaname.tablename
To solve the above limitation, to make Presto engines work with multi catalog metastores that support HMS-like protocol, there needs to be a way to pass the catalog to the external metastore which this PR addresses.

Even though it will transmit the catalog name to the traditional HMS metastore, the traditional HMS metastore will ignore it. The catalog name won't be retained in the metastore, ensuring there's no regression or alteration in behavior, thereby maintaining full backward compatibility.

from presto.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.