Skip to content

[ISSUE] Unable to use clusterBy (liquid clustering) with JAVA api #428

Open
@subamdebnath739

Description

@subamdebnath739

Description
Hi Team, we are trying to use liquid clustering for delta table. In pyspark clusterBy api is used but in java this api does not exist. Please help us if there is a different api for delta table liquid clustering with java.

Reproduction
Currently we are using partitionBy api with following code while creating delta table.
session.createDataFrame(new ArrayList<>(), schema)
.write()
.format(delta)
.partitionBy("partition column")
.save(deltaTable_location)
We are trying to move to liquid clustering using clusterBy api inplace of partitionBy.

Expected behavior
Expected behaviour is to create delta table with liquid clustering.

Is it a regression?
No

Debug Logs
The SDK logs helpful debugging information when debug logging is enabled. Set the log level to TRACE (e.g. add log4j.logger.com.databricks=TRACE to the log4j.properties in your project), and include the logs here.

Other Information

  • OS: [e.g. macOS]
  • Version: [e.g. 0.1.0]

Additional context
Add any other context about the problem here.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions