[spark] Add union read for lake-enabled log tables#2956
[spark] Add union read for lake-enabled log tables#2956fresh-borzoni wants to merge 3 commits intoapache:mainfrom
Conversation
|
@fresh-borzoni thank you for the patch, I create an issue to track it #2983. |
I will add spark sql support for union read pk table, #2984 |
|
Thanks @Yohahaha! Just a heads up, this PR already implements batch union read for log tables, so #2983 should be covered once it's merged. Regarding PK table union read (#2984),I was planning to follow up with that as noted in the PR description. |
yeah, you could add "closes #2983" in the PR description so that the corresponding issue can be properly linked and closed, like other PRs. when fluss release a new version, RM can easily collect the features of version scope. |
@fresh-borzoni I was planing to implementing it over the next two weeks. Do you already have a draft PR? |
Yohahaha
left a comment
There was a problem hiding this comment.
left some comments, thank you!
fluss-spark/fluss-spark-common/src/main/scala/org/apache/fluss/spark/read/FlussLakeBatch.scala
Outdated
Show resolved
Hide resolved
fluss-spark/fluss-spark-common/src/main/scala/org/apache/fluss/spark/read/FlussLakeBatch.scala
Show resolved
Hide resolved
...luss-spark-ut/src/test/scala/org/apache/fluss/spark/lake/SparkLakeLogTableReadTestBase.scala
Outdated
Show resolved
Hide resolved
...luss-spark-ut/src/test/scala/org/apache/fluss/spark/lake/SparkLakeLogTableReadTestBase.scala
Outdated
Show resolved
Hide resolved
37d1a7d to
9682f4e
Compare
9682f4e to
281d734
Compare
|
@Yohahaha Ty for the review, |
Summary
closes #2983
Adds batch read for lake-enabled log tables. When a table has datalake enabled, reads combine lake storage (Paimon/Iceberg) with Fluss log tail. Lake and log are planned as separate Spark partition, lake tasks read from lake storage without Fluss connections, log tail tasks reuse the existing reader. Falls back to pure log reads when no snapshot exists. Only enabled in FULL startup mode.
Tests cover both Paimon and Iceberg.
Follow-up PRs