[VL] Allow to reuse local SSD cache on Spark context restart #8891

zhouyuan · 2025-03-04T08:36:55Z

Description

Currently local SSD cache will be discarded after Spark context shutdown. It would better to make it reusable with a new optional config

jackylee-ch · 2025-03-04T08:50:13Z

Curious about the scenarios where reuse would be necessary. In a cluster environment, if a Spark Application is regenerated, the executors will be re-allocated. The cache on the executors may not align, and even if there is cache, the hit rate might not be high.

zhouyuan · 2025-03-04T08:54:42Z

@jackylee-ch Thanks, I was told the soft cache affinity can help to alleviate this issue, but I'm trying to find more resource verify
https://github.com/apache/incubator-gluten/blob/main/shims/common/src/main/scala/org/apache/gluten/config/GlutenConfig.scala#L691

jackylee-ch · 2025-03-04T09:46:32Z

I was told the soft cache affinity can help to alleviate this issue, but I'm trying to find more resource verify

AFAIK, the cache involved in duplicateReading will become invalid after Spark restarts, unless the cache can be reused and the executor rescheduling issue is resolved. Nevertheless, the pr is still helpful for cache reuse within the same application.

FelixYBW · 2025-03-04T18:25:44Z

@jackylee-ch It's more for benchmark testing actually. So the warm run in second test can get 100% hit.
You are right in production the local ssd cache may be used in the same query but hard to in second query.

zhouyuan · 2025-03-05T08:58:16Z

For single instance based tests it should be still useful.

zhouyuan added the enhancement New feature or request label Mar 4, 2025

zhouyuan linked a pull request Mar 4, 2025 that will close this issue

[GLUTEN-8891][VL] Allow to reuse local SSD cache on Spark context restart #8892

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[VL] Allow to reuse local SSD cache on Spark context restart #8891

[VL] Allow to reuse local SSD cache on Spark context restart #8891

zhouyuan commented Mar 4, 2025

jackylee-ch commented Mar 4, 2025

zhouyuan commented Mar 4, 2025

jackylee-ch commented Mar 4, 2025

FelixYBW commented Mar 4, 2025

zhouyuan commented Mar 5, 2025

[VL] Allow to reuse local SSD cache on Spark context restart #8891

[VL] Allow to reuse local SSD cache on Spark context restart #8891

Comments

zhouyuan commented Mar 4, 2025

Description

jackylee-ch commented Mar 4, 2025

zhouyuan commented Mar 4, 2025

jackylee-ch commented Mar 4, 2025

FelixYBW commented Mar 4, 2025

zhouyuan commented Mar 5, 2025