Spark: Avoid closing deserialized copies of shared resources like FileIO #12868

xiaoxuandev · 2025-04-22T17:21:02Z

This change prevents calling close() on FileIO during the cleanup of Spark's broadcast variable when memoryStore.remove(blockId) is called. Closing FileIO can unintentionally shut down shared resources—such as a shared connection pool—when S3FileIO is backed by the Apache HTTPClient. Calling close() will trigger the shutdown of the HttpClientConnectionManager, leading to request failures if other instances are still in use.
Fixes: #12858, #12046

bk-mz

👍

singhpk234 · 2025-04-23T16:13:37Z

spark/v3.4/spark/src/main/java/org/apache/iceberg/spark/source/SerializableTableWithSize.java

-      LOG.info("Releasing resources");
-      io().close();
+      LOG.info("Executor-side cleanup: closing deserialized table resources");


is there a way to ensure that the io and hence the pool is eventually closed ?

Thanks for the review! Based on code inspection and some local debugging, Iceberg doesn’t explicitly call close() on most FileIO instances (i.e., the regular, non-deserialized ones). The exception is S3FileIO, which overrides finalize() to invoke close() during garbage collection—and this applies to deserialized copies as well.
And given that the underlying connection pool maybe shared, we might want to consider removing finalize() from S3FileIO to avoid unintended side effects during garbage collection. cc: @rdblue @aokolnychyi

Side note: finalize() was deprecated in Java 9 due to potential performance issues, deadlocks, and unpredictable behavior during GC.

mgmarino · 2025-04-29T11:47:00Z

Thanks, @xiaoxuandev this looks to work similarly to the PR that I opened to fix this issue here: #12129, but I was unable to come up with tests there. Would love to get this in.

github-actions bot added the spark label Apr 22, 2025

xiaoxuandev mentioned this pull request Apr 22, 2025

[SparkMicroBatchStream] Executors prematurely close I/O client during Spark broadcast cleanup #12858

Open

3 tasks

xiaoxuandev force-pushed the fix-close-on-executor branch from e9c9529 to 035f3b8 Compare April 22, 2025 17:31

Spark: Avoid closing deserialized copies of shared resources like FileIO

4daa941

xiaoxuandev force-pushed the fix-close-on-executor branch from 035f3b8 to 4daa941 Compare April 22, 2025 19:15

bk-mz approved these changes Apr 23, 2025

View reviewed changes

singhpk234 reviewed Apr 23, 2025

View reviewed changes

dramaticlly approved these changes Apr 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spark: Avoid closing deserialized copies of shared resources like FileIO #12868

Spark: Avoid closing deserialized copies of shared resources like FileIO #12868

xiaoxuandev commented Apr 22, 2025 •

edited

Loading

bk-mz left a comment

singhpk234 Apr 23, 2025

xiaoxuandev Apr 24, 2025

mgmarino commented Apr 29, 2025

Spark: Avoid closing deserialized copies of shared resources like FileIO #12868

Are you sure you want to change the base?

Spark: Avoid closing deserialized copies of shared resources like FileIO #12868

Conversation

xiaoxuandev commented Apr 22, 2025 • edited Loading

bk-mz left a comment

Choose a reason for hiding this comment

singhpk234 Apr 23, 2025

Choose a reason for hiding this comment

xiaoxuandev Apr 24, 2025

Choose a reason for hiding this comment

mgmarino commented Apr 29, 2025

xiaoxuandev commented Apr 22, 2025 •

edited

Loading