[bug fix] set finish before notifying load in LanceArrowWriter setFinished method #1985

xx789633 · 2025-11-16T04:34:26Z

… method

Purpose

Linked issue: close #xxx

This commit is copied from lance-format/lance-spark#92.

I found that the Lance writer has a certain probability of hanging. After some troubleshooting, I discovered this is related to the LanceArrowWriter.setFinished method.

The original code appears to have a bug where it sets the finished status after notifying loadNextBatch, which could cause loadNextBatch to hang.

Root Cause

The ideal flow should be:

(thread 1) loadToken.release
(thread 1) finished = true
(thread 2) loadNextBatch
(thread 2) finished is true and count is 0 so return false
However, there's a chance it becomes:

(thread 1) loadToken.release
(thread 2) loadNextBatch
(thread 2) finished is false so return true and waiting
(thread 1) finished = false
If the second scenario occurs, thread 2 will hang indefinitely and cannot receive new notifications. jstack will show stacks hanging in LanceDataWriter.commit.

Brief change log

Tests

API and Format

Documentation

… method

Copilot

Pull Request Overview

This PR fixes a critical race condition bug in LanceArrowWriter.setFinished() that could cause loadNextBatch() to hang indefinitely. The fix reorders operations to set the finished flag before releasing the semaphore, ensuring proper thread synchronization.

Key Changes:

Reordered finished = true to execute before loadToken.release() in setFinished() method to prevent race condition
Added explicit false initialization to the finished field

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-11-18T02:32:40Z

...ake/fluss-lake-lance/src/main/java/org/apache/fluss/lake/lance/tiering/LanceArrowWriter.java

    private final int batchSize;

-    private volatile boolean finished;
+    private volatile boolean finished = false;


[nitpick] The explicit initialization = false is redundant in Java. Boolean instance fields are automatically initialized to false by default. While this doesn't cause any issues, it's unnecessary and can be removed for cleaner code.

Suggested change

private volatile boolean finished = false;

private volatile boolean finished;

luoyuxia

+1

fix: set finish before notifying load in LanceArrowWriter setFinished…

181b358

… method

luoyuxia requested a review from Copilot November 18, 2025 02:24

Copilot started reviewing on behalf of luoyuxia November 18, 2025 02:25 View session

Copilot finished reviewing on behalf of luoyuxia November 18, 2025 02:26

Copilot AI reviewed Nov 18, 2025

View reviewed changes

luoyuxia approved these changes Nov 18, 2025

View reviewed changes

luoyuxia merged commit 64971b1 into apache:main Nov 18, 2025
14 of 15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[bug fix] set finish before notifying load in LanceArrowWriter setFinished method #1985

[bug fix] set finish before notifying load in LanceArrowWriter setFinished method #1985

Uh oh!

xx789633 commented Nov 16, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Nov 18, 2025

Uh oh!

luoyuxia left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	private volatile boolean finished = false;
	private volatile boolean finished;

[bug fix] set finish before notifying load in LanceArrowWriter setFinished method #1985

[bug fix] set finish before notifying load in LanceArrowWriter setFinished method #1985

Uh oh!

Conversation

xx789633 commented Nov 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Brief change log

Tests

API and Format

Documentation

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Nov 18, 2025

Choose a reason for hiding this comment

Uh oh!

luoyuxia left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xx789633 commented Nov 16, 2025 •

edited

Loading