Skip to content

[FEATURE] Allow cancelling of active long running tasks: experiment execution, judgment rating generation #265

@martin-gaievski

Description

@martin-gaievski

Is your feature request related to a problem?

If user needs to interrupt active task, it's not possible in SWR, only option is either to wait task completion or restarting the cluster. Both are not great options and definitely not good user experience

What solution would you like?

Any type of control that allows me to interrupt such active task. On a backend we can interrupt all active threads and cancel all queued threads. Partial results can be deleted as part of the cancellation process as they are probably not needed in such scenario.

What alternatives have you considered?

For local cluster I can just restart it. Simple deletion of the task from UI doesn't do anything as work has been scheduled as async long running task with multiple working threads.

Do you have any additional context?

I bumped into such problem with LLM generated ratings and hybrid optimizer experiment. Example scenario:

  • I started rating generation with LLM, for large dataset and number of ratings this is long running process
  • after some time I started hybrid optimizer experiment that uses those LLM ratings. This is possible because document record for judgment ratings has been created right away. Note that actual rating generation is running and all ratingS are empty/0.0
  • At this point I realize hybrid optimizer experiment results are not useful because of incomplete ratings, all metric will be 0.0

Problem: I cannot cancel hybrid optimizer experiment even if I know this is throw away work. I had to wait 15mins or restart cluster.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    Status

    🆕 New

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions