Skip to content

Job control to continue, restart, cancel distributed jobs #47

@agocorona

Description

@agocorona

A jobcontrol primitive that can get Cloud computations as parameter and let the user decide what to do when an non-handled exception happens: either continue after the problem is fixed, using the log facility, restart anew or cancel the job. That control will work regardless of the node where currently it is executing.

For example A program would invoke some distributed facility that is not running/not installed. The user can retry and if it continue failing he can check if it is down, has a failure or is not installed. He can stop the computation, install it and resume execution without re-execting possibly heavy tasks already done at that point.

Messages to the user will appear in the console of the node that initiated the cloud computation and will be managed with console primitives like option and input

This comes from cloudshell

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions