DockerIM should use operation mechanism properly #498

0405ysj · 2025-11-10T02:21:55Z

Context: b/453876231

k311093 · 2025-11-10T04:42:40Z

pkg/app/instances/docker.go

+		return nil, fmt.Errorf("operation not found for %q", name)
+	}
+	entry := val.(*operationEntry)
+	ctx, cancel := context.WithTimeout(context.TODO(), 3*time.Minute)


Can context.TODO() be context.Background() ?

k311093 · 2025-11-10T04:45:21Z

pkg/app/instances/docker.go

+			return entry.(*operationEntry).op
+		}
+	}
+	panic("Reached newOperationRetryLimit")


Shoud this be panic? looks like it stops service if newOperation is failed, how about just logging?

I think it's okay to be panic, as retry count increments when uuid conflict happens. I believe it's very close to 0% with large retry couny, even less than sha256 hash conflict.

Leave a comment explaining that the chances of hitting this are practically zero and not worth it of changing the API to return an error and that in the unlikely scenario this triggers a panic is better than running forever or returning an invalid/nil operation.

ser-io · 2025-11-10T16:35:06Z

pkg/client/client_test.go

+		case "POST /operations/deletingbar/:wait":
+			writeOK(w, apiv1.HostInstance{Name: "bar"})
+		case "POST /operations/deletingbaz/:wait":
+			writeOK(w, apiv1.HostInstance{Name: "baz"})


waiting for a delete host operation should return an empty response: https://github.com/google/cloud-android-orchestration/blob/main/pkg/app/instances/gce.go#L344. Please fix the DockerIM to return an empty response first.

ser-io · 2025-11-10T16:36:18Z

pkg/client/client.go

+				return
+			}
+			ins := &apiv1.HostInstance{}
+			if err := c.waitForOperation(&op, ins); err != nil {


waiting for a delete host operation should return an empty response: https://github.com/google/cloud-android-orchestration/blob/main/pkg/app/instances/gce.go#L344.

create a new helper: waitForOperation() where you don't have to pass a response argument, after fixing the DockerIM implementation.

ser-io · 2025-11-10T16:48:24Z

pkg/app/instances/docker.go

-	mutexes sync.Map
+	Config     Config
+	Client     *client.Client
+	mutexes    sync.Map


The CO IM is designed to be stateless.

Why do you need to add this new state to the Docker IM implementation? What are the actual operations you need this for? You should be relying on Docker Engine to query whether a container was created, deleted and so on, to track such operations.

It was motivated from taking long(> 1min) times to reply REST API, such as POST /hosts and DELETE /hosts.

The CO IM is designed to be stateless.

Then this PR should be definitely my fault.. How should I get such information? I think these circumstances are repeated, and I wish to retrieve how we manage those before working on it.

0405ysj added 3 commits November 10, 2025 10:50

DeleteHosts should wait for the operation result

7237685

DockerIM should use operation mechanism properly

ab60bf1

Ensure docker volume deletion on DockerIM of CO

4cfeb36

0405ysj force-pushed the operation branch from 17c964d to 4cfeb36 Compare November 10, 2025 03:45

0405ysj marked this pull request as ready for review November 10, 2025 03:53

0405ysj requested review from Databean, adelva1984, jemoreira, jmacnak, rmuthiah and ser-io as code owners November 10, 2025 03:53

0405ysj requested review from ikicha and k311093 and removed request for Databean, adelva1984, jmacnak and rmuthiah November 10, 2025 03:53

k311093 approved these changes Nov 10, 2025

View reviewed changes

ser-io requested changes Nov 10, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DockerIM should use operation mechanism properly #498

DockerIM should use operation mechanism properly #498

Uh oh!

0405ysj commented Nov 10, 2025

Uh oh!

k311093 Nov 10, 2025

Uh oh!

k311093 Nov 10, 2025

Uh oh!

0405ysj Nov 10, 2025

Uh oh!

jemoreira Nov 10, 2025

Uh oh!

ser-io Nov 10, 2025

Uh oh!

ser-io Nov 10, 2025

Uh oh!

ser-io Nov 10, 2025 •

edited

Loading

Uh oh!

0405ysj Nov 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

DockerIM should use operation mechanism properly #498

Are you sure you want to change the base?

DockerIM should use operation mechanism properly #498

Uh oh!

Conversation

0405ysj commented Nov 10, 2025

Uh oh!

k311093 Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

k311093 Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

0405ysj Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

jemoreira Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

ser-io Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

ser-io Nov 10, 2025

Choose a reason for hiding this comment

Uh oh!

ser-io Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

0405ysj Nov 12, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ser-io Nov 10, 2025 •

edited

Loading