-
Notifications
You must be signed in to change notification settings - Fork 4.7k
start running the access tokens and huge services test #17671
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
|
Scenario is shared by both AWS and GCP. |
|
I can scope the change to just GCE, but let's benchmark the presubmits. /test presubmit-kops-aws-scale-amazonvpc-using-cl2 |
I don't have anything against enabling those, just would be nice to be all on the same page. |
|
/retest |
|
/test presubmit-kops-aws-scale-amazonvpc-using-cl2 |
|
Something isn't right, the test suite shrunk |
|
/test presubmit-kops-aws-scale-amazonvpc-using-cl2 |
1 similar comment
|
/test presubmit-kops-aws-scale-amazonvpc-using-cl2 |
1933886 to
cab9990
Compare
|
/test presubmit-kops-gce-small-scale-ipalias-using-cl2 |
cab9990 to
82d80c9
Compare
|
/test presubmit-kops-gce-small-scale-ipalias-using-cl2 |
82d80c9 to
cb4e4e6
Compare
|
/test presubmit-kops-gce-small-scale-ipalias-using-cl2 |
|
/test presubmit-kops-aws-small-scale-using-cl2 |
|
/test presubmit-kops-aws-small-scale-amazonvpc-using-cl2 |
cb4e4e6 to
70506ac
Compare
08f6cf3 to
480a3c0
Compare
3756f25 to
f5468c8
Compare
f5468c8 to
1f31040
Compare
| - args: | ||
| - --v=2 | ||
| - --conf=/etc/kubernetes/kops-controller/config/config.yaml | ||
| - --log_file=/var/log/kops-controller.log |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
found this
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A blast from the past :-)
|
/test presubmit-kops-gce-scale-ipalias-using-cl2 |
|
@upodroid, looks like the scale jobs are failing due to latency. Should we also test DRA the config in this pre-submit job? https://prow.k8s.io/view/gs/kubernetes-ci-logs/pr-logs/pull/kops/17671/presubmit-kops-gce-scale-ipalias-using-cl2/1980346550060060672 |
|
We don't have a DRA presubmit against this repo. Also, I need this merged first, then we can soak it and then address the other performance issues. |
| d.GCPProject = resource.Name | ||
| klog.V(1).Infof("Got project %s from boskos", d.GCPProject) | ||
|
|
||
| if os.Getenv("SCALE_SCENARIO") == "performance" { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not required after kubernetes/perf-tests#3653
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
|
/test presubmit-kops-gce-scale-ipalias-using-cl2 |
04f620d to
6ebf4a0
Compare
|
/test presubmit-kops-gce-scale-ipalias-using-cl2 |
| } | ||
|
|
||
| b.addLogRotate(c, "docker", "/var/log/docker.log", logRotateOptions{}) | ||
| b.addLogRotate(c, "kops-controller", "/var/log/kops-controller.log", logRotateOptions{}) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think kops-controller is "just a pod" and doesn't write logs here. But it is relatively important, so I think we could make the case for doing so...
| d.GCPProject = resource.Name | ||
| klog.V(1).Infof("Got project %s from boskos", d.GCPProject) | ||
|
|
||
| if os.Getenv("SCALE_SCENARIO") == "performance" { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| if version > "1.29" { | ||
| // Requires https://github.com/kubernetes/kops/pull/16128 | ||
| args = append(args, "--set", `spec.containerd.configAdditions=plugins."io.containerd.grpc.v1.cri".containerd.runtimes.test-handler.runtime_type=io.containerd.runc.v2`) | ||
| "--set", `spec.containerd.configAdditions=plugins."io.containerd.grpc.v1.cri".containerd.runtimes.test-handler.runtime_type=io.containerd.runc.v2`, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This looks like a cleanup for handling of old kube versions that we no longer test? (I.e. could be split into its own PR, though that would be a very short PR!)
|
|
||
| exec.InheritOutput(cmd) | ||
| err = cmd.Run() | ||
| err := cmd.Run() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
I like if err := cmd.Run(); err != nil { because then you don't influence the next single-error return. But this is an improvement so 👍
| - args: | ||
| - --v=2 | ||
| - --conf=/etc/kubernetes/kops-controller/config/config.yaml | ||
| - --log_file=/var/log/kops-controller.log |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A blast from the past :-)
| requests: | ||
| cpu: 50m | ||
| memory: 50Mi | ||
| securityContext: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think there's a case that kops-controller is important so should have a log in /var/log, but the components that are there today are the ones that we can't get with kubectl logs (when they fail). Is there e.g. a perf issue with getting kops-controller logs from kubectl logs (or some other reason)? Because now we will have to deal with the scanners flagging this one as running as root...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There isn't a performance issue and assuming the control plane is bootstrapped, we can see the logs from kops toolbox dump.
Happy to remove it if it's too tricky
| @@ -0,0 +1 @@ | |||
| defaultBaseImage: gcr.io/distroless/static-debian12 | |||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍 to introducing this file so we can more easily configure some of these ko options, for example I realized we aren't passing the Version properly to ko
| if err != nil { | ||
| return err | ||
| } | ||
| b.AddFirewallRulesTasks(c, "ssh-external-to-master", &gcetasks.FirewallRule{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
So this hits the "main code path" of kOps. Can we SSH through the bastion? Or if not, can we somehow make this not change the configuration for "everyone else" - e.g. with a feature flag or by adding something in the cluster or instancegroup? (The feature flag is normally easiest)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This specific change won't be in this PR, just need the patch to get the scale tests passing.
It is being worked in a different PR
6ebf4a0 to
69116e7
Compare
69116e7 to
11f922b
Compare
|
/test presubmit-kops-gce-scale-ipalias-using-cl2 |
|
@upodroid: The following tests failed, say
Full PR test history. Your PR dashboard. Please help us cut down on flakes by linking to an open issue when you hit one in your PR. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
|
/test presubmit-kops-gce-scale-ipalias-using-cl2 |
We need to run the access tokens and hugepage services perf tests for scale jobs to achieve parity with the current GCE 5k test.
https://github.com/kubernetes/test-infra/blob/7c5dc4ca148503409199df1945856845730b94bb/config/jobs/kubernetes/sig-scalability/sig-scalability-release-blocking-jobs.yaml#L162