-
Notifications
You must be signed in to change notification settings - Fork 312
ADR about security-responder #9006
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
|
This looks good to me 👍 |
3ec4545 to
e0892fd
Compare
|
Ok for me I would add that there should be a possibility in Rancher / Rancher Prime deployment context to setup for the data to be collected by Rancher (with Rancher acting as "proxy" to needed data) |
|
@pgonin We'll have EIO stand up another RMS/Prime gathering similar data about systems under management is a very different discussion, let's have that elsewhere please. |
|
Agreed, it should be a second step. To be discussed together with Rancher actually |
docs/adrs/010-security-responder.md
Outdated
| - nodeCount | ||
| - serverNodeCount | ||
| - agentNodeCount |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit: nodeCount = serverNodeCount + agentNodeCount. A separate field seems unnecessary?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Agreed.
I think the intent is to differentiate between pure control plane nodes and those that run workloads. For example, SUSE's pricing model excludes pure control plane nodes.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How is that determined? There is no such thing as a "pure" control-plane node as far as Kubernetes or RKE2 are concerned. Are you saying that customers don't have to pay for server nodes (cp/etcd/cp+etcd) if they put some taints on them to prevent workload scheduling? What taints, exactly?
brandond
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
a few nits, and a comment on deployment opt-out and configuration
e0892fd to
df0de20
Compare
docs/adrs/010-security-responder.md
Outdated
| } | ||
| ``` | ||
|
|
||
| The `clusteruuid` is needed to differentiate between different deployments (the UUID of `kube-system`). It is completely random and does not expose privacy considerations. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Probably not worth addressing here, but I have seen some users that build images with RKE2 already installed and started. This is a bad idea for several reasons, but it happens. When this is done, the clusters will have the same root CAs - and the same kube-system namespace UID. This will cause them to look like the same cluster to anything using that UID as a cluster UID. I don't know that there's a good way to work around this, but it is something to be aware of.
df0de20 to
0c35abc
Compare
docs/adrs/010-security-responder.md
Outdated
|
|
||
| ```yaml | ||
| # /etc/rancher/rke2/config.yaml | ||
| security-responder-enabled: true # default |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| security-responder-enabled: true # default | |
| security-responder: true # default |
Having "enabled" in the name is redundant for a bool flag. We don't follow that style or any of our other bool flags like embedded-registry or supervisor-metrics.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yeah I already requested replacement of this with a normal --disable flag, since it'll be packaged as a helm chart. no special flag necessary. That bit of review seems to have been dropped though.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry about that! 🤦
docs/adrs/010-security-responder.md
Outdated
| } | ||
| ``` | ||
|
|
||
| The `clusteruuid` is needed to differentiate between different deployments (the UUID of `kube-system`). It is completely random and does not expose privacy considerations. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We might consider hashing the clusteruuid to ensure another level of obfuscation for collected metrics.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let me add it as an option. I think we can discuss about this in the real PR
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, I'm kinda meh on that. I don't see what hashing an already random number would add, other than the appearance of obfuscation.
docs/adrs/010-security-responder.md
Outdated
|
|
||
| ```yaml | ||
| # /etc/rancher/rke2/config.yaml | ||
| security-responder-enabled: true # default |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
| security-responder-enabled: true # default | |
| disable: | |
| - rke2-security-responder |
brandond
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nit on how to disable
Signed-off-by: manuelbuil <[email protected]>
Signed-off-by: manuelbuil <[email protected]>
0c35abc to
bcc461d
Compare
Could you check again Brad? Thanks |
brandond
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm
Proposed Changes
Types of Changes
Verification
Testing
Linked Issues
User-Facing Change
Further Comments