Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(fluent-bit): set system-node-critical as a default priorityClass #70

Merged

Conversation

m00lecule
Copy link
Contributor

Issues:
#54

In Kubernetes there is a good practice to specify a system-node-critical priority class for monitoring agents running as daemonsets.

ref: https://kubernetes.io/docs/tasks/administer-cluster/guaranteed-scheduling-critical-addon-pods/

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

@m00lecule m00lecule force-pushed the feat-priority-class-support branch 3 times, most recently from 25fd497 to c3ed667 Compare July 25, 2024 19:13
@m00lecule m00lecule force-pushed the feat-priority-class-support branch from c3ed667 to b4cef35 Compare July 25, 2024 19:13
@m00lecule
Copy link
Contributor Author

@lisguo may I ask for a review?

@m00lecule m00lecule changed the title feat(fluent-bit): set system-node-critical as default priorityClass feat(fluent-bit): set system-node-critical as a default priorityClass Jul 25, 2024
@m00lecule m00lecule force-pushed the feat-priority-class-support branch from cd671cc to e54cc5a Compare August 12, 2024 20:35
@m00lecule m00lecule force-pushed the feat-priority-class-support branch from e54cc5a to 043f598 Compare August 12, 2024 20:36
@m00lecule
Copy link
Contributor Author

@lisguo May I ask for a review again? The Integration tests are failing due to Error: Credentials could not be loaded, please check your action inputs: Could not load credentials from any providers which doesn't seem to be connected with my changes.

@m00lecule m00lecule requested a review from lisguo September 9, 2024 19:14
@markandersontrocme
Copy link

@m00lecule we should also add the priorityClass to the cloudwatch-agent? https://github.com/aws-observability/helm-charts/blob/main/charts/amazon-cloudwatch-observability/templates/linux/cloudwatch-agent-daemonset.yaml#L23

@m00lecule
Copy link
Contributor Author

@m00lecule we should also add the priorityClass to the cloudwatch-agent? https://github.com/aws-observability/helm-charts/blob/main/charts/amazon-cloudwatch-observability/templates/linux/cloudwatch-agent-daemonset.yaml#L23

@markandersontrocme I believe so, the general principle is to ensure that monitoring agents are running on all EKS nodes

@markandersontrocme
Copy link

@markandersontrocme I believe so, the general principle is to ensure that monitoring agents are running on all EKS nodes

I fee like it would make sense to add that to this PR 🙏

@milldr
Copy link

milldr commented Dec 11, 2024

@m00lecule thanks for opening this PR! I'm hitting this same issue for both fluent-bit and cloudwatch-agent. Is there anything we can do to push this through?

@voidlily
Copy link

voidlily commented Dec 11, 2024

I've been manually working around this by setting the priorityClassName on the deployments/daemonsets, but that obviously doesn't scale well for version updates or deploying to multiple clusters. This PR would be useful to me as well (also adding system-cluster-critical, possibly system-node-critical if it's a daemonset) for the cloudwatch agent.

See also the containers-roadmap ticket I linked earlier in this thread

@m00lecule
Copy link
Contributor Author

m00lecule commented Dec 12, 2024

@milldr @voidlily lets keep fingers crossed together that somebody from AWS (@lisguo) will review proposed changes.

@m00lecule
Copy link
Contributor Author

m00lecule commented Dec 12, 2024

@milldr @markandersontrocme priorityClassName support to cloudwatch-agent daemonset has been introduced here 9e60aaa

@sky333999 sky333999 merged commit dc0552a into aws-observability:main Dec 17, 2024
0 of 3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants