Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

the object has been modified; please apply your changes to the latest version and try again #607

Open
gl-001 opened this issue Nov 27, 2023 · 8 comments

Comments

@gl-001
Copy link

gl-001 commented Nov 27, 2023

version 0.4.0
anyone occurred this problem?
i add some log and found doUpdateJobStatus function will raise "the object has been modified; please apply your changes to the latest version and try again"
thx

@tenzen-y
Copy link
Member

"the object has been modified; please apply your changes to the latest version and try again"

This is a well-known client-side apply issue. However, this error doesn't raise any bugs.

/close

Copy link

@tenzen-y: Closing this issue.

In response to this:

"the object has been modified; please apply your changes to the latest version and try again"

This is a well-known client-side apply issue. However, this error doesn't raise any bugs.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@gl-001
Copy link
Author

gl-001 commented Nov 28, 2023

this error will lead the job status Not Credible, which job will long time is running, but the pods was succeed. If there is a job after the mpi job in a pipeline, then the job will not be processed after waiting a long time. So are there any methods to solve this problem? @tenzen-y

Completion Time: 2023-11-27T03:34:09Z
Conditions:
Last Transition Time: 2023-11-27T03:31:
Last Update Time: 2023-11-27T03:31:20Z
Message: MPIJob a5qvbedvqod1-mpijob is created.
Reason: MPIJobCreated
Status: True
Type: Created
Last Transition Time: 2023-11-27T03:34:09Z.
Last Update Time: 2023-11-27T03:34:09Z
Message: Job has reached the specified backoff limit
Reason: BackoffLimitExceeded
Status: True
Type: Failed
Last Transition Time: 2023-11-27T03:34:09Z
Last Update Time: 2023-11-27T03:34:09Z
Message: MPIJob a5qvbedvqod1-mpijob is running.
Reason: MPIJobRunning
Status: True
Type: Running [will live a long time]
Replica Statuses:
Launcher:
Failed: 1
Worker:
Start Time: 2023-11-27T03:31:20Z   

@tenzen-y
Copy link
Member

/reopen

@google-oss-prow google-oss-prow bot reopened this Dec 27, 2023
Copy link

@tenzen-y: Reopened this issue.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@tenzen-y
Copy link
Member

/kind support

Copy link

@tenzen-y: The label(s) kind/support cannot be applied, because the repository doesn't have them.

In response to this:

/kind support

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

@tenzen-y
Copy link
Member

this error will lead the job status Not Credible, which job will long time is running, but the pods was succeed. If there is a job after the mpi job in a pipeline, then the job will not be processed after waiting a long time. So are there any methods to solve this problem? @tenzen-y

Completion Time: 2023-11-27T03:34:09Z
Conditions:
Last Transition Time: 2023-11-27T03:31:
Last Update Time: 2023-11-27T03:31:20Z
Message: MPIJob a5qvbedvqod1-mpijob is created.
Reason: MPIJobCreated
Status: True
Type: Created
Last Transition Time: 2023-11-27T03:34:09Z.
Last Update Time: 2023-11-27T03:34:09Z
Message: Job has reached the specified backoff limit
Reason: BackoffLimitExceeded
Status: True
Type: Failed
Last Transition Time: 2023-11-27T03:34:09Z
Last Update Time: 2023-11-27T03:34:09Z
Message: MPIJob a5qvbedvqod1-mpijob is running.
Reason: MPIJobRunning
Status: True
Type: Running [will live a long time]
Replica Statuses:
Launcher:
Failed: 1
Worker:
Start Time: 2023-11-27T03:31:20Z   

@gl-001 Sorry for the late response. IIUC, if the update process fails, the controller will retry to update MPIJob.
Can you share the mpi-operator logs with us?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants