Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[PM-1942] Support transfers for a job on the HOST OS instead of from within the container #2055

Open
mayani opened this issue Dec 14, 2024 · 0 comments
Assignees
Labels
affects-5.0.7 affects-master Current Trunk Version fix-5.1.0 fix-master Current Trunk Version major Major loss of function. Planner: Containers Planner: Transfer Module Refers to Transfer Refiners, Second and First Level Staging, Transfer implementations in Pegasus Cod sync-from-jira Synced from Jira

Comments

@mayani
Copy link
Member

mayani commented Dec 14, 2024

As part of #1435 , in Pegasus, for containerized jobs, the data for the job gets pulled in when the container starts from within the container. This requires the container to have the pegasus worker package deployed within the container. While the pegasus worker package does get deployed inside the container at runtime, certain python dependencies still need to be fulfilled in the container build file.  This approach was selected to allow users to use transfer tools whose dependencies are not fulfilled on the hostos. 

However, there are legitimate cases for the supporting data transfers on the Host OS also for containerized jobs. For example, for ML applications using tensor flow etc that have ntasks set to > 1.

Reporter: @vahi
Watchers:
@vahi

@mayani mayani changed the title PM-1942 [PM-1942] Support transfers for a job on the HOST OS instead of from within the container Dec 14, 2024
@mayani mayani added Planner: Transfer Module Refers to Transfer Refiners, Second and First Level Staging, Transfer implementations in Pegasus Cod affects-master Current Trunk Version labels Dec 14, 2024
@mayani mayani added affects-5.0.7 fix-master Current Trunk Version fix-5.1.0 major Major loss of function. labels Dec 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
affects-5.0.7 affects-master Current Trunk Version fix-5.1.0 fix-master Current Trunk Version major Major loss of function. Planner: Containers Planner: Transfer Module Refers to Transfer Refiners, Second and First Level Staging, Transfer implementations in Pegasus Cod sync-from-jira Synced from Jira
Projects
None yet
Development

No branches or pull requests

2 participants