You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
2024-12-25 07:37:06.206 | 3619804 | INFO | balsam.site.launcher.mpi_mode:120] Job Acquisition: 1 empty nodes; 1.0 aggregate free nodes; requested up to 990 jobs [node packing allowed: True]; Acquired 1 jobs.
2024-12-25 07:37:06.208 | 3619804 | WARNING | balsam.site.launcher.mpi_mode:158] Insufficient resources to place Job 40674082 nacl-melt. Stashing for later launch.
2024-12-25 07:37:07.211 | 3619804 | WARNING | balsam.site.launcher.mpi_mode:158] Insufficient resources to place Job 40674082 nacl-melt. Stashing for later launch.
2024-12-25 07:38:07.208 | 3619804 | WARNING | balsam.site.launcher.mpi_mode:158] Insufficient resources to place Job 40674082 nacl-melt. Stashing for later launch.
2024-12-25 07:38:07.208 | 3619804 | INFO | balsam.site.launcher.mpi_mode:74] Exceeded 60 sec TTL: shutting down because nothing to do
Jobs created through CLI do not get picked up by the balsam scheduler as shown below.
Here is the mpi log file output,
"""
2024-12-25 07:37:04.931 | 3619804 | INFO | balsam:114] Configured logging on x3005c0s1b1n0
2024-12-25 07:37:04.932 | 3619804 | INFO | balsam.platform.compute_node.alcf_polaris_node:51] x3005c0s1b1n0.hsn.cm.polaris.alcf.anl.gov detected GPU IDs: [3, 2, 1, 0]
2024-12-25 07:37:06.206 | 3619804 | INFO | balsam.site.launcher.mpi_mode:120] Job Acquisition: 1 empty nodes; 1.0 aggregate free nodes; requested up to 990 jobs [node packing allowed: True]; Acquired 1 jobs.
2024-12-25 07:37:06.208 | 3619804 | WARNING | balsam.site.launcher.mpi_mode:158] Insufficient resources to place Job 40674082 nacl-melt. Stashing for later launch.
2024-12-25 07:37:07.211 | 3619804 | WARNING | balsam.site.launcher.mpi_mode:158] Insufficient resources to place Job 40674082 nacl-melt. Stashing for later launch.
2024-12-25 07:38:07.208 | 3619804 | WARNING | balsam.site.launcher.mpi_mode:158] Insufficient resources to place Job 40674082 nacl-melt. Stashing for later launch.
2024-12-25 07:38:07.208 | 3619804 | INFO | balsam.site.launcher.mpi_mode:74] Exceeded 60 sec TTL: shutting down because nothing to do
2024-12-25 07:38:07.208 | 3619804 | INFO | balsam.site.launcher.mpi_mode:227] Launcher starting shutdown sequence
2024-12-25 07:38:07.208 | 3619804 | INFO | balsam.site.job_source:212] Signal: JobSource cancelling tick thread and deleting API Session
2024-12-25 07:38:12.437 | 3619804 | INFO | balsam.site.job_source:214] JobSource exit graceful
2024-12-25 07:38:12.437 | 3619804 | INFO | balsam.site.launcher.mpi_mode:229] Timing out active runs
2024-12-25 07:38:13.231 | 3619808 | INFO | balsam.site.status_updater:50] Signal: break out of StatusUpdater main loop
2024-12-25 07:38:13.232 | 3619808 | INFO | balsam.site.status_updater:52] StatusUpdater thread finished.
"""
However, job created thorugh python api in the same site and for the same app runs just fine. Is this a bug?
The text was updated successfully, but these errors were encountered: