Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regarding the maximum number of parallel sequences #13

Open
Tianci-Wen opened this issue Dec 19, 2024 · 2 comments
Open

Regarding the maximum number of parallel sequences #13

Tianci-Wen opened this issue Dec 19, 2024 · 2 comments

Comments

@Tianci-Wen
Copy link

Hello! Thank you for open-sourcing this excellent work! I’d like to ask if you train multiple sequences in parallel when using a single RTX 4090, or do you train just one sequence at a time?

I noticed that the GPU utilization for each sequence is relatively low, so I run 5–6 sequences simultaneously. However, I observed that the CPU utilization becomes quite high. Could this potentially affect the final accuracy? Currently, the accuracy I achieve is slightly lower than the results reported in the paper.

In addition, when running 6 sequences in parallel, the results for some scenes are missing files such as chkpnt30000.pth, chkpnt40000.pth, and point_cloud. These results are not saved. Do you know why this might be happening?

@Tianci-Wen
Copy link
Author

image

@juno181
Copy link
Owner

juno181 commented Dec 31, 2024

Hi, the reason for the high CPU utilization is that we are loading images just in time to reduce memory usage. This creates a bottleneck on the CPU when running many sequences at once. I only ran one job per GPU, and I don't recommend running multiple jobs on a single GPU (it can consume more than 10GB of GPU memory and kill the job). I think the reason the results are not saved is that the jobs are killed by OOM, so they are not saved.

This is unlikely to affect performance, and I've seen some differences in the current preprocessing results and am working to reduce them.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants