Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix device selection using CUDA_VISIBLE_DEVICES #6530

Merged

Conversation

tohtana
Copy link
Contributor

@tohtana tohtana commented Sep 12, 2024

This PR addresses #5818.
Instead of contiguous numbers based on the device count, this PR uses device indices in --include.

@tohtana tohtana requested a review from awan-10 as a code owner September 12, 2024 23:33
@tjruwase tjruwase requested review from tjruwase and GuanhuaWang and removed request for awan-10 September 17, 2024 16:48
@tjruwase tjruwase requested a review from loadams as a code owner October 7, 2024 21:40
@loadams
Copy link
Collaborator

loadams commented Oct 8, 2024

We might need to check the docs for what we say about CUDA_VISIBLE_DEVICES on the deepspeed.ai website to see if we need to update that with these changes?

@tohtana tohtana added this pull request to the merge queue Oct 8, 2024
@tohtana
Copy link
Contributor Author

tohtana commented Oct 8, 2024

@loadams I believe this PR aligns the launcher's behavior with the description in the document.

Merged via the queue into deepspeedai:master with commit 5cbbff4 Oct 8, 2024
11 checks passed
@rlanday
Copy link

rlanday commented Dec 1, 2024

The Getting Started guide says:

https://www.deepspeed.ai/getting-started/

Also note that CUDA_VISIBLE_DEVICES can’t be used with DeepSpeed to control which devices should be used.

Is this correct?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants