Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unpin once transformers latest is fixed #7088

Open
wants to merge 4 commits into
base: master
Choose a base branch
from

Conversation

loadams
Copy link
Collaborator

@loadams loadams commented Feb 27, 2025

Reverts #7085 which pinned the transformers version due to likely breaking changes from huggingface/transformers#36335

Error:

RuntimeError: Failed to import transformers.models.vision_encoder_decoder.modeling_vision_encoder_decoder because of the following error (look up to see its traceback)
FAILED unit/runtime/zero/test_zero_nesting_init.py::TestNestedParallelInit::test_nested_parallel_init - RuntimeError: Error(s) in loading state_dict for VisionEncoderDecoderModel:

Testing fixes in huggingface/transformers#36455

@loadams loadams requested a review from tjruwase February 27, 2025 16:49
Signed-off-by: Logan Adams <[email protected]>
@loadams
Copy link
Collaborator Author

loadams commented Feb 27, 2025

nv-a6000 test appears to be failing because transformers calls from torch.distributed.tensor import DTensor which doesn't exist prior to torch 2.5. This can be resolved by updating the docker image running on the test node to 24.12.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant