Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[codegen][gpu] Removing zero slice check from gpu_apply_tiling_level #20120

Merged
merged 1 commit into from
Feb 28, 2025

Conversation

jerryyin
Copy link
Member

@jerryyin jerryyin commented Feb 27, 2025

This PR flipped the switch for allow-zero-slice option, which means zero slice will be tolerated for igemm that requires a padding. The dynamic if condition check is unnecessary because we compute the K loop bound in a reasonable fashion and zero slice situation shouldn't happen. The motivation of the PR is performance benefit to remove an unnecessary conditional from padded gemm global loading.

@jerryyin
Copy link
Member Author

MI300 machine is failing with hipErrorInvalidDevice, aligned with the tip of main branch CI.

I'll leave this PR a bit longer till merge. Hopefully wait till when CI on MI300 machine is fixed to get a round of model level regression coverage there.

@jerryyin jerryyin merged commit aa26710 into main Feb 28, 2025
46 checks passed
@jerryyin jerryyin deleted the users/zyin/remove-zero-slice-check branch February 28, 2025 14:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants