Skip to content

Actions: ROCm/triton

AMD Perf Kernel Integration Tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
573 workflow runs
573 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Change grouping calculation in gemm.py
AMD Perf Kernel Integration Tests #562: Pull request #732 opened by azaidy
February 24, 2025 18:55 26m 17s ali/gemm_grouping
February 24, 2025 18:55 26m 17s
Tianxing/rope latent attention
AMD Perf Kernel Integration Tests #561: Pull request #731 synchronize by Chi-Chu319
February 24, 2025 08:54 1h 26m 44s tianxing/rope-latent-attention
February 24, 2025 08:54 1h 26m 44s
Tianxing/rope latent attention
AMD Perf Kernel Integration Tests #560: Pull request #731 synchronize by Chi-Chu319
February 24, 2025 08:53 1m 21s tianxing/rope-latent-attention
February 24, 2025 08:53 1m 21s
Tianxing/rope latent attention
AMD Perf Kernel Integration Tests #559: Pull request #731 synchronize by Chi-Chu319
February 24, 2025 08:52 1m 57s tianxing/rope-latent-attention
February 24, 2025 08:52 1m 57s
Tianxing/rope latent attention
AMD Perf Kernel Integration Tests #558: Pull request #731 synchronize by Chi-Chu319
February 24, 2025 08:48 3m 30s tianxing/rope-latent-attention
February 24, 2025 08:48 3m 30s
Tianxing/rope latent attention
AMD Perf Kernel Integration Tests #557: Pull request #731 synchronize by Chi-Chu319
February 24, 2025 08:39 9m 11s tianxing/rope-latent-attention
February 24, 2025 08:39 9m 11s
Tianxing/rope latent attention
AMD Perf Kernel Integration Tests #556: Pull request #731 opened by Chi-Chu319
February 24, 2025 08:29 10m 27s tianxing/rope-latent-attention
February 24, 2025 08:29 10m 27s
Clean up GEMM kernel
AMD Perf Kernel Integration Tests #555: Pull request #730 opened by vgokhale
February 21, 2025 22:58 1h 24m 21s vinayak/gemm
February 21, 2025 22:58 1h 24m 21s
Added compiler hints to enable buffer loads
AMD Perf Kernel Integration Tests #554: Pull request #729 opened by azaidy
February 21, 2025 17:38 1h 30m 49s alizaidy/buffer_load
February 21, 2025 17:38 1h 30m 49s
[FA] Add tl.assume to flash_attention.py
AMD Perf Kernel Integration Tests #553: Pull request #728 synchronize by jungpark-mlir
February 21, 2025 17:26 1h 25m 55s fa-assume
February 21, 2025 17:26 1h 25m 55s
RMSNorm backward kernel implementaton
AMD Perf Kernel Integration Tests #552: Pull request #709 reopened by xiaohuguo2023
February 21, 2025 16:52 1h 30m 42s rmsnorm_bwd
February 21, 2025 16:52 1h 30m 42s
[FA] Add tl.assume to flash_attention.py
AMD Perf Kernel Integration Tests #551: Pull request #728 opened by jungpark-mlir
February 21, 2025 15:44 1h 30m 0s fa-assume
February 21, 2025 15:44 1h 30m 0s
RMSNorm backward kernel implementaton
AMD Perf Kernel Integration Tests #550: Pull request #709 synchronize by xiaohuguo2023
February 21, 2025 12:47 1h 33m 27s rmsnorm_bwd
February 21, 2025 12:47 1h 33m 27s
RMSNorm backward kernel implementaton
AMD Perf Kernel Integration Tests #549: Pull request #709 synchronize by xiaohuguo2023
February 20, 2025 23:23 1h 25m 57s rmsnorm_bwd
February 20, 2025 23:23 1h 25m 57s
RMSNorm backward kernel implementaton
AMD Perf Kernel Integration Tests #548: Pull request #709 synchronize by xiaohuguo2023
February 20, 2025 17:41 1h 32m 23s rmsnorm_bwd
February 20, 2025 17:41 1h 32m 23s
Fused moe gemm + silu activation kernel
AMD Perf Kernel Integration Tests #547: Pull request #710 synchronize by Chi-Chu319
February 19, 2025 09:53 1h 27m 30s tianxing/fused-moe-single-gemm
February 19, 2025 09:53 1h 27m 30s
Fix pid remapping logic when GRID_MN cannot divide NUM_XCDS
AMD Perf Kernel Integration Tests #546: Pull request #722 synchronize by zhanglx13
February 16, 2025 04:55 1h 35m 8s fix_pid_remapping
February 16, 2025 04:55 1h 35m 8s
Fix pid remapping logic when GRID_MN cannot divide NUM_XCDS
AMD Perf Kernel Integration Tests #545: Pull request #722 opened by zhanglx13
February 16, 2025 04:50 4m 52s fix_pid_remapping
February 16, 2025 04:50 4m 52s
Add int4 quantization support to MoE
AMD Perf Kernel Integration Tests #544: Pull request #715 synchronize by rahulbatra85
February 13, 2025 20:58 1h 26m 31s moe_int4
February 13, 2025 20:58 1h 26m 31s
Add int4 quantization support to MoE
AMD Perf Kernel Integration Tests #543: Pull request #715 synchronize by rahulbatra85
February 13, 2025 18:42 1h 30m 40s moe_int4
February 13, 2025 18:42 1h 30m 40s
Add workaround for pytorch device selection issue
AMD Perf Kernel Integration Tests #542: Pull request #711 synchronize by AlexAUT
February 6, 2025 14:03 1h 27m 10s tune_gemm_pytorch_fix
February 6, 2025 14:03 1h 27m 10s
Attempt to unify perf kernel and AOTriton's kernel.
AMD Perf Kernel Integration Tests #541: Pull request #716 synchronize by xinyazhang
February 5, 2025 19:29 1h 29m 20s xinyazhang/union-fa-naming_convention
February 5, 2025 19:29 1h 29m 20s
Multihead latent attention
AMD Perf Kernel Integration Tests #540: Pull request #712 synchronize by Chi-Chu319
February 5, 2025 08:36 1h 36m 10s jukorhon/latent-attention
February 5, 2025 08:36 1h 36m 10s
[AMD][RFC][DO-NOT-MERGE] Named-barrier using LDS
AMD Perf Kernel Integration Tests #539: Pull request #720 opened by karthik-man
February 4, 2025 23:12 Action required karthik-man:km-named-barrier
February 4, 2025 23:12 Action required
Add int4 quantization support to MoE
AMD Perf Kernel Integration Tests #538: Pull request #715 synchronize by rahulbatra85
February 4, 2025 20:16 3h 59m 22s moe_int4
February 4, 2025 20:16 3h 59m 22s