Skip to content

Actions: vllm-project/vllm

Add label on auto-merge enabled

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
1,806 workflow runs
1,806 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Fix benchmark_moe.py tuning for CUDA devices
Add label on auto-merge enabled #1806: Pull request #14164 auto_merge_enabled by ywang96
March 4, 2025 05:00 10s
March 4, 2025 05:00 10s
add cutlass support for blackwell fp8 gemm
Add label on auto-merge enabled #1805: Pull request #13798 auto_merge_enabled by tlrmchlsmth
March 4, 2025 03:13 10s
March 4, 2025 03:13 10s
[core] moe fp8 block quant tuning support
Add label on auto-merge enabled #1804: Pull request #14068 auto_merge_enabled by mgoin
March 4, 2025 01:04 13s
March 4, 2025 01:04 13s
[TPU][Profiler] Support start_profile/stop_profile in TPU worker
Add label on auto-merge enabled #1803: Pull request #13988 auto_merge_enabled by mgoin
March 3, 2025 23:02 13s
March 3, 2025 23:02 13s
[Docs] Add GPTQModel
Add label on auto-merge enabled #1802: Pull request #14056 auto_merge_enabled by mgoin
March 3, 2025 21:20 13s
March 3, 2025 21:20 13s
[v1] Add comments to the new ragged paged attention Pallas kernel
Add label on auto-merge enabled #1801: Pull request #14155 auto_merge_enabled by mgoin
March 3, 2025 21:09 13s
March 3, 2025 21:09 13s
[WIP][[V1][Metrics] Implement max_num_generation_tokens, request_params_n, and request_params_max_tokens metrics
Add label on auto-merge enabled #1800: Pull request #14055 auto_merge_enabled by robertgshaw2-redhat
March 3, 2025 17:43 16s
March 3, 2025 17:43 16s
[V0][Metrics] Remove unimplemented vllm:tokens_total
Add label on auto-merge enabled #1799: Pull request #14134 auto_merge_enabled by robertgshaw2-redhat
March 3, 2025 17:22 11s
March 3, 2025 17:22 11s
[v1][Metrics] Add design doc
Add label on auto-merge enabled #1798: Pull request #12745 auto_merge_enabled by robertgshaw2-redhat
March 3, 2025 17:17 14s
March 3, 2025 17:17 14s
[V0][Metrics] Deprecate some KV/prefix cache metrics
Add label on auto-merge enabled #1797: Pull request #14136 auto_merge_enabled by robertgshaw2-redhat
March 3, 2025 17:04 15s
March 3, 2025 17:04 15s
[V0][Metrics] Deprecate some questionable request time metrics
Add label on auto-merge enabled #1796: Pull request #14135 auto_merge_enabled by robertgshaw2-redhat
March 3, 2025 17:03 12s
March 3, 2025 17:03 12s
Improve the docs for TransformersModel
Add label on auto-merge enabled #1795: Pull request #14147 auto_merge_enabled by DarkLight1337
March 3, 2025 16:24 12s
March 3, 2025 16:24 12s
Fix head_dim not existing in all model configs (Transformers backend)
Add label on auto-merge enabled #1794: Pull request #14141 auto_merge_enabled by jeejeelee
March 3, 2025 15:54 11s
March 3, 2025 15:54 11s
[Bugfix][CI] ALiBi test case in xformers multi_query_kv_attention
Add label on auto-merge enabled #1793: Pull request #11301 auto_merge_enabled by DarkLight1337
March 3, 2025 11:42 14s
March 3, 2025 11:42 14s
[Misc][Platform] Move use allgather to platform
Add label on auto-merge enabled #1792: Pull request #14010 auto_merge_enabled by youkaichao
March 3, 2025 05:13 12s
March 3, 2025 05:13 12s
[Bugfix] Fix gptq_marlin for deepseek-v3
Add label on auto-merge enabled #1791: Pull request #13750 auto_merge_enabled by mgoin
March 3, 2025 02:51 11s
March 3, 2025 02:51 11s
add cutlass support for blackwell fp8 gemm
Add label on auto-merge enabled #1790: Pull request #13798 auto_merge_enabled by tlrmchlsmth
March 2, 2025 20:12 14s
March 2, 2025 20:12 14s
[Bugfix] Explicitly include "omp.h" for MacOS to avoid installation failure
Add label on auto-merge enabled #1789: Pull request #14051 auto_merge_enabled by hmellor
March 2, 2025 15:33 10s
March 2, 2025 15:33 10s
Update deprecated Python 3.8 typing
Add label on auto-merge enabled #1788: Pull request #13971 auto_merge_enabled by DarkLight1337
March 2, 2025 15:09 13s
March 2, 2025 15:09 13s
[Doc] Source building add clone step
Add label on auto-merge enabled #1787: Pull request #14086 auto_merge_enabled by Isotr0py
March 2, 2025 10:32 12s
March 2, 2025 10:32 12s
[v1] Add __repr__ to KVCacheBlock to avoid recursive print
Add label on auto-merge enabled #1786: Pull request #14081 auto_merge_enabled by comaniac
March 1, 2025 19:27 12s
March 1, 2025 19:27 12s
[v1][Bugfix] Only cache blocks that are not in the prefix cache
Add label on auto-merge enabled #1785: Pull request #14073 auto_merge_enabled by comaniac
March 1, 2025 06:35 11s
March 1, 2025 06:35 11s
[Model] Add support for GraniteMoeShared models
Add label on auto-merge enabled #1784: Pull request #13313 auto_merge_enabled by DarkLight1337
March 1, 2025 06:27 11s
March 1, 2025 06:27 11s
[Documentation] Add more deployment guide for Kubernetes deployment
Add label on auto-merge enabled #1783: Pull request #13841 auto_merge_enabled by KuntaiDu
March 1, 2025 06:05 11s
March 1, 2025 06:05 11s
[Docs] Add pipeline_parallel_size to optimization docs
Add label on auto-merge enabled #1782: Pull request #14059 auto_merge_enabled by DarkLight1337
March 1, 2025 05:01 11s
March 1, 2025 05:01 11s