Skip to content

Actions: IBM/unitxt

Test HELM Integration

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,668 workflow runs
4,668 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

surfacing a problem in current LoadHF
Test HELM Integration #4684: Pull request #1599 synchronize by dafnapension
February 23, 2025 18:45 12m 25s touch_the_loaded_dataset
February 23, 2025 18:45 12m 25s
small typos in loaders and in profiler
Test HELM Integration #4683: Pull request #1600 synchronize by dafnapension
February 23, 2025 18:43 2m 50s small_typos_in_loaders
February 23, 2025 18:43 2m 50s
no main-memory cache to loaders
Test HELM Integration #4682: Pull request #1624 synchronize by dafnapension
February 23, 2025 17:58 2m 20s no_loader_cache
February 23, 2025 17:58 2m 20s
Add provider specific args and allow using unrecognized model names (…
Test HELM Integration #4681: Commit 20a99df pushed by elronbandel
February 23, 2025 16:06 2m 59s main
February 23, 2025 16:06 2m 59s
no main-memory cache to loaders
Test HELM Integration #4680: Pull request #1624 synchronize by dafnapension
February 23, 2025 14:17 2m 31s no_loader_cache
February 23, 2025 14:17 2m 31s
Start implementing assesment for unitxt assitant
Test HELM Integration #4679: Pull request #1625 opened by eladven
February 23, 2025 14:06 2m 23s assistant_assessment
February 23, 2025 14:06 2m 23s
no main-memory cache to loaders
Test HELM Integration #4678: Pull request #1624 synchronize by dafnapension
February 23, 2025 10:38 2m 17s no_loader_cache
February 23, 2025 10:38 2m 17s
no main-memory cache to loaders
Test HELM Integration #4677: Pull request #1624 synchronize by dafnapension
February 23, 2025 09:53 2m 0s no_loader_cache
February 23, 2025 09:53 2m 0s
Enable offline mode for hugginface by using local pre-downloaded metr…
Test HELM Integration #4675: Commit d9d9a9d pushed by elronbandel
February 23, 2025 09:27 2m 8s main
February 23, 2025 09:27 2m 8s
Enable offline mode for hugginface by using local pre-downloaded metrics, datasets and models
Test HELM Integration #4674: Pull request #1603 synchronize by elronbandel
February 23, 2025 08:25 2m 49s local-cache
February 23, 2025 08:25 2m 49s
Example for evaluating system message leakage
Test HELM Integration #4673: Pull request #1609 synchronize by elronbandel
February 23, 2025 08:22 6m 8s system-leakage
February 23, 2025 08:22 6m 8s
Enable offline mode for hugginface by using local pre-downloaded metrics, datasets and models
Test HELM Integration #4672: Pull request #1603 synchronize by elronbandel
February 23, 2025 08:00 7m 48s local-cache
February 23, 2025 08:00 7m 48s
Add correctness_based_on_ground_truth criteria (#1623)
Test HELM Integration #4671: Commit f0531dc pushed by elronbandel
February 23, 2025 07:54 10m 9s main
February 23, 2025 07:54 10m 9s
Add correctness_based_on_ground_truth criteria
Test HELM Integration #4670: Pull request #1623 synchronize by elronbandel
February 23, 2025 07:54 2m 31s correcteness-criteria
February 23, 2025 07:54 2m 31s
Fix Azure OpenAI based LLM judges (#1619)
Test HELM Integration #4669: Commit 4d8047d pushed by elronbandel
February 23, 2025 07:53 59s main
February 23, 2025 07:53 59s
no main-memory cache to loaders
Test HELM Integration #4668: Pull request #1624 synchronize by dafnapension
February 22, 2025 13:47 2m 45s no_loader_cache
February 22, 2025 13:47 2m 45s
no main-memory cache to loaders
Test HELM Integration #4667: Pull request #1624 synchronize by dafnapension
February 22, 2025 12:15 2m 3s no_loader_cache
February 22, 2025 12:15 2m 3s
no main-memory cache to loaders
Test HELM Integration #4666: Pull request #1624 opened by dafnapension
February 22, 2025 12:05 2m 31s no_loader_cache
February 22, 2025 12:05 2m 31s
Add correctness_based_on_ground_truth criteria
Test HELM Integration #4665: Pull request #1623 opened by martinscooper
February 21, 2025 20:19 2m 12s correcteness-criteria
February 21, 2025 20:19 2m 12s
in situ
Test HELM Integration #4664: Pull request #1620 synchronize by dafnapension
February 21, 2025 20:11 2m 13s in_situ
February 21, 2025 20:11 2m 13s
Add prediction variable name customization to LLM as Judge
Test HELM Integration #4663: Pull request #1622 synchronize by martinscooper
February 21, 2025 17:31 2m 54s llm-judge-response-name
February 21, 2025 17:31 2m 54s
Add prediction variable name customization to LLM as Judge
Test HELM Integration #4662: Pull request #1622 synchronize by martinscooper
February 21, 2025 17:30 43s llm-judge-response-name
February 21, 2025 17:30 43s
Add prediction variable name customization to LLM as Judge
Test HELM Integration #4661: Pull request #1622 synchronize by martinscooper
February 21, 2025 16:58 2m 59s llm-judge-response-name
February 21, 2025 16:58 2m 59s
Add prediction variable name customization to LLM as Judge
Test HELM Integration #4660: Pull request #1622 opened by martinscooper
February 21, 2025 16:56 1m 40s llm-judge-response-name
February 21, 2025 16:56 1m 40s