Skip to content

Actions: IBM/unitxt

Test Catalog Consistency

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
5,014 workflow runs
5,014 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

surfacing a problem in current LoadHF
Test Catalog Consistency #5030: Pull request #1599 synchronize by dafnapension
February 23, 2025 18:45 3m 22s touch_the_loaded_dataset
February 23, 2025 18:45 3m 22s
small typos in loaders and in profiler
Test Catalog Consistency #5029: Pull request #1600 synchronize by dafnapension
February 23, 2025 18:43 12m 50s small_typos_in_loaders
February 23, 2025 18:43 12m 50s
no main-memory cache to loaders
Test Catalog Consistency #5028: Pull request #1624 synchronize by dafnapension
February 23, 2025 17:58 3m 44s no_loader_cache
February 23, 2025 17:58 3m 44s
Add provider specific args and allow using unrecognized model names (…
Test Catalog Consistency #5027: Commit 20a99df pushed by elronbandel
February 23, 2025 16:06 3m 36s main
February 23, 2025 16:06 3m 36s
no main-memory cache to loaders
Test Catalog Consistency #5026: Pull request #1624 synchronize by dafnapension
February 23, 2025 14:17 5m 33s no_loader_cache
February 23, 2025 14:17 5m 33s
Start implementing assesment for unitxt assitant
Test Catalog Consistency #5025: Pull request #1625 opened by eladven
February 23, 2025 14:06 3m 43s assistant_assessment
February 23, 2025 14:06 3m 43s
no main-memory cache to loaders
Test Catalog Consistency #5024: Pull request #1624 synchronize by dafnapension
February 23, 2025 10:38 3m 43s no_loader_cache
February 23, 2025 10:38 3m 43s
no main-memory cache to loaders
Test Catalog Consistency #5023: Pull request #1624 synchronize by dafnapension
February 23, 2025 09:53 3m 19s no_loader_cache
February 23, 2025 09:53 3m 19s
Enable offline mode for hugginface by using local pre-downloaded metr…
Test Catalog Consistency #5021: Commit d9d9a9d pushed by elronbandel
February 23, 2025 09:27 3m 33s main
February 23, 2025 09:27 3m 33s
Enable offline mode for hugginface by using local pre-downloaded metrics, datasets and models
Test Catalog Consistency #5020: Pull request #1603 synchronize by elronbandel
February 23, 2025 08:25 3m 33s local-cache
February 23, 2025 08:25 3m 33s
Example for evaluating system message leakage
Test Catalog Consistency #5019: Pull request #1609 synchronize by elronbandel
February 23, 2025 08:22 3m 23s system-leakage
February 23, 2025 08:22 3m 23s
Enable offline mode for hugginface by using local pre-downloaded metrics, datasets and models
Test Catalog Consistency #5018: Pull request #1603 synchronize by elronbandel
February 23, 2025 08:00 18m 52s local-cache
February 23, 2025 08:00 18m 52s
Add correctness_based_on_ground_truth criteria (#1623)
Test Catalog Consistency #5017: Commit f0531dc pushed by elronbandel
February 23, 2025 07:54 3m 36s main
February 23, 2025 07:54 3m 36s
Add correctness_based_on_ground_truth criteria
Test Catalog Consistency #5016: Pull request #1623 synchronize by elronbandel
February 23, 2025 07:54 3m 41s correcteness-criteria
February 23, 2025 07:54 3m 41s
Fix Azure OpenAI based LLM judges (#1619)
Test Catalog Consistency #5015: Commit 4d8047d pushed by elronbandel
February 23, 2025 07:53 56s main
February 23, 2025 07:53 56s
no main-memory cache to loaders
Test Catalog Consistency #5014: Pull request #1624 synchronize by dafnapension
February 22, 2025 13:47 3m 20s no_loader_cache
February 22, 2025 13:47 3m 20s
no main-memory cache to loaders
Test Catalog Consistency #5013: Pull request #1624 synchronize by dafnapension
February 22, 2025 12:15 3m 42s no_loader_cache
February 22, 2025 12:15 3m 42s
no main-memory cache to loaders
Test Catalog Consistency #5012: Pull request #1624 opened by dafnapension
February 22, 2025 12:05 3m 37s no_loader_cache
February 22, 2025 12:05 3m 37s
Add correctness_based_on_ground_truth criteria
Test Catalog Consistency #5011: Pull request #1623 opened by martinscooper
February 21, 2025 20:19 4m 6s correcteness-criteria
February 21, 2025 20:19 4m 6s
in situ
Test Catalog Consistency #5010: Pull request #1620 synchronize by dafnapension
February 21, 2025 20:11 3m 34s in_situ
February 21, 2025 20:11 3m 34s
Add prediction variable name customization to LLM as Judge
Test Catalog Consistency #5009: Pull request #1622 synchronize by martinscooper
February 21, 2025 17:31 3m 51s llm-judge-response-name
February 21, 2025 17:31 3m 51s
Add prediction variable name customization to LLM as Judge
Test Catalog Consistency #5008: Pull request #1622 synchronize by martinscooper
February 21, 2025 17:30 41s llm-judge-response-name
February 21, 2025 17:30 41s
Add prediction variable name customization to LLM as Judge
Test Catalog Consistency #5007: Pull request #1622 synchronize by martinscooper
February 21, 2025 16:58 3m 36s llm-judge-response-name
February 21, 2025 16:58 3m 36s
Add prediction variable name customization to LLM as Judge
Test Catalog Consistency #5006: Pull request #1622 opened by martinscooper
February 21, 2025 16:56 1m 40s llm-judge-response-name
February 21, 2025 16:56 1m 40s