Skip to content

Actions: tatsu-lab/alpaca_eval

alpaca_eval unit tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
607 workflow runs
607 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[BUG] tool_calls (#429)
alpaca_eval unit tests #812: Commit 30d94f5 pushed by YannDubs
December 27, 2024 21:44 4m 20s main
December 27, 2024 21:44 4m 20s
[BUG] tool_calls
alpaca_eval unit tests #811: Pull request #429 opened by YannDubs
December 27, 2024 21:42 4m 26s yann/fix_tools
December 27, 2024 21:42 4m 26s
Add TOA to AlpacaEval (#428)
alpaca_eval unit tests #810: Commit 2898342 pushed by YannDubs
December 27, 2024 20:20 4m 23s main
December 27, 2024 20:20 4m 23s
Add TOA to AlpacaEval
alpaca_eval unit tests #809: Pull request #428 synchronize by YannDubs
December 27, 2024 20:20 4m 14s oceanypt:main
December 27, 2024 20:20 4m 14s
Add FuseChat-3.0 models to AlpacaEval (#426)
alpaca_eval unit tests #808: Commit 8bb6e57 pushed by YannDubs
December 27, 2024 20:16 4m 33s main
December 27, 2024 20:16 4m 33s
Add TOA to AlpacaEval
alpaca_eval unit tests #807: Pull request #428 opened by oceanypt
December 26, 2024 13:31 4m 35s oceanypt:main
December 26, 2024 13:31 4m 35s
Add FuseChat-3.0 models to AlpacaEval
alpaca_eval unit tests #806: Pull request #426 opened by yangzy39
December 16, 2024 07:01 4m 45s yangzy39:main
December 16, 2024 07:01 4m 45s
Add FuseChat-Llama-3.1-8B-Instruct, FuseChat-Gemma-2-9B-Instruct and …
alpaca_eval unit tests #805: Pull request #424 opened by yangzy39
December 15, 2024 06:46 4m 33s yangzy39:main
December 15, 2024 06:46 4m 33s
add example for Llama3 vllm server (#404)
alpaca_eval unit tests #804: Commit 0b4af76 pushed by YannDubs
November 11, 2024 07:17 4m 45s main
November 11, 2024 07:17 4m 45s
add example for Llama3 vllm server
alpaca_eval unit tests #803: Pull request #404 reopened by YannDubs
November 11, 2024 07:17 9m 1s cameron-chen:evaluator-vllm-server
November 11, 2024 07:17 9m 1s
Add Llama-3-Instruct-8B-RainbowPO to AlpacaEval (#416)
alpaca_eval unit tests #802: Commit 6976988 pushed by YannDubs
November 11, 2024 07:16 7m 42s main
November 11, 2024 07:16 7m 42s
Add Llama-3-Instruct-8B-RainbowPO to AlpacaEval
alpaca_eval unit tests #801: Pull request #416 synchronize by hanyang1999
October 31, 2024 17:25 5m 14s hanyang1999:main
October 31, 2024 17:25 5m 14s
Add Llama-3-Instruct-8B-RainbowPO to AlpacaEval
alpaca_eval unit tests #800: Pull request #416 opened by hanyang1999
October 31, 2024 04:36 4m 11s hanyang1999:main
October 31, 2024 04:36 4m 11s
Add NullModel to AlpacaEval (#414)
alpaca_eval unit tests #799: Commit 3c6ae8f pushed by YannDubs
October 23, 2024 17:00 4m 11s main
October 23, 2024 17:00 4m 11s
Add NullModel to AlpacaEval
alpaca_eval unit tests #798: Pull request #414 synchronize by YannDubs
October 23, 2024 17:00 4m 26s xszheng2020:main
October 23, 2024 17:00 4m 26s
Add GPO-Llama-3-8B-Instruct-GPM-2B and SPPO-Llama-3-8B-Instruct-GPM-2…
alpaca_eval unit tests #797: Commit 9d8e91d pushed by YannDubs
October 19, 2024 21:48 4m 20s main
October 19, 2024 21:48 4m 20s
Add NullModel to AlpacaEval
alpaca_eval unit tests #796: Pull request #414 opened by xszheng2020
October 15, 2024 20:39 4m 26s xszheng2020:main
October 15, 2024 20:39 4m 26s
add example for Llama3 vllm server
alpaca_eval unit tests #795: Pull request #404 synchronize by cameron-chen
October 13, 2024 14:24 7m 37s cameron-chen:evaluator-vllm-server
October 13, 2024 14:24 7m 37s
Add GPO-Llama-3-8B-Instruct-GPM-2B and SPPO-Llama-3-8B-Instruct-GPM-2…
alpaca_eval unit tests #794: Pull request #413 opened by xukp20
October 10, 2024 13:50 4m 26s general-preference:main
October 10, 2024 13:50 4m 26s
add Self-taught-llama3.1-70B-dpo as a evaluator (#412)
alpaca_eval unit tests #793: Commit d96bcbd pushed by YannDubs
September 26, 2024 15:37 4m 13s main
September 26, 2024 15:37 4m 13s
Fix the float number & Add SelfMoA_gemma-2-9b-it-SimPO, SelfMoA_gemm…
alpaca_eval unit tests #792: Commit b759c8d pushed by YannDubs
September 25, 2024 21:13 4m 37s main
September 25, 2024 21:13 4m 37s
Add SelfMoA_gemma-2-9b-it-SimPO, SelfMoA_gemma-2-9b-it-WPO-HB to AlpacaEval
alpaca_eval unit tests #791: Pull request #411 synchronize by wenzhe-li
September 25, 2024 19:37 4m 20s wenzhe-li:main
September 25, 2024 19:37 4m 20s
add Self-taught-llama3.1-70B-dpo as a evaluator
alpaca_eval unit tests #790: Pull request #412 opened by tianlu-wang
September 25, 2024 18:12 4m 40s tianlu-wang:add_self_taught_evaluator
September 25, 2024 18:12 4m 40s
Add SelfMoA_gemma-2-9b-it-SimPO, SelfMoA_gemma-2-9b-it-WPO-HB to AlpacaEval
alpaca_eval unit tests #789: Pull request #411 opened by wenzhe-li
September 24, 2024 03:26 4m 35s wenzhe-li:main
September 24, 2024 03:26 4m 35s
Updated HF Link in model_configs for Llama-3-8B-Instruct-SkillMix (#409)
alpaca_eval unit tests #788: Commit f8a7bf9 pushed by YannDubs
September 20, 2024 13:43 7m 45s main
September 20, 2024 13:43 7m 45s