Skip to content

Actions: tatsu-lab/alpaca_eval

alpaca_eval unit tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
605 workflow runs
605 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

minor
alpaca_eval unit tests #282: Commit 642cd5e pushed by YannDubs
November 28, 2023 03:35 3m 3s main
November 28, 2023 03:35 3m 3s
show img in readme (#178)
alpaca_eval unit tests #281: Commit 94cd8b6 pushed by YannDubs
November 27, 2023 23:03 3m 14s main
November 27, 2023 23:03 3m 14s
show img in readme
alpaca_eval unit tests #280: Pull request #178 synchronize by YannDubs
November 27, 2023 23:03 2m 53s yann/verified_img
November 27, 2023 23:03 2m 53s
show img in readme
alpaca_eval unit tests #279: Pull request #178 synchronize by YannDubs
November 27, 2023 23:02 2m 54s yann/verified_img
November 27, 2023 23:02 2m 54s
show img in readme
alpaca_eval unit tests #278: Pull request #178 synchronize by YannDubs
November 27, 2023 23:01 2m 54s yann/verified_img
November 27, 2023 23:01 2m 54s
show img in readme
alpaca_eval unit tests #277: Pull request #178 synchronize by YannDubs
November 27, 2023 23:00 2m 52s yann/verified_img
November 27, 2023 23:00 2m 52s
show img in readme
alpaca_eval unit tests #276: Pull request #178 opened by YannDubs
November 27, 2023 22:57 2m 50s yann/verified_img
November 27, 2023 22:57 2m 50s
feat: add way to verify results (#177)
alpaca_eval unit tests #275: Commit 5d66c75 pushed by YannDubs
November 27, 2023 22:54 3m 9s main
November 27, 2023 22:54 3m 9s
feat: add way to verify results
alpaca_eval unit tests #274: Pull request #177 synchronize by YannDubs
November 27, 2023 22:54 2m 52s yann/readme_verified
November 27, 2023 22:54 2m 52s
feat: add way to verify results
alpaca_eval unit tests #273: Pull request #177 synchronize by YannDubs
November 27, 2023 22:53 2m 54s yann/readme_verified
November 27, 2023 22:53 2m 54s
feat: add way to verify results
alpaca_eval unit tests #272: Pull request #177 opened by YannDubs
November 27, 2023 22:45 2m 59s yann/readme_verified
November 27, 2023 22:45 2m 59s
Add 01-ai/Yi-34B-Chat to AlpacaEval (#175)
alpaca_eval unit tests #271: Commit 330cf69 pushed by YannDubs
November 26, 2023 20:43 2m 57s main
November 26, 2023 20:43 2m 57s
fix: ensure that people use the correct baseline
alpaca_eval unit tests #270: Commit 588772c pushed by YannDubs
November 26, 2023 20:38 2m 57s main
November 26, 2023 20:38 2m 57s
Add MiniChat-1.5-3B to AlpacaEval and Fix MiniChat-3B (#176)
alpaca_eval unit tests #269: Commit b226e30 pushed by YannDubs
November 26, 2023 20:29 3m 10s main
November 26, 2023 20:29 3m 10s
Add MiniChat-1.5-3B to AlpacaEval and Fix MiniChat-3B
alpaca_eval unit tests #268: Pull request #176 opened by GeneZC
November 26, 2023 03:31 2m 53s GeneZC:minichat-3b-fix
November 26, 2023 03:31 2m 53s
Fix the results of MiniChat-3B
alpaca_eval unit tests #267: Pull request #173 synchronize by GeneZC
November 26, 2023 03:16 2m 47s GeneZC:minichat-3b-fix
November 26, 2023 03:16 2m 47s
Add 01-ai/Yi-34B-Chat to AlpacaEval
alpaca_eval unit tests #266: Pull request #175 opened by HyperdriveHustle
November 26, 2023 03:13 3m 2s main
November 26, 2023 03:13 3m 2s
Merge pull request #174 from Muennighoff/patch-1
alpaca_eval unit tests #265: Commit cd18dd4 pushed by rtaori
November 26, 2023 00:23 3m 28s main
November 26, 2023 00:23 3m 28s
Fix mssg check
alpaca_eval unit tests #264: Pull request #174 opened by Muennighoff
November 25, 2023 23:39 3m 21s Muennighoff:patch-1
November 25, 2023 23:39 3m 21s
Fix the results of MiniChat-3B
alpaca_eval unit tests #263: Pull request #173 opened by GeneZC
November 25, 2023 16:16 3m 37s GeneZC:minichat-3b-fix
November 25, 2023 16:16 3m 37s
[BUG] non chat openai models
alpaca_eval unit tests #262: Commit dd5ac0b pushed by YannDubs
November 24, 2023 22:49 3m 13s main
November 24, 2023 22:49 3m 13s
Add Tulu 2 models to AlpacaEval (#171)
alpaca_eval unit tests #261: Commit 90506bf pushed by YannDubs
November 18, 2023 23:31 2m 53s main
November 18, 2023 23:31 2m 53s
Add Tulu 2 models to AlpacaEval
alpaca_eval unit tests #260: Pull request #171 opened by hamishivi
November 18, 2023 23:24 2m 55s hamishivi:add-tulu-2
November 18, 2023 23:24 2m 55s
update cohere as evaluator
alpaca_eval unit tests #259: Commit 3388c5a pushed by YannDubs
November 18, 2023 20:34 3m 22s main
November 18, 2023 20:34 3m 22s
feat: verify all the cohere models (#170)
alpaca_eval unit tests #258: Commit af92219 pushed by YannDubs
November 18, 2023 20:26 3m 12s main
November 18, 2023 20:26 3m 12s
ProTip! You can narrow down the results and go further in time using created:<2023-11-18 or the other filters available.