Update nightly llama benchmarking tests #754

aviator19941 · 2025-01-04T11:13:57Z

Updates nightly llama benchmarking tests to benchmark input token lengths of 128 and 2048 for llama 8b, 70b, and 405b.
Switch IREE compile flag from --iree-hal-target-backends to --iree-hal-target-device

TODO: Add 405b decode benchmark calls to 405b fp16 tests when decode is fixed

Signed-off-by: aviator19941 <avinash.sharma@amd.com>

ScottTodd · 2025-01-06T23:37:24Z

.github/workflows/ci-llama-large-tests.yaml

@@ -7,6 +7,7 @@
 name: Llama Benchmarking Tests

 on:
+  pull_request:


This is still running after almost 4 hours: https://github.com/nod-ai/shark-ai/actions/runs/12639047256/job/35216569084 , resulting in a bit of a queue for the llama-mi300x-1 runner: https://github.com/nod-ai/shark-ai/actions?query=is%3Aqueued. Is that expected?

aviator19941 added 4 commits January 3, 2025 20:31

Fix 8b nightly tests

50eb2e6

Signed-off-by: aviator19941 <avinash.sharma@amd.com>

Update 70b test

a3a1a39

Signed-off-by: aviator19941 <avinash.sharma@amd.com>

Update 70b tests

db7d79e

Signed-off-by: aviator19941 <avinash.sharma@amd.com>

Update 405b 128 and 2048 tests

2abe6ad

Signed-off-by: aviator19941 <avinash.sharma@amd.com>

aviator19941 requested a review from archana-ramalingam January 4, 2025 11:13

archana-ramalingam and others added 7 commits January 6, 2025 09:47

Merge branch 'main' into fix_sharded_llama_tests

e68ee2f

Test benchmark nightly

afc8cff

Add missing iree_hal_target_device flags

69997e6

Use iree_hal_target_device flag to compile

4849c76

Add --iree-hal-target-device fixture

1a3c4cb

Use --iree-hal-target-device flag in perplexity tests

d747d89

Merge branch 'main' into fix_sharded_llama_tests

33815da

archana-ramalingam requested a review from IanNod January 6, 2025 20:04

ScottTodd reviewed Jan 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update nightly llama benchmarking tests #754

Update nightly llama benchmarking tests #754

aviator19941 commented Jan 4, 2025 •

edited by archana-ramalingam

Loading

ScottTodd Jan 6, 2025

Update nightly llama benchmarking tests #754

Are you sure you want to change the base?

Update nightly llama benchmarking tests #754

Conversation

aviator19941 commented Jan 4, 2025 • edited by archana-ramalingam Loading

ScottTodd Jan 6, 2025

Choose a reason for hiding this comment

aviator19941 commented Jan 4, 2025 •

edited by archana-ramalingam

Loading