Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Don't do a loadgen release from dev branch, add python3.12,13 to loadgen test, exclude power-checker from auto format #1994

Merged
merged 134 commits into from
Jan 7, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
134 commits
Select commit Hold shift + click to select a range
d8f0d7e
Update generate_final_report.py
arjunsuresh Oct 31, 2024
390a8cb
Merge branch 'master' into dev
arjunsuresh Nov 7, 2024
6b1a0f8
Fix sdxl (#1911)
arjunsuresh Nov 7, 2024
a4ba51f
Fixes for filtering invalid results
arjunsuresh Nov 7, 2024
7097ef5
Merge branch 'master' into dev
arjunsuresh Nov 7, 2024
190ee41
Merge 7097ef540bfa0286c65c81fbfdcb300e6d54f770 into d3c01ed3de6618a8d…
arjunsuresh Nov 7, 2024
451b310
[Automated Commit] Format Codebase
arjunsuresh Nov 7, 2024
4c109ea
Update preprocess_submission.py
arjunsuresh Nov 7, 2024
40c1fe0
Added an option to pass in sample_ids.txt for SDXL accuracy check
arjunsuresh Nov 7, 2024
2a61df9
Merge 40c1fe0c28364b243b5944b3569000611ddf2b7d into d3c01ed3de6618a8d…
arjunsuresh Nov 7, 2024
89a2ffe
[Automated Commit] Format Codebase
arjunsuresh Nov 7, 2024
69ffdc0
Update accuracy_coco.py
arjunsuresh Nov 7, 2024
76b703b
Merge 69ffdc0aa783f9127af612a7de57c6329703c1dc into d3c01ed3de6618a8d…
arjunsuresh Nov 7, 2024
d1d642e
[Automated Commit] Format Codebase
arjunsuresh Nov 7, 2024
8d3b8ab
Fix typo
arjunsuresh Nov 7, 2024
b09b1ef
Not use default for sample_ids.txt
arjunsuresh Nov 8, 2024
857494f
Merge branch 'master' into dev
arjunsuresh Nov 12, 2024
df5049d
Update requirements.txt (#1907)
arjunsuresh Nov 14, 2024
a7e8c8a
Fix preprocess_sudbmission for a bug
arjunsuresh Nov 15, 2024
213c239
Fix conflict
arjunsuresh Nov 15, 2024
8915a90
Update submission_checker.py | Removed TEST05
arjunsuresh Nov 16, 2024
36d5b74
Merge branch 'master' into dev
arjunsuresh Nov 16, 2024
da9e6bb
Fix to SDXL accuracy output
arjunsuresh Nov 21, 2024
db7fb38
Merge branch 'master' into dev
arjunsuresh Nov 21, 2024
13cd2e0
Added exists checks for rmtree in preprocess_submission script
arjunsuresh Nov 21, 2024
0abd369
Merge 13cd2e0abc98a36601e125f60dc96bc13738dc62 into 35c7e1f0dbbd1a63b…
arjunsuresh Nov 21, 2024
db5e737
[Automated Commit] Format Codebase
arjunsuresh Nov 21, 2024
dbecd2b
Delete .github/workflows/format.yml
arjunsuresh Nov 21, 2024
011a0d1
Delete .github/scripts directory
arjunsuresh Nov 21, 2024
d04ae63
Update build_wheels.yml | Added src distribution
arjunsuresh Nov 22, 2024
1923b01
Update VERSION.txt
arjunsuresh Nov 22, 2024
dc1acd3
Update build_wheels.yml
arjunsuresh Nov 22, 2024
dac78f5
Update VERSION.txt
arjunsuresh Nov 22, 2024
5443e07
Update pyproject.toml
arjunsuresh Nov 22, 2024
cae5eb2
Increment version to 4.1.26
arjunsuresh Nov 22, 2024
3b1031f
Update MANIFEST.in
arjunsuresh Nov 22, 2024
15581e2
Increment version to 4.1.27
arjunsuresh Nov 22, 2024
0511f95
Update pyproject.toml
arjunsuresh Nov 22, 2024
03911ac
Increment version to 4.1.28
arjunsuresh Nov 22, 2024
2418e81
Update build_wheels.yml
arjunsuresh Nov 22, 2024
6e1bcfd
Update VERSION.txt
arjunsuresh Nov 22, 2024
62949d0
Update accuracy_coco.py
arjunsuresh Nov 24, 2024
5314c5c
Making sdxl run thread safe
arjunsuresh Nov 30, 2024
130413b
Merge branch 'master' into dev
arjunsuresh Dec 3, 2024
3b90f6f
Merge branch 'master' into dev
arjunsuresh Dec 3, 2024
509f555
Create format.yml | Run format on push instead of PR
arjunsuresh Dec 4, 2024
6137dbf
Merge master
arjunsuresh Dec 8, 2024
17dc9aa
Update backend_pytorch.py | Fix lock usage
arjunsuresh Dec 8, 2024
dc7be47
Upgrade loadgen version to 5.0 (#1962)
nvzhihanj Dec 9, 2024
d64509d
Fix loadgen build for version numbers having "0" (#1967)
arjunsuresh Dec 9, 2024
b149bec
Increment version to 5.0.1
pgmpablo157321 Dec 9, 2024
0a9b2e4
Fix Dockerfile for 405B (#1960)
nvzhihanj Dec 10, 2024
47422d1
Add llama3 metrics + remove llama3-99.9 (#1973)
pgmpablo157321 Dec 11, 2024
070ef4d
Fix submission checker for v5.0 rgat (#1974)
arjunsuresh Dec 12, 2024
2da8aee
Fix test05 seeds missing error for v5.0 submission checker (#1976)
arjunsuresh Dec 12, 2024
b4d72fb
Fix llama3-405B docker workflow and performance sample count (#1978)
nvzhihanj Dec 13, 2024
a2c8a36
Increment version to 5.0.2
mrmhodak Dec 13, 2024
a988a32
Fix submission generation for v5.0 (#1981)
arjunsuresh Dec 13, 2024
e24ba21
More minor fixes for llama3.1-405b (#1983)
nvzhihanj Dec 17, 2024
659e563
Remove unused rgat files (#1961)
pgmpablo157321 Dec 18, 2024
f4c1f9a
Update docker GPU, avoid long build time (#1966)
pgmpablo157321 Dec 18, 2024
ea5153f
Require equal issue mode for R-GAT (#1968)
pgmpablo157321 Dec 18, 2024
281189a
Increment version to 5.0.3
mrmhodak Dec 18, 2024
ab05233
Docs update for r-gat (#1969)
arjunsuresh Dec 18, 2024
a65114b
[Automated Commit] Format Codebase
mlcommons-bot Dec 18, 2024
99a2015
Update automated run command section - R-GAT (#1970)
anandhu-eng Dec 18, 2024
aeb415e
Unify llama3 names to llama3.1-405b (#1982)
pgmpablo157321 Dec 19, 2024
8a319f0
Increment version to 5.0.4
mrmhodak Dec 19, 2024
2d4360f
Create test-rgat.yml (#1984)
arjunsuresh Dec 19, 2024
1a51a95
Update compliance test table (#1987)
pgmpablo157321 Dec 19, 2024
9309ef7
Create benchmark-checklist.md for r-gat (#1985)
arjunsuresh Dec 19, 2024
d9f1b6f
Increment version to 5.0.5
arjunsuresh Dec 20, 2024
3b5497c
Don't do a loadgen release from dev branch
arjunsuresh Dec 20, 2024
81983e5
Added python3.12, 3.13 to loadgen test
arjunsuresh Dec 20, 2024
ff2e54b
Update format.yml | Don't format power_checker being synced from powe…
arjunsuresh Dec 20, 2024
5f7c4ba
Merge branch 'master' into dev
arjunsuresh Dec 20, 2024
e9354d8
Update index.md | Update accuracy for r-gat
arjunsuresh Dec 20, 2024
76047ff
Update benchmark-checklist.md for r-gat
arjunsuresh Dec 20, 2024
45bb01b
Update CM commands in R-GAT README.md
arjunsuresh Dec 20, 2024
293d4ea
Update README.md
arjunsuresh Dec 20, 2024
c3408dd
Merge branch 'master' into dev
arjunsuresh Dec 21, 2024
1416bbd
Create reset-branch.yml
arjunsuresh Dec 21, 2024
b2ce7c8
Create auto-update-dev.yml
arjunsuresh Dec 22, 2024
1162c25
Tested and fixed SDXL README (#1997)
sahilavaran Dec 23, 2024
b91d6f2
Update preprocess_submission.py
arjunsuresh Dec 23, 2024
d6faf23
Update README.md
sahilavaran Dec 25, 2024
5c44891
Update README.md | added the outdirname in the CM command
sahilavaran Dec 25, 2024
d679f14
Update README.md | added the outdirname in the CM Command
sahilavaran Dec 25, 2024
c6e149d
Merge branch 'mlcommons:dev' into dev
sahilavaran Dec 25, 2024
390d934
include cm commands - accuracy and calibration
anandhu-eng Dec 23, 2024
9340b79
Update README.md
anandhu-eng Dec 23, 2024
5da0abe
Update README.md | added the outdirname in the CM command
sahilavaran Dec 25, 2024
9a32069
Merge branch 'mlcommons:dev' into dev
sahilavaran Dec 26, 2024
ff6d245
Update README.md| added outdirname in the CM command
sahilavaran Dec 30, 2024
69ff400
Support audit.conf with static mlperf.conf
arjunsuresh Dec 30, 2024
e11f6cd
Support audit.conf with static mlperf.conf
arjunsuresh Dec 30, 2024
cb18921
[Automated Commit] Format Codebase
mlcommons-bot Dec 30, 2024
a5c1552
Update test_settings_internal.cc | Fix conf_type usage
arjunsuresh Dec 30, 2024
d4f3f2d
Update test_settings_internal.cc
arjunsuresh Dec 30, 2024
ea2ba9c
Fixes to submission checker
arjunsuresh Dec 30, 2024
3d5688a
[Automated Commit] Format Codebase
mlcommons-bot Dec 30, 2024
3f28e99
Merge branch 'dev' into fix_compliance_runs
arjunsuresh Dec 30, 2024
16019e6
[Automated Commit] Format Codebase
mlcommons-bot Dec 30, 2024
58b36e5
Merge pull request #2004 from mlcommons/fix_compliance_runs
arjunsuresh Dec 30, 2024
b7b6790
Merge pull request #2001 from sahilavaran/dev
arjunsuresh Dec 31, 2024
308147a
Update submission_checker.py | Fix rgat performance_sample_count
arjunsuresh Dec 31, 2024
5d5da3c
Update evaluate-accuracy.py | Fixes #2008
arjunsuresh Jan 1, 2025
1146e20
Merge branch 'master' into dev
arjunsuresh Jan 2, 2025
4ba40e2
Update index.md
arjunsuresh Jan 6, 2025
a893166
Update index.md
arjunsuresh Jan 6, 2025
194aeda
Update index.md
arjunsuresh Jan 6, 2025
bab97ff
Update submission generation steps (WIP)
arjunsuresh Jan 6, 2025
318110c
add submission generation graphs for local sync and through github re…
anandhu-eng Jan 6, 2025
41721df
Update index.md
arjunsuresh Jan 6, 2025
a8cdcaf
Update index.md
arjunsuresh Jan 6, 2025
6eda9f1
Update index.md
arjunsuresh Jan 6, 2025
36db7cc
Update index.md
arjunsuresh Jan 6, 2025
6430b79
Update index.md
arjunsuresh Jan 6, 2025
8425da6
Update index.md
arjunsuresh Jan 6, 2025
6b24ac0
Fixes to submission generation docs
arjunsuresh Jan 6, 2025
1120df8
Fixes to submission generation docs
arjunsuresh Jan 6, 2025
e3ea5ef
Added link to the expected results folder structure
arjunsuresh Jan 7, 2025
2c67b24
add docs for llama3 + inference version upgrade (#2020)
anandhu-eng Jan 7, 2025
c0ed0a8
Update CM run commands for llama3_1-405b (#2019)
anandhu-eng Jan 7, 2025
17960a5
Fixes the submission README
arjunsuresh Jan 7, 2025
51af492
Update README.md
arjunsuresh Jan 7, 2025
cd25f8e
Create test-submission-generation.yml
arjunsuresh Jan 7, 2025
9bcdbee
Update test-submission-generation.yml
arjunsuresh Jan 7, 2025
961efb7
Clean invalid model results in preprocess_submission script
arjunsuresh Jan 7, 2025
ab2fc44
[Automated Commit] Format Codebase
mlcommons-bot Jan 7, 2025
d98eb37
Fixes the submission README
arjunsuresh Jan 7, 2025
37e94c6
Update README.md
arjunsuresh Jan 7, 2025
27827dc
Update README.md
arjunsuresh Jan 7, 2025
a56a251
Update test-submission-generation.yml
arjunsuresh Jan 7, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
34 changes: 34 additions & 0 deletions .github/workflows/auto-update-dev.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,34 @@
name: Auto-Update Dev Branch from Master

on:
push:
branches:
- master # Trigger workflow on commits to 'dev' branch

jobs:
update-main:
runs-on: ubuntu-latest
permissions:
contents: write # Required to push to protected branches

steps:
- name: Checkout Main Branch
uses: actions/checkout@v4
with:
ref: dev
fetch-depth: 0
ssh-key: ${{ secrets.DEPLOY_KEY }}

- name: Configure Git User
run: |
git config user.name "github-actions"
git config user.email "github-actions@github.com"

- name: Merge auto-update into dev
run: |
git fetch origin master:master
git merge --no-ff master -m "Auto-merge updates from master branch"

- name: Push Changes to Main
run: |
git push origin dev
1 change: 0 additions & 1 deletion .github/workflows/build_wheels.yml
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,6 @@ on:
branches:
- master
- loadgen-release
- dev
paths:
- loadgen/**

Expand Down
2 changes: 1 addition & 1 deletion .github/workflows/format.yml
Original file line number Diff line number Diff line change
Expand Up @@ -38,7 +38,7 @@ jobs:
for FILE in $(git diff --name-only $filter | grep -E '.*\.py$')
do
# Check if the file still exists in the working tree
if [ -f "$FILE" ]; then
if [ -f "$FILE" ] && [ "$FILE" != "tools/submission/power/power_checker.py" ]; then
autopep8 --in-place -a "$FILE"
git add "$FILE"
fi
Expand Down
42 changes: 42 additions & 0 deletions .github/workflows/reset-branch.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,42 @@
name: Reset Current Branch to Upstream After Squash Merge

on:
workflow_dispatch:
inputs:
branch:
description: 'Branch to reset (leave blank for current branch)'
required: false
default: 'dev'

jobs:
reset-branch:
runs-on: ubuntu-latest

steps:
- name: Checkout Repository
uses: actions/checkout@v4
with:
fetch-depth: 0

- name: Detect Current Branch
if: ${{ inputs.branch == '' }}
run: echo "branch=$(git rev-parse --abbrev-ref HEAD)" >> $GITHUB_ENV

- name: Use Input Branch
if: ${{ inputs.branch != '' }}
run: echo "branch=${{ inputs.branch }}" >> $GITHUB_ENV

- name: Add Upstream Remote
run: |
git remote add upstream https://github.com/mlcommons/inference.git
git fetch upstream
- name: Reset Branch to Upstream
run: |
git checkout ${{ env.branch }}
git reset --hard upstream/${{ env.branch }}
if: success()

- name: Force Push to Origin
run: |
git push origin ${{ env.branch }} --force-with-lease
if: success()
2 changes: 1 addition & 1 deletion .github/workflows/test-loadgen.yml
Original file line number Diff line number Diff line change
Expand Up @@ -21,7 +21,7 @@ jobs:
strategy:
fail-fast: false
matrix:
python-version: ["3.7", "3.8", "3.9", "3.10", "3.11"]
python-version: ["3.8", "3.9", "3.10", "3.11", "3.12", "3.13"]

steps:
- uses: actions/checkout@v3
Expand Down
52 changes: 52 additions & 0 deletions .github/workflows/test-submission-generation.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,52 @@
# This workflow will test the submission generation using MLPerf Automation

name: CM based Submission Generation

on:
pull_request:
branches: [ "master", "dev" ]
paths:
- '.github/workflows/test-submission-generation.yml'
- '**'
- '!**.md'
jobs:
submission_generation:
runs-on: ${{ matrix.os }}
strategy:
fail-fast: false
matrix:
os: [ubuntu-latest, windows-latest, macos-latest]
python-version: [ "3.12" ]
division: ["closed", "open", "closed-open"]
category: ["datacenter", "edge"]
case: ["closed"]
action: ["run", "docker"]
exclude:
- os: macos-latest
- os: windows-latest
- category: "edge"

steps:
- uses: actions/checkout@v4
- name: Set up Python ${{ matrix.python-version }}
uses: actions/setup-python@v3
with:
python-version: ${{ matrix.python-version }}
- name: Install dependencies
run: |
pip install cm4mlops
- name: Pull repo where test cases are uploaded
run: |
git clone -b submission-generation-examples https://github.com/mlcommons/inference.git submission_generation_examples
- name: Run Submission Generation - ${{ matrix.case }} ${{ matrix.action }} ${{ matrix.category }} ${{ matrix.division }}
continue-on-error: true
run: |
if [ "${{ matrix.case }}" == "closed" ]; then
description="Test submission - contains closed edge and datacenter"
elif [ "${{ matrix.case }}" == "closed-power" ]; then
description="Test submission - contains closed-power edge and datacenter results"
fi
# Dynamically set the log group to simulate a dynamic step name
echo "::group::$description"
cm ${{ matrix.action }} script --tags=generate,inference,submission --adr.compiler.tags=gcc --version=v5.0 --clean --preprocess_submission=yes --submission_base_dir=mysubmissions --results_dir=$PWD/submission_generation_tests/${{ matrix.case }}/ --run-checker --submitter=MLCommons --tar=yes --division=${{ matrix.division }} --env.CM_DETERMINE_MEMORY_CONFIGURATION=yes --quiet
cm ${{ matrix.action }} script --tags=run,submission,checker --submitter_id_off=mysubmitter_id --tar=yes --submission_dir=mysubmissions/submissions --submission_tar_file=mysubmission.tar.gz
41 changes: 41 additions & 0 deletions docs/benchmarks/language/get-llama3_1-405b-data.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,41 @@
---
hide:
- toc
---

# Text Summarization using LLAMA3.1-405b

## Dataset

The benchmark implementation run command will automatically download the validation and calibration datasets and do the necessary preprocessing. In case you want to download only the datasets, you can use the below commands.

=== "Validation"

### Get Validation Dataset
```
cm run script --tags=get,dataset,mlperf,inference,llama3,_validation --outdirname=<path to download> -j
```

=== "Calibration"

### Get Calibration Dataset
```
cm run script --tags=get,dataset,mlperf,inference,llama3,_calibration --outdirname=<path to download> -j
```

## Model
The benchmark implementation run command will automatically download the required model and do the necessary conversions. In case you want to only download the official model, you can use the below commands.

Get the Official MLPerf LLAMA3.1-405b Model

=== "Pytorch"

### Pytorch
```
cm run script --tags=get,ml-model,llama3 --outdirname=<path to download> --hf_token=<huggingface access token> -j
```

!!! tip

Downloading llama3.1-405B model from Hugging Face will require an [**access token**](https://huggingface.co/settings/tokens) which could be generated for your account. Additionally, ensure that your account has access to the [llama3.1-405B](https://huggingface.co/meta-llama/Llama-3.1-405B-Instruct) model.

13 changes: 13 additions & 0 deletions docs/benchmarks/language/llama3_1-405b.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
---
hide:
- toc
---

# Text Summarization using LLAMA3_1-405b

=== "MLCommons-Python"
## MLPerf Reference Implementation in Python

{{ mlperf_inference_implementation_readme (4, "llama3_1-405b-99", "reference", devices=["CPU","CUDA"]) }}

{{ mlperf_inference_implementation_readme (4, "llama3_1-405b-99.9", "reference", devices=["CPU","CUDA"]) }}
2 changes: 1 addition & 1 deletion docs/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -163,7 +163,7 @@ The currently valid [MLPerf Inference Benchmarks](index_gh.md) as of MLPerf infe
- **Dataset Size**: 788,379
- **QSL Size**: 788,379
- **Number of Parameters**:
- **Reference Model Accuracy**: ACC = ?
- **Reference Model Accuracy**: ACC = 72.86%
- **Server Scenario Latency Constraint**: N/A
- **Equal Issue mode**: True
- **High accuracy variant**: No
Expand Down
Loading
Loading