Skip to content

Actions: TJ-Solergibert/nanotron

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
60 workflow runs
60 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Llama3.2 conversion updated
Secret Leaks #57: Commit 1e31cb9 pushed by TJ-Solergibert
November 28, 2024 22:53 14s llama3.2_conversion
November 28, 2024 22:53 14s
ready
Secret Leaks #56: Commit bd81b67 pushed by TJ-Solergibert
November 26, 2024 11:46 16s fix_resume_pp
November 26, 2024 11:46 16s
Merge branch 'main' into fix_resume_pp
Secret Leaks #55: Commit d5f656a pushed by TJ-Solergibert
November 26, 2024 11:43 21s fix_resume_pp
November 26, 2024 11:43 21s
not
Secret Leaks #54: Commit a7ca23b pushed by TJ-Solergibert
November 26, 2024 11:35 13s fix_resume_pp
November 26, 2024 11:35 13s
try
Secret Leaks #53: Commit 27abd7c pushed by TJ-Solergibert
November 26, 2024 11:25 17s fix_resume_pp
November 26, 2024 11:25 17s
not
Secret Leaks #52: Commit a7ca23b pushed by TJ-Solergibert
November 26, 2024 11:19 17s fix_resume_pp
November 26, 2024 11:19 17s
Load properly
Secret Leaks #51: Commit 1981af2 pushed by TJ-Solergibert
November 26, 2024 11:14 22s fix_resume_pp
November 26, 2024 11:14 22s
LR Schedule same name as optimizer
Secret Leaks #50: Commit 51bd072 pushed by TJ-Solergibert
November 26, 2024 11:11 19s fix_resume_pp
November 26, 2024 11:11 19s
Bringing liger kernels back
Secret Leaks #49: Commit df3ef9d pushed by TJ-Solergibert
September 26, 2024 07:28 18s document_xattention
September 26, 2024 07:28 18s
No more NaN losses
Secret Leaks #48: Commit 3969aa2 pushed by TJ-Solergibert
September 26, 2024 07:18 17s document_xattention
September 26, 2024 07:18 17s
read datasets locally
Secret Leaks #46: Commit f3bf21d pushed by TJ-Solergibert
September 17, 2024 08:36 18s document_xattention
September 17, 2024 08:36 18s
Fixed metadata issue
Secret Leaks #45: Commit 81fdb3a pushed by TJ-Solergibert
September 16, 2024 15:47 22s document_xattention
September 16, 2024 15:47 22s
Only load model parameters on SFT
Secret Leaks #44: Commit cd81111 pushed by TJ-Solergibert
September 16, 2024 15:30 17s document_xattention
September 16, 2024 15:30 17s
Merge branch 'main' into document_xattention
Secret Leaks #43: Commit ed51183 pushed by TJ-Solergibert
September 16, 2024 15:15 20s document_xattention
September 16, 2024 15:15 20s
September 16, 2024 12:38 21s
Compatibility with llama.py checkpoints
Secret Leaks #41: Commit a7051d1 pushed by TJ-Solergibert
September 16, 2024 09:22 20s document_xattention
September 16, 2024 09:22 20s
Added EP==0
Secret Leaks #40: Commit ef835e8 pushed by TJ-Solergibert
September 6, 2024 12:06 18s fix_resume_pp
September 6, 2024 12:06 18s
Fix pp naming
Secret Leaks #39: Commit 4d61489 pushed by TJ-Solergibert
September 6, 2024 10:27 20s fix_resume_pp
September 6, 2024 10:27 20s
Fix eval check
Secret Leaks #38: Commit 1969526 pushed by TJ-Solergibert
September 4, 2024 16:56 24s validation
September 4, 2024 16:56 24s
Optional validation
Secret Leaks #37: Commit 8e6f8ab pushed by TJ-Solergibert
August 27, 2024 17:33 17s validation
August 27, 2024 17:33 17s
Adding liger kernels and modifyng conversion scripts
Secret Leaks #36: Commit a185c50 pushed by TJ-Solergibert
August 26, 2024 14:57 23s document_xattention
August 26, 2024 14:57 23s
Lets move to todi
Secret Leaks #35: Commit 5157392 pushed by TJ-Solergibert
August 26, 2024 07:58 16s document_xattention
August 26, 2024 07:58 16s
Little hack to fix first length
Secret Leaks #34: Commit efd168f pushed by TJ-Solergibert
August 22, 2024 20:02 17s document_xattention
August 22, 2024 20:02 17s
first commit
Secret Leaks #33: Commit 71122d3 pushed by TJ-Solergibert
August 22, 2024 16:56 22s document_xattention
August 22, 2024 16:56 22s