-
Notifications
You must be signed in to change notification settings - Fork 4.3k
Issues: hpcaitech/ColossalAI
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[FEATURE]: Consider integrating the pretraining process of the llama3-405B model?
enhancement
New feature or request
#6170
opened Dec 25, 2024 by
JiadiLee
[BUG]: RuntimeError: mat1 and mat2 must have the same dtype, but got Float and BFloat16
bug
Something isn't working
#6169
opened Dec 25, 2024 by
balcklive
1 task done
[BUG]: Size Mismatch Issue When Loading Model Checkpoints Trained with Tensor Parallel if Something isn't working
vocab_size % tp_size != 0
bug
#6167
opened Dec 24, 2024 by
Lemon-412
1 task done
[BUG]: Gemini saved an additional portion of the weights while using tie_word_embeddings=True
bug
Something isn't working
#6160
opened Dec 13, 2024 by
ericxsun
1 task done
[FEATURE]: Lora/QLora in GeminiPlugin and TorchFSDP
enhancement
New feature or request
#6138
opened Nov 16, 2024 by
ericxsun
[FEATURE]: support google/gemma-2-2b for Tensor Parallelism
enhancement
New feature or request
#6120
opened Nov 9, 2024 by
jing-4369
2
[BUG]: ColossalAI Inference example response empty result without error
bug
Something isn't working
#6112
opened Nov 4, 2024 by
GuangyaoZhang
1 task done
[BUG]: why duplicate PID appears on rank 0
bug
Something isn't working
#6111
opened Nov 3, 2024 by
ericxsun
1 task done
[BUG]: Llama3.1 HybridParallelPlugin train failed when pp_size>1
bug
Something isn't working
#6110
opened Nov 2, 2024 by
cingtiye
1 task done
[PROPOSAL]: FP8 with block-wise amax
enhancement
New feature or request
#6105
opened Oct 28, 2024 by
Edenzzzz
1 task
[FEATURE]: Windows wheel needed
enhancement
New feature or request
#6103
opened Oct 27, 2024 by
nitinmukesh
[BUG]: weird stuck while training
bug
Something isn't working
#6095
opened Oct 19, 2024 by
ericxsun
1 task done
[BUG]: Got nan during backward with zero2
bug
Something isn't working
#6091
opened Oct 16, 2024 by
flymin
1 task done
[BUG]: Unable to train on H20 machine
bug
Something isn't working
#6079
opened Oct 6, 2024 by
kaixinbear
1 task done
[DOC]: 环境安装失败
documentation
Improvements or additions to documentation
#6066
opened Sep 21, 2024 by
eccct
[FEATURE]: Is it Possible to integrate Liger-Kernel?
enhancement
New feature or request
#6047
opened Sep 6, 2024 by
ericxsun
[BUG]: remove Something isn't working
.github/workflows/submodule.yml
bug
#6039
opened Aug 28, 2024 by
BoxiangW
1 task done
[FEATURE]: Support Zerobubble pipeline
enhancement
New feature or request
#6037
opened Aug 28, 2024 by
duanjunwen
[BUG]: errror Colossalai 0.4.0/0.4.2 /usr/bin/supervisord
bug
Something isn't working
#6032
opened Aug 23, 2024 by
Storm0921
1 task done
[BUG]: AttributeError: 'GeminiDDP' object has no attribute 'module'
bug
Something isn't working
#6021
opened Aug 20, 2024 by
dheerj188
1 task done
[BUG]: Torch compile causes multi-process to hang with python 3.9
bug
Something isn't working
#5987
opened Aug 10, 2024 by
Edenzzzz
1 task done
[FEATURE]: How to skip a custom node from generating strategies in colossal-auto?
enhancement
New feature or request
#5983
opened Aug 8, 2024 by
robotsp
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.