-
Notifications
You must be signed in to change notification settings - Fork 232
Issues: linkedin/Liger-Kernel
[RFC] Liger FlexChunkLoss: Alignment and Distillation loss
#371
opened Nov 8, 2024 by
shivam15s
Open
21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Dtype Mismatch in
torch.addmm
within ops/fused_linear_cross_entropy.py
in AMP training
#501
opened Dec 26, 2024 by
DandinPower
Extending Liger-Kernel Optimizations to Encoder Models Like BER
#500
opened Dec 26, 2024 by
pengzhangzhi
error when run Good for newcomers
sh run_qwen.sh
good first issue
#487
opened Dec 18, 2024 by
CharlesJhonson
Potential Optimization for Preference Training with Prefix Sharing
#476
opened Dec 13, 2024 by
austin362667
ValueError: Pointer argument (at 0) cannot be accessed from Triton (cpu tensor?)
#401
opened Nov 20, 2024 by
shivam15s
[RFC] Liger FlexChunkLoss: Alignment and Distillation loss
#371
opened Nov 8, 2024 by
shivam15s
5 of 12 tasks
Possible support for weighted average loss calculation in FusedLinearCrossEntropy kernel
#338
opened Nov 1, 2024 by
ChenlongDeng
Training LLaVA with the Liger kernel results in degraded performance.
#319
opened Oct 22, 2024 by
y-rok
In-place operations in triton kernel might result in incorrect gradient calculations
bug
Something isn't working
#272
opened Sep 26, 2024 by
Tcc0403
Previous Next
ProTip!
no:milestone will show everything without a milestone.