Skip to content

Commit

Permalink
Fixing rocm gptq by using triton code too (renamed cuda into triton). (
Browse files Browse the repository at this point in the history
  • Loading branch information
Narsil authored Oct 25, 2024
1 parent 43df056 commit cece863
Show file tree
Hide file tree
Showing 2 changed files with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions server/text_generation_server/layers/gptq/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -10,8 +10,8 @@

if SYSTEM == "ipex":
from .ipex import QuantLinear
elif SYSTEM == "cuda":
from .cuda import QuantLinear
elif SYSTEM in {"cuda", "rocm"}:
from .triton import QuantLinear


@dataclass
Expand Down

0 comments on commit cece863

Please sign in to comment.