Support for Deepspeed-MII worker #2994

tungsontran started this conversation in General

tungsontran
Mar 26, 2024

Deepspeed-MII (https://github.com/microsoft/DeepSpeed/tree/master/blogs/deepspeed-fastgen) claims that their inference throughput is much faster than vLLM. Will Factory dev team consider adopting it soon?

Replies: 0 comments

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment