Fabre

#1095237 ITP: vllm -- A high-throughput and memory-efficient inference and serving engine for LLMs #1095237

Package:: wnpp

Source:: wnpp

Submitter:: M. Zhou

Date:: 2025-11-29 16:56:44 UTC

Severity:: normal

#1095237#5

Date:: 2025-02-05 18:08:01 UTC

From:

To:

* Package name    : vllm
  Version         : 0.7.1
  Upstream Contact:
* URL             :
* License         : Apache-2.0
  Programming Lang: Python
  Description     : A high-throughput and memory-efficient inference and serving engine for LLMs

I think this is one of the most important applications in the reverse
dependency tree of pytorch package. vllm has a very large tree of dependencies
and many of them are missing. I'm just setting vllm as a long term goal.

Alternatives are ollama and llama.cpp.

Everything, including vllm's necessary dependencies will be maintained by
debian deep learning team.

Thank you for using reportbug

#1095237 ITP: vllm -- A high-throughput and memory-efficient inference and serving engine for LLMs #1095237

Just Reply to ...

Reply to submitter ...

Send control command (Silently)

Set Architecture Tags (Silently)