Skip to content

Support soft_prompt or inputs_embeds? #267

Open
@jessiewiswjc

Description

@jessiewiswjc

Does triton-infernece-server support multi-modal models such as blip2 in trt-llm https://github.com/NVIDIA/TensorRT-LLM/tree/main/examples/blip2?

Metadata

Metadata

Assignees

Labels

questionFurther information is requestedtriagedIssue has been triaged by maintainers

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions