Skip to content

0.0.18: llava-v1.5 support? #446

Answered by turboderp
Katehuuh asked this question in Q&A
Discussion options

You must be logged in to vote

Yes, in a limited sense. You still need the vision tower from Transformers to produce the image feature embeddings, but now there's an interface for mixing the embeddings into the prompt. See #399 for more.

It will hopefully become a full set of features at some point, maybe even dropping the Transformers dependency. Just different priorities right now.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by Katehuuh
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants