Enable uploading multiple images in demo.py #232

ifsheldon · 2023-05-17T12:11:31Z

I made a few changes to enable uploading multiple images. This should close #180.

The changes include:

Changed the logic of upload button, image window and text input
Added a new UI component, i.e., gallery, to show uploaded images

One remaining problem is that we cannot upload multiple images all at once, because I suspect this line may cause issues.

MiniGPT-4/minigpt4/conversation/conversation.py

Line 133 in 22d8888

conv.messages[-1][1] = ' '.join([conv.messages[-1][1], text])

So now, after uploading an image, a text input should follow before uploading another image.

LFavano · 2023-05-22T10:42:05Z

I previously tried to feed multiple images manually using chat.upload_img(), but when asking the model to describe the different pictures uploaded it would still only consider one. In case you already tested this, is this also the case with your code?

ifsheldon · 2023-05-22T15:44:57Z

I previously tried to feed multiple images manually using chat.upload_img(), but when asking the model to describe the different pictures uploaded it would still only consider one. In case you already tested this, is this also the case with your code?

I don't have this issue. Everything works fine. The quoted code is exactly the source of your issue. You can try my branch.

LFavano · 2023-05-23T11:24:40Z

I previously tried to feed multiple images manually using chat.upload_img(), but when asking the model to describe the different pictures uploaded it would still only consider one. In case you already tested this, is this also the case with your code?

I don't have this issue. Everything works fine. The quoted code is exactly the source of your issue. You can try my branch.

I tried your branch but am still having the issue, here's my steps:

Upload the first image
Use the prompt "Please describe the first image provided, a second image is coming after" and a couple of other variations. The description provided here is good.
Upload a second image
Use the prompt "Please describe both the images provided". The description provided is accurate but is only about the second image and ignores the first one

From this point on asking "Please describe the [first/second] image" only gets me really weird descriptions that seem to mix up the two images.

If you can get the model to describe both images at the same time (or do any reasoning on multiple images at once) maybe you can share the prompt you used.

EDIT: I realized that the outcome is a bit random, sometimes the descriptions get mixed up and other times they don't, but I can't manage to reliably do accurate reasoning on multiple images

ifsheldon · 2023-05-23T15:34:11Z

@LFavano yeah, the IQ of miniGPT4 can fluctuate, especially for small models. I guess your prompt is also a bit misleading. The image embeddings are actually appended to the prompt, so the total embeddings the model see is <embedding of "Please describe the first image provided, a second image is coming after"> + <embedding of the first image>, then you see why sometimes miniGPT4 gives confusing output, because it gets confused as well.

arshadshk · 2023-11-11T07:30:03Z

when an image is uploaded, it gets converted to an embedding and then concatenated to the token embeddings. In the case of multiple images, is anyone here aware of a model that takes in multiple such image embeddings simultaneously and concatenates in the same sequence, a kind of one-pass inference?

ifsheldon added 2 commits May 17, 2023 17:40

remove unnecessary input and output

7fc7710

modify the code so to enable demo to accept multiple image input

5b0a99e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable uploading multiple images in demo.py #232

Enable uploading multiple images in demo.py #232

ifsheldon commented May 17, 2023

LFavano commented May 22, 2023

ifsheldon commented May 22, 2023

LFavano commented May 23, 2023 •

edited

Loading

ifsheldon commented May 23, 2023 •

edited

Loading

arshadshk commented Nov 11, 2023

Enable uploading multiple images in demo.py #232

Are you sure you want to change the base?

Enable uploading multiple images in demo.py #232

Conversation

ifsheldon commented May 17, 2023

LFavano commented May 22, 2023

ifsheldon commented May 22, 2023

LFavano commented May 23, 2023 • edited Loading

ifsheldon commented May 23, 2023 • edited Loading

arshadshk commented Nov 11, 2023

LFavano commented May 23, 2023 •

edited

Loading

ifsheldon commented May 23, 2023 •

edited

Loading