How to get the image a man rides a horse? #2

hujunchao · 2023-06-19T13:26:59Z

I try this project. It's amzing and interesting.
But now, I meet a question. It's hard for me to get a good image by the text "a man rides a horse".
Can you give me some advice?
Thank you!

TonyLianLong · 2023-06-19T19:11:28Z

Some initial attempts (you can improve by trying more options and seeds)

You may wonder why the man's face is weird. This is a known artifact of stable diffusion on small objects that is out of our scope to fix. Generating a man with a larger proportion of face to image may help.

hujunchao · 2023-06-20T13:20:19Z

Thank you for your reply!

hujunchao · 2023-06-20T13:27:53Z

When two objects do not interact, it is easy to use layout to get perfect image. But when two objects interact, it may be hard to use layout to get good image. How to show the action between objects? For example, a man and a horse may be easy. A man rides a horse may be difficult. A man is chasing a horse may be more difficult.

TonyLianLong · 2023-06-20T14:26:13Z

Good question! This is why the space allows specifying a prompt for overall generation. Without it, you use a default prompt and don't get object interaction (SD will try to guess the object interaction, so it could also guess a man standing close to a horse on the specified location). With it, you get the object interaction (e.g., a man riding the horse, then SD knows the man is supposed to ride the horse, as shown in the generation above).

However, adding more fine-grained control to object interactions is a very useful future direction. This paper specifies the idea of "text->intermediate representation->image". You are encouraged to extend to more representations (e.g., scene graph or LLM-generated SVG that captures more information).

Examples:
Same config, overall prompt: A man standing nearby a horse (I didn't play around the hyperparam)

Same config, overall prompt: A man riding a horse

hujunchao changed the title ~~How to get the image a man ride a horse?~~ How to get the image a man rides a horse? Jun 19, 2023

TonyLianLong mentioned this issue Jun 28, 2024

Does it only generate bboxes that are not overlapped? #18

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get the image a man rides a horse? #2

How to get the image a man rides a horse? #2

hujunchao commented Jun 19, 2023 •

edited

Loading

TonyLianLong commented Jun 19, 2023 •

edited

Loading

hujunchao commented Jun 20, 2023

hujunchao commented Jun 20, 2023

TonyLianLong commented Jun 20, 2023 •

edited

Loading

How to get the image a man rides a horse? #2

How to get the image a man rides a horse? #2

Comments

hujunchao commented Jun 19, 2023 • edited Loading

TonyLianLong commented Jun 19, 2023 • edited Loading

hujunchao commented Jun 20, 2023

hujunchao commented Jun 20, 2023

TonyLianLong commented Jun 20, 2023 • edited Loading

hujunchao commented Jun 19, 2023 •

edited

Loading

TonyLianLong commented Jun 19, 2023 •

edited

Loading

TonyLianLong commented Jun 20, 2023 •

edited

Loading