Google’s new Nano Banana Pro is very good for image generation, I gave it a prompt that I figured was quite complicated and might not work and it got almost everything right.
Prompt:
[picture of me] This is me, can you draw a five-panel comic of me in a science fantasy setting. I should have a band of hovering multicolored gems h
overing around my wrists (nothing physically connecting them, they’re hovering in air) as well as two futuristic drones floating around my head. One of them, Whisper, is specialized for reconnaissance and the other, Thunder, for combat. The shade of my clothes is similar to the picture and I’m wearing a cloak.
Panel 1: I’m standing on a mountain cliff, looking at a village below. I say “Ah, finally a place to rest. Whisper, go check out the locals.” Whisper says “acknowledged” and is seen flying toward the village.
Panel 2: The village as seen through Whisper’s cameras. We can see that there is something wrong with the villagers; they have electronic collars around their necks and have distressed expressions. Red text points at the collar and reads “class-3 body control device”.
Panel 3: I am seen sitting on the cliff, looking at the drone’s camera data on my tablet. I say “Entropy take me! Those collars override any signals sent from the brain to the body! Whisper, trace the source of the control signal; Thunder, assault the source!” Thunder is seen flying toward the village as well, saying “initiating attack sequence”.
Panel 4: Whisper is shown following the control signal to a transmitter in the middle of the village. A caption reads “Whisper rapidly located the source of the signal...”
Panel 5: Thunder is shown blowing up the transmitter. The caption reads ”...which Thunder then eliminated. But who had enslaved the villagers in the first place?”
Result:
I do have some points of improvement but these are minor:
The drones were said to be “futuristic”, but Whisper in particular looks just like a small version of a fighter jet, nothing particularly futuristic.
Also Whisper looks to have a cockpit which doesn’t make much sense. I also don’t see its cameras anywhere on its hull—though maybe you could say that the cameras are housed inside the thing that looks like the cockpit...
In the third panel, Thunder looks to be flying away from the village rather than toward it, but maybe it made a loop back in the air.
First points would have been easy to fix by also giving it a reference image for the drones.
In the first panel, you and the drones are turned right, as if this were the direction where the village is, but it’s actually deeper/further in the scene. Same with the third panel, but less so.
Also, the village looks very different in the first and the third panel.
True! In fairness, the first point is reasonably common for human-drawn scenes like this as well. If you want to show both the village and the main character’s face, you need to have both of them facing the “camera”, and then it ends up looking like this.
Google’s new Nano Banana Pro is very good for image generation, I gave it a prompt that I figured was quite complicated and might not work and it got almost everything right.
Prompt:
Result:
I do have some points of improvement but these are minor:
The drones were said to be “futuristic”, but Whisper in particular looks just like a small version of a fighter jet, nothing particularly futuristic.
Also Whisper looks to have a cockpit which doesn’t make much sense. I also don’t see its cameras anywhere on its hull—though maybe you could say that the cameras are housed inside the thing that looks like the cockpit...
In the third panel, Thunder looks to be flying away from the village rather than toward it, but maybe it made a loop back in the air.
First points would have been easy to fix by also giving it a reference image for the drones.
In the first panel, you and the drones are turned right, as if this were the direction where the village is, but it’s actually deeper/further in the scene. Same with the third panel, but less so.
Also, the village looks very different in the first and the third panel.
True! In fairness, the first point is reasonably common for human-drawn scenes like this as well. If you want to show both the village and the main character’s face, you need to have both of them facing the “camera”, and then it ends up looking like this.