AI primer: creating consistent characters

in Splinterlands2 days ago

image.png

TLDR for those who want to cut right to the chase:

  1. Use Nano Banana
  2. Use it through a different tool like Dzine or Higgsfield
  3. Do LOTS of generations (don't give up, it'll come out like you want in the end)
  4. Come up with a storyboard to decide all the frames / angles you need before getting started

Making consistent characters is the first step to any AI-generated video that requires any kind of characters.

For my latest series (aka "Meet the Praetorians" or MTP) I'm doing videos in the style of LADBible Stories or the Soft White Underbelly interviews, which feature a specific character who gets interviewed for the entire duration of the video. Since there are multiple different camera angles, it's important to create a consistent character for the narrative to stay believable throughout.

In this article I'll share how you can use Google's Nano Banana image generation model to create consistent characters. Nano Banana is available from within Google Gemini and the allowed number of image-generation credits vary based on your Google One plan. You can also get access to Nano Banana in most of the AI tool aggregators like Dzine, Higgsfield, OpenArt, and others.

I have found, in fact, that using Nano Banana directly in Google Gemini is sometimes worse than using it through other tools. For example, it'll often reject a prompt to make alternates of an image because it says it's a "recognizable person" (even though in my cases they're all AI-generated!), while when using Dzine I've never had this problem.

My reference for this article is Hodlor. The reference image below is from a video generated using Google Veo 3.

image.png

Before getting started, it's important to have a rough storyboard so you can determine what angles / frames you need to create.

Once I have my storyboard, and using the reference image, I asked Nano Banana to create a profile image of the character with the people in the background removed:

image.png

I wanted his skin to be a bit more realistic and textured:

image.png

Next, I zoom out a few levels:

image.png

Sitting.png

While I'm at it, I get a view of the empty bench to use as my starting frame for the "walk in" video:

0-Empty bench.png

Angles are next. Now another issue I found is with background consistency - if you want to have a character stay consistent AND the background stay consistent when the camera MOVES, that's something Nano Banana may struggle with.

Take this example, where I've asked Nano Banana to make a three quarters angle view:

image.png

You can see that while it succeeds at moving the character, it's clear the street in the background hasn't moved, so it won't be believable if used in the same video as my profile shot. I struggled quite a bit getting Nano Banana via Gemini to correctly shift the background, so I tried on Dzine and was able to get it working (though it took many attempts). Another (big) advantage of using these other tools is you can generally submit multiple concurrent jobs simultaneously, which is hard to do in the Gemini interface. In fact, I'd say that's really the best advice I can give on these - try a lot of times to get it right. If you hope

Here are a few examples (done in Dzine) where we can see that the houses on the side of the street are at a more believable angle, so while it's not perfect it's more believable with the shift in the background.

Untitled-20251006-194348-0352-undefinedx.png

Untitled-20251008-121307-1465-2x.png

With my various angles ready to go, I'm now working on getting the audio files through ElevenLabs, then lip sync via Dzine, and finally stitch things together in Davinci Resolve. Stay tuned for the Hodlor episode.

image.png

Check out the teaser here!


The art on this page was create using Nano Banana AI

Sort:  

Thank you for this! Really valuable advice, I haven't played around with the AI aggregators, usually just Gemini and Chat GPT.

Where do you get your news and info on the new AI tools? I have a hard time keeping up-to-date on all the new options.

Here and there from X, but tbh there too much noise, so I just pick and choose things that sound interesting and then deep dive on them, or research a specific thing I'm trying to do and then figure out what tool or tools I should try to accomplish that specific goal.

Everyone who's screaming alpha about AI these days is also just trying to get attention. It seems every second post is like "BLA changes EVERYTHING" and it tries to get everyone's attention to BLA. But 95% of the time it's a wild exaggeration you should ignore. Other times (like with Nano Banana) it's real, but then it sustains so you know it's real...

Anyways I'd say just have a goal and do research, and then stick to a tool or process you're comfortable with instead of jumping around too much, otherwise it's easy to get distracted and lost.

Good suggestions, thanks for the advice! 🙏

Boom , thats a nice teaser...

Thanks ;)

This post has been supported by @Splinterboost with a 15% upvote! Delagate HP to Splinterboost to Earn Daily HIVE rewards for supporting the @Splinterlands community!

Delegate HP | Join Discord


This post has been supported by @fallen.angels guild!
Delegate Tokens and HP to Fallen Angels to earn weekly rewards!
Delegate | Join to the guild