Categories
Artificial Intelligence

Another AI ad

Again, not a tutorial, a note about how cool this stuff it. I’ve already made an AI ad (see my Pea Pea Soda ad ) using paid AIs. Now this time this is done exclusively with local models running in ComfyUI, with the exception of the YouTube thumbnail. You can run all of this on your local machine, given you have enough VRAM, but for simplicity and speed I used their exorbitantly expensive cloud RTX 6000 Pro, using only basic templates. I found out that most of the time tidy and clear workflows works way better than huge complex messes of spaghetti code.

The idea is memeing about stuff. The product now is a perfume made from my piss.

The face used is mine, feeding my LinkedIn profile pic to Flux 9B to recreate a character that looks like me. “Let us make Muu? in our image, in our likeness“, Genesis 1,26. We are playing god here, creating something that looks like myself, but it’s not quite myself. Considering my first deepfake took like two days of gpu training using deepfacelab, having a model that does 95% of similarity in mere seconds is awesome.

Why the LinkedIn pic? Because it’s already online and while it’s a single image it’s surely in all the current models datasets. I know it does sound bleak but we can’t escape that.

Now for the video part I have used LTX 2.3, always feeding the Flux image as starting point, creating 4 to 6 seconds videos. LTX prefers the prompt to be a whole epic, but sometimes reusing the flux prompt with the requested motion and audio added was the right choice to get a decent video.

For the audio part, here we are a bit of hit and miss. The last clip where the beautiful actor is talking comes straight from LTX’s generation. In most clips I had to remove the random music or background noises or speech it added unprompted and for no reason. For the car engine I used Sound-AI-SFX on HuggingFace. The music is done using Ace Step v1.5.

The video was roughly added together with OpenShot video editor, I say roughly because I only used it to stitch and to add/remove audio tracks, no equalization, no color grading. It’s a meme, not a professional shot and I have no actual skills to make it look like a professional work.

For the idiot youtuber thumbnail I found a template that used Google’s Nano Banana 2, the only paid tool in the bunch, not really necessary for the thumbnail, but I was worth a try.

By Andrea Giorgio "Muu?" Cerioli

Italian dad, developer, designer, maker who loves everything in technology: AI, Machine Learning, LLMs, mechanics, electronics, IT, 3D printing. Since 10 years employed in the Pharmaceutical API Manufacturing Industry.