Proof of concept for myself :) (I'm sure somebody has already done it)
big continuity errors that would be easy to fix but its expensive to run more generations and uploading more reference images makes it even more expensive
Also, some of the speech bubbles look different because I went back and had them added without providing reference images for how they should look like lol