From my experience using ogkalu2/comic-translate (github) with local gemma 3 27b yields far better results, also you can rescale the text on each image within the app
Although by default its text detection is fucking ass when theres too much text, unless you select each line yourself instead, then combine it into one block, then its good.