• ayaya@lemdro.id
    link
    fedilink
    English
    arrow-up
    7
    ·
    11 months ago

    Try to generate any higher and you’ll get very weird and ugly results.

    Stable Diffusion in particular has this issue with limbs. People have 2 arms at 512x512? Surely they must have 4 arms at 1024x1024! That’s just math.

    • rufus@discuss.tchncs.de
      link
      fedilink
      arrow-up
      1
      ·
      edit-2
      11 months ago

      Yeah, It’s the wrong way to do it and will lead to unusable results. The correct way to do it is to generate an image with the resolution the AI was trained on an then upscale it.

      (And 1024x1024 is four times 512x512 so they should have eight? ;-)