• Usernameblankface@lemmy.world
    link
    fedilink
    arrow-up
    2
    ·
    8 months ago

    You did this? I saw some examples in the “show me your best hearse if it were a ___” post, and I’m flabbergasted at the computer-based witchcraft it’s pulling off over there.

    What image generator is it using? Is it using chatgpt to expand the prompts?

    • Rikudou_Sage@lemmings.world
      link
      fedilink
      English
      arrow-up
      2
      ·
      8 months ago

      It’s using DALL-E, the version 3 also sometimes expands the prompt (IMO it’s using GPT in the background to do so, but I haven’t checked). The prompt is only printed below the image if it has been changed. I’ve noticed that if the prompt is long already, it doesn’t change it, but I haven’t really confirmed anything. Well, you can try it yourself, just tag the bot and it will respond with the image.

      It also falls back to DALL-E version 2 to lower the costs and there’s some daily limit (per-user and global).

      • Usernameblankface@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        8 months ago

        Very interesting. So all the requests from the bot get sent through one account? Seems like it would get expensive fast as people find out that this option exists.

        • Rikudou_Sage@lemmings.world
          link
          fedilink
          English
          arrow-up
          3
          ·
          8 months ago

          Yeah, but there’s rate limiting in place that makes sure I won’t pay more than I’m comfortable with. I’m thinking of making it possible for users to supply their own api key.