Tech experts are starting to doubt that ChatGPT and A.I. ‘hallucinations’ will ever go away: ‘This isn’t fixable’::Experts are starting to doubt it, and even OpenAI CEO Sam Altman is a bit stumped.

  • Serdan
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    10
    ·
    edit-2
    11 months ago

    GPT can write and edit code that works. It simply can’t be true that it’s solely doing language patterns with no semantic understanding.

    To fix your analogy: the Spanish speaker will happily sing along. They may notice the occasional odd turn of phrase, but the song as a whole is perfectly understandable.

    Edit: GPT can literally write songs that make sense. Even in Spanish. A metaphor aiming to elucidate a deficiency probably shouldn’t use an example that the system is actually quite proficient at.

    • Dark Arc@lemmy.world
      link
      fedilink
      English
      arrow-up
      8
      arrow-down
      2
      ·
      11 months ago

      Sure it can, “print hello world in C++”

      #include 
      
      int main() {
        std::cout << "hello world\n";
        return 0;
      }
      

      “print d ft just rd go t in C++”

      #include 
      
      int main() {
        std::cout << "d ft just rd go t\n";
        return 0;
      }
      

      The latter is a “novel program” it’s never seen before, but it’s possible because it’s seen a pattern of “print X” and the X goes over here. That doesn’t mean it understands what it just did, it’s just got millions (?) of patterns it’s been trained on.

      • Serdan
        link
        fedilink
        English
        arrow-up
        1
        ·
        11 months ago

        A human would give you the same solution for the same reason. No dev would deeply ponder the meaning of “cout” if told to print something. It’s so simple it’s almost muscle memory.

        Hell, there are probably non-NN autocomplete systems that could successfully do that.

        GPT can do more than that though. You can have a conversation with it about what you’re trying to achieve and what the requirements should be and then you can tell it to write code on the basis of that natural language conversation. You can then discuss the code with it, make suggestions or ask for them.

        People who claim that it’s “just” looking up the answer in its training data seriously have no idea what they’re talking about.

    • tryptaminev 🇵🇸 🇺🇦 🇪🇺@feddit.de
      link
      fedilink
      English
      arrow-up
      9
      arrow-down
      6
      ·
      11 months ago

      Because it can look up code for this specific problem in its enormous training data? It doesnt need to understand the concepts behind it as long as the problem is specific enough to have been solved already.

      • Serdan
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        1
        ·
        11 months ago

        I can tell GPT to do a specific thing in a given context and it will do so intelligently. I can then provide additional context that implicitly changes the requirements and GPT will pick up on that and make the specific changes needed.

        It can do this even if I’m trying to solve a novel problem.

        • cryball@sopuli.xyz
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          1
          ·
          11 months ago

          But the naysayers will argue that your problem is not novel and a solution can be trivially deduced from the training data. Right?

          I really dislike the simplified word predictor explanation that is given for how LLM’s work. It makes it seem like the thing is a lookup table, while ignoring the nuances of what makes it work so well.

          • ioen
            link
            fedilink
            English
            arrow-up
            2
            arrow-down
            1
            ·
            11 months ago

            deleted by creator

            • Serdan
              link
              fedilink
              English
              arrow-up
              1
              ·
              11 months ago

              I.e. 99.9% of the work done by devs in the real work.

      • SirGolan@lemmy.sdf.org
        link
        fedilink
        English
        arrow-up
        4
        arrow-down
        1
        ·
        11 months ago

        If that were true, it shouldn’t hallucinate about anything that was in its training data. LLMs don’t work that way. There was a recent post with a nice simple description of how they work, but I’m not finding it. If you’re interested, there’s plenty of videos and articles describing how they work.

      • Mirodir@lemmy.fmhy.net
        link
        fedilink
        English
        arrow-up
        5
        arrow-down
        2
        ·
        11 months ago

        It doesn’t have the ability to just look up anything from its training data, that stuff is encoded in its parameters. Still, the input has to be encoded in a way that causes the correct “chain reaction” of excited/not excited neurons.

        Beyond that, it’s not just a carbon copy from what was in the training either because you can tell it what variable names to use, which order to do things in, change some details, etc. If it was simply a lookup that wouldn’t be possible. The training made it able to generalize what it learned to some extent.

        • tryptaminev 🇵🇸 🇺🇦 🇪🇺@feddit.de
          link
          fedilink
          English
          arrow-up
          6
          arrow-down
          1
          ·
          11 months ago

          Yes, but it doesnt do so because it understands what a variable is, it does so because it has statistics as to where variables belong most likely.

          In a way it is like the guy that won the french scrabble championship without speaking a single word of french, by learning the words in the dictionary.

          • Serdan
            link
            fedilink
            English
            arrow-up
            1
            ·
            11 months ago

            It’d be like the scrabble guy if he could hold a conversation in French.

            Why do people keep making analogies that are very blatantly not analogous?