• intensely_human
      link
      fedilink
      English
      arrow-up
      1
      arrow-down
      1
      ·
      6 months ago

      Okay so assuming the camera’s output can be represented as a series of bits, and that the arms’ input can be represented as another series of bits, you have successfully identified a text processing task.

      Your assertion then, is that this is a task outside the ability of an LLM to succeed at?

      How do you know that an LLM-steered robot cannot perform that task? Has that been tried?