• SynopsisTantilize
    link
    fedilink
    English
    arrow-up
    5
    ·
    1 day ago

    4o codes like 50 first dates memory style. And takes things so literally sometimes it’s silly and laughable.

    • TropicalDingdong@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      arrow-down
      6
      ·
      1 day ago

      Dude it’s just kinda fucking bad. Like legitimately, the first weekend I had access to 3.5 I took the challenge of coding this complex YouTube network analysis. No problem. Like, no code just explanation. But none of the recent (anything with rails) seems to have the sharpness, where it was basically right. Even basic tasks it takes an almost worst case approach.

      • froztbyte@awful.systems
        link
        fedilink
        English
        arrow-up
        8
        ·
        21 hours ago

        this isn’t autoplag fan club, and honestly if “guard rails” are the reason you think this shit is problematic it’s definitely not the place for you to be posting

      • SynopsisTantilize
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        4
        ·
        17 hours ago

        I’m not sure about the down votes. But I agree it’s just gotten worse for specific tasks. I’ve only had it shut down a task once and it was me trying to get it to do something stupid.

        Work related tasks I do enjoy using it.

        • froztbyte@awful.systems
          link
          fedilink
          English
          arrow-up
          1
          ·
          48 minutes ago

          “I’m not sure about the downvotes” sigh, how many times do we have to see this stupid refrain

          imagine reflecting on feedback. or do promptfans need mirror-finished text inputs to pretend to do that thinking for them too?