• BlackEco@lemmy.blackeco.comOP
    link
    fedilink
    arrow-up
    26
    ·
    edit-2
    3 months ago

    Seems like not a bias by Al models themselves, rather a reflection of the source material.

    That’s what is usually meant by AI bias: a bias in the material used to train the model that reflects in its behavior

    • Lucy :3@feddit.org
      link
      fedilink
      arrow-up
      19
      arrow-down
      1
      ·
      3 months ago

      But why is it even mentioned then? It’s FUCKING OBVIOUS. It’s like saying “AIs are biased towards english and neglect latin” or smth ffs

      • BlackEco@lemmy.blackeco.comOP
        link
        fedilink
        arrow-up
        32
        ·
        3 months ago

        I feel like not everyone is conscious of these biases and we need to raise the awareness and try preventing for example HR people from buying AI-based screening software that has a strong bias that is not disclosed by their vendors (because why would you advertise that?)

        • Bitrot@lemmy.sdf.org
          link
          fedilink
          English
          arrow-up
          20
          ·
          3 months ago

          I was confused how a resume or application would be largely affected, but the article points out that software is often used to look over social media now as part of hiring (which is awful).

          The bias when it determined guilt or considered consequences for a crime is concerning as more law enforcement agencies integrate black box algorithms into investigative work.

      • n2burns@lemmy.ca
        link
        fedilink
        arrow-up
        11
        ·
        3 months ago

        Great comparison, a dialect used by millions of people to a dead language. It really shows how much you care about the people who speak that dialect…

        • Lucy :3@feddit.org
          link
          fedilink
          arrow-up
          4
          arrow-down
          1
          ·
          3 months ago

          AIs are trained on what is written in the Internet. Latin is not spoken, it’s written. But even then, it’s rarely used. African american is a dialect, it’s only present in speech.

          • MostlyBlindGamer@rblind.com
            link
            fedilink
            English
            arrow-up
            9
            ·
            3 months ago

            You need to get out more. I totally get that you would think that’s the case, but only if you’re not exploring parts of the internet outside your bubble. It’s absolutely written.

          • curbstickle@lemmy.dbzer0.com
            link
            fedilink
            arrow-up
            4
            ·
            3 months ago

            There are actually quite a few books written in AAVE…the earliest I’m aware of is their eyes were watching god, from the 1930s. The Color Purple, Beloved, The Sellout, the books of Chester Himes…

      • Gaywallet (they/it)@beehaw.org
        link
        fedilink
        arrow-up
        10
        ·
        3 months ago

        It’s FUCKING OBVIOUS

        What is obvious to you is not always obvious to others. There are already countless examples of AI being used to do things like sort through applicants for jobs, who gets audited for child protective services, and who can get a visa for a country.

        But it’s also more insidious than that, because the far reaching implications of this bias often cannot be predicted. For example, excluding all gender data from training ended up making sexism worse in this real world example of financial lending assisted by AI and the same was true for apple’s credit card and we even have full-blown articles showing how the removal of data can actually reinforce bias indicating that it’s not just what material is used to train the model but what data is not used or explicitly removed.

        This is so much more complicated than “this is obvious” and there’s a lot of signs pointing towards the need for regulation around AI and ML models being used in places it really matters, such as decision making, until we understand it a lot better.