New accessibility feature coming to Firefox, an “AI powered” alt-text generator.


"Starting in Firefox 130, we will automatically generate an alt text and let the user validate it. So every time an image is added, we get an array of pixels we pass to the ML engine and a few seconds after, we get a string corresponding to a description of this image (see the code).

Our alt text generator is far from perfect, but we want to take an iterative approach and improve it in the open.

We are currently working on improving the image-to-text datasets and model with what we’ve described in this blog post…"

  • grrgyle@slrpnk.net
    link
    fedilink
    arrow-up
    2
    ·
    5 months ago

    I imagine it’s a 2D array? So width would be captured by uhh like a[N].len.

    It could be I’m misunderstanding you, because not not sure what you mean by:

    linear data with varying sectioning content

    • Kissaki@beehaw.org
      link
      fedilink
      English
      arrow-up
      2
      ·
      5 months ago

      Looking at Wikipedia on arrays, I think I’m just not used to array as terminology for multi-dimensional data structures. TIL