• @[email protected]
    link
    fedilink
    English
    407 months ago

    Oh, it’s worse than that.

    Google’s “AI” results feed you things for 10 year old Reddit posts that are subtle (but sometimes, also not so subtle) bullshit.

    Whatever they’re using to curate training data is evidently pretty awful at detecting shitposts.

    • Diplomjodler
      link
      fedilink
      English
      87 months ago

      Those underpaid Indians probably aren’t very good at picking up irony, even if they give a shit.

    • @[email protected]
      link
      fedilink
      English
      47 months ago

      Most of the curation or fine tuning is done in low income African countries so this is little surprising. They‘re cheap labour but you can‘t expect them to reliably detect sarcasm or notice mistakes in specialized fields. They basically give a thumbs up whenever the AI sounds convincing. Of course that includes instances where it‘s confidently wrong and that appears to be most of the time with this model.

    • @[email protected]
      link
      fedilink
      English
      27 months ago

      It’s not a training data issue, look up Retrieval Augmented Generation. It’s basically serving up stuff on the web and taking it as gospel.