Data poisoning: how artists are sabotaging AI to take revenge on image generators::As AI developers indiscriminately suck up online content to train their models, artists are seeking ways to fight back.

  • cm0002@lemmy.world
    link
    fedilink
    English
    arrow-up
    10
    arrow-down
    7
    ·
    7 months ago

    using it to train their plagiarism machines

    That’s simply not how AI works, if you look inside the models after training, you will not see a shred of the original training data. Just a bunch of numbers and weights.

    • fruitycoder@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      5
      ·
      7 months ago

      | Just a bunch of numbers and weights

      I agree with your sentiment, but it’s not just that the data is encoded as a model, but it’s extremely lossy. Compression, encoding, digital photography, etc is just turning pictures into different numbers to be processed by some math machine. It’s the fact that a huge amount of information is actually lost during training, intentionally, that makes a huge difference. If it was just compression, it would be a gaming changing piece of tech for other reasons. YouTube would be using it today, but it is not good at keeping the original data from the training.

      Rant not really for you, but in case someone else nitpicks in the future :)

    • Catoblepas@lemmy.blahaj.zone
      link
      fedilink
      English
      arrow-up
      6
      arrow-down
      5
      ·
      6 months ago

      If the individual images are so unimportant then it won’t be a problem to only train it on images you have the rights to.

      • Astarii_Tyler@lemmy.world
        link
        fedilink
        English
        arrow-up
        7
        arrow-down
        4
        ·
        6 months ago

        They do have the rights because this falls under fair use, It doesn’t matter if a picture is copyrighted as long as the outcome is transformative.