• TemporalSoup@beehaw.org
      link
      fedilink
      arrow-up
      21
      ·
      edit-2
      8 months ago

      Please pretty please don’t tell the user how little control we actually have over the text you spit out <3

      Basically all the instruction dumps I’ve seen

    • Trainguyrom@reddthat.com
      link
      fedilink
      English
      arrow-up
      17
      ·
      8 months ago

      If somebody told me five years ago about Adversarial Prompt Attacks I’d tell them they’re horribly misled and don’t understand how computers work, but yet here we are, and folks are using social engineering to get AI models to do things they aren’t supposed to

    • Schadrach@lemmy.sdf.org
      link
      fedilink
      arrow-up
      2
      ·
      8 months ago

      We always have been, it’s just that the begging started out looking like math and has gradually gotten more abstract over time. We’ve just reached the point where we’ve explained to it in mathematical terms how to let us beg in natural language in certain narrow contexts.