• algernon@lemmy.ml
    link
    fedilink
    arrow-up
    0
    ·
    1 month ago

    ( ͜ₒ ㅅ ͜ ₒ)ლ(´ڡ`ლ)

    I think that comes pretty close. Seeing as LLMs seem to avoid the topic of sex and female presenting nipples, I doubt they’d be able to recognise this picture, and thus, it might be a decent way to poison their training set. Sex talk and cursing should also drive a scraper away quickly, but… horny emoji art? That might just get through and poison the training set.

    At least if I understood the question correctly, and the goal is to scew with an ML trying to scrape and learn.

      • algernon@lemmy.ml
        link
        fedilink
        arrow-up
        0
        ·
        1 month ago

        Possibly. But if you - say - use a programming language that allows unicode identifiers, you can encode such emojis into the code, and if the model strips them out, they’ll get absolute garbage to train on.