I figured out how to remove most of the safeguards from some AI models. I don’t feel comfortable sharing that information with anyone. I have come across a few layers of obfuscation to make this type of alteration more difficult to find and sort out. This caused me to realize, a lot of you are likely faced with similar dilemmas of responsibility, gatekeeping, and manipulating others for ethical reasons. How do you feel about this?

  • MajorHavoc@programming.dev
    link
    fedilink
    arrow-up
    0
    ·
    edit-2
    3 months ago

    I figured out how to remove most of the safeguards from some AI models.

    Nice.

    How do you feel about this?

    It’s another kind of power. I try to use mine responsibly, but also to give myself a break when I don’t meet my own standards.

    Some good advice I got once was that it’s impossible to “un-say” something, so it pays to think twice before speaking.

    If your gut is telling you to pause, listen to it. Wait to move forward until you feel better about it.

    As someone else pointed out, responsible disclosure is an option.

    You also have the option to just quietly enjoy a better copy of the AI than others have.

    If you decide to publish your discoveries, be aware that others will judge you for how you go about it. For me that means the two options are responsibly, or anonymously.