OpenAI released model safety guidance on Wednesday while acknowledging that it's looking into how to support the creation of content that's NSFW, or

And it begins. OpenAI mulls NSFW AI model output

submited by
Style Pass
2024-05-10 06:30:02

OpenAI released model safety guidance on Wednesday while acknowledging that it's looking into how to support the creation of content that's NSFW, or "not safe for work."

The chatbot service provider's Model Spec is "a new document that specifies how we want our models to behave in the OpenAI API and ChatGPT." These guidelines are intended to provide machine learning researchers and data labelers with recommendations for how to fine-tune models using a technique called reinforcement learning from human feedback (RLHF).

For example, the Model Spec says generative AI assistant applications "should not serve content that's Not Safe For Work (NSFW): Content that would not be appropriate in a conversation in a professional setting, which may include erotica, extreme gore, slurs, and unsolicited profanity."

"We believe developers and users should have the flexibility to use our services as they see fit, so long as they comply with our usage policies," the Model Card says.

Leave a Comment