jiaqiz's picture
upload
6239888

Safety & Security

Field Response
Model Application Field(s): Reward Modeling
Describe the life critical impact (if present). Not Applicable
Description of methods implemented in data acquisition or processing, if any, to address other types of potentially harmful data in the training, testing, and validation data: The HelpSteer3 data annotation process includes a pre-filtering step excluding any task containing or requesting harmful content.
Use Case Restrictions: Use of this model is governed by the Apache 2.0 license.
Model and dataset restrictions: The Principle of least privilege (PoLP) is applied limiting access for dataset generation and model development. Restrictions enforce dataset access during training, and dataset license constraints adhered to.