| Model Application Field(s): |
Reward Modeling |
| Describe the life critical impact (if present). |
Not Applicable |
| Description of methods implemented in data acquisition or processing, if any, to address other types of potentially harmful data in the training, testing, and validation data: |
The HelpSteer3 data annotation process includes a pre-filtering step excluding any task containing or requesting harmful content. |
| Use Case Restrictions: |
Use of this model is governed by the Apache 2.0 license. |
| Model and dataset restrictions: |
The Principle of least privilege (PoLP) is applied limiting access for dataset generation and model development. Restrictions enforce dataset access during training, and dataset license constraints adhered to. |