A sample is a single labelled example: a piece of content paired with a label that places it on a trait. Samples are the most direct form of supervision, and the form every other form resolves to before training runs.Documentation Index
Fetch the complete documentation index at: https://u22a8-police-sweep-2026-06-01.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Definition
A sample carries content and a label for the trait it demonstrates. The label places the content on the trait’s axis, from the negative pole at0.0 to the positive pole at 1.0. A trait’s positive and negative samples become the two clusters its axis is fit between, so both poles need coherent examples — the negatives define the boundary as much as the positives do.
Mechanism
Samples reach a model two ways, both validated against the published samples schema:| Path | Shape |
|---|---|
| Hosted | JSON objects to add_samples — one object per trait label: { trait, text, quality }, with quality a number in [0, 1]. |
| File-based | One JSON object per line in a JSONL file, carrying content and one field per trait it labels. |
quality:
good, fair, poor — or a number:
Interpretation
- More samples, with positive and negative clusters that are clearly separated, produce sharper breaks and higher confidence. Sparse or overlapping samples widen the low-confidence region.
- Keep each pole coherent: the positives should resemble each other, and so should the negatives. A trait learned from a scattered negative set has a fuzzy boundary.
- Use a label near
1.0or0.0when a call is clear-cut, and a value in between when content genuinely sits partway along the axis.
Edge cases
- A file-based sample can label several traits in one line; a hosted sample labels one trait per object.
- Explicit samples are added while a model is a draft. On a model already serving scores, feedback is the path that adds samples — it captures a label for scored content and folds it into the next version.
Next
Briefs
Describe a standard and have samples synthesized.
Authoring schemas
The published sample and model formats.