Buckets:
2.76 MB
13 files
Updated about 17 hours ago
Ctrl+K
| Name | Size | Uploaded | Xet hash |
|---|---|---|---|
| data | 2 items | ||
| figures | 8 items | ||
| tables | 3 items | ||
| README.md | 615 Bytes xet | 20b1b13f |
dolma3-bin-characterization
Bin-characterization analysis artifacts (RQ4 figures, statistics) for the Dolma3 6T corpus across the 24x24 topic x format grid.
Provenance
This bucket was renamed on 2026-05-24 as part of the TrackStar typo fix + SOC-number cleanup.
| Field | Value |
|---|---|
| Previous name | HCAI-Lab/soc14-rq4-bin-characterization |
| SOC ticket(s) | SOC-14 |
| Renamed | 2026-05-24 |
See docs/data_home/inventory.json for the full inventory including the old_names field on each entry.
- Total size
- 2.76 MB
- Files
- 13
- Last updated
- May 24
- Pre-warmed CDN
- US EU US EU