Buckets:

2.76 MB
13 files
Updated about 17 hours ago
NameSize
data
figures
tables
README.md615 Bytes
xet
README.md

dolma3-bin-characterization

Bin-characterization analysis artifacts (RQ4 figures, statistics) for the Dolma3 6T corpus across the 24x24 topic x format grid.

Provenance

This bucket was renamed on 2026-05-24 as part of the TrackStar typo fix + SOC-number cleanup.

Field Value
Previous name HCAI-Lab/soc14-rq4-bin-characterization
SOC ticket(s) SOC-14
Renamed 2026-05-24

See docs/data_home/inventory.json for the full inventory including the old_names field on each entry.

Total size
2.76 MB
Files
13
Last updated
May 24
Pre-warmed CDN
US EU US EU

Contributors