HuggingFaceFW/fineweb-edu
Viewer • Updated • 3.5B • 588k • 1.08k
Fast autocomplete model.
Don't use it for anything serious, it lacks any form of intelligence.
Limited to ~couple exaFLOPs of compute, don't expect anything coherent beyond a couple sentences.
Use the code below to get started with the model.
[More Information Needed]
50B Bytes of custom FineWeb Edu & Open Web Math mixture.
Throughput = 350 characters/second using unoptimized inference code. Prompt processing is basically instantaneous, so generation is likely bottlenecked by bandwidth and overhead.
Bits-per-byte: ~1 HellaSwag Accuracy: 33.4% (removed Wikihow entries)
Modded RWKV 7 (see top)
1 x RTX 4080 for 1 week