A multilingual dataset for NER covering 91 langauges and 25 scripts
Jonas Golde
whoisjones
AI & ML interests
Data-efficient transfer learning
Recent Activity
updated a dataset about 1 month ago
whoisjones/sudoku authored a paper about 2 months ago
Hierarchical Text Classification with LLM-Refined Taxonomies updated a dataset about 2 months ago
whoisjones/mazeOrganizations
models 12
whoisjones/otter-bi-mmbert
Token Classification • 0.5B • Updated • 482
whoisjones/otter-bi-rembert
Updated • 1
whoisjones/otter-ce-rembert
Updated
whoisjones/otter-ce-mmbert
Updated
whoisjones/finerweb-multilabel-classifier-xlmr-4o
Text Classification • 0.3B • Updated • 7
whoisjones/finerweb-binary-classifier-xlmr-4o
Text Classification • 0.3B • Updated • 4
whoisjones/finerweb-binary-classifier-xlmr-gemma3
Text Classification • 0.3B • Updated • 2
whoisjones/finerweb-multilabel-classifier-xlmr-gemma3
Text Classification • 0.3B • Updated • 1
whoisjones/finerweb-binary-classifier-mdeberta-gemma3
Text Classification • 0.3B • Updated • 2
whoisjones/finerweb-binary-classifier-mdeberta-4o
Text Classification • 0.3B • Updated • 1
datasets 28
whoisjones/sudoku
Viewer • Updated • 1.42M • 20
whoisjones/maze
Viewer • Updated • 9k • 14
whoisjones/multinerd
Viewer • Updated • 1.67M • 91
whoisjones/masakhaner
Viewer • Updated • 153k • 35 • 1
whoisjones/uner
Viewer • Updated • 66.8k • 13
whoisjones/fiNERweb
Viewer • Updated • 3.98M • 1.08k • 7
whoisjones/fiNERweb-x
Updated • 71
whoisjones/fiNERweb-x-multi
Updated • 382
whoisjones/fiNERweb-gemma-x-multi
Updated • 29
whoisjones/fiNERweb-4o-x-multi
Updated • 32