smam commited on
Commit
ecbee68
·
verified ·
1 Parent(s): b6df3f6

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +30 -0
README.md ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Pyannote Segmentation Model - Bengali/Multilingual
2
+
3
+ Fine-tuned version of `pyannote/segmentation-3.0` for Bengali and multilingual speaker diarization.
4
+
5
+ ## Training Data
6
+ - **DISPLACE24**: 67 recordings (Dev + Eval)
7
+ - **DISPLACE26**: 125 recordings (Hindi)
8
+ - **Synthetic Bengali V4**: 300 synthetic recordings (1-30 speakers)
9
+ - **Total**: 492 recordings
10
+
11
+ ## Performance
12
+ - **Best Validation Accuracy**: 76.67%
13
+ - **Training Epochs**: 18
14
+
15
+ ## Training Details
16
+ - Heavy on-the-fly augmentation (noise, volume variation)
17
+ - OneCycleLR scheduler with warmup
18
+ - Label smoothing (0.1)
19
+ - Gradient clipping
20
+
21
+ ## Usage
22
+
23
+ ```python
24
+ from pyannote.audio import Model
25
+
26
+ model = Model.from_pretrained("smam/pyannote-segmentation-bengali-multilingual")
27
+ ```
28
+
29
+ ## Citation
30
+ Fine-tuned as part of DLSPRINT26 Bengali Speaker Diarization Challenge.