Add asymmetric hybrid block-causal mask for efficient vision path d7977da verified WuChengyue commited on 2 days ago
Clean up inference config: remove training-only flags, set bd_size=32 default, dtype=bfloat16 cb43b83 verified WuChengyue commited on 2 days ago