What is IndicBERT-HPA?

A multilingual orthopedic clinical text classification framework built on IndicBERT, incorporating an orthopedic adapter head to handle mixed scripts and domain-specific terminology across English, Hindi, and Punjabi.

Why does this research matter?

IndicBERT-HPA achieves a mean Macro-F1 of 0.8792 under natural clinical distributions, significantly outperforming zero-shot LLMs, providing highly reliable AI-assisted decision support for low-resource multilingual healthcare settings.

What should we watch next?

The selective verification layer achieved 84.4% accuracy at 72.3% coverage, offering a critical safety mechanism for managing ethical and legal risks in clinical AI deployments.

多語種骨科決策支持：語言感知適配與驗證引導的延遲機制

本文針對低資源醫療場景下多語種骨科臨床文本分類的可靠性挑戰，提出了一種名為 IndicBERT-HPA 的語言感知適配框架。該模型在 IndicBERT 基礎上引入骨科適配器頭，以支援英語、印地語和旁遮普語的混合腳本及專業術語處理。研究對比了多語言 Transformer、DistilBERT、零樣本大型語言模型及該領域自適配編碼器。實驗表明，零樣本 LLM 在封閉集分類中表現不佳且存在語言不穩定性，而 IndicBERT-HPA 在自然臨床分佈下取得最佳性能，平均 Macro-F1 達 0.8792，Macro-AUROC 為 0.894。此外，研究實現了結合置信度閾值與證據一致性檢查的選擇性驗證層，在 72.3% 的覆蓋率下實現了 84.4% 的選擇性準確率，顯著優於全接受基準，為多語種臨床決策支援提供了高可靠性的延遲機制。

Sources

arXiv