高级检索

诊断和提高迁移学习模型鲁棒性的可视分析方法

A Visual Method for Diagnosing and Mitigating the Robustness of Transfer Learning Models

  • 摘要: 虽然迁移学习可以使开发人员根据复杂的预训练模型(教师模型)构建符合目标任务的自定义模型(学生模型), 但是迁移学习中的学生模型可能会继承教师模型中的缺陷, 而模型鲁棒性是作为衡量模型缺陷继承的重要指标之一. 在迁移学习领域中, 通常会运用缺陷缓解或学生模型和教师模型联合训练的方法, 达到减少继承教师模型的缺陷知识目的. 因此, 文中提出一种用于探索迁移学习过程中模型鲁棒性变化情况的可视分析方法, 并构建了相应的原型系统——TLMRVis. 该方法首先计算了学生模型的鲁棒性能指标; 其次在数据实例层面展示模型各类别的表现性能; 然后在实例特征层面通过模型抽象化方式去揭示教师模型和学生模型之间继承的重用知识; 最后结合模型切片方法改善模型的缺陷继承用以提高模型鲁棒性. 同时, TLMRVis 系统不仅结合多种可视化方法展示多种学生模型和教师模型之间的异同点, 而且通过引入缺陷缓解技术来查看和诊断教师模型和学生模型的性能变化和底层预测行为机制. 2 个案例的实验结果表明, TLMRVis 系统可以帮助用户分析迁移学习中模型的鲁棒性、模型继承的缺陷知识和模型缺陷改善后的性能变化.

     

    Abstract: Although transfer learning enables developers to build a custom model (student model) that meets the target task based on a complex pre-trained model (teacher model), the student model in transfer learning may inherit the defects in the teacher model, and model robustness is one of the important indicators to measure the inheritance of model defects. In the field of transfer learning, the method of defect mitigation or joint training of student model and teacher model is usually used to reduce the defect knowledge inherited from teacher model. Therefore, this paper proposes a visual analysis method to explore the changes of model robustness in the process of transfer learning, and constructs the corresponding prototype system TLMRVis. Firstly, the robustness index of the student model is calculated. Secondly, the performance of each type of model is shown at the data instance level. Then, at the level of case features, the reuse knowledge inherited between teacher model and student model is revealed through model abstraction. Finally, the model slicing method is used to improve the defect inheritance of the model to improve the robustness of the model. At the same time, TLMRVis system not only displays the similarities and differences between various student models and teacher models with various visualization methods, but also introduces defect mitigation technology to view and diagnose the performance changes of teacher models and student models and the underlying predictive behavior mechanism. The experimental results of two cases show that, TLMRVis system can help users analyze the robustness of the model in transfer learning, the inherited defect knowledge of the model, and the performance changes after the improvement of the model defect.

     

/

返回文章
返回