高级检索

药症方关联的中医药古籍交互可视分析方法

Visual Analysis of Medication-Symptom-Prescription Association in Traditional Chinese Medicine Ancient Texts

  • 摘要: 中医药古代典籍承载了中医基础理论、 药理知识和实践经验, 具有很高的研究价值. 传统的中医药古籍信息提取、过滤与简单的可视化方法未能充分地挖掘中医药理论知识内容及其相关性. 针对该问题, 与领域专家紧密合作, 提出药症方关联的中医药古籍交互可视分析方法. 基于《四库全书》语料预训练 BERT 模型, 设计了一个中医药知识图谱构建与处理方法; 基于中药的“君臣佐使”原理, 提供了药方主症相关性计算方法; 采用创新的知识图谱药方布局, 支持从中医整体观和辨证论治理论, 以药、症、方 3 个角度探查中医理论知识的关联性. 与传统方法相比, 该方法能更好地帮助专家从中医药古籍数据中进行高效率地探索、理解和推断. 通过对比实验, 所提 BERTsiku-BiLSTM-CRF 模型在命名实体识别任务上的精确率、召回率及 F1 值分别达到 90.57%, 93.53%, 91.99%; 所提 BERTsiku-PCNN 模型在关系抽取任务上的精确率、召回率及 F1 值分别为 93.29%, 75.14%, 80.40%, 结果均优于其他参比模型,证明了所提模型的有效性. 通过 2 个《本草纲目》应用案例的研究, 验证了交互可视分析系统的实用性, 并在访谈中获得了领域专家的积极反馈.

     

    Abstract: Ancient Chinese medical texts embody the foundational theories, pharmacological knowledge, and practical experiences of traditional Chinese medicine (TCM), possessing significant research value. Traditional methods of information extraction, filtering, and simple visualization of TCM ancient texts have failed to fully exploit the content and relevance of TCM theoretical knowledge. To address this issue, in close collaboration with domain experts, an interactive visual analysis method for exploring the correlation among herbs, symptoms, and prescriptions in ancient TCM texts is proposed. Leveraging a BERT model pre-trained on the corpus of the “Siku Quanshu”, a method for constructing and processing a TCM knowledge graph is designed. Based on the principle of “Jun-Chen-Zuo-Shi” in Chinese herbal medicine, a method for calculating the relevance between main symptoms and medicinal prescriptions is provided. An innovative layout of the knowledge graph for medicinal prescriptions is employed to support exploration of the correlation between TCM theoretical knowledge from the perspectives of herbs, symptoms, and prescriptions, rooted in the holistic view and differential treatment theory of TCM. Compared to traditional methods, this approach can better assist experts in efficiently exploring, understanding, and inferring from data in ancient TCM texts. Through comparative experiments, the proposed BERTsiku-BiLSTM-CRF model achieves precision, recall, and F1 values of 90.57%, 93.53%, and 91.99%, respectively, for named entity recognition tasks, while the proposed BERTsiku-PCNN model achieves precision, recall, and F1 values of 93.29%, 75.14%, and 80.40%, respectively, for relation extraction tasks, outperforming other benchmark models, thus validating the effectiveness of the proposed models. Two case studies on the application of “Compendium of Materia Medica” demonstrate the practicality of the interactive visual analysis system and receive positive feedback from domain experts during interviews.

     

/

返回文章
返回