高级检索

基于图神经网络的手绘表格单元格类型识别算法

GNN-based Recognition of Handwritten Table Cell Types

  • 摘要: 当前,用户采用移动设备、智能白板等触屏设备绘制草图表格中,对其结构类型进行自动分析理解的需求日益迫切。针对现有笔式交互研究多集中于版面分析与框线识别,手绘表格单元格类型识别的研究面临挑战的问题,提出一种基于图神经网络的手绘表格单元格类型识别算法。首先采用时空图注意力网络对草图笔画进行二分类,识别出单元格笔画块;然后对这些笔画块之间的时空关系进行建模,通过赋予不同节点差异化的注意力权重,有效地捕捉表格结构中的时间与空间关联,提升单元格类型识别的准确性。在用户手绘数据集和IAMonDo公开数据集上的实验结果表明,与基线算法相比,所提算法在用户手绘数据集上的总体识别准确率提升14.20个百分点,在IAMonDo数据集上提升13.08个百分点,验证了其有效性。此外,还构建一个包含字符级细粒度标注的草图表格数据集,为草图表格的识别与理解提供了数据支持。

     

    Abstract: Currently, there is an increasingly urgent need for the automatic analysis and understanding of the structur-al types of sketch-based tables drawn by users on touchscreen devices such as mobile devices and smart whiteboards. In response to the challenges in cell type recognition for hand-drawn tables—given that ex-isting pen-based interaction research has primarily focused on layout analysis and line recognition—this paper proposes a hand-drawn table cell type recognition algorithm based on graph neural networks. First, a spatiotemporal graph attention network is employed to classify sketch strokes into two categories, thereby identifying cell stroke blocks. Then, the spatiotemporal relationships among these stroke blocks are mod-eled. By assigning differentiated attention weights to different nodes, the algorithm effectively captures the temporal and spatial correlations within the table structure, thereby improving the accuracy of cell type recognition. Experimental results on both a user hand-drawn dataset and the public IAMonDo dataset show that, compared with baseline algorithms, the proposed algorithm improves the overall recognition accuracy by 14.20 percentage points on the user hand-drawn dataset and by 13.08 percentage points on the IAMon-Do dataset, demonstrating its effectiveness. Additionally, a sketch table dataset with character-level fi-ne-grained annotations is constructed, providing data support for the recognition and understanding of sketch-based tables.

     

/

返回文章
返回