高级检索
欧阳一鸣, 何敏, 梁华国, 汪秀敏, 常郝. 3D NoC中故障感知的RVOQ容错架构设计[J]. 计算机辅助设计与图形学学报, 2015, 27(1): 192-200.
引用本文: 欧阳一鸣, 何敏, 梁华国, 汪秀敏, 常郝. 3D NoC中故障感知的RVOQ容错架构设计[J]. 计算机辅助设计与图形学学报, 2015, 27(1): 192-200.
Ouyang Yiming, He Min, Liang Huaguo, Wang Xiumin, Chang Hao. A Fault-Tolerant Architecture Design of Fault-Aware RVOQ in Three-Dimensional Network-on-Chip[J]. Journal of Computer-Aided Design & Computer Graphics, 2015, 27(1): 192-200.
Citation: Ouyang Yiming, He Min, Liang Huaguo, Wang Xiumin, Chang Hao. A Fault-Tolerant Architecture Design of Fault-Aware RVOQ in Three-Dimensional Network-on-Chip[J]. Journal of Computer-Aided Design & Computer Graphics, 2015, 27(1): 192-200.

3D NoC中故障感知的RVOQ容错架构设计

A Fault-Tolerant Architecture Design of Fault-Aware RVOQ in Three-Dimensional Network-on-Chip

  • 摘要: 针对因路由器内部输入缓存和交叉开关故障引起的可靠性及网络拥塞问题,提出一种故障感知的RVOQ容错架构设计方案.首先在输入端口处增加冗余虚通道进行输入缓存故障的容错设计,通过故障信息的反馈和仲裁算法使得数据选择有效的路径进行传输;然后修改交叉开关的架构,增加多路选择开关和相应控制模块,输入数据优先考虑本地数据链路,故障情况下选择冗余路径进行数据传输.实验结果表明,在故障数为3时,该方案比已有方法的时延降低了11%~53.1%;在网络出现多个故障、面临网络重负载时,仍然能够保证系统的高可靠性以及传输性能.

     

    Abstract: Aiming at the reliability and congestion problems caused by the faults occurring in input buffer and crossbar of the router, this article proposes a fault-tolerant and fault-aware RVOQ router design to tolerate fault. Firstly, this article adds a redundant virtual channel in each input port to tolerate input buffer faults, through the feedback of fault information and arbitration algorithm, our fault-tolerant architecture design can ensure data select valid path to transmit. Secondly, this article modifies the crossbar architecture, by adding a multi-channel selection switch and corresponding control modules, input data gives priority to the local data channel, when faults occur, the data select redundant path to transmit. The experimental results show that, compared to the reference documentation, our proposed scheme has 11%-53.1% less average network latency in the presence of 3 faults under uniform traffic pattern. Our scheme has obvious advantage and ensures the high-reliability and low-latency of the entire network when congestion and faults occur.

     

/

返回文章
返回