基于通道增益的可变比特率点云压缩
Variable Rate Compression of Point Cloud Based on Channel Gain
-
摘要: 针对现有基于深度学习的点云压缩方法需要训练多个网络,耗费大量的时间和空间资源的缺陷,提出一种基于通道增益的可变比特率点云压缩方法.首先在网络的编码端利用层次化结构,通过每个层级提取点云特征和应用偏移注意力机制,有效地捕捉输入点云的关键特征信息;然后引入增益单元评估和缩放各个隐向量通道的重要度,消除向量通道间的信息冗余,仅需训练单个网络即可实现可变比特率压缩;为了预测特征向量的概率分布,对特征向量进行超先验编码,构造高斯熵模型,通过熵编码进一步降低编码量;最后在解码端采用子点卷积进行上采样重构原始点云,避免顶点的局部聚集,提高点云的重构质量.实验结果表明,在ShapeNet数据集上,以率失真曲线的BD-rate作为性能评价指标,与VRR和Draco方法相比,平均比特率分别降低48.66%和63.56%;压缩性能得到了显著的提升.Abstract: Existing point cloud compression methods based on deep learning require training multiple networks, which consumes a large amount of time and space resources. To address this issue, a variable rate point cloud compression network model based on channel gain is proposed. First, the encoding end of the network uses a hierarchical structure to extract point cloud features and apply offset attention mechanisms at each level, effectively capturing the key feature information of the input point cloud. Secondly, gain units are introduced to evaluate and scale the channel of each hidden vector, eliminating information redundancy between vector channels, and requiring only one network to achieve variable rate compression. Then, to predict the probability distribution of feature vectors, a hyperprior encoder is performed on the feature vectors to construct a Gaussian entropy model, further reducing the coding volume through entropy coding. Finally, the decoder uses sub-point convolutions for upsampling to reconstruct the original point cloud, avoiding local point aggregation and improving the reconstruction quality. For the ShapeNet dataset, with BD-rate on the rate-distortion curve as the performance evaluation index, our method reduces the average bitrate by 48.66% and 63.56% compared with VRR and Draco methods, respectively. The experimental results show that the compression performance has been significantly improved compared to current methods.