基于张量分解的红外机场监控视频压缩

Tensor Decomposition based Infrared Video Compression for Airport Surveillance Scenario

  • 摘要: 与传统视频相比,大景深红外机场监控视频的背景和前景区域往往变化较平缓,纹理信息较少,因此其时域和空域的相关性更强。虽然可用现有视频编码工具对红外监控视频进行压缩,但只能利用局部的视频信息去除冗余,且编码复杂度较高。该文从数据的全局角度出发,将整个红外监控视频视为一个三阶张量,然后采用张量分解近似表示以降低数据量,最后将分解得到的系数矩阵进行量化进一步提升压缩效率。同时,采用张量的CP分解和Tucker分解,分析对比了两种张量分解方法的压缩性能。实验结果表明,相较于HEVC,对于视频存在大量平缓区域的情况,Tucker分解可取得77.7%的BD-rate节省,但对于内容复杂和运动较大的情况,CP分解和Tucker分解的BD-PSNR下降超10 dB;但CP分解和Tucker分解的编码时间仅为HEVC的0.57%和2.25%。

     

    Abstract: Infrared video in airport surveillance scenario with a large depth of filed consists of smooth background and foreground areas with less textures, compared with traditional video. Therefore, the temporal and spatial correlations of Infrared video is larger. Although commonly used video codecs can be utilized to compress Infrared video, only a little local information is used to reduce the redundancy with high computation cost. In view of this, this paper considers the whole Infrared video as a tensor from global angle and then use tensor decomposition approach to reduce the data volume. The resulting factor matrix is then quantized. Moreover, this paper analyzed the compression performance of CP decomposition and Tucker decomposition. Experimental results show that, compared with HEVC, Tucker decomposition can achieve about 77.7% BD-rate savings while the video contains lots of smooth areas. If not, CP decomposition and Tucker decomposition would BD-PSNR lose about 10 dB. However, the encoding time of CP decomposition and Tucker decomposition is just about 0.57% and 2.25% compared with HEVC, respectively.

     

/

返回文章
返回