基于张量分解的红外机场监控视频压缩技术

Technology of Tensor Decomposition based Infrared Video Compression for Airport Surveillance Scenario

  • 摘要: 与传统视频相比,大景深红外机场监控视频的背景和前景区域往往变化较平缓,纹理信息较少,因此其时域和空域的相关性更强。虽然可用现有视频编码工具对红外监控视频进行压缩,但只能利用局部的视频信息去除冗余,且编码复杂度较高。该文从数据的全局角度出发,将整个红外监控视频视为一个三阶张量,然后采用张量分解近似表示以降低数据量,最后将分解得到的系数矩阵进行量化进一步提升压缩效率。同时,采用张量的CP分解和Tucker分解,分析对比了两种张量分解方法的压缩性能。实验结果表明,相较于HEVC,对于视频存在大量平缓区域的情况,Tucker分解可取得77.7%的BD-rate节省,但对于内容复杂和运动较大的情况,CP分解和Tucker分解的BD-PSNR下降超10 dB;CP分解和Tucker分解的编码时间仅为HEVC的0.57%和2.25%。

     

    Abstract: Compared with traditional videos, infrared videos in airport surveillance scenario with a large depth of filed consist of smooth background and foreground areas with less textures, thus processing stronger temporal and spatial correlations. Although commonly used video coding tools can be utilized to compress infrared videos, they only utilize local information to reduce the redundancy, resulting in high computational complexity. Based on a global perspective of the data, this paper considers the whole infrared surveillance video as a third-order tensor, then adopts tensor decomposition for approximate representation to reduce the data volume, and finally quantizes the resulting factor matrices to further improve compression efficiency. Meanwhile, CP decomposition and Tucker decomposition are adopted to analyze and compare the compression performance of these two tensor decomposition methods. Experimental results show that, compared with HEVC, Tucker decomposition can achieve about 77.7% BD-rate savings for videos containing lots of smooth areas. For videos with complex content and significant motion, however, both CP decomposition and Tucker decomposition suffer from BD-PSNR lose of over 10 dB. Nevertheless, the encoding time of CP decomposition and Tucker decomposition is just about 0.57% and 2.25% of that of HEVC, respectively.

     

/

返回文章
返回