SSIM-Based CTU-Level Joint Optimal Bit Allocation and Rate Distortion Optimization
Abstract
Structural similarity (SSIM)-based distortion DSSIM is more consistent with human perception than the traditional mean squared error DMSE. To achieve better video quality, many studies on optimal bit allocation (OBA) and rate-distortion optimization (RDO) used DSSIM as the distortion metric. However, many of them failed to optimize OBA and RDO jointly based on SSIM, thus causing a non-optimal R-DSSIM performance. This problem is due to the lack of an accurate R-DSSIM model that can be used uniformly in both OBA and RDO. To solve this problem, we propose a DSSIM-DMSE model first. Based on this model, the complex R-DSSIM cost in RDO can be calculated as simpler R-DMSE cost with a new SSIM-related Lagrange multiplier. This not only reduces the computation burden of SSIM-based RDO, but also enables the R-DSSIM model to be uniformly used in OBA and RDO. Moreover, with the new SSIM-related Lagrange multiplier in hand, the joint relationship of R-DSSIM-λSSIM (the negative derivative of R-DSSIM) can be built, based on which the R-DSSIM model parameters can be calculated accurately. With accurate and unified R-DSSIM model, SSIM-based OBA and SSIM-based RDO are unified together in our scheme, called SOSR. Compared with the HEVC reference encoder HM16.20, SOSR saves 4%, 10%, and 14% bitrate under the same SSIM in all-intra, hierarchical and non-hierarchical low-delay-B configurations, which is superior to other state-of-the-art schemes.