Intra-Frame Error Concealment Scheme using 3D Reversible Data Hiding in Mobile Cloud Environment

04/03/2018 ∙ by Yanli Chen, et al. ∙ NetEase, Inc 0

Data in mobile cloud environment are mainly transmitted via wireless noisy channels, which may result in transmission errors with a high probability due to its unreliable connectivity. For video transmission, unreliable connectivity may cause significant degradation of the content. Improving or keeping video quality over lossy channel is therefore a very important research topic. Error concealment with data hiding (ECDH) is an effective way to conceal the errors introduced by channels. It can reduce error propagation between neighbor blocks/frames comparing with the methods exploiting temporal/spatial correlations. The existing video ECDH methods often embed the motion vectors (MVs) into the specific locations. Nevertheless, specific embedding locations cannot resist against random errors. To compensate the unreliable connectivity in mobile cloud environment, in this paper, we present a video ECDH scheme using 3D reversible data hiding (RDH), in which each MV is repeated multiple times, and the repeated MVs are embedded into different macroblocks (MBs) randomly. Though the multiple embedding requires more embedding space, satisfactory trade-off between the introduced distortion and the reconstructed video quality can be achieved by tuning the repeating times of the MVs. For random embedding, the lost probability of the MVs decreases rapidly, resulting in better error concealment performance. Experimental results show that the PSNR values gain about 5dB at least comparing with the existing ECDH methods. Meanwhile, the proposed method improves the video quality significantly.



There are no comments yet.


page 4

page 8

page 9

This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.

I Introduction

With the rapid growth of information and communication technology, multimedia is becoming the popular format in the Internet. However, the storage space, computer resource, and bandwidth limit the development of multimedia communication. Cloud computing moves services, computation, and data to location-transparent centralized facilities or providers to solve these problems [1].

The cloud services are becoming more and more popular, especially in the computing and storage aspects. Now, many multimedia companies have cloud-based platforms or shift their storage and computing service to the third parties, e.g., Youtube, Dailymotion, Tencent, and iqiyi. And the services provided by cloud can be used without high cost and complex architecture. In this case, with the development of terminal equipment, mobile cloud becomes a popular environment. However, for the unreliable connectivity of mobile cloud environment, it creates new problems for users. Mobile cloud architecture may bring out transmission degradation, especially transmission errors in mobile cloud [2].

Currently, video quality is becoming more attractive, especially for the medical video,surveillance video. To guarantee better video services, mobile cloud provides efficient platform for computing, storage and transmission. Since there is massive redundancy in the video sequence, they always need to be compressed before storing and transmitting. The popular video compression standard such as MPEG-2/4, H.264/AVC, and H.265 demands either parallel or distributed processing platform [2]. In the case of cloud computing, many Video Service Providers (VSPs) rent out the distribution architecture from Cloud Service Providers (CSPs) [3].

The highly compressed videos are sensitive to the transmission errors. In order to ensure the quality, the video transmission encounters many challenges [2]: 1) Since both the low and high speed networks are existing in the Internet, it may cause data buffering at different locations during data transmission, then out of order delivery and packet loss or drops may occur; 2) Video transmission is based on connectionless protocols, and it cannot provide lossless data transmission.

For the video compression standard, there are two basic modes, inter-frame and intra-frame. Usually, for a frame, macroblocks (MBs) are the processing units under intra mode, and a processing unit may be a frame itself or a MB under inter mode. That is to say the video quality is mainly based on frame/MB quality. To improve the video quality, error resilience and error concealment are used as popular technologies. The error resilience techniques are used to recover the errors at the encoder side, and the error concealment techniques are used to conceal errors at the decoder side especially for the problem of the packet lost.

There are two methods for error concealment at the decoder side, data hiding-based method and the spatially/temporally correlation-based (data interpolation) method

[1, 4]. The data interpolation method, which works well [5, 6] in the case of the successfully received data, can be used to reconstruct the lost data. For data hiding-based method, the performance depends on that whether the marked data can reflect the frames characteristics. Generally, the mark data consists of motion vectors (MVs) or residual which described the motion features.

Data hiding is a popular technique used in the field of multimedia security such as copyright protection, content authentication and secret communication [7]. Because the data hiding has the advantage that the mark data can be carried in a seamless way, it has been employed to conceal video errors produced by the transmission. Adsumilli et al.[8] proposed an error concealment technique in which the 2-level discrete wavelet transforms (DWT) approximation coefficients are converted to halftone image which is embedded into original frame. Yilmaz et al. [9] presented an error concealment method using edge orientation information as marked data. Ref. [4] compressed the residuals of the neighbor frames by compressed sensing (CS) as marked data at the encoder side, and reconstructed the original residual by CS to conceal the channel errors at decoder side.

In general, the more information about frame is embedded, the better error concealment performance. However, embedding more information means to degrade the quality of marked video. For the highly correlation between neighbor frames, the MVs of every MB are hidden to conceal the corresponding error MBs [10, 11]. In Yao et al. [10], the MVs of the MBs belonging to the region of interest (ROI), which are shared in a frame group, were embedded into the background region within the same frame. In Chen et al. [11], the MVs of every MB are embedded into the quantized discrete cosine transform (QDCT) coefficients of neighboring MBs, and at the decoder side, the extracted MVs are used to find the matching MBs. However, it modified the DCT coefficients which may lead to video quality degrade.

To strike balance between the marked video quality and error concealment performance, reversible data hiding (RDH) [12, 13] can recover the cover losslessly. Chung et al. [14] presented a video error concealment scheme using RDH. Each MV is embedded into QDCT coefficients with zero values of the corresponding MB by a circular embedding scheme. Xu et al. [15] proposed a RDH-based intra-frame error concealment method, in which it contains MV data pre-processing and the selection of embedding region. Later, Xu et al. [7] raised a two dimensional RDH-based intra-frame error concealment scheme, and it is mainly used to reduce the distortions caused by RDH.

However, their performance needs to be improved for two reasons. Firstly, for the random of the channel state, errors are occurred randomly. So the error concealment performance in [14, 15, 16] cannot benefit from changeless embedding location. Secondly, for smooth video sequences, embedding MVs of the MBs sized is a valuable selection to gain better error concealment performance and reduce the amount of marked data. However, to enhance the robustness of data hiding, the marked data may be produced by repeating MVs multiple times, and even more MVs with smaller block size are needed to be embedded into the frames. In order to solve these problems, an intra-frame error concealment scheme using 3D RDH is proposed. Since the mobile cloud-based services are contributed on the distributed transmitted system [2], and almost all the client ends are connected to the network with wireless, the video can be transmitted effectively in mobile cloud environment in the case of limited bandwidth.

In this paper, to combine with the popular compression standard, H.264 is selected as the implementation platform. The rest of this paper is organized as follows. In Section 2, the motivation and novelty of proposed scheme is described. The main idea of 3D RDH is illustrated in Section3. And then, the proposed 3D RDH based error concealment method is presented in Section 4. In Section 5, experimental results are introduced. At last, the paper is concluded in Section 6.

Ii Motivation and Novelty

For a video transmitted through mobile cloud environment, the received video quality is mainly decided by the channel state in mobile cloud environment. Since the unreliable connectivity is existing in mobile cloud, for the video signal, error concealment methods are needed to improve the visual video quality. Meanwhile, there are much redundancy between the neighbor frames in a video. To raise video communication efficiency, the video signal is compressed before transmitting, and only the residuals between two frames/MBs are transmitted. So, some of error concealment methods are based on these temporally/spatially correlation. However, if the reference frames/MBs are corrupted, the errors may be propagated in the current frame/MB. Error concealment scheme using data hiding (ECDH) can conceal the errors from the extracted data, and its performance usually depends on that whether the mark data can reflect the frame/MB feature well. Then, a video ECDH is proposed in this paper to conceal the errors caused by the unreliable connectivity.

Secondly, at the decoder side, since the data hiding is realized by embedding data into QDCT coefficients. In other words, there are difference between the marked QDCT coefficients and the original QDCT coefficients. Also, these difference will produce high distortions in the decoded frames, and higher quantization steps make higher distortions. In order to avoid these distortions, a reversible data hiding method that the carrier can be recovered losslessly after data extraction is used in the proposed ECDH scheme.

Thirdly, for a MB, there is only one MV( include horizontal and vertical components, it is the same in the rest of this paper) that is hidden in the frame. If the MV is embedded into a corrupted MB, the MB cannot be concealed. To raise the probability of success concealment, every MV is repeated multiple times before being embedded, and these MV, belonging to a specified MB, are embedded into MBs in a frame separately and randomly. Then, if a MB is corrupted, only one MV is enough to reconstruct the MB. Also, a MB and its MV may be lost at the same time with lowest probability. However, MVs repeating makes the amount of marked data increasing. A 3D RDH is proposed to satisfy the demands of the embedding capacity-distortion performance, and then the video quality is improved. This also the novelty of this paper.

Iii Proposed 3D RDH Scheme

The methods in [14, 15] focus on using one-dimensional (1D) coefficient histogram for RDH. The 1D RDH is usually defined as [7]


where denotes the cardinal number of a set. Specifically, by considering every two adjacent residual coefficients together, 2D RDH can be defined as [7]


For 2D RDH, more than two bits may be embedded into a pair of residual coefficients [20]. Furthermore, bins ’11’ can be embedded by modifying only one coefficients [7, 20]. Obviously, the distortion is lower than that in 1D RDH before. However, they cannot satisfy the demand of the capacity-distortion performance. For 3D RDH method, every three adjacent residual coefficients are considered as a coefficient triple, it can be defined as


For a coefficient triple, the transform space is more than that in 2D RDH. In 2D RDH, the two adjacent residual coefficients can be modified to four different pairs. However, in 3D RDH, a coefficient triple can be modified to eight different possible triples, and it can provide more capacity.

Generally, the amount of the mark data is constant. However, most of the MVs have zero or near-zero values [7]. For 1D RDH [14, 15], residual coefficients do not need to be expanded [7] in this occasion, but the histogram shifting operation is still to be carried out for the non-zero coefficients. For 2D RDH, beside the coordinate (0, 0), half coefficients in a coefficients pair are needed to be modified to embed one/two bits. In [7, 20], at most two bits are embedded into a pair, and only one coefficient expands 1. So, both the 1D RDH and 2D RDH cannot meet the demands of the capacity-distortion performance well. In order to solve this problem, a 3D RDH with high capacity and low distortion is proposed and it is shown in Fig.1.

Recently, a 3D RDH scheme is proposed in [21] to decrease the distortion. In [21] at most two bits are embedded into a coefficient triple, and it is not very suitable for our application scenarios. Our intention is to embed more bits and shift histogram in a less distorted direction as much as possible.

Fig. 1: Illustration of the proposed 3D reversible data hiding

Iv Proposed error concealment scheme

For the intra-frame error concealment, it mainly focus on the corrupted block. And for the inter-frame error concealment, it focus on the whole frame errors, such as delete or lost a frame, insert a frame, mis-order frames, and so on. In this scheme, we pay our attentions on utilizing 3D RDH and MV characteristics, which can reflect the most matching blocks, to conceal the corrupted blocks in a frame. The proposed 3D RDH in Section 3 is used to increase the embedding capacity and decrease the embedding distortions. The MVs are used as the secret data that are embedded in a frame, and also help the receiver to find the most matching blocks to conceal the corrupted blocks. If the sender want to communicate with the receiver, he/she will transport a secret key to the receiver, which is used to make the same scrambled MVs for the sender and receiver.

Iv-a Mark Data generation

(1) Mark data

To provide high quality video, a high capacity RDH-based error concealment method is needed. All the schemes in [7, 14, 15] selected the MVs of MBs as the marked data, and they can satisfy the demand of lower activity video sequence. To improve the robustness, the marked data is repeated multiple times, and more information may be embedded into the video sequence.

For a MB with size , if the search range is 15, the total number of information bits to be embedded into a MB is [7]:


where denotes the ceiling function. In this paper, to enhance the robustness of the marked data, at the encoder side, all the MVs in a frame are repeated multiple times as marked data before being embedded. Assuming every MV is repeated times, then the total number of information bits to be embedded into one MB is:


Significantly, with the increasing, a large capacity RDH scheme is needed, and the embedding capacity is controllable by adjusting .

(2) Embedding locations

In [7], [14] and [15], the MV of every MB is embedded into the neighbor MB. However, in actuality, in mobile cloud environment, the errors may be spread multiple neighbor blocks in a higher probability, the stationary embedding locations cannot resist the random channel errors.

In [7, 10, 11, 14, 15, 16], every neighbor MBs are considered as a MB group, MVs are embedded into the neighbor MBs circularly. There are horizontal, vertical and diagonal patterns, and only vertical pattern can recover two MBs [15]. However, for mobile cloud environment, the errors are occurred randomly, and the neighbor MBs may be corrupted in the same time. To keep consistent with the random channel, all the repeated MVs in a frame are scrambled randomly, then every MVs are considered as a group and embedded into MBs sequentially. In other words, the MVs belonging to the same MB may be distributed in different MBs randomly. And for a lost MB, it is enough to conceal the error that only one of the repeated MVs is embedded into the non-corrupted MB. Meanwhile, The host MBs and their all repeated MVs are lost at the same time with a much lower probability. Also, the repeating frequency can be adjusted according to the demands of the video application.

In order to illustrate the performance of random embedding MVs further, assuming there are N blocks in a frame, and the PLR is . For a specified MB denoted as B, it is lost with probability . Since the repeated MVs are scrambled randomly, the repeated MVs belonging to the specified MB may be embedded into MBs. If we do not consider the influence from other MVs in the embedding process, the MVs are embedded into MBs with probability . Furtherly, in the embedding process, the worst condition is that there is only one location to embed the MVs, then . If the host MBs are corrupted at the same time, MB B cannot be concealed correctly. This case occurs with probability , and it is also the probability that a MB cannot be concealed correctly. Conversely, for a lost MB, even if only one of its MVs is embedded in the normal MB, MB B can be concealed correctly. So a lost MB can be concealed with the probability . Obviously, under the normal channel, , the relationship between and is described as


For the random embedding location, the relationship between host MBs and the embedding locations of their MVs is changeable. Fig.2 and Fig.3 shows the mapping of host MBs and the embedding locations of their MVs in the residual coefficients frame in video hall with and , and it shows the neighbor MBs in the left(the original host MBs) are scrambled dispersedly in the right residual coefficients frame (scrambled embedding locations).

Fig. 2: The mapping of host MB and scrambled MVs with

Fig. 3: The mapping of host MB and scrambled MVs with

In order to show the influence on concealment performance significantly, in this paper, all the corrupted MBs that cannot be concealed correctly are set to be zeros and all the concealment experimental results in this paper are implemented under this case. In the other words, if the MBs and their MVs are lost at the same time, the decoded MBs are substituted by black blocks. To reveal the impact from random embedding on the concealment performance, Fig.4 shows the comparison of concealed frames PSNR between the random embedding and changeless embedding with PLR 0.1, 0.3 and , it reveals that random embedding can achieve a better performance.

At the decoder side, for a MB, there would be multiple same MVs for a specified MB, and they may be extracted from different MBs. So, if a MB is corrupted, it can be concealed in a high probability.

(a) PLR=0.1
(b) PLR=0.3
Fig. 4: PSNR of concealed video sequences foreman with different PLR: (a) PLR=0.1 ;( b) PLR=0.3

Fig. 5: The H.264 Encoding framework

Iv-B Data Hiding

Fig.5 shows the H.264 encoding procedure [18, 19]. Considering the video compressing process, the marked data is embedded into the quantized DCT coefficients, and the mid-frequency and high-frequency coefficients are used to carry the MVs. For different repeating frequency requires different embedding capacity, the AC component is reserved for embedding. Since the smallest encoding unit is block, all the AC components of QDCT coefficients are considered as the cover. After zigzag scanned, every three adjacent coefficients are grouped to be a triple. After that, the mid-frequency and high-frequency coefficients are divided into triples in order. The modifications on these triples can be actualized based on the proposed 3D RDH in Section 3. Every embedding operation is actualized by modifying one or two elements in a triple with . The original coefficients triple , the marked coefficients triple and the recovered coefficient triple are denoted as .

The embedding procedure is as follows.

(1) If , the marked coefficient triple is determined as follows


where denotes the to-be-embedded bits. In this case, three bits are embedded by modifying one element in a triple at most with the probability 0.75.

(2) If , the marked coefficient triple is determined as follows


where denotes the symbol of .In this case, at least one bit is embedded with modifying one of three coefficients. Comparing with the 3D RDH [21], the capacity-distortion performance is improved.

(3) If , the marked coefficient triple is determined as follows


In this case, more than two bits are embedded by expanding 1. However, in the 3D RDH [21], only one bit can be embedded with the same distortion.

(4) If , the marked coefficient triple is determined as follows


In this case, one or two bit is embedded with modifying one coefficient. For the large quantization step, most of the AC coefficients are zeros. This case is occurred with low probability infrequently.

(5) If and , the coefficient triple is shifted as follows.


(6) If and , the coefficient triple is shifted as follows.


In the case 5) and 6), all the shifting are realized by modifying two coefficients in a triple.

Iv-C Data Extraction and Original Video Recovery

The original video can be recovered by extracting the hidden MVs according to the proposed 3D RDH in this paper, and error MBs can be concealed by replacing the matching MBs in reference frame. The data extraction process is actualized before de-quantization, so the host MBs can be decoded without any extra distortion when there are no transmitting errors. Once the transmitting errors occur, the error concealment process is realized after the data extraction.

Assuming there are no errors in the transmission, data extraction procedure can be described as follows

(1) If the marked coefficients triple the hidden bits can be extracted as


After the bits are extracted, the recovered coefficients triple recovers to (0,0,0).

(2) If the marked coefficients triple , the hidden bits are extracted as follows.


After the bits are extracted, the host coefficient triples are recovered as


(3) If the marked coefficients triple, the hidden bits are extracted as


After the bits are extracted, the host coefficient triples are recovered as


(4) If the marked coefficients triple , the hidden bits are extracted as


After the bits are extracted, the host coefficient triples are recovered as


(5) If the marked coefficient triples belong to shifting triples, they are recovered by inverse-processing of shifting.

Unfortunately, if there are transmission errors, after all the bits are extracted, the MVs are obtained by inverse-scrambling the extracted MVs. The error MBs are concealed by replacing those using matching blocks.

V Experimental Result

The proposed scheme is applied in the H.264, and the front 30 frames of six standard sequences in QCIF format () are used to test the performance of the proposed scheme. Every 10 frames are considered as a GOP. In order to reveal the advantage of the proposed scheme, recent works in [7] and [15] are used to be compared with the proposed scheme.

V-a Embedding Distortion

In this scheme, 3D RDH is used to embed the MVs into the quantized DCT residual coefficients. The goal of proposed RDH scheme is to improve embedding capacity and introduce low embedding distortion. However, since MVs are embedded into the host residual coefficients, it would cause difference between the host coefficients and marked coefficients. For the RDH algorithm, the host coefficients can be recovered completely from the embedding distortions if the video sequences are transmitted without any channel errors.

(1) ECDR

Absolute embedding capacity and distortion ratio (ECDR) can be defined simply as


where denotes the absolute embedding capacity, denotes the absolute distortion, and both of them are calculated with the full embedding situation. In order to analyze the proposed 3D RDH performance better, the residual coefficients teiples are divided into multiple sets. and denote the original coefficient triples sets of the 1st, 2nd, 3rd ,4rd case in Section 4, is defined in Eq.(25), denotes the original coefficient triples set of 5th and 6th case. The ECDR of the proposed 3D RDH is evaluated by comparing recent 3D RDH in [21]. Meanwhile, the embedding capacity of the proposed 3D RDH, and that in [21] are denoted as , and the distortions in these schemes are denoted as .


where denotes the elements number in set. The embedding capacity and distortion difference are calculated as


Eq. (25) indicates that the embedding capacity in proposed scheme is much more than that in the scheme [21] and Eq. (26) indicates that the distortion difference in the two schemes is decided by the elements number in and . However, for video compressed standard, the number of the elements in these sets is mainly decided by quantization step. With the quantization step decreasing, coefficients with zero values decrease, and also the distortion difference may decrease. Generally, to guarantee the quality of the decoded sequence, the quantization step cannot be a number which make all the AC components are zeros. So the ECDR in the proposed scheme is more than that in the scheme [21] in great probability. Let and denote the probability of the case and , then the relation of probability is described as


Assuming and denote the ECDR in the proposed scheme and in [21]. According to Eq.(25) and Eq.(26), is greater than in a higher probability, and it can be described as follows.


To illustrate the ECDR of the two 3D RDH scheme, emebedding capacity(EC) and the corresponding distortion of the 2nd frame in video sequence , , and are calculated in Table I. It shows that with the quantization step (QP) increasing, the embedding capacity is increased too, and the distortion is decreased for the decreased shifting operations with lower QP. It also shows the proposed 3DRDH can provide more embedding space. Fig.6 shows the ECDR comparison of the ECDR in proposed 3D RDH is nearly two times as that in [21].

        QP 18 20 22 24 26 28 30
               EC   proposed 15802 16981 17939 18732 19444 19994 20484
                        [21] 12792 13807 14633 15295 15905 16358 16762
foreman      D    proposed 7329 7140 7023 6892 6859 6817 6795
                        [21] 7051 7024 6819 6693 6510 6451 6309
               EC   proposed 16694 17893 18771 19822 20225 10577 20875
                        [21] 13560 14595 15336 16204 16537 16828 17072
  hall         D    proposed 6920 6816 6758 6787 6797 6816 6817
                        [21] 7372 7048 6795 6528 6383 6280 6186
               EC   proposed 12586 14107 15405 16762 17900 18760 19624
coastguard              [21] 10034 11350 12459 13625 14591 15320 16041
               D    proposed 7996 7539 7190 7002 6857 6807 6758
                        [21] 7480 7472 7411 7195 7043 6798 6589
              EC   proposed 18781 19342 19730 20398 20679 20925 21254
  grandma                [21] 15301 15784 16114 16684 16909 17121 17396
               D    proposed 6748 6747 6721 6763 6786 6814 6852
                         [21] 6871 6738 6557 6361 6276 6179 6100
TABLE I: the embedding capacity and distortion comparison with [21]
(a) coastguard
(b) foreman
(c) hall
(d) grandma
Fig. 6: The EDAR of proposed 3D RDH comparison with [21]

(2) PSNRs of marked frames

Generally, at the decoder side, the quality of decoded frames without any other manipulations is the limitation of the concealed frames. If the peak signal to noise ratio (PSNR) values of marked frames are close to the limitations, the data hiding would have few influences on the marked video quality. However, the repeating frequency is also a key factor that influences the quality of the marked frames. Higher repeating frequency means more data is embedded into the frames. To indicate the concealment performance with different repeating frequencies, the cases with repeating frequencies and are compared with the limitation case. Fig.7 shows the PSNR values comparison of marked frames with the decoded frames in the proposed method, and it shows the PSNR values with is near the limitation.

(a) grandma
(b) akiyo
(c) hall
(d) foreman
(e) silent
(f) coastguard
Fig. 7: he PSNR values comparison of marked frame with the limitation

V-B Performance of Error Concealment

(1) Error concealment performance comparison

The error concealment performance is one of the most important features for this scheme.In cloud environment, data are stored in distributed way and may be from different servers. The packet lost rates (PLRs) can reflect all the channel states of the whole transition procedure. To simulate the transmission errors, different random intra-frame PLR are applied to each intra-frame, such as 0.05, 0.10, 0.15, 0.20 and 0.30. For every PLR, the performance is measured in terms of PSNR between original video sequence and concealed video sequence. Since [7] indicated the concealment performance in it is better than that in [15], so the concealment performance comparison is taken with [7]. The PSNR comparisons of concealed frames are showed in Fig.8 and both the cases and are considered. According to Fig.8, the error concealment performance is better than that in [7] significantly. Especially, the corrupted frames are concealed completely with . The reason is that for the scheme in [7], the cover MB for each MV is not random as the channel errors. In contrary, the embedding location of each MB is random in our proposed error concealment scheme, and also it reveal that Eq. (6) is correct.

(a) grandma
(b) akiyo
(c) hall
(d) foreman
(e) silent
(f) coastguard
Fig. 8: PSNR comparison of concealed frames

Fig.9, Fig.10, and Fig.11 shows the visual quality of original frame, error frame, concealed frame in coastguard , grandma and hall sequences with PLR and respectively. In this paper, all the errors caused by channel are occurred randomly and the corrupted frames are shown in Fig.9 (b), Fig.10 (b) and Fig.11 (b). The visual quality of the concealed frames is shown in Fig.9 (c), Fig.10 (c) and Fig.11(c).

(a) original
(b) error
(c) conceaed
Fig. 9: Visual quality comparison for the coastguard sequence with PLR 0.2 and
(a) original
(b) error
(c) conceaed
Fig. 10: Visual quality comparison for the grandma sequence with PLR 0.2 and
(a) original
(b) error
(c) concealed
Fig. 11: Visual quality comparison for the coastguard sequence with PLR 0.2 and

Furtherly, the visual quality of concealment performance is tested with . Fig.12 (a), Fig.13 (a) and Fig.14 (a) show the error frames, and their original frames are the same as the Fig.9, Fig.10 and Fig.11. Fig.12 (b), Fig.13 (b) and Fig.14 (b) show the concealed frames with , the concealed frame is similar in visual to the original frames.

(a) error
(b) conceaed
Fig. 12: Visual quality comparison for the coastguard sequence with PLR 0.2 and
(a) original
(b) error
Fig. 13: Visual quality comparison for the grandma sequence with PLR 0.2 and
(a) original
(b) error
Fig. 14: Visual quality comparison for the hall sequence with PLR 0.2 and

(2) MSE

PSNR denotes the peak SNR, it cannot reveal the distortion overall. Mean square error (MSE) is usually explored to capture the errors of every element, and it can reflect the comprehensive errors in a frame. For an error concealment scheme using RDH, the distortion may be generated both in the procedure of embedding and concealment. Without loss of generality, the original and the concealed frame with size are denoted as and , and the pixels in them are denoted as , and . The concealment distortion is defined as


For the scheme in this paper, the quality of the concealed frames is controllable by adjusting the repeating frequency. However, the concealment performance is mainly decided by the error concealment method. As we can see, the PSNR values and the visual quality can illustrate the concealment performance, but they cannot reveal the detail distortions in a frame. The MSE is the summation of every distortion, also it can amplify the great distortion and reduce the low distortion. To explain the distortion of the concealment frames, Fig.15 shows the MSE of concealed frames with different repeating frequencies and different videos. It shows MSE performance of almost all the video sequences will achieve a steady values with the repeating frequencies increasing. Also, for the test video foreman , hall , and coastguard , the MSE in the proposed scheme is better than that in [7]. In addition, based on the extracted marked data, the concealment performance in this paper is mainly decided by the random embedding locations, the concealment performance may be different every time. And the Section 5.1 has explained the errors can be concealed with greater probability.

(a) additivedisconceal
(b) foreman
(c) hall
(d) coastguard
Fig. 15: MSE of concealed frames with different PLR

Vi Conclusion

Reliable transmission of compressed video under mobile cloud environment is a challenging problem. In this paper, a 3D RDH-based video error concealment technique is proposed for H.264 codec. At the encode side, every three adjacent AC coefficients are counted to be a triple elements, and the data is embedded in the triples by shifting the triples. The proposed 3D RDH is utilized to embed the MVs into quantized DCT coefficients randomly. At decoder side, MVs are extracted and recovered to conceal the corrupted MBs. Meanwhile, the marked data is controllable by adjusting the MV repeating frequency, and both the concealment and embedding performance can be adjusted correspondingly. The experimental results show that the scheme is benefit from the repeated and random embedding locations, and also the proposed high capacity 3D RDH. By using the proposed 3D RDH method, rich space are utilized to embed more marked data. How to take advantage of the 3D space characteristic to exploit a more suitable histogram modifying should be researched in the future. Also, the error concealment scheme should be improved to the direction of few embedding bits and better concealment performance for the mobile cloud environment.


This work is supported by the National Natural Science Foundation of China (NSFC) under the grant No. U1536110.


  • [1] T. Hobfeld, R. Schatz, M. Varela, C. Timmerer, Challenges of QoE management for cloud applications, IEEE Commun. Mag.,    vol. 50, no. 4, pp. 28-36, Apr. 2012.
  • [2] M. Usman, X. He, K. Lam, M. Xu, S. Bokhari, and J. Chen, Frame interpolation for cloud-based mobile video streaming, IEEE Transactions on Multimedia,    vol. 18, no. 5, pp. 831 C839, 2016.
  • [3] A. Alasaad, K. Shafiee, H. Behairy, V. Leung, Innovative schemes for resource allocation in the cloud for media streaming applications, IEEE Trans. Parallel Distrib. Syst.,    vol. 26, no. 04, pp. 1021-1033, Apr. 2015.
  • [4] Yanli Chen, Hongxia Wang , Hanzhou Wu, A video error concealment method using data hiding based on compressed sensing over lossy channel, doi:10.1007/s11235-017-0393-1,to be published.
  • [5] J. Liu, G.T. Zhai, X.K. Yang, B. Yang, L. Chen, Spatial error concealment with an adaptive linear predictor, IEEE Trans. Circuits Syst. Video Technol.,    vol.25,no.3, pp. 353-366, 2015.
  • [6] Yanling Xu, Yuanhua Zhou, Adaptive Temporal Error Concealment Scheme for H.264/AVC Video Decoder, IEEE Transactions on Consumer Electronics,    vol.54, no.4, pp.1846-1851, 2008.
  • [7] Dawen Xu, Rangding Wang, Two-dimensional reversible data hiding-based approach for intra-frame error concealment in H.264/AVC, Signal Processing: Image Communication,    vol.47, pp. 369-379, 2016.
  • [8] Chowdary B. Adsumilli, Myl ne C. Q. Farias,Sanjit K. Mitra, and Marco Carli, A Robust Error Concealment Technique Using Data Hiding for Image and Video Transmission Over Lossy Channels, IEEE Trans. Circuits Syst. Video Technol.,,    vol. 15, no. 11, pp.1394-1406, 2005.
  • [9] A. Y?lmaz, A.A. Alatan, Error detection and concealment for video transmission using information hiding, Signal Process.: Image Commun.,    vol.23 no.4, pp. 298-312, 2008.
  • [10] Yuanzhi Yao,Weiming Zhang,Nenghai Yu, Adaptive Video Error Concealment Using Reversible Data Hiding, in Proc. MINES, 2012, pp.658-661.
  • [11] S.Y. Chen, H. Leung, A temporal approach for improving intra-frame concealment performance in H.264/AVC, IEEE Trans. Circuits Syst. Video Technol.,    vol.19 , no.3, pp. 422-426, 2009.
  • [12] Z. Ni, Y.Q. Shi, N. Ansari, and W. Su, Reversible Data Hiding, In Proc. ISCS, Bangkok, Thailand, vol. 2, pp. 912-915, 2003.
  • [13] Z. Ni, Y. Q. Shi, N. Ansari, and W. Su, Reversible data hiding, IEEE Trans. Circuits Syst. Video Technol.,    vol. 16, no. 8, pp. 354 C362, 2006.
  • [14] K.L. Chung, Y.H. Huang, P.C. Chang, H.-Y.M. Liao, Reversible data hiding-based approach for intra-frame error concealment in H.264/AVC, IEEE Trans. Circuits. Syst. Video Technol.,    vol.20, no.11, pp.1643 C1647, 2010.
  • [15] D.W. Xu, R.D. Wang, Y.Q. Shi, An improved reversible data hiding-based approach for intra-frame error concealment in H.264/AVC, J. Vis. Commun. Image Represent.    vol.25, no.2, pp. 410 C422, 2014.
  • [16] K. Lin, J. Song, J. Luo, W. Ji, M. Hossain, and A. Ghoneim, GVT: Green video transmission in the mobile cloud networks, IEEE Trans. Circuits Syst. Video Technol., to be published, doi: 10.1109/TCSVT.2016.2539618.2016.
  • [17] Tew Y, Wong K. An Overview of Information Hiding in H.264/AVC Compressed Video, IEEE Trans. Circuits Syst. Video Technol.,    vol.24, pp.305 C 319,2014.
  • [18] A A Muhit, M R Pickering, M R Frater and J F Arnold, Video Coding using Elastic Motion Model and Larger Blocks, IEEE Trans. Circuits Syst. Video Technol.,    vol. 20, no. 5, pp. 661-672, 2010
  • [19] A A Muhit, M R Pickering, M R Frater and J F Arnold, Video Coding using Geometry Partitioning and an Elastic Motion Model, Visual Communication and Image Representation, vol.23,no.1,pp.31-41,2012.
  • [20] Juan Zhao , Zhi-Tang Li , Bing Feng, A novel two-dimensional histogram modification for reversible data embedding into stereo H.264 video, Multimedia Tools and Applications,    vol.75 on.10, pp.5959-5980, May 2016.
  • [21] Juan Zhao, Zhi-Tang Li, three-dimensional histogram shifting for reversible data hiding, Multimedia System(2016),doi:10.1007/s00530-016-0529-2,to be publiched. .