Automated approach for splicing detection using first digit probability distribution features

Mire, Archana V.; Dhok, Sanjay B.; Mistry, N. J.; Porey, Prakash D.

doi:10.1186/s13640-018-0257-y

Research
Open access
Published: 05 March 2018

Automated approach for splicing detection using first digit probability distribution features

Archana V. Mire¹,
Sanjay B. Dhok²,
N. J. Mistry¹ &
…
Prakash D. Porey²

EURASIP Journal on Image and Video Processing volume 2018, Article number: 18 (2018) Cite this article

1976 Accesses
6 Citations
Metrics details

Abstract

Digital image tampering operations destroy inbuilt fingerprints and create own new fingerprint in the tampered region. Considering the Internet speed and storage space, most of the images are circulated in the JPEG format. In a single compressed JPEG image, the first digits of DCT coefficients follow a logarithmic distribution. This distribution is not followed by DCT coefficients of DCT grid aligned double compressed images. In a tampered image, the major portion of the original JPEG image is aligned double JPEG compressed. Hence, untampered region does not follow this logarithmic distribution. Due to the nonalignment of DCT compression grids, tampered region still follows this logarithmic distribution. Many tampering localization techniques have investigated this fingerprint, but the majority of them uses SVM classifier, specifically trained for the respective primary and secondary compression qualities of the test images. The efficiency of these classifiers is dependent on the knowledge of tampered image compression history. Hence, these approaches are not fully automated. In this paper, we have investigated a method, which does not require prior compression quality knowledge. Our experimental analysis shows that the addition of Gaussian noise can make the probability distribution of an aligned double compressed image similar to a nonaligned double compressed image. We divided the test image and its Gaussian version into sub-images and clustered them using K-means clustering algorithm. The application of K-means clustering algorithm does not require compression quality knowledge. This makes our approach more practical as compared to the other first digit probability distribution-based algorithms. The proposed algorithm gives compatible performance with the other approaches, based on different JPEG fingerprints.

1 Introduction

With the sophisticated image editing tools, digital images can be easily tampered with the great professional quality. This creates a big dilemma in the authenticity of the digital images. An image tampered and distributed through such tools may cause adverse effect on the society. Passive digital image forensic techniques investigate such digital images in the absence of any embedded security information. There exist various statistics such as CFA interpolation, resampling artifacts, motion blur, lightning intensity, reflections, edges, and JPEG fingerprint, which are consistent in the untampered images [1, 2]. Recently in [3], the authors provided a comprehensive survey of different forgery detection techniques such as copy-move forgery, splicing, resampling, and image retouching. Mostly, they covered pixel-based techniques, as these techniques do not require any a priori information about the type of tampering. In [4], the authors extracted sensor pattern noise from various images and clustered it using pairwise correlations among them. Thus, they clustered images captured by the same camera into the same cluster. Each SPN was treated as a random variable, and a Markov random field approach was employed to iteratively assign a class label to each SPN. However, they have validated their approach only on the gray images. In [5], the authors segmented image into small patches and computed noise variance of each patch using kurtosis concentration-based pixel-level noise estimation method. Later, suspicious region was identified by searching those conjunct patches, which are out of the linear constraint. In [6], the authors applied Gabor Wavelets and Local Phase Quantization to extract texture features at different scales and orientations to train SVM classifier. They claimed a comparable performance with much reduced feature dimensions. Being default image distribution format, JPEG fingerprint has emerged as one of the most important fingerprints. To hide the visual traces of image tampering, rotation and scaling are often applied to the tampered region. These basic tampering operations can be located in the double compressed images using JPEG fingerprints [7]. Most of the JPEG fingerprint-based forgery detection techniques locate aligned double JPEG (ADJPEG) compression artifacts [7,8,9,10,11,12].

It is difficult to create a good tampering by aligning 8 × 8 DCT compression grids of the tampered region. At least some portion of the tampered region undergoes nonaligned double JPEG (NADJPEG) compression, where double compression artifacts are missing. Figure 1 shows an example of tampering, where star segment from the source image (a) (compressed at qualityQ₁) is copied and pasted into the destination image (b) (compressed at qualityQ₂) to create composite double compressed image (c) (compressed at qualityQ₃). Grids in Fig. 1 represent 8 × 8 DCT grid (shown with dark edges) used for compression. Star shape copied from source image (a) shows significant shifting within 8 × 8 grids in figure (c). Hence, the pasted star shape undergoes NADJPEG compression, while smiley shape and all remaining components of background image undergo ADJPEG compression. A tampered region is identified as a region, where aligned double JPEG compression artifacts are missing.

The author in [8] used the difference between test image and its recompressed versions for locating ADJPEG compressed regions. He found the difference minimum, at the primary compression quality as well as at the secondary compression quality, called as ghost effect. In [9], the authors investigated the periodicity in the histogram of double quantized DCT coefficients in ADJPEG compressed images. In [10], the authors used these periodicities and expectation maximization algorithm to generate the probability of each 8 × 8 block being DJPEG/NADJPEG compressed. In [13], the authors plotted the histogram of DCT coefficients inside the 8 × 8 blocks and at the edges of blocks. They showed that both the histograms overlap with each other for uncompressed images and show significant difference for the compressed images. In [14], the authors called this histogram difference as a block artificial characteristic matrix (BACM). They have detected NADJPEG compressed images by investigating symmetry of this matrix. DCT coefficients of single compressed images follow a generalized Benford’s model; aligned double compressed images do not follow this model [15]. This model was further investigated in [11, 12, 16] for tampering localization. The authors in [11, 12, 16] used first digit probability distribution (FDPD) of single compressed images and their double compressed counterparts for training the SVM classifier. Thus, such investigation needs primary compression quality of the test image, without which accurate forensic investigation is not possible. In [12], the authors showed that the probability distribution of the first digits “2,” “5,” and “7” is sufficient for forensic investigation. In [17], the authors combined the moments of characteristic function features with the FDPD features. They enhanced localization by training an SVM classifier with 436-D vector. Factor histogram of DCT coefficient shows double maxima in the ADJPEG compressed images [18]. This double maximum present in the factor histogram can be used to locate tampering present in the double compressed images [7]. In [19], the authors developed neuro-fuzzy inference system by combining features retrieved from discrete wavelet transformation (DWT) decompression and edge images based on gray level co-occurrence. In [20], the authors used CNNs for aligned and nonaligned double JPEG compression detection. In particular, they explored the capability of CNNs to capture DJPEG artifacts directly from images. Their forgery detection and localization were based on the computation and analysis of a correlation matrix calculated by recompressing the given (possibly tampered) image at different quality factors and then comparing the recompressed versions with the given image. Our proposed scheme also captures DJPEG artifacts directly from the image. However, our scheme needs only single recompression, whereas algorithm in [20] requires multiple recompression. In this paper, we have explored the forensic application of FDPD, when compression history is not available.

The paper is organized as follows. In Section 2, we have discussed the FDPD of single and double compressed images. Section 3 investigates the possibility of blind NADJPEG compressed FDPD estimation. Based on our empirical analysis, we have proposed FDPD-based K-means clustering algorithm in Section 4. Since the performance of the proposed algorithm is compared to [7, 10,11,12], Section 5 shortly discusses these approaches, its limitations, and how our proposed scheme overcomes these limitations. Experimental setup and performance analysis are discussed in Section 6. Finally, the paper concludes with the future work in Section 7.

2 FDPD-based SVM classifier

As per Benford’s law [21], in a set of naturally generated data, the probability distribution of the first digits d(d = 1, 2..9) follows the logarithmic nature as shown in Eq. (1).

$$ p(d)={\log}_{10}\left(1+1/d\right)\kern0.5em d=1,2,...\mathrm{9} $$

(1)

Where p(d) stands for the probability of first digit d.

DCT coefficients of an uncompressed image follow this law, and DCT coefficients of single compressed images can be fitted with a new logarithmic model using Eq. (2). This modified model is called as generalized Benford first digit probability distribution (GBFDPD) model. ADJPEG compressed images show significant divergence from this model [15].

$$ p(d)=N{\log}_{10}\left(1+1/\left(s+{d}^q\right)\right)\kern0.5em d=1,2,\mathrm{3..9} $$

(2)

Where N is a normalization factor, which makes p(d) a probability distribution, s and q are the model parameters specific to the compression quality of an image.

Usually, secondary compression grids do not overlap with the primary compression grids of the tampered region. This leads to the logarithmic FDPD of the tampered region [15]. Hence, FDPD-based tampering localization techniques train SVM classifier using FDPD of single compressed images and their aligned double compressed counterparts [11, 12]. These techniques divide the test image into sub-images and classify each of the sub-images using earlier trained SVM classifier. As these classifiers are trained using images at a specific compression quality, this will give the best performance for the test images compressed with the compression quality of the training images. As the compression qualities of the test images starts deviating, the performance of the classifiers goes on decreasing. Whenever a new test image arrives for tampering localization, compression quality on the scale of (0–100) is unknown. Although it is possible to guess the compression quality visually, it is not sufficient to apply the respective SVM classifier. Hence, practically, it is difficult to use the current FDPD-based forensic investigation techniques.

3 Blind estimation of FDPD of NADJPEG

As discussed in Section 2, most of the FDPD-based algorithms use SVM classifier, which needs prior training with FDPD of single compressed images and double compressed images. FDPD of single compressed images serves as a feature of tampered region and FDPD of aligned double compress region serves as a feature of the untampered region. These classifiers perform well on tampered images with the same compression history. The major problem while using these classifiers is the knowledge of the primary and secondary compression qualities of the test image. If SVM classifier trained with different compression history than the test image is applied to test the image, the performance severely degrades. Although there exist some work to identify primary quantization steps [8, 18, 22], it is not sufficient to assess the exact primary compression quality. If these images are custom quantized, even primary quantization step computation is difficult [8].

We have used K-means clustering algorithm to eliminate this compression quality prerequisite. As K-means clustering algorithm does not require prior training, we do not need FDPD features of single compressed images. For a given test image, tampered region is NADJPEG compressed and follows FDPD, while untampered region being ADJPEG compressed does not follow it. Thus, an image is divided into clusters following FDPD and those not following FDPD, using K-means clustering algorithm. An image is divided into B × B overlapping sub-images, and FDPD for the first 20 AC frequencies is computed. K-means clustering algorithm uses intercluster and intracluster distances for clustering features into different classes. For a set of FDPD of m blocks (x₁, x₂, .…x_m), each distribution consists of probability of the first digits 1 to 9 from the first 20 AC frequencies. These 180-D m observations are partitioned into two sets S = {S₁, S₂} to minimize the within-cluster sum of square distances. In other words, the objective is to find S, as shown in Eq. (3).

$$ \underset{s}{\arg \min}\sum \limits_{i=1}^k\sum \limits_{x\in {s}_i}{\left\Vert x-{\mu}_i\right\Vert}^2 $$

(3)

Where μ_i is the mean of points in S_i and k = 2.

The K-means clustering algorithm uses an iterative refinement technique. Initially, dataset is randomly partitioned into two classes, and the initial means are computed for both the classes. In the next phase, for an initial set of means $ {\mu}_1^1,{\mu}_2^1 $, the algorithm proceeds by alternating between assignment step and update step. During assignment step, each observation x_p is assigned to the cluster whose mean has the least squared Euclidean distance as shown in Eq. (4).

$$ {S}_i^{(n)}=\left\{{x}_p:{\left\Vert {x}_p-{\mu}_i^{(n)}\right\Vert}^2\le {\left\Vert {x}_p-{\mu}_j^{(n)}\right\Vert}^2{\forall}_j,1\le j\le 2\right\} $$

(4)

Where n is the iteration number and p is the sample number.

In the update state, the new means are updated as per the centroids of the observation in the new cluster, as shown in Eq. (5).

$$ {\mu}_i^{\left(n+1\right)}=\frac{1}{\left|{S}_i^{(n)}\right|}\sum \limits_{x_j\in {S}_i^{(t)}}{x}_j $$

(5)

The algorithm converges when the assignment no longer changes. Thus, this approach does not require knowledge of prior compression history and assign samples to different classes iteratively. Generally, the size of the tampered region is very small as compared to the untampered region. For proper clustering, the number of features should be sufficiently large. Hence, initially, algorithm was not able to cluster these features. As the size of untampered feature set was sufficiently large, we have investigated the possibility of increasing the size of a tampered feature set.

3.1 Impact of Gaussian noise on FDPD

JPEG compression involves 8 × 8 block DCT transform. Resultant DCT distribution modeled as a Gaussian distribution, which follows generalized Benford’s model [15]. JPEG compression uses 8 × 8 quantization steps for quantizing each of the 8 × 8 DCT coefficients. When uncompressed image gets single compressed, its DCT coefficients c₀ are quantized with step q₁. After neglecting quantization and rounding error, the resultant dequantized coefficient c₁ is shown in Eq. (6).

$$ {c}_1=\left[{c}_0/{q}_1\right]\times {q}_1 $$

(6)

The operator [.] represents the rounding operation, which maps the set of unquantized DCT coefficients to the same value of dequantized coefficient c₁. Due to the property of rounding operation, the probability distribution of DCT coefficients at a specific frequency in 8 × 8 blocks can be shown in Eq. (7).

$$ {p}_1\left({c}_1;{q}_1\right)={\sum}_{c_0={q}_1\left({c}_1-1/2\right)}^{q_1\left({c}_1+1/2\right)}{p}_0\left({c}_0\right) $$

(7)

During secondary compression, these coefficients c₁ are quantized with step q₂ and the resultant dequantized coefficients c₂ are shown in Eq. (8).

$$ {c}_2={}\left[\left[{c}_0/{q}_1\right]\times {q}_1/{q}_2\right]\times {q}_2 $$

(8)

Due to DCT grid aligned double JPEG compression, all DCT coefficients present at a specific frequency in 8 × 8 blocks are quantized with the same set of quantization steps q₁ ∈ Q₁ and q₂ ∈ Q₂ respectively, where Q₁ and Q₂ represents the primary and secondary quantization tables. The resultant probability distribution of dequantized coefficients can be shown using Eq. (9).

$$ {p}_2\left({c}_2;{q}_1,{q}_2\right)={\sum}_{c_1={q}_2\left({c}_2-1/2\right)}^{q_2\left({c}_2+1/2\right)}{p}_1\left({c}_1\right) $$

(9)

When q₂ < q₁, the set of unquantized DCT coefficients c₀ map to the same value of the secondary dequantized DCT coefficient c₂. This increases the numbers of certain DCT coefficients, while certain DCT coefficients are completely removed from the ADJPEG compresses image. In [9], the authors proved this periodicity with a set of one-dimensional data. Since the logarithmic FDPD followed by naturally generated random data, periodic aligned double compressed DCT coefficients show divergence with it. When an image undergoes nonaligned double compression, there is no fixed relationship between quantization steps q₁ and q₂. Hence, periodic quantization artifacts are not introduced in the secondary dequantized DCT coefficients [13]. Random quantization artifacts maintain randomness in the DCT coefficient distribution, and logarithmic FDPD is maintained in these coefficients.

Gaussian noise is introduced in the natural digital images due to sensor noise and poor illumination. Although natural images are non-Gaussian, the distribution of DCT coefficients can be well fitted with a generalized Gaussian distribution [23]. In a single compressed image, DCT coefficients at each of the frequencies present in 8 × 8 blocks are assumed as 64 independent identically distributed random variables. As per central limit theory, under normal conditions, the sum of many random variables will have an approximately Gaussian distribution. In [24], the authors used this Gaussian distribution for forensic investigation. Hence, we have added zero mean Gaussian noise to the aligned double compressed image and recompressed it at quality factor 100. At this compression quality, all the quantization steps are “one.” As quantization step is “one,” no quantization occurs and rounding noise does not get introduced in an image. The DCT and inverse DCT operations are applied at the stage of compression and decompression. Hence, the resultant DCT coefficients still follow Gaussian distribution and obey logarithmic FDPD.

3.2 Empirical analysis of Gaussian noise on FDPD

The possibility of data modeling Benford’s law can be verified using tests such as mantissa test, chi-square test, and the geometric distribution. As these tests also analyze the actual data values, there is no formal mathematical proof, which will confirm Benford’s model [25]. The logarithmic nature of FDPD is verified empirically by actually plotting FDPD of the data [15, 16]. Hence, to verify the above effect, we have added zero mean Gaussian noise to the images from UCID database [26]. This database consists of 1338 uncompressed images in TIFF format. Each of the uncompressed images was single compressed, ADJPEG and NADJPEG compressed at various compression qualities. Figure 2 shows the average FDPD of all images in each category. Due to the space constraint, it is not possible to show FDPD at all qualities, but we found the same impact at other qualities. This shows that, as per discussion in [11, 12, 15, 16], FDPD of ADJPEG images is non-logarithmic. After adding Gaussian noise, FDPD becomes logarithmic. Thus, it is verified that the addition of Gaussian noise makes FDPD of ADJEPG compressed images logarithmic. This distribution can be used to increase the sample features of the NADJPEG compressed region. Thus, features of both the classes become sufficiently large and clustering algorithm can be applied.

4 Proposed approach

As per the earlier discussion, the proposed localization algorithm creates a noisy version of the test image by adding zero mean Gaussian noise to it. Test image and noisy image are divided into B × B overlapping sub-images, and FDPD for the first 20 AC frequencies is computed. Thus, the features of both the classes become sufficiently large for clustering. We have applied K-means clustering algorithm on these features to cluster them in two different classes as shown below.

If test image I is of size M × N and block size considered is B, then algorithm will have (M − B + 1)(N − B + 1) iterations for generating FDPD features of all the blocks. All other FDPD-based algorithms will also require the same number of iterations to generate these features. To cluster n number of d dimensional samples into k number of clusters, K-means clustering algorithm requires time complexity O(n^dk + 1). Hence, the proposed algorithm has O([(M − B + 1)(N − B + 1)]^2d + 1) time complexity. Since we are considering nine FDPD of the first 20 frequencies, all the samples are 180 dimensional, and accordingly, time complexity of the proposed algorithm is O([(M − B + 1)(N − B + 1)]³⁶¹).

5 Method of [7, 10,11,12]

Like most of the JPEG artifact-based forensic techniques, algorithms in [7, 10,11,12] also use DCT coefficients at a respective AC frequency. In [7], the primary quantization table was computed using factor histogram. As discussed earlier, aligned double quantization with step q₁ followed by q₂ maps the set of primary DCT coefficients c₁ to the same secondary coefficients c₂. The set of coefficients c₁ mapping to the same values ofc₂ can be computed using quantization step q₂ and coefficient c₂. The histogram of the factors of this set is called as factor histogram [18]. It has maximum frequency up to step q₂ as well as at a step q₁. Thus, the primary quantization steps can be detected. Similar to other approaches, [7] also has investigated DCT grid aligned blocks. Tampered region was assumed as NADJPEG compressed, while untampered region was assumed as ADJPEG compressed. Each block was categorized as tampered/untampered depending on the second maxima in the block factor histogram. Ideally, the second maxima in factor histogram should be available at primary quantization step. However, it may not necessarily be absent at nearby quantization step. In such cases, the computed primary quantization step will be wrong and the performance of the algorithm may degrade. As our proposed algorithm is not computing any specific quantization step and using distribution of DCT coefficients, it does not suffer with little changes in DCT coefficients.

Although [10] uses probability distribution, it is not FDPD. They used DCT coefficient periodicity in ADJPEG compressed images. The FDPD is followed by NADJPEG compressed image, while DCT coefficient periodicity is followed by ADJPEG compressed images. Thus, this approach is completely different from techniques investigating FDPD. They used expectation maximization algorithm to compute posterior probability map for each 8 × 8 block being aligned/nonaligned double compressed. Both the approaches [7, 10] computed primary quantization steps, but [7] does not use sample NADJPEG distribution. In [10], the authors showed that if double compressed image is recompressed by aligning the DCT grid with the primary compression grid, histogram of the resultant DCT coefficients shows higher magnitude. Hence, they shifted DCT grid of the test image at different position in 8 × 8 block and computed probable grid shift. This primary position of DCT grid was further used to measure quantization errors. Shifting DCT grid at different positions in 8 × 8 block means one has to try 64 positions of recompression grids and analyze these 64 DCT coefficient histograms. This increases the computational complexity of the algorithm. Authors has used parallel processing approach to reduce this time. Neither the proposed approach nor [7, 12] needs to compute these errors. In our proposed approach, single recompression is enough to get the statistics of nonaligned double compressed region from the aligned double compressed image, after adding Gaussian noise to it.

Forensic approach in [11, 12] has investigated FDPD, but its use has a practical limitation. If the tampered test image has the primary compression quality Q₁ and secondary compression qualityQ₂, an SVM classifier needs to be trained using single compressed images of quality Q₁ and their double compressed counterparts at quality Q₂. In real life, one rarely knows the primary compression history of the test image. They trained SVM classifier using FDPD of DCT coefficients at the first 20 AC frequencies. They also had a forensic investigation by dividing an image into DCT grid aligned sub-images. Each sub-image was individually classified as tampered or untampered using FDPD. The authors in [12] used FDPD of digits 2, 5, and 7 while in [11], all nine FDPD were used. As [11] uses all nine FDPD (1,2...9), [12] cannot perform better than [11]. As our proposed algorithm is not using SVM classifier, it does not require classifier training, which is very specific to the primary and secondary compression qualities of test image. Since the test image is recompressed using the secondary quantization table present in the test image itself, the primary compression history at all is not required. Thus, our proposed algorithm can work even on an image having customized quantization table. This is not possible with [11, 12], as it requires primary quantization table while training SVM classifier.

6 Results and discussion

The proposed algorithm is implemented with MATLAB (R2011a), 64-bit version, and P. Sallee MATLAB JPEG toolbox [27] was used for reading DCT coefficients. Like the other standard experimental setups [10,11,12, 16], we have also created random tampered images using uncompressed TIFF images from UCID database [26]. Each image was compressed at the primary compression quality Q₁, and random 120 × 120 DCT grid nonaligned source image blocks were copied to create tampered images. While tampering an image, the pasted block borders are aligned with the DCT grids of the destination image. Resultant images were compressed at a secondary compression quality Q₂(Q₂ > Q₁). For investigation, each of the test images were divided into 40 × 40 non-overlapping sub-images. Thus, we are aware that sub-images are actually tampered (NADJPEG)/untampered (ADJPEG). The output of the classifier can be directly compared to this.

Since [11, 12] needs special setup, we have trained various quality-specific SVM classifiers using single compressed and aligned double compressed images at different compression qualities. As [11] is expected to perform better than [12], practical limitations are shown only for [11]. For this, we have tested each of the test images by applying SVM classifiers trained for different compression qualities. The results of [12] are plotted by applying SVM classifier trained with exact compression quality. Since [7, 10] and proposed approach do not require a compression history and prior training, no special setup was needed.

For each of the test sub-images, the output of the classifier was compared with its actual class to compute misclassification error rate as shown in Eq. (10).

$$ \mathrm{miserr}=\sum \limits_{i=1}^NT(i)\ne C(i)/N $$

(10)

Where T(i) represents the original class of the sub-image i, C(i) represents the class assigned by supervised/unsupervised classifier and N represents the total number of sub-images in the test case. Table 1 shows the misclassification error rate generated by different algorithms. The first column of this table mentions the primary and secondary compression qualities of the test images. Remaining columns show the observed misclassification error rate for each of the algorithms. As [11] was investigated by applying different trained SVM classifiers, columns 6 to 14 show the respective misclassification error rate. Top of these columns mentions the compression history of the training images. When training and test images had a same compression history, misclassification rate is marked in bold.

Table 1 Misclassification error rate for randomly tampered images

Full size table

Figure 3 shows a graphical representation of these misclassification error rates. As [11] was tested by applying different SVM classifiers with different training history, we have plotted its best misclassification error rate (represented in bold, when training and test image compression qualities are same), as well as an average misclassification error rate (average of each row for [11] in Table 1). Since [12] does not perform better than [11] even when compression qualities of training and test images are same, we have not shown its average misclassification error rate. From Fig. 3, we can conclude that [11, 12] gives the best performance, but as test image compression qualities divert from training qualities, the performance degrades. All the remaining approaches, including our proposed algorithm, perform differently at different compression qualities. At the qualities 60–65, 60–70, and 80–85, our proposed algorithm gives the best performance. At quality 60–75, [7] gives the best performance. In between 60–80 and 70–85, [10] performs best. Hence, there is not a single method, which may perform better at all compression qualities. Different compression artifacts have different strength at different compression qualities. At lower compression qualities and at lower quality differences, most of the algorithms except [11, 12] fail (error rate greater than 0.4). However, our proposed algorithm still gives comparable performance.

We have also tested these algorithms on tampered image database CASIA V.2 [28]. On some of the images, we got very promising results. As the compression history of these images was not available, it was difficult to evaluate the performance of localization algorithms. Figure 4a, b shows the original image and its tampered version from CASIA V.2 [28]. Figure 4c–f shows the tampering localization map generated by applying [7, 10, 11] and proposed algorithm on the tampered image (b). For testing [11], we have applied various SVM classifiers having different training compression history. Figure 4e shows the best possible outcome and respective training compression history for [11]. The output is represented as the probability of each pixel being tampered. The higher value of probability is assigned to NADJPEG tampered pixels (yellow color), and untampered region appears with a lower index value (shown with blue colors). The output map generated by proposed algorithm is comparable with the other algorithms. Algorithm in [10] locates the sufficiently large tampered region, but it is difficult to select a final tampered region. For [10], the yellow color untampered boat also comes out as tampered, along with the tampered segment of the ship. For [11], when we applied several classifiers having a different training compression history, we could get comparable output at a training quality sequence 70, 85. However, it locates very small portion of the actual tampered region.

Figure 5 shows one more localization example. Here, the map generated by [10] is at all not useful for forensic investigation. Algorithm in [7] generates traceable map, but it is not sufficient to make critical decisions. Due to the unavailability of primary compression quality, we could not get comparable output for [11]. Comparatively, our proposed algorithm has generated robust output, sufficient to make a decision.

To evaluate algorithms against realistic tampering, 100 tampered images were blindly created using different source images. Each image was tampered by pasting segment from the same image as well as from the other images using PIXLR online image editing tool [29]. To make tampering convincing, different pre-processing operations such as lightning adjustment, color balance, contrast stretching, up sampling, down sampling, and rotation were applied to the tampered region. Compression history was unknown to us. For [11], each image needs to be investigated by applying classifiers with different training history Q₁ = 60..100, Q₂ = Q₁ + 1..100. Due to this classifier training dependency, we could not compare [11] for these images. For accurate localization, overlapping block processing was very necessary. Hence, we have applied these algorithms with 8 × 8 overlapping. Since tampered region spans across 8 × 8 blocks nonuniformly, the misclassification error rate was computed at pixel level. If any of the 40 × 40 blocks classified as a tampered, all overlapping pixels were classified as tampered, irrespective of the decision made by the other overlapping blocks. Accordingly, in Eq. (8), T(i) represents the original class of the pixel i, C(i)represents the class assigned by the classifier, and N represents the total number of pixels in an image. Table 2 shows the corresponding misclassification error rate. Our proposed approach gives comparable performance with other machine learning/non-machine learning-based approaches.

Table 2 Misclassification error rate for manually tampered images

Full size table

Figure 6 shows one of the examples of such tampering localization. In this tampered image (b), source (a) and destination (a) of tampered region are the same. Segment of water is copied from an image and pasted into the same image to hide the right leg of a man standing in the sea. The copied region was resampled before pasting. Figure 6c shows the ground truth of the tampered region. Figure 6d–g shows the outputs of various tampering localization algorithms. Although [7] works without prerequisite of compression quality knowledge, it did not work for this tampered image. It locates very small portion of the tampered region, along with some other untampered region. The located true tampered region and false tampered region have nearly the same size, which makes it difficult to make any decision. As compression qualities are not available, we have applied various SVM classifiers with different training compression history for [11]. The output for [11] is plotted in (e), which is again of no use to make decisions. The algorithm in [10] locates only the tampered region. However, the tampering probability assigned to the region is very low and not continuous. Our proposed algorithm clearly locates the continuous tampered region. Although there exist some small falsely identified tampered regions, its size is very small. After considering connected components, original tampered region will appear as a largest connected component with high tampering probability. Thus, although misclassification rate of proposed scheme is a little bit more than [7, 10], its output map is more useful than the other algorithms. The proposed algorithm and the algorithm in [10] can be used to cross validate each other.

7 Conclusions

The discrepancy in double compression artifacts is an indicator for most of the tampering operations. We have shown that the DCT coefficients of aligned double compressed image do not follow first digit probability distribution. However, DCT coefficients of single compressed and nonaligned double compressed image follow it. In addition, we have also shown that the additive Gaussian noise in the aligned double compressed images makes resultant first digit probability distribution logarithmic. Hence, proposed algorithm does not need features of single compressed images and is able to work in the absence of prior knowledge of primary compression history of the test image. Thus, its efficiency is not dependent on the pre-trained classifier, specific to the compression history of the test images. Validity of the proposed algorithm has been demonstrated by computing misclassification error rate. The effectiveness of the proposed algorithm is also confirmed by the realistic test images in the absence of image compression history. Performance analysis shows that different compression artifacts perform at different compression qualities. Hence, multiple artifacts need to be fused to devise an algorithm performing at all the compression qualities. Algorithms discussed here are not robust against antiforensic attacks. In the future, we will try to address these issues in our research.

References

A Piva, An overview on image forensics. ISRN Signal Proc 2013, 22 (2013). https://doi.org/10.1155/2013/496701. Article ID 496701
H Farid, Image forgery detection, a survey. IEEE Signal Process. Mag. (2009)
Muhammad Ali Qureshi, Mohamed Deriche, A bibliography of pixel-based blind image forgery detection techniques, In Signal Processing: Image Communication, Volume 39, Part A, 2015, Pages 46–74, , https://doi.org/10.1016/j.image.2015.08.008.
C-T Li, X Lin, A fast source-oriented image clustering method for digital forensics. EURASIP J Image Video Proc 69 (2017) https://doi.org/10.1186/s13640-017-0217-y
H Yao, F Cao, Z Tang, J Wang, T Qiao, Expose Noise Level Inconsistency Incorporating the Inhomogeneity Scoring Strategy, Article in Multimedia Tools and Applications (2017). https://doi.org/10.1007/s11042-017-5206-8
Google Scholar
MM Isaac, M Wilscy, Multiscale local gabor phase quantization for image forgery detection. Multimedia Tools and Applications, 1–22 (2017) https://doi.org/10.1007/s11042-017-5189-5
AV Mire, SB Dhok, NJ Mistry, PD Porey, Factor histogram based forgery localization in double compressed JPEG images. Procedia Comput Sci 54, 690–696 (2015)
Article Google Scholar
H Farid, Exposing digital forgeries from JPEG ghosts, IEEE transactions on information forensics and security. Vol. 4(1), 154–160 (2009)
Google Scholar
Z Lin, J He, X Tang, C-K Tang, Fast, automatic and fine-grained tampered JPEG image detection via DCT coefficient analysis. Pattern Recognition, Vol 42, 2492–2501 (2009)
Article MATH Google Scholar
T Bianchi, A Piva, Image forgery localization via block-grained analysis of JPEG artifacts. IEEE Transactions on Information Forensics Security 7(3), 1003–1017 (2012)
Article Google Scholar
XH Li, YQ Zhao, M Liao, FY Shih, YQ Shi, Detection of the tampered region for JPEG images by using mode-based first digit features. EURASIP Journal on Advances in Signal Processing 190, 2012 (2012)
Google Scholar
I Amerini, R Becarelli, R Caldelli, A Del Mastio, Splicing forgeries localization through the use of first digit features. proceedings of IEEE International Workshop on Information Forensics and Security(WIFS),143-148 (2014). https://doi.org/10.1109/WIFS.2014.7084318
Z Fan, RL de Queiroz, Identification of bitmap compression history: JPEG detection and quantizer estimation. IEEE Transaction on Image Processing, Vol 12(2), 230–235 (2003). https://doi.org/10.1109/TIP.2002.807361
W Luo, Z Qu, J Huang, G Qiu, A novel method for detecting cropped and recompressed image block. IEEE Int Conference Acoustics Speech Signal Proc (2007)
Dongdong Fu, Yun Q. Shi, Wei Su. A Generalized Benford’s Law for JPEG Coefficients and its Applications in Image Forensics, SPIE Conference on Security, Steganography, and Watermarking of Multimedia Contents, 2007
Google Scholar
B Li, YQ Shi, J Huang, Detecting Doubly Compressed JPEG Images by Using Mode Based First Digit Features (IEEE International Workshop on Multimedia Signal Processing, Queensland, 2008), pp. 730–735
Google Scholar
F Zhao, YU Zhenhua, S Li, Detecting double compressed JPEG images by using moment features of mode based DCT histograms. proceedings of 2010 International Conference on Multimedia Technology,1-4 (2010). https://doi.org/10.1109/ICMULT.2010.5631476
J Yang, G Zhu, Detecting Doubly Compressed JPEG Images by Factor Histogram (Proceeding of APSIPA, ASC, 2011)
Google Scholar
H Ghaffari-Hadigheh, GB Sulong, Annual Iranian mathematics conference (Hamedan, 2017)
M Barni, L Bondi, N Bonettini, P Bestagini, A Costanzo, M Maggini, B Tondi, S Tubaro, Aligned and non-aligned double JPEG detection using convolutional neural networks. J. Vis. Commun. Image Represent. 49, 153–163 (2017) https://doi.org/10.1016/j.jvcir.2017.09.003
Article Google Scholar
F Benford, The law of anomalous numbers. Proc. Amer. Phil. Soc. 78, 551–572 (1938)
MATH Google Scholar
J Lukas, J Fridrich, Estimation of primary quantization matrix in double compressed JPEG images. Proc Digital Forensic Res Workshop, 1–17 (2003)
D Zoran, Y Weiss, Scale invariance and noise in natural images. IEEE 12^th Int Conference Comput Vision, 2209–2216 (2009)
R Zhang, X-g YUJ Zhao, JY Liu, Symmetric Alpha Stable Distribution Model Application in Detecting Double JPEG Compression. Proc International Conference on Artificial Intelligence and Software Engineering(AISE2014), 462-467(2014)
M Nigrini, JT Wells, Benford’s Law: Applications for Forensic Accounting, Auditing, and Fraud Detection,Wiley Publication, 19-21(2012) ISBN: 978-1-118-15285-0
G Schaefer, M Stich, UCID—An Uncompressed Colour Image Database (Technical Report, School of Computing and Mathematics, Nottingham Trent University, U.K, 2003)
P. Sallee, Matlab JPEG toolbox 1.4, [online], Available: http://dde.binghamton.edu/download/jpeg_toolbox.zip
CASIA Tampered image detection evaluation database http ://forensics.idealtest.org:8080/ index_v2.htm
PIXLR online image editing tool. https://pixlr.com/editor/

Download references

Acknowledgements

Not applicable.

Funding

This study received funding from PhD contingency grant and fellowship assigned by Sardar Vallabhbhai National Institute of Technology, under MHRD, India.

Availability of data and materials

The datasets analyzed during the current study are as follows: CASIA V.2 tampered image database, http://forensics.idealtest.org/casiav2/; UCID—an uncompressed color image database, http://jasoncantarella.com/downloads/ucid.v2.tar.gz; and self-created database, https://drive.google.com/drive/folders/0B11DqvnKC0SGdElDZ2tjVmJKMFE?usp=sharing.

Author information

Authors and Affiliations

SVNIT, Surat, India
Archana V. Mire & N. J. Mistry
VNIT, Nagpur, India
Sanjay B. Dhok & Prakash D. Porey

Authors

Archana V. Mire
View author publications
You can also search for this author in PubMed Google Scholar
Sanjay B. Dhok
View author publications
You can also search for this author in PubMed Google Scholar
N. J. Mistry
View author publications
You can also search for this author in PubMed Google Scholar
Prakash D. Porey
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

We have proposed a reliable algorithm which automatically locates the spliced region in a double compressed tampered image. The tampered region is assumed as NADJPEG compressed region, and untampered region is assumed as ADJPEG compressed region. We have used first digit probability distribution of DCT coefficients to cluster tampered and untampered region into two different classes. Aiming at overcoming limitations in [11, 12], the proposed algorithm makes the following improvements: (1) it does not require prior compression quality knowledge, (2) it uses K-means clustering algorithm to cluster ADJEG and NADJPEG compressed region, (3) it is shown that if image is recompressed after the addition of Gaussian noise, resultant DCT coefficients follow the first digit probability distribution, and (4) this recompressed version can be used to increase sample NADJPEG compressed features. The algorithm is evaluated by the following standard process of creation of random tampered images from untampered, uncompressed UCID database color images. The algorithm is also evaluated against real-life tampered images from CASIA V.2 database. In addition, algorithm is evaluated against our own dataset, where tampering is visually indistinguishable. To compare algorithm against standard machine learning and non-machine-based algorithms, performance is compared using misclassification error rate. All the authors have read and approved the manuscript.

Corresponding author

Correspondence to Archana V. Mire.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Mire, A.V., Dhok, S.B., Mistry, N.J. et al. Automated approach for splicing detection using first digit probability distribution features. J Image Video Proc. 2018, 18 (2018). https://doi.org/10.1186/s13640-018-0257-y

Download citation

Received: 16 February 2016
Accepted: 22 February 2018
Published: 05 March 2018
DOI: https://doi.org/10.1186/s13640-018-0257-y

Automated approach for splicing detection using first digit probability distribution features

Abstract

1 Introduction

2 FDPD-based SVM classifier

3 Blind estimation of FDPD of NADJPEG

3.1 Impact of Gaussian noise on FDPD

3.2 Empirical analysis of Gaussian noise on FDPD

4 Proposed approach

5 Method of [7, 10,11,12]

6 Results and discussion

7 Conclusions

References

Acknowledgements

Funding

Availability of data and materials

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords