Selection of optimal pixel resolution for landslide susceptibility analysis within the Bukit Antarabangsa, Kuala Lumpur, by using image processing and multivariate statistical tools

Quraishi, Iqbal; Hasnat, Abul; Choudhury, J. Paul

doi:10.1186/s13640-017-0169-2

Research
Open access
Published: 03 March 2017

Selection of optimal pixel resolution for landslide susceptibility analysis within the Bukit Antarabangsa, Kuala Lumpur, by using image processing and multivariate statistical tools

Iqbal Quraishi¹,
Abul Hasnat² &
J. Paul Choudhury¹

EURASIP Journal on Image and Video Processing volume 2017, Article number: 21 (2017) Cite this article

1957 Accesses
3 Citations
Metrics details

Abstract

Landslides are considered as one of the natural hazards responsible for casualties, damage of assets, and infrastructures. In many situations, collection of field data from remote places is difficult due to inaccessibility of landslide area. This paper examines landslide susceptibility in the Bukit Antarabangsa, Kuala Lumpur, to ease geographical studies, using image processing and multivariate statistical tools by reviewing the digital images using remote-sensing technique without any physical survey. We considered different pixel resolutions and report the effectiveness of using factor analysis, principal component analysis, linear discriminant analysis, and their hybridization. Eight types of databases for heavy, medium, and no landslide were created. The modeling works were carried out at 2 × 2, 4 × 4, 8 × 8, 16 × 16, 32 × 32, 64 × 64, 128 × 128, and 256 × 256 pixel resolutions. Results indicate 2 × 2 was optimal in both heavy and medium while 8 × 8 found to be ideal for no landslide region. Performance at different pixel resolutions was compared using receiver operating characteristic (ROC) curves, and average success of 87.36% was found. This simple yet robust system holds great potential for saving lives.

1 Introduction

Landslide is an extreme natural phenomenon that takes a heavy toll on human life and property leaving far-reaching consequence not only on economy but also nature and ecosystem of the affected region. Flash flood, long and terrible monsoon, earthquake rock sliding or toppling, soil cave-in, and sudden profusion of snow melting are some of the elementals that precede landslide in a particular area. Again, soil condition alters after earthquake or any other natural transformation in a region and its neighborhood making it more vulnerable to landslide. Landslide susceptibility analysis can be helpful in such case as certain preventive measures can be taken in time to minimize future threat to human life in the best possible way. Tarantino et al. [1] had applied change detection techniques for monitoring landslides in southern Italy. Saha et al. [2] had utilized geographic information system (GIS)-based statistical approach for landslide susceptibility in the Himalayas. Liang et al. [3] had used multi-satellite images and GIS data for statistical analysis of landslide in Taiwan Island. Rau et al. [4] had applied time series satellite images for monitoring and assessment of landslide. Voigt et al. [5] had shown the efficient use of image analysis based on satellite images for landslide mapping. Joyce et al. [6] had also used image processing techniques with manual interpretation for predicting landslide proneness. Rainfall recording data had been taken by Martelloni et al. [7] for the prediction of landslide in a local territory of Emilia Romagna, Italy. Kanungo and Sharma [8] had obtained rainfall threshold values for landslides in and around Chamoli, Joshimath region of the Garhwal Himalayas, India. Cultivated, fallow, and wood land data had been investigated by Biro et al. [9] for landslide assessment. Topological, geological, and environmental parameters are used as parameters for landslide assessment by Akgun [10]. Aerial photographs and field surveys of Cameron Highlands, Malaysia, had been utilized by Pradhan et al. [11] to analyze landslide hazards, and accuracy of 83.45% was established. Pradhan et al. [12] have also used spatial-based statistical models for landslide hazard analysis. Pradhan et al. [13] developed a neuro-fuzzy model using remote-sensing data and GIS in a part of the Cameron Highland areas in Malaysia. GIS and remote-sensing data were used by Lee et al. [14, 15] for the assessment of landslide. Historical data of rainfall and earthquake was taken by Muthu et al. [16] for landslide analysis. Assessment of landslide using lithology, rock weathering, geomorphology, soil type, and depth had been conducted by Champatiray et al. [17]. It is observed that most of the authors have used field data like geological and topological parameters, rainfall, and their likes in addition to the satellite image information of that place for the assessment of landslide susceptibility.

The objective of our research work is to analyze and asses the landslide susceptibility based on the examination of the satellite images of a particular region. It must be borne in mind the image feature values of a landslide-affected area are different from those of the normal one. Examination of these changes in such parameters using image processing and multivariate statistical tools may hold the key. The main task in our research work constitutes the collection of image features; selection of principal features out of 131 image features; formation of eight databases with varying pixel resolution for heavy, medium, and no landslide-affected regions; selection of optimal pixel resolution out of 2 × 2, 4 × 4, 8 × 8, 16 × 16, 32 × 32, 64 × 64, 128 × 128, and 256 × 256 for heavy, medium, and no landslide-affected regions, respectively, based on residual analysis, multivariate statistical tools, and their hybridized ones. Finally, analysis of performance, testing, and validation of the experimental work is carried out.

2 Methods

2.1 Data collection and database creation

A sequence of main processing phases proposed is furnished in Fig. 1. Eight types of databases are created for heavy, medium, and no landslide-affected regions, depending on the condition of soil. Each database consists of 450 images of pixel resolution 2 × 2, 4 × 4, 8 × 8, 16 × 16, 32 × 32, 64 × 64, 128 × 128, and 256 × 256 respectively. The pixel resolution 2 × 2 indicates the length and breadth of the image is 2 pixels. The detailed description of image databases with varying pixel resolution is presented in Table 1.

Table 1 Pixel resolution and their dimensions

Full size table

Different pixel resolutions are considered for feature extraction. Li et al. [18] had estimated fractal dimension (FD) using box-counting approach as an important feature for texture description in an image. Whereas Hawlick et al. [19] reported a survey for various statistical and structural features for image texture classification like autocorrelation function, optical transform, digital transform, textural edgeness, run length, gray tone co-occurrence, structural element, and autoregressive models. Thangavel and Manavalan [20] applied gray level co-occurrence matrix (GLCM) features for image classification of prostate cancer. Variance, kurtosis, skewness, and geometric mean are used as features for fault diagnosis of ball bearing by Vakharia et al. [21]. Fernandez-Lozano et al. [22] used statistical and run length features for texture classification. Mercimek et al. [23] proposed moments as invariant global features of images in pattern recognition. Filho et al. [24] measured lacunarity using Gliding-Box and Differential Box-Counting algorithms to classify textures of urban regions. Fernando et al. [25] suggested a scheme for mining mid-level features based on Frequent Local Histograms for image classification. Lowe [26] used fast nearest-neighbor algorithm followed by a Hough transform for feature extraction. In this study, various statistical characteristics of images, viz., mean of fractal dimension, standard deviation of fractal dimension, lacunarity, mean of ripplet coefficient, standard deviation of ripplet coefficient, mean of autocorrelation coefficient, and standard deviation of autocorrelation coefficient, are used for feature extraction of each image in eight databases. A total of 131 feature values are calculated. The details of features are presented in Table 2.

Table 2 Image features and their description

Full size table

Out of 450 images taken into consideration for each database, 360 images are used to create the training dataset and the remaining 90 images are used for test purpose.

2.2 Application of multivariate statistical tools and hybridization

Multivariate statistical tools, viz., factor analysis, principal component analysis, and linear discriminant analysis are briefly discussed below.

2.2.1 Factor analysis (FA)

FA [27, 28] is a statistical method used to describe variability among observed, correlated variables in terms of a potentially lower number of unobserved variables called factors. Initially, the correlation matrix and the contribution of eigenvectors and eigenvalues are calculated. The eigenvector with the highest eigenvalues is the major component of the dataset [27, 28]. Then the lower important dimensions contributing less than 2.5% are discarded. The square root of the corresponding eigenvalues are taken and multiplied with the square of the eigenvector to get the column-wise contributor for each of the major contributing component. At the last step, the ranges of cumulative values for each database are calculated.

2.2.2 Principal component analysis (PCA)

PCA [29] is used to recognize patterns in data of high dimension. At first, the mean of the dataset is calculated and subtracted from each of the data dimensions. Then, the covariance matrix, eigenvectors, and eigenvalues are calculated. The lower important dimensions contributing less than 2.5% are discarded. The corresponding eigenvalues are multiplied with the square of the eigenvector to get the column-wise contributor for each of the major contributing components. The ranges of cumulative values for each database are calculated.

2.2.3 Linear discriminant analysis (LDA)

LDA [30] maximizes the ratio of between-class variance to the within-class variance in any dataset. The major dissimilarity between LDA and PCA is that, generally, PCA is used for feature classification, whereas LDA is used for data classification. Also, in the case of PCA, when dataset is transformed to another space, position of the original dataset changes, whereas in the case of LDA, the position remains the same. For LDA, initially, the corresponding mean for each component are calculated and subtracted from the original dataset to create a new dataset. Then, the new dataset and its transpose are multiplied to create the modified matrix. Next, contributions of eigenvectors among all the eigenvalues are calculated. The lower important dimensions contributing less than 2.5% are discarded. The corresponding eigenvalues are taken and multiplied with the square of the eigenvectors to get the column-wise contributor for each of the major contributing component. The range of cumulative values for each database is calculated. The hybridized variant of FA, PCA, and LDA are applied also as shown in Table 3.

Table 3 Multivariate statistical tools and their hybridized variants

Full size table

2.3 Residual analysis

Residual analysis is done to assess the performance of the multivariate statistical tools and their hybridized variants while applied on all the databases. The parameters used in this study for performance evaluation are absolute residual (AR), mean of absolute residual (MAR), standard deviation of absolute residual (SDAR), mean residual error (MRE), and mean of mean residual error (MMRE) [31, 32]. The smaller value of these parameters of residual analysis indicates better model. The different parameters are defined using Eqs. 1–5.

$$ \mathrm{A}\mathrm{bsolute}\ \mathrm{R}\mathrm{esidual}, \mathrm{A}\mathrm{R}\left( i, j\right)=\left|\mathrm{actual}\left( i, j\right)-\mathrm{appr}\left( i, j\right)\right| $$

(1)

$$ \mathrm{Mean}\ \mathrm{of}\ \mathrm{Absolute}\ \mathrm{Residual}\ \left(\mathrm{MAR}\right)=\frac{{\displaystyle {\sum}_{i=1}^m}{\displaystyle {\sum}_{j=1}^n}\mathrm{AR}\left( i, j\right)}{m* n} $$

(2)

$$ \mathrm{Standard}\ \mathrm{Deviation}\ \mathrm{of}\ \mathrm{Absolute}\ \mathrm{Residual}\ \left(\mathrm{SDAR}\right)=\sqrt{\frac{{\displaystyle {\sum}_{i=1}^m}{\displaystyle {\sum}_{j=1}^n}{\left(\mathrm{AR}\left( i, j\right)-\mathrm{MAR}\right)}^2}{m* n}} $$

(3)

$$ \mathrm{Mean}\ \mathrm{of}\ \mathrm{R}\mathrm{esidual}\ \mathrm{E}\mathrm{rror},\ \mathrm{M}\mathrm{R}\mathrm{E}\left( i, j\right)=\frac{\left|\mathrm{actual}\left( i, j\right)-\mathrm{appr}\left( i, j\right)\right|}{\mathrm{actual}\left( i, j\right)} $$

(4)

$$ \mathrm{Mean}\ \mathrm{of}\ \mathrm{M}\mathrm{R}\mathrm{E}\left(\mathrm{MMRE}\right)=\frac{{\displaystyle {\sum}_{i=1}^m}{\displaystyle {\sum}_{j=1}^n}\mathrm{MR}\mathrm{E}\left( i, j\right)}{m* n} $$

(5)

Where actual(i, j) and appr(i, j) are (i, j)th element in the true value matrix and observed value matrix having m rows and n columns respectively.

2.4 Validation statistics

True positive (TP) values are the incidents where models have correctly predicted landslide. False positive (FP) or false alarms are the situation where the models predicted landslide but, actually, landslide has not occurred. False negative (FN) or missed alarms are the instances where the models predicted no landslide but, actually, landslide has occurred. True negative (TN) values are the incidents where the model has correctly predicted no landslide. Correct prediction (true positive and true negative) and errors (false negative and false positive) are calculated. These terms are defined by a confusion matrix as given in Table 4.

Table 4 Confusion matrix

Full size table

The validation statistics, viz., efficiency, misclassification rate, positive predictive power, negative predictive power, sensitivity, specificity, false positive rate, and false negative rate, are defined using Eqs. 6–13.

$$ \mathrm{Efficiency} = \frac{\mathrm{TP}+\mathrm{TN}}{\mathrm{TP}+\mathrm{TN}+\mathrm{FP}+\mathrm{FN}} $$

(6)

$$ \mathrm{Misclassification}\ \mathrm{Rate}=\frac{\mathrm{FP}+\mathrm{FN}}{\left(\mathrm{TP}+\mathrm{TN}+\mathrm{FP}+\mathrm{FN}\right)} $$

(7)

$$ \mathrm{Positive}\ \mathrm{Predictive}\ \mathrm{Power}=\frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{FP}} $$

(8)

$$ \mathrm{Negative}\ \mathrm{Predictive}\ \mathrm{Power}=\frac{\mathrm{TN}}{\mathrm{FN}+\mathrm{TN}} $$

(9)

$$ \mathrm{Sensitivity}=\frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{FN}} $$

(10)

$$ \mathrm{Specificity}=\frac{\mathrm{TN}}{\mathrm{FP}+\mathrm{TN}} $$

(11)

$$ \mathrm{False}\ \mathrm{Positive}\ \mathrm{Rate}=\frac{\mathrm{FP}}{\mathrm{FP}+\mathrm{TN}} $$

(12)

$$ \mathrm{False}\ \mathrm{Negative}\ \mathrm{Rate}=\frac{\mathrm{FN}}{\mathrm{TP}+\mathrm{FN}} $$

(13)

Sensitivity, specificity, and receiver operating characteristic (ROC) are used to measure the superiority and trustworthiness of an assessment. Sensitivity assesses the superiority of the assessment by identifying positive landslide. Whereas specificity calculates approximately the possibility of a region without landslide would be acceptably discarded. ROC curve is a graphic arrangement to display the relationship between sensitivity and specificity, and it assists for best model selection by deciding the optimal threshold for the landslide assessment. Positive predictive power is the percentage of regions with a positive assessment which actually have the landslide. Negative predictive power is the percentage of regions with a negative assessment which do not have the landslide. Positive and negative predictive powers are directly related to the prevalence of the landslide in the regions.

3 Results and discussions

3.1 Data collection

A landslide took place at Taman Bukit Mewah, Bukit Antarabangsa, Hulu Kelang, Selangor, Kuala Lumpur, Malaysia, at 3:30 a.m. measuring 109 m in width, 120 m in length, and 15 m in depth. About 101,500 m³ of ground had moved. It had totally obstructed the only communicating road, Jalan Bukit Antarabangsa, with neighborhood. The satellite images after landslide of Bukit Antarabangsa were collected from Ikonos Satellite Image ©Centre for Remote Imaging, Sensing and Processing, National University of Singapore via http://www.crisp.nus.edu.sg [33]. Figure 2 shows the place at Bukit Antarabangsa on the north-eastern side of Kuala Lumpur collected on 9 December 2008, after landslide. It is situated at latitude 3^°9^'58.94^" N and longitude 101^∘45^'33.392^" E. The image shows the source, track, and expanded toe of the landslide. The lower part of the landslide buried a number of houses, damaged other objects, and carried parts of buildings with down slope. Low et al. [34] carried out detailed analysis of Bukit Antarabangsa landslide and concluded prolonged rainfall as the main causal factor.

3.1.1 Residual analysis

The values of MAR, MMRE, and SDAR for the three types of images of heavy landslide prone region, medium landslide prone region, or no landslide prone region are calculated and shown in Figs. 3a–h, 4a–h, and 5(a–h) respectively. It is observed that in all the three cases, the values obtained using factor analysis is better than that of the other multivariate statistical tools or their hybridized variants. So, all the experimental results shown here are calculated using factor analysis only. Best pixel resolution among 2 × 2–256 × 256, the values of MAR is selected, and MMRE and SDAR are calculated using factor analysis only (Tables 5, 6 and 7) for three types of images. In the case of heavy landslide prone region, it is found (Table 5) that out of three parameters of residual analysis, pixel resolution of 2 × 2 gives better result in terms of MAR and SDAR, while pixel resolution of 256 × 256 gives better result in terms of MMRE. Therefore, pixel resolution of 2 × 2 becomes ideal for heavy landslide prone region. In the case of medium landslide prone region (Table 6), pixel resolution of 2 × 2 gives better result in terms of MAR and SDAR, while pixel resolution of 8 × 8 gives better result in terms of MMRE. Therefore, pixel resolution of 2 × 2 is ideal for medium landslide prone region also.

Table 5 Values of MAR, MMRE, and SDAR for heavy landslide prone regions using FA

Full size table

Table 6 Values of MAR, MMRE, and SDAR for medium landslide prone regions using FA

Full size table

Table 7 Values of MAR, MMRE, and SDAR for no landslide prone regions using FA

Full size table

In the case of no landslide prone region (Table 7), the pixel resolution of 8 × 8 gives better result in terms of MMRE and SDAR, while pixel resolution of 2 × 2 gives better result in terms of MAR. Therefore, pixel resolution of 8 × 8 becomes ideal for no landslide prone region. The pixel resolution of 2 × 2 is suitable in heavy and medium landslide prone region, while pixel resolution of 8 × 8 is suitable for no landslide prone region. So, pixel resolution of 2 × 2 can be considered ideal for landslide images (heavy and medium). The variation of cumulative values of 2 × 2 pixel resolution images of heavy, medium, and no landslide affection after application of FA is shown in Fig. 6. It is found that the range of cumulative values for heavy, medium, and no landslide affection are well separated. The blue and red colors show the cumulative values for landslide affected images (heavy and medium), whereas green color shows no landslide affected images. The ranges of cumulative values after application of FA for different types of images with 2 × 2 pixel resolution are shown in Table 8. It is observed from the experimental results that the cumulative values for no landslide regions lies between 43 and 85, whereas the cumulative values for heavy landslide region lies between 174 and 203. It is also found that the cumulative values for medium landslide regions lie between 87 and 161.

Table 8 Range of cumulative values for different types of 2 × 2 pixel resolution images using FA

Full size table

3.1.2 Validation

In the validation step, 90 landslide images for each pixel resolution 2 × 2–256 × 256 are taken and evaluated in terms of true positive (TP), true negative (TN), false positive (FP), and false negative (FN) values (Tables 9 and 10). From Table 11, it is found that the lowest and the highest accuracies are 92.2 and 83.3% respectively. For performance evaluation, sensitivity, specificity, accuracy, and the area under the receiver operator curve (ROC) metric are used. A comparison between accuracy, sensitivity, and specificity is given. The results are plotted in Fig. 7 which depict higher value of accuracy, sensitivity, and specificity. Figure 7 shows better performance of the present study. The performance of the system at different pixel resolutions is evaluated using receiver operating characteristic (ROC) curves [35, 36] as shown in Fig. 8. The highest area under the curve (93%) (Fig. 8) is obtained for 2 × 2 pixel resolution. Validation statistics, viz., efficiency, misclassification rate, positive predictive power, negative predictive power, sensitivity, specificity, false positive rate, and false negative rate, are calculated and plotted as shown in Fig. 9. The optimal efficiency (92.22%), misclassification rate (7.77%), negative predictive power (94.64%), specificity (92.98%), and false positive rate (7%) are found in the case of 2 × 2 pixel resolution in this study. Optimal value of sensitivity (92.30%) and false negative rate (7.69%) are considered in the case of 16 × 16 pixel resolution.

Table 9 Matching and alarm statistics

Full size table

Table 10 Validation results

Full size table

Table 11 Prediction accuracy

Full size table

From the experimental results, it may be concluded that FA performs better compared to the multivariate statistical tools (PCA or LDA) and their hybridized variants. According to Kim [37], PCA or LDA analyzes all of the variance of the set of variables (common variance and unique variance), whereas FA analyzes only common variance (correlation) of the set of variables. The use of PCA and LDA are data reduction by summarizing many variables into a smaller number of components. On the other hand, FA finds a factor model that can reproduce observed correlation; thus, it aimed at explaining the correlation between variables. In hybridized variants of multivariate statistical tools, values arising from PCA or LDA overpower the effect of values arising from FA. So, more deviation of output values is found in hybridized variants of multivariate statistical tools than that of FA.

4 Conclusions

In this paper, multivariate statistical tools were used for reducing number of features in order to boost performance of the system with regard to landslide susceptibility analysis. Eight databases were created, and residual analysis was used to decide the best multivariate statistical tools and their hybridized variants. Factor analysis was performed and found to be superior to others and applied thereby. The performance of the system was evaluated using various statistical parameters such as sensitivity, specificity, accuracy, and ROC. The experimental results revealed that the pixel resolution 2 × 2 based on factor analysis performs best for heavy and medium landslide images, whereas pixel resolution 8 × 8 based on factor analysis came out as optimal for no landslide images. In terms of accuracy also, 2 × 2 pixel resolutions show outstanding performance. The developed system can be effectively used for landslide susceptibility analysis for different landslide-affected regions with only satellite images in hand without any physical survey. The validation statistics also prove the robustness of the system can enable analysis of landslide susceptibility of remote locations and holds great prospect for saving lives of human being under the threat of disaster.

Abbreviations

AR:: Absolute residual
FA:: Factor analysis
FD:: Fractal dimension
FN:: False negative
FP:: False positive
GIS:: Geographic information system
GLCM:: Gray level co-occurrence matrix
LDA:: Linear discriminant analysis
MAR:: Mean of absolute residual
MMRE:: Mean of mean residual error
MRE:: Mean residual error
PCA:: Principal component analysis
ROC:: Receiver operating characteristic
SDAR:: Standard deviation of absolute residual
TN:: True negative
TP:: True positive

References

C Tarantino, P Blonda, G Pasquariello, Application of change detection techniques for monitoring man-induced landslide causal factors, IEEE. 2, pp 1103-1106 (2004). 0-7803-8742-2/04.
AK Saha, RP Gupta, I Sarkar, MK Arora, E Csaplovics, An approach for GIS-based statistical landslide susceptibility zonation—with a case study in the Himalayas. Landslides. 2, 61–69 (2005). doi:10.1007/s10346-004-0039-8
Article Google Scholar
L Liang, K Chen, U Chang, J Lien, Monitoring and statistical analysis of landslides in Taiwan Island using multi satellite images and GIS Data, IEEE. 2, pp 1231-1234 (2007). 1-4244-1212-9/07.
J Rau, L Chen, J Liu, T Wu, Dynamic monitoring and disaster assessment for watershed management using time-series satellite images, IEEE. 45(6), 0196-2892 (2007).
S Voigt, T Kemper, T Riedlinger, R Kiefl, K Scholte, H Mehl, Satellite image analysis for disaster and crisis-management support, IEEE. 45(6), 0196-2892 (2007).
KE Joyce, GD Dellow, PJ Glassey, Assessing image processing techniques for mapping landslides, IGARSS, IEEE. 2, pp 1231-1234 (2008). 978-1-4244-2808-3/08.
G Martelloni, S Segoni, R Fanti, F Catani, Rainfall thresholds for the forecasting of landslide occurrence at regional scale. Landslides. 9, 485–495 (2012). doi:10.1007/a10346-011-0308-2
Article Google Scholar
DP Kanungo, S Sharma, Rainfall thresholds for prediction of shallow landslides around Chamoli-Joshimath region, Garhwal Himalayas, India. Landslides. 11, 629–638 (2014). doi:10.1007/s10346-013-0438-9
Article Google Scholar
K Biro, B Pradhan, M Buchroithner, F Makeschin, Landuse/Land cover change analysis and its impact on soil Properties in the northern part of gadarif region, Sudan, Land Degradation & Development Wiley online library. (2011). doi: 10.1002/ldr.1116.
A Akgun, EA Sezer, HA Nefeslioglu, C Gokceoglu, B Pradhan, An easy-to-use MATLAB program (MamLand) for the assessment of landslide susceptibility using a Mamdani fuzzy algorithm. Computers & Geosciences. (2011). doi:10.1016/j.cageo.2011.04.012.
B Pradhan, S Lee, MF Buchroithner, Use of geospatial data for the development of fuzzy algebraic operators to landslide hazard mapping: a case study in Malaysia. Appl. Geomatic. 1, 3–15 (2009)
Article Google Scholar
B Pradhan, AM Youssef, Manifestation of remote sensing data and GIS on landslide hazard analysis using spatial-based statistical models, Arabian. J. Geosciences. (2009). doi:10.1007/s12517-009-0089-2
B Pradhan, S Lee, Landslide susceptibility assessment and factor effect analysis: back propagation artificial neural networks and comparison with frequency ratio and bivariate logistic regression modeling. Environ. Model. Softw. 25, 747–759 (2010)
Article Google Scholar
S Lee, DG Evangelista, Earthquake induced landslide susceptibility mapping using an artificial neural network. Nat. Hazards. Earth. Syst. Sci. 6, 687–695 (2006)
Article Google Scholar
S Lee, Application and verification of fuzzy algebraic operators to landslide susceptibility mapping. Environ. Geol. 52, 615–623 (2007)
Article Google Scholar
K Muthu, M Petrou, C Tarantino, P Blonda, Landslide possibility mapping using fuzzy approaches. IEEE. Trans. Geosci. Remote. Sens. 46(4), 1253–1265 (2008). doi:10.1109/tgrs.2007.912441
Article Google Scholar
PK Champatiray, S Dimri, RC Lakhera, S Santosh, Fuzzy-based method for landslide hazard assessment in active seismic zone of Himalaya. Landslides. 4, 101–111 (2007)
Article Google Scholar
J Li, Q Du, C Sun, An improved box-counting method for image fractal dimension estimation. Pattern. Recogn. 422, 460–2469 (2009). doi:10.1016/j.patcog
MATH Google Scholar
RM Hawlick, Statistical and structural approaches to texture. Proc. IEEE. 67(5), 786–804 (1979)
Article Google Scholar
R Thangavel, R Manavalan, Soft computing models based feature selection for TRUS prostate cancer image classification. Soft. Comput. 18, 1165–1176 (2014). doi:10.1007/s00500-013-1135-2
Article Google Scholar
V Vakharia, VK Gupta, PK Kankar, A comparison of feature ranking techniques for fault diagnosis of ball bearing. Soft. Comput. 20, 1601–1619 (2016). doi:10.1007/s00500-015-1608-6
Article Google Scholar
C Fernandez-Lozano, JA Seoane, M Gestal, TR Gaunt, J Dorado, C Campbell, Texture classification using feature selection and kernel based technique. Soft. Comput. 19, 2469–2480 (2015). doi:10.1007/s00500-014-1573-5
Article Google Scholar
M Mercimek, K Gulez, TV Mumcu, Real object recognition using moment invariants. Sadhan. 30(6), 765–775 (2005)
Article MATH Google Scholar
MNB Filho, FJA Sobreira, Accuracy of lacunarity algorithms in texture classification of high spatial resolution images from urban areas, The International Archives of the Photogrammetry. Remote Sensing and Spatial Information Sciences. vxxxvii(b3b), pp 417-422 (2008).
B Fernando, E Fromont, T Tuytelaars, Mining mid-level features for image classification. Int. J. Comput. Vis. 108, 186–203 (2014). doi:10.1007/s11263-014-0700-1
Article MathSciNet Google Scholar
DG Lowe, Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 60(2), 91–110 (2004)
Article Google Scholar
AC Rencher, Methods of Multivariate Analysis, 2nd Edn. (John Willey, USA, 2002), pp 102-105.
R Doerffer, D Murphy, Factor analysis and classification of remotely sensed data for monitoring tidal flats. Helgolander. Meeresunters. 43, 275–293 (1989)
Article Google Scholar
C Rodarmel, J Shan, Principal component analysis for hyperspectral image classification. Surveying. Land. Inf. Syst. 62(2), 115–123 (2002)
Google Scholar
RA Johnson, DW Wichern, Applied Multivariate analysis, 6th Edn. (Pearson Prentice Hall, New Jersey, 2007), pp 102-110.
C Yan, Y Zhang, J Xu, F Dai, J Zhang, Q Dai, F Wu, Efficient parallel framework for HEVC motion estimation on many-core processors. IEEE. Trans. Circuits. Syst. Video. Technol. 24(12), 2077–2089 (2014)
Article Google Scholar
C Yan, Y Zhang, J Xu, F Dai, L Li, Q Dai, F Wu, A highly parallel framework for HEVC coding unit partitioning tree decision on many-core processors. IEEE. Signal. Process. Lett. 21(5), 573–576 (2014)
Article Google Scholar
Ikonos Satellite Image, Centre for Remote Imaging, Sensing and Processing, National University of Singapore. http://www.crisp.nus.edu.sg. Accessed 20 Mar 2016.
TH Low, Area based landslide hazard assessment for hillside development, (Ph. D thesis, University of Malaya, 2011)
E Yesilnacar, T Topal, Landslide susceptibility mapping: a comparison of logistic regression and neural networks methods in medium scale study, Hendek region, Turkey. Eng. Geol. 79, 251–266 (2005). doi:10.1016/j.enggeo.2005.02.002
Article Google Scholar
JA Swets, Measuring the accuracy of the diagnostic systems. Science. 240, 1285–1293 (1988). doi:10.1126/science.3287615.
Article MathSciNet MATH Google Scholar
HJ Kim, Common factor analysis vs principal component analysis: choice for symptom cluster research. Asian. Nurs. Res. 2(1), 17–24 (2008)
Article Google Scholar

Download references

Acknowledgements

We acknowledge Ikonos Satellite Image, Centre for Remote Imaging, Sensing and Processing, National University of Singapore, for providing satellite images after the landslide of Bukit Antarabangsa via http://www.crisp.nus.edu.sg.

Funding

We acknowledge TEQIP-II, Government College of Engineering and Textile Technology, Berhampore, West Bengal, India, for providing financial support.

Authors’ contributions

MIQ designed and tested the proposed algorithm and drafted the manuscript. AH tested the algorithm. MIQ carried out the database creation, application of multivariate statistical tools, and their hybridizations along with in-depth residual analysis. JPC participated in the algorithm design. MIQ performed the performance and statistical analysis. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Author information

Authors and Affiliations

Department of Information Technology, Kalyani Government Engineering College, Kalyani, Nadia, West Bengal, India
Iqbal Quraishi & J. Paul Choudhury
Department of Computer Science & Engineering, Govt. College of Engineering and Textile Technology, Berhampore, Murshidabad, West Bengal, India
Abul Hasnat

Authors

Iqbal Quraishi
View author publications
You can also search for this author in PubMed Google Scholar
Abul Hasnat
View author publications
You can also search for this author in PubMed Google Scholar
J. Paul Choudhury
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Iqbal Quraishi.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Quraishi, I., Hasnat, A. & Choudhury, J.P. Selection of optimal pixel resolution for landslide susceptibility analysis within the Bukit Antarabangsa, Kuala Lumpur, by using image processing and multivariate statistical tools. J Image Video Proc. 2017, 21 (2017). https://doi.org/10.1186/s13640-017-0169-2

Download citation

Received: 03 September 2016
Accepted: 15 February 2017
Published: 03 March 2017
DOI: https://doi.org/10.1186/s13640-017-0169-2

Selection of optimal pixel resolution for landslide susceptibility analysis within the Bukit Antarabangsa, Kuala Lumpur, by using image processing and multivariate statistical tools

Abstract

1 Introduction

2 Methods

2.1 Data collection and database creation

2.2 Application of multivariate statistical tools and hybridization

2.2.1 Factor analysis (FA)

2.2.2 Principal component analysis (PCA)

2.2.3 Linear discriminant analysis (LDA)

2.3 Residual analysis

2.4 Validation statistics

3 Results and discussions

3.1 Data collection

3.1.1 Residual analysis

3.1.2 Validation

4 Conclusions

Abbreviations

References

Acknowledgements

Funding

Authors’ contributions

Competing interests

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords