Analysis of thermal videos for detection of lie during interrogation

Certain reflexes and physiological activities of the human

action of this system are subconscious.Whenever there is an increased blood flow to an area, temperature of that area increases.When a person lies, he/she is stressed, and the reflex action of the autonomic nervous system is activated, increasing the blood flow to some distinct areas of the face.The areas most affected are the periorbital and forehead area.Due to increased blood flow, the skin's surface temperature in those areas increases [4].
We can identify whether a person is lying based on the changes in physiological parameters.Polygraph is a technology commonly used for lie detection by capturing these changes.Blood pressure, respiratory, cardiovascular, and electrodermal activity are the common parameters used in lie detection [5,6].The polygraph method has limitations because of the dearth of trained experts, and it being an invasive procedure [7,8].Even if the interrogation session is of a small duration, the time taken to process it is very long [9].
The motivating factors to work out non-invasive methods are: • Unpredictable behavior of participants under present contact-based lie-detection method.• Time-taking and cumbersome procedure to ascertain the detection.
• Lack of well-trained experts.
Research has found that there is a measurable change in the behavioral and physiological parameters of a deceiver from a truth-teller during the time of interrogation.These changes are extremely important to distinguish between liars and truth-tellers [10][11][12][13].Some of the most common non-invasive methods use videos, audio, text [14], a fusion of video, audio and text [15], and thermal imaging [16] for the detection of deceit.
In this work, we focus on the use of thermal imaging for the detection of deceit.Using thermal imaging one can easily measure parameters like respiratory rate [17], pulse rate [18], blood flow [12] and blood flow distribution [18] in a non-invasive manner.The facial blood flow pattern is affected when a person is lying or trying to deceive others.This change in blood flow beneath the skin causes change in skin temperature which can be measured by the use of a thermal camera [19][20][21].

Related works
There are two significant aspects of deceit detection: 1. Creation of an appropriate database.2. Proper examination of the thermal signature to differentiate lie and truth.
In most of the cases a mock crime scenario is planned to simulate deceit, which is essential for database creation.Specific biomarkers are used to identify lies and deceit.The work done on the above two points is described in the following paragraphs.
Simulating guilt and lying: Few databases are available in the field of deceit detection based on a mock crime scenario.Examples of such scenarios performed in the past are concealing a banned object [22], stealing money [7,23] or jewelry [24] and attempt to kill a mannequin for stealing [7].The experiment done by Frank and Ekman in 2004 is one of the ideal models among these scenarios [25].In this experiment, they created a mock crime scenario of stealing money.Some of the subjects stole money as per instruction, and the rest acted according to their own decision.After the experiment, all the subjects were interrogated.In the end, reward was given to truth-tellers and punishment to liars.
In the present work, the experiment has been designed uniquely to emulate reallife theft scenarios.Here the participants were neither informed previously about the experiment nor were instructed to steal.So, when some of them stole and later lied about it, they could have experienced a real feeling of guilt inside them.
Umut Sen et al. [26] have also used real-life scenarios for deceit detection, but instead of an experiment, they have used video clips from real court trials.They have tried to detect deceit in real-life data using verbal, acoustic and visual modalities.
Deceit detection using thermal imaging: The most common physiological parameter which can be captured by the thermal camera is temperature variation of skin surface due to change in blood flow rate.Liars experience two types of stress: one is of acute onset, and the other is slow and sustained.In both cases, different facial regions have an increased blood flow rate.In the first case, there is a sudden increase in blood flow in the periorbital area, whereas in the second case, there is a slow increase in the rate in the forehead area [27,28].
Several researchers have carried out experiments on deceit detection using thermal imaging.Rajoub and Zwiglear [3] have found that if subjects have committed a crime and lie, they have increased skin surface temperature around the periorbital area at the time of interrogation.This increase in temperature was marked particularly when they were answering questions pertinent to the crime, but no such change was observed when they faced other questions unrelated to the crime.On the other hand, for innocent persons, no change in temperature was marked [3].The use of statistical and machine learning methods have been highlighted by both Pollina et al. and Gunes and Piccardi for classifying deceit from truthful cases [8,29].It has been reported that methods for deceit detection using thermal imaging have an accuracy of about 87% [13] to 91.7% [8].
In the present study, a low-cost thermal camera has been used.The temperature of two regions of the face, i.e., forehead and periorbital area, have been recorded and subsequently processed using an incremental tracking algorithm.
The technical contributions of this paper are as follows: • In contrast to the previous works, where most of the experiments have been conducted with a mock crime enacted scenario, our experiment is conducted in a natural setting with real-life stealing.• Participants of the present study are selected at the behest of the police department.
They have a past record of habitual stealing.The recording has been carried out in a concealed manner such that the participants are unaware of the experiment.• Later, the facial thermal videos were analyzed to measure the pattern of blood flow in particular areas of the face to differentiate deceit from non-deceit.• In this work, we have implemented an algorithm that tracks the ROI in the face taking care of the movements of the head of the subjects unlike [7] who had car-ried out the analysis of blood flow rate assuming a completely stationary subject for a very short period of time.
The remaining part of the paper has been organized in the following manner.Section 3 discusses the design of the experiment, protocol, and experimental setup.The methodology followed for the detection of deceit is given in Sect. 4. Section 5 discusses the result, Sect.6 discusses the result analysis and the conclusion is given in Sect.7.

Design of experiment
Ethical committee approval has been taken to conduct the experiments in several phases from the administration of the Midnapore medical college and hospital, where the experiment has been conducted.The subjects for the experiment have been selected with due diligence at the behest of the city police.Subjects who have a track record of pick-pocketing and stealing and do not have any mental or physical health disorders have been chosen for the experiment.The design of the experiment was planned in such a way that a real situation of stealing could be created.The subjects were brought under the plea of routine medical checkups to the hospital at different times individually.
They were asked to wait alone in an isolated room where some currency notes had been dropped.This process would entice the subjects to steal.During this waiting, they were served tea and snacks.Here, they wait for about an hour or so.Subsequently, the subjects were invited to a different room where they were subjected to a friendly interaction process.After some time (about 30 min), another trained expert enters the room for a tougher interrogation as done in police custody.The thermal video and audio recordings have been carried out in a concealed manner.

Selection of subjects
The study includes 33 male subjects with an age range of 18 to 40 years.All subjects were explained that they would have a normal health checkup in the hospital, and in this context, they were invited to the hospital on different times and dates.A set of healthrelated basic questionnaires (General Health Questionnaire-GHQ) were given to each subject to rule out any mental illness.At the end of the experiment, a detailed information about the interactive sessions was explained to each individual, and those who gave consent were included in the study.

Ethical approval
The ethical committee of Midnapore medical college and hospital has approved the experiment protocol.A free medical checkup was carried out for subjects.Due compensation was also paid to every subject after the experiment.Before participation in the experiment, written consent was taken from each subject, where it was mentioned that their data would be used for future research work.

Experiment protocol
The subjects were brought individually to the medical school under the pretext of standard health checkups at the behest of the police.The subjects were brought on different times and days to avoid any chance of interaction among themselves.The experiment has been divided into three stages: • Waiting in a room where an opportunity of stealing has been created.
• Friendly interaction by trained experts.
• Interrogation by trained experts.
The experiment was conducted in separate rooms.In the first stage, while waiting in an isolated room, the subject was enticed to pick up currency notes dropped beneath the table .In some cases, a wallet with cash was also kept deliberately beside the chair.After about 45 to 60 min of waiting, friendly interaction was carried out in the second room with the subject.Subsequently, the interrogation took place in the same room.
The flowchart for the experimental procedure is given in Fig. 1.
The act of stealing This part of the experiment happens in the first room.Some currency notes have been left on the table or haphazardly dropped on the floor beneath the table or chairs.This was done to lure the subjects to steal the money.Without exception, it has been observed that this method of luring the subjects has been very successful as, in most cases, the subjects took the cash from the floor or wallet, leaving the wallet.
Friendly interaction In this stage, a friendly interaction for the subjects was arranged in the second room by a trained faculty member.Some basic questions about their health problems have been asked to the subjects.They are also asked about their family history and questions related to their personal, social, and economic background.These questions are oriented in such a way as to make the subject feel at ease.The facial thermal video was recorded in a concealed manner, as shown in Fig. 2.

Fig. 1 The experimental flowchart
In the next stage, the same setup was used to interrogate the subject.This interrogation by another trained expert was used to find out if he had picked up the cash.
The interrogation stage This stage also happens in the second room.The interrogator involved in this research process has been trained accordingly to handle the questionnaires.He has been chosen to be a male person and is unaware whether the subject has stolen or not while in the waiting room.During the process of interrogation, some of the subjects expressed that they had stolen the cash while in the waiting room.Despite repeated questions, some of the subjects did not admit their act.These persons are taken as liars, and those who expressed their act of stealing in front of the interrogator are taken as truth-tellers.Those who did not steal and admit the same are also taken as truth-tellers.As in the previous stage, the thermal video of the subject has been recorded by a concealed camera.In the end, the subject was explained thoroughly about the experiment, and adequate compensation was paid to each one in the third room.
The change in the facial blood-flow pattern in the subjects during the process of interrogation (deviation from the normal) can possibly lead to the differentiation of truth and lie.The first stage of interaction is set up in such a friendly manner that during the process of interrogation, some of them open up themselves.However, some are not as evasive as expected [30].

Experiment setup
A closed, air-conditioned room was chosen for the interrogation session so that there was no disturbance to the process.The experimental setup is shown in Fig. 2. The thermal camera and the voice recorder were kept in a concealed manner.The temperature of the room has been regulated at 22 °C.
Recording devices FLIR One Pro USB C camera was used for the thermal video recording of the interrogation at a frame rate of 2.5034 frames/s and a visual resolution of 19,200 pixels.A Sony voice recorder was used for audio recording at a sampling rate of 44.1 kHz.A box of the medical instrument having a hole for the lens of the camera was used to cover the thermal camera.A newspaper was used to hide the audio recorder.
Illumination setup The room was illuminated with standard roof lights [31].This makes the subjects feel at ease without any special lights.A total of 33 participants were invited for the experiment, out of which the data of five participants had to be excluded because of their excessive head movements during the interview, which caused their faces to go out of the camera view.The records of the rest 28 participants for the experiment are used for analysis.It has been found that 14 subjects either did not steal cash or admitted the act, while the other 14 participants stole the cash but did not admit their acts during interrogation.This data is given in Table 1.

Methodology
The analysis of the thermal videos for the detection of truth and lies has been carried out by annotating and delineating the audio and thermal data.Here, the blood flow rates estimated from the heat maps of various facial regions have been used as the features to differentiate between the truth-tellers and liars.The block diagram of the methodology is given in Fig. 3.As it is observed from the block diagram the first step is the selection of the region of interest (ROI) which comprises the forehead and the periorbital region of the face.Then the second step is the tracking of the ROI using a proper algorithm discussed below.Then the blood flow rate is calculated in the ROI using Eq. 6.The blood flow rate and frame number obtained from the algorithm are the input features to the SVM classifier.Finally, the SVM classifier separates the subjects into two categories, i.e.,

Category No. of participants
The participants who stole and admitted about stealing 6 (all true responses) The participants who stole and did not admit about stealing 14 (all false responses) The participants who did not steal and did not admit about stealing 8 (all true responses) Fig. 3 Block diagram of the deceit detection system truth-tellers and liars.All the methods, i.e., ROI tracking of the forehead and periorbital region of the face, the calculation of the blood flow rate in the ROI, and the use of Support Vector Machine in classifying the truth-tellers and liars, have been described below.
As the thermal resolution of the camera was not very good, we were unable to measure parameters like the respiratory rate and the pulse rate.In this work, we are focusing only on measuring the temperature and blood flow rate in a non-invasive way using the thermal camera.
It is established from previous studies that there is an increase in blood flow rate in the periorbital area due to immediate stress, whereas if the stress builds up progressively in a sustained manner, there is an increase in blood flow rate in the forehead area.Hence independent analysis has been carried out both for the periorbital and forehead area.
The analysis of the blood flow rate involves three parts, namely, 1. Region of interest (ROI) tracking.
2. Estimating the blood flow rate.
3. Separating the lie and truth response using support vector machine (SVM).

Region of interest tracking
The tracking of ROI is an important step towards finding the blood flow rate in a particular region.We tried to implement the automatic tracking of the face using some face tracking algorithms like Kanade-Lucas-Tomasi (KLT).The problem we faced was that this algorithm did not work well with thermal videos.The tracking of the face is not done properly by using this algorithm.Therefore, we used the incremental tracking algorithm, which has been described below.Here a method developed by Asvadi et al. [32] has been used for ROI tracking.The algorithm uses the RGB histogram of the ROI for tracking.It involves the creation of an object model, creation of the confident map, finding of the new centroid and updation of the object model.The obtained ROI in each frame is used to find the average blood flow rate at a particular frame or time.

Creation of the object model
An object model is created using the RGB histogram of the object and the background region.The object or the ROI is selected manually in the first frame as a rectangle.The object and surrounding rectangles are chosen in such a way that the number of pixels in the object region is the same as the number of pixels in the region surrounding the object.This can be done by choosing the width of the surrounding region as W = √ 2 × w and height as H = √ 2 × h as shown in Fig. 4. Here, the w and h are the width and height of the selected object region.W and H are the width and height of the selected background rectangle.The selected object region is inside the solid red rectangle, and the surrounding background region is the area between the red and dashed black rectangles.In this figure, a sample ROI of the face is shown, but the actual ROI which is used in the algorithm are the forehead and periorbital region of the face.
The object model is created by using the 3D joint RGB histogram of the object and background region.A quantized 3D joint RGB histogram is calculated for the regions representing the inner rectangle and the background area.The object model can be found out using the following relation: where H o (s) is the histogram computed within the object rectangle, and H b (s) is the his- togram for the background region.Here, 8 bins have been used in each channel for histogram quantization.So the index s ranges from 1 to 8 3 and 8 3 is the total number of histogram seeds.Here ε is set to 1.

Finding of confident map
The confident map M x i , y i is created from the object model L s from the object region I x i , y i , c j as given below: where (x i , y i ) is the pixel location in the image coordinate and c j is the color channel of image.

Finding the new centroid
This part of the algorithm relies on the fact that the change in the object location will not be ballistic.So the center of the object rectangle is shifted to the centroid of the current confident map.The center of the object rectangle is shifted from the old location (x i , y i ) to new location (x new , y new ) using Eqs. 3 and 4: (1) (2) The ROI is manually selected in the first frame, and the object model (Sect.4.1.1),confident map, and centroid are calculated.For the subsequent frames, the ROI is tracked by finding the confident map, finding the new centroid, and updating the object model based on the detected ROI as given in Sects.4.1.2to 4.1.4.The tracking of the forehead region for a subject in intermittent frames is shown in Fig. 5.
Algorithm 1 Finding the blood flow rate

Blood flow rate
The blood flow rate is calculated for each frame except the first (taken as zero) from the tracked ROI.It is related to the temperature gradient by the relation [7]: where C S = the heat capacity of skin, V s is the blood flow rate at the skin level, T B = 310 K is the blood temperature at the body core, T s is the skin temperature, K c = 0.168 kcal/ m/h/K is the thermal conductivity of skin, d = the depth of core temperature point from skin surface and C is a constant.The average blood flow rate for each frame is computed using Eq. 6.The initial blood flow rate is assumed to be zero.
The consolidated algorithm for the ROI selection and calculation of the blood flow rate is given in Algorithm 1. Finally, we have used two ROI's which consist of the forehead and the periorbital region.The blood flow rate of the forehead and periorbital region are analyzed separately.
The incremental tracking algorithm works well with the thermal videos unlike other algorithms like KLT and Viola Jones.The SVM is a standard generic method for classification, which is applicable to thermal images as well.This paper uses SVM for deceit classification in real-time.We have improvised the incremental tracking algorithm to find out the blood flow rate in the ROIs in all the frames of the thermal videos of the subjects.On the basis of the blood flow rate analysis, SVM has been used to differentiate the liars from the truth-tellers.
We have compared our algorithm with respect to the KLT tracking algorithm.KLT tracking in a particular thermal video of a subject is shown in Fig. 6.It is evident from the figure that KLT cannot be used for tracking the ROI in the face in the thermal videos.Therefore, we have used a different algorithm in our work.This tracks the ROI, taking care of the movements of the face.Therefore we could not compare the final accuracy of our tracking algorithm with respect to other algorithms.

Support vector classification
SVM is an effective machine learning tool proposed by Vapnik for binary classification problems [33].In a two-class classifier, the goal is to construct a hyperplane, as shown in Fig. 7, which separates the data points of each class while maximizing the distance between the two classes from the hyperplane.Mathematically, the hyperplane is represented by the equation: where W is the weight vector and b is the bias.The optimal hyperplane divides the data points(x) into two such that the data points of each class are on two sides of the plane.That is, (6) The output of algorithm 1 for a thermal video of a subject gives the blood flow rate for each frame of the thermal video.Each of these outputs, i.e., blood flow rate and frame number together, is taken as a data point.The algorithm 1 gives a two-dimensional feature vector whose features are blood flow rate and frame number.The blood flow rate and frame number obtained from the algorithm are the input features of the SVM classifier.The blood flow rate of all the subjects starts from zero as the initial value in the first frame is taken as zero.The value of blood flow rate lies between 0 and 0.1 for all the subjects, and they are not normalized.Also, there are 100 frames of the thermal video taken in our analysis.The data points of all subjects are separated into lie and truth cases, and the hyperplane separating the two classes is found out using SVM.

Result
The data include 14 cases of truth and 14 cases of lie, as shown in Table 1.The plot of the blood flow rate of the forehead and periorbital region of the subjects is shown in Fig. 8.The hyperplane separating the truth and lie cases is shown using dotted lines.The plot shown above in the figure consists of 100 frames in the thermal video.These 100 frames of the video are part of the hard interrogation.It can be observed from the graph of the forehead and the periorbital region that, except for the 3 cases, all have been properly separated by the separating hyperplane.The misclassification includes three lies in the forehead region and three lies in the periorbital region.Therefore the classification accuracy obtained is 89.28%.It is observed that there is a difference in the pattern of the rise of blood flow rate for lie and truth cases both in the periorbital and forehead region.For deceit cases, the rise is rapid; whereas, for the truth cases, the rising rate in intensity is slow and smooth.Though the dotted line separates the truth and lie cases, it can be observed that there are some overlapping on the separating line during the initial frames.This is because the blood flow rate is calculated by taking the initial condition as zero.Also, it can be observed that the separation of the truth and lie cases becomes evident as the interrogation progresses.This change of Fig. 6 A particular frame during KLT tracking in a thermal video pattern of blood flow rate for deceit (rapidly increasing slope) vs. non-deceit (moderately increasing slope) cases can be used to differentiate between them.
We have compared the proposed method with previous work.Pavlidis et al. [7] classified the subjects into deceptive and truthful groups by finding the slope products of the blood flow rate curves.If the slope product (in angle) crosses a threshold value then it is classified as deceptive else it is considered as truth.We also implemented this method in our own database and compared the values of recall, precision, F1 score, and accuracy by both methods.The results are provided in Table 2.
It can be observed from the above table that we get 89.28%accuracy value which is much higher than the accuracy obtained by using the method as in [7].
The 28 subjects are arranged into two categories, out of which subject numbers 1 to 14 are in the truth class and subject numbers 15 to 28 are in the deceit class.From the forehead data, we found out that subject numbers 21, 24, and 27 (who were actually liars) were classified in the truth class by the SVM.Similarly, from the periorbital data, subject numbers 17, 21, and 24 (who were actually liars) were classified in the truth class by the SVM.All the subjects who told the truth were classified correctly by the SVM, which resulted in the number of true negatives (TN) being 14 and false positives (FP) being 0. By combining the forehead and the periorbital data using an OR operation (if both outputs are deceit, the final decision is deceit, if both are true, the final decision is true; and if one of them is true, then the final decision is also true), we got true positives as 12 (TP = 12) and false negatives as 2 (FN = 2).This results in a final accuracy of 92.86%, which is more than our previous accuracy of 89.28%.The confusion matrix for the combined forehead data and the periorbital data is shown in Fig. 9.

Discussion
This paper uses a novel method for simulating guilt.The standard binary classifiers such as a linear SVM could classify the thermal responses into truth and lie with 89.28% accuracy, which is greater than that obtained by Pavlidis [7].The better performance in the classification might be due to the following reasons.
Firstly, the experimental protocol used a real-life crime scenario in which a real act of stealing takes place.The participants have not been asked to enact, unlike the previous works.This might be the reason for the evident thermal signature.The experiments have been done unobtrusively so that the behavior of the subjects is not affected.The natural behavior of the subjects was the advantage of the experiment, though at the same time, there was some difficulty, like the excess movement of the head by some subjects due to their natural behavior or stress causing difficulty in recording.Such movement caused difficulty in proper tracking because sometimes the face was going out of camera view.A pilot study was done in 2019 by the authors, taking ten subjects, but due to less number of subjects, such difficulty was not faced at that time which is seen in the present study [34].Secondly, most of the participants have a record of petty thefts, unlike the previous works where subjects are normal people who are asked to enact.Seventy percent of the 28 participants stole during the experiment.The fact that 70 percent of people stole shows that the participants are habitual stealers and make the database special as the behavior of normal people and habitual stealers are different.During interrogation, it could easily be understood that some people have been adamant regarding their act.
The results obtained also corroborate with earlier findings that the blood flow rate increased steeply for the deceit cases while it increased steadily for non-deceit cases.

Conclusion
The proposed method has been successful in simulating guilt, as it is evident from results obtained from the thermal signatures.Here, the blood flow rate at the forehead and periorbital region is found out from the thermal videos.The results show that almost all the responses could be properly segregated.
In the future scope, we would like to include the voice as well as thermal parameters for detecting deceit which would help in overcoming the shortcoming of the movement of people beyond the camera view.

Fig. 4
Fig. 4 Object and background rectangles selected from face where the red box shows a sample ROI

Fig. 7
Fig. 7 Optimal hyperplane separating the two distinct classes

Fig. 8
Fig. 8 Graph showing support vector machine classification of lie and truth

Fig. 9
Fig. 9 Confusion matrix of the combined forehead data and the periorbital data

Table 2
Comparison of recall, precision, accuracy and F1 score in the periorbital and forehead region by using the Pavlidis method of slope product and our own proposed method