TY - GEN
T1 - End-To-End Deep Learning for pNN50 Estimation Using a Spatiotemporal Representation
AU - Ziaratnia, Sayyedjavad
AU - Sripian, Peeraya
AU - Laohakangvalvit, Tipporn
AU - Ohzeki, Kazuo
AU - Sugaya, Midori
N1 - Funding Information:
Acknowledgements. This work was partially supported by JSPS KAKENHI Grant Number JP19K20302.
Publisher Copyright:
© 2021, Springer Nature Switzerland AG.
PY - 2021
Y1 - 2021
N2 - Various industries widely use emotion estimation to evaluate their consumer satisfaction towards their products. Generally, emotion can be estimated based on observable expressions such as facial expression, or unobservable expressions such as biological signals. Although used by many research, the Facial Expression Recognition has a lack of precision for expressions that are very similar to each other or a situation where the shown expression differs from the real subject’s emotion. On the other hand, biological signal indexes such as pNN50 can act as a supportive mechanism to improve emotion estimation from observable expressions such as facial expression recognition method. pNN50 is a reliable index to estimate stress-relax, and it originates from unconscious emotions that cannot be manipulated. In this work, we propose a method for pNN50 estimation from facial video using a Deep Learning model. Transfer learning technique and a pre-trained Image recognition Convolutional Neural Network (CNN) model are employed to estimate pNN50 based on a spatiotemporal map created from a series of frames in a facial video. The model which trained on low, middle, and high pNN50 values, shows an accuracy of about 80%. Therefore, it indicates the potential of our proposed method, and we can expand it to categorize the more detailed level of pNN50 values.
AB - Various industries widely use emotion estimation to evaluate their consumer satisfaction towards their products. Generally, emotion can be estimated based on observable expressions such as facial expression, or unobservable expressions such as biological signals. Although used by many research, the Facial Expression Recognition has a lack of precision for expressions that are very similar to each other or a situation where the shown expression differs from the real subject’s emotion. On the other hand, biological signal indexes such as pNN50 can act as a supportive mechanism to improve emotion estimation from observable expressions such as facial expression recognition method. pNN50 is a reliable index to estimate stress-relax, and it originates from unconscious emotions that cannot be manipulated. In this work, we propose a method for pNN50 estimation from facial video using a Deep Learning model. Transfer learning technique and a pre-trained Image recognition Convolutional Neural Network (CNN) model are employed to estimate pNN50 based on a spatiotemporal map created from a series of frames in a facial video. The model which trained on low, middle, and high pNN50 values, shows an accuracy of about 80%. Therefore, it indicates the potential of our proposed method, and we can expand it to categorize the more detailed level of pNN50 values.
KW - Convolutional Neural Network
KW - Deep Learning
KW - pNN50 Estimation
KW - rPPG
UR - http://www.scopus.com/inward/record.url?scp=85112039113&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85112039113&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-78642-7_79
DO - 10.1007/978-3-030-78642-7_79
M3 - Conference contribution
AN - SCOPUS:85112039113
SN - 9783030786410
T3 - Communications in Computer and Information Science
SP - 588
EP - 593
BT - HCI International 2021 - Posters - 23rd HCI International Conference, HCII 2021, Proceedings
A2 - Stephanidis, Constantine
A2 - Antona, Margherita
A2 - Ntoa, Stavroula
PB - Springer Science and Business Media Deutschland GmbH
T2 - 23rd International Conference on Human-Computer Interaction, HCII 2021
Y2 - 24 July 2021 through 29 July 2021
ER -