TY - GEN
T1 - Announcement Capture System in Real Environments Using Recurrent Neural Network
AU - Nakazawa, Shintaro
AU - Sasaki, Takeshi
N1 - Publisher Copyright:
© 2020 IEEE.
PY - 2020/1
Y1 - 2020/1
N2 - Announcement is useful transmission mean. It is used in various places. Important information such as evacuation guidance in the case of emergency is often transmitted in the announcements. But some people miss it due to various factors. In this paper, we propose an announcement capture system to resolve that problems. The system consists of three steps: Firstly, announcements are detected in real environmental recordings. Secondly, duration of the announcement is extracted from detected sounds. Finally, extracted announcements are output in some form on user's device. For the first step to develop the proposed system, we validated the performance of the classifier to detect announcements. The classifier was trained in some features using BLSTM which is one of the methods of machine learning. In the validation experiment, the performances of each classifiers trained by varying features were compared. As results of the validation, the feature in the human perceptual aspect was effective to identify announcements. In addition to the result, it was considered that there is a possibility to improve the performance of announcement detection using the feature in the acoustic aspect. However, to incorporate the acoustic feature, reviewing the hyperparameters and removing surrounding sounds of the announcement are required.
AB - Announcement is useful transmission mean. It is used in various places. Important information such as evacuation guidance in the case of emergency is often transmitted in the announcements. But some people miss it due to various factors. In this paper, we propose an announcement capture system to resolve that problems. The system consists of three steps: Firstly, announcements are detected in real environmental recordings. Secondly, duration of the announcement is extracted from detected sounds. Finally, extracted announcements are output in some form on user's device. For the first step to develop the proposed system, we validated the performance of the classifier to detect announcements. The classifier was trained in some features using BLSTM which is one of the methods of machine learning. In the validation experiment, the performances of each classifiers trained by varying features were compared. As results of the validation, the feature in the human perceptual aspect was effective to identify announcements. In addition to the result, it was considered that there is a possibility to improve the performance of announcement detection using the feature in the acoustic aspect. However, to incorporate the acoustic feature, reviewing the hyperparameters and removing surrounding sounds of the announcement are required.
UR - http://www.scopus.com/inward/record.url?scp=85082601817&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85082601817&partnerID=8YFLogxK
U2 - 10.1109/SII46433.2020.9025855
DO - 10.1109/SII46433.2020.9025855
M3 - Conference contribution
AN - SCOPUS:85082601817
T3 - Proceedings of the 2020 IEEE/SICE International Symposium on System Integration, SII 2020
SP - 1046
EP - 1051
BT - Proceedings of the 2020 IEEE/SICE International Symposium on System Integration, SII 2020
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2020 IEEE/SICE International Symposium on System Integration, SII 2020
Y2 - 12 January 2020 through 15 January 2020
ER -