TY - GEN
T1 - Sound-To-Sound Translation Using Generative Adversarial Network and Sound U-Net
AU - Kunisada, Yugo
AU - Premachandra, Chinthaka
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - In this paper, we propose a generic learning method for training conditional generative adversarial networks on audio data. This makes it possible to apply the same generic approach as described in this study to problems that previously required completely different loss formulations when learning audio data. This method can be useful for labeling noises with a certain number of identical frequencies, generating speech labels corresponding to each frequency, and generating audio data for noise cancellation. To achieve this, we propose a sound restoration process based on U-Net, called Sound U-net. In this study, we realized a wide applicability of our system, owing to its ease of implementation without a parameter adjustment, as well as a reduction in the training time for audio data. During the experiment, reasonable results were obtained without manually adjusting the loss function.
AB - In this paper, we propose a generic learning method for training conditional generative adversarial networks on audio data. This makes it possible to apply the same generic approach as described in this study to problems that previously required completely different loss formulations when learning audio data. This method can be useful for labeling noises with a certain number of identical frequencies, generating speech labels corresponding to each frequency, and generating audio data for noise cancellation. To achieve this, we propose a sound restoration process based on U-Net, called Sound U-net. In this study, we realized a wide applicability of our system, owing to its ease of implementation without a parameter adjustment, as well as a reduction in the training time for audio data. During the experiment, reasonable results were obtained without manually adjusting the loss function.
KW - Audio Processing
KW - Conditional GAN
KW - Generative Adversarial Networks
KW - Machine Leaning
KW - Sound U-Net
UR - http://www.scopus.com/inward/record.url?scp=85134016886&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85134016886&partnerID=8YFLogxK
U2 - 10.1109/ICIPRob54042.2022.9798737
DO - 10.1109/ICIPRob54042.2022.9798737
M3 - Conference contribution
AN - SCOPUS:85134016886
T3 - 2022 2nd International Conference on Image Processing and Robotics, ICIPRob 2022
BT - 2022 2nd International Conference on Image Processing and Robotics, ICIPRob 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2nd International Conference on Image Processing and Robotics, ICIPRob 2022
Y2 - 12 March 2022 through 13 March 2022
ER -