--output Output path (default: input name + extension) --format jpg or png (default: jpg) --width Output width (default: 1920) --height Output height (default: 1080 ...
Abstract: In this work, we propose CleanMel, a single-channel Mel-spectrogram denoising and dereverberation network for improving both speech quality and automatic speech recognition (ASR) performance ...
Abstract: Vision Transformers have shown tremendous success in numerous computer vision applications; however, they have not been exploited for stress assessment using physiological signals such as ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results