Speechbrain/sepformer-whamr with large files

Is it possible to separate large files with speechbrain/sepformer-whamr?
I’ve used VAD to split up larger files which can then be processed by speechbrain/sepformer-whamr but I’m concerned that there could be inconsistency with speaker separation between each file (split using VAD).

Which VAD did you use?
Anyway, I think this is the best way to work with large audio file.
Maybe you should fix the min audio duration for 4-6 seconds

Hey hi, it’s quite unusual to work with large files and DNN. We have a VAD now in SpeechBrain you can try to couple it.

thanks, i’ve since tried out the VAD library and had much better success.