espnet2.fileio.vad_scp.VADScpReader
Less than 1 minute
espnet2.fileio.vad_scp.VADScpReader
class espnet2.fileio.vad_scp.VADScpReader(fname, dtype=<class 'numpy.float32'>)
Bases: Mapping
Reader class for ‘vad.scp’.
Different from segments, the vad.scp would focus on utterance-level, while the segments are expected to focus on a whole session. The major usage in ESPnet is to guide the silence trim for UASR.
Examples
key1 0:1.2000 key2 3.0000:4.5000 7.0000:9:0000 …
>>> reader = VADScpReader('wav.scp')
>>> array = reader['key1']
keys() → a set-like object providing a view on D's keys