clean_corpus.sh
Less than 1 minute
clean_corpus.sh
Usage: clean_corpus.sh [options] <data-dir> <langs>
e.g.: clean_corpus.sh data/train "en de"
Options:
--maxframes # number of maximum input frame length
--maxchars # number of maximum character length
--utt_extra_files # extra text files for target sequence
--no_feat # set to True for MT recipe