Are there any reported results with very small (~10-15 hours) training set from scratch (without using pre-trained model)? Or if developers did any similar experiment? I trained a system with 15hours from scratch with enc-dec attention architecture (CRDNN) and getting ~99% minimum WER (12 epochs). I assume its due to too small data to converge training but I am not sure if its so or I am doing something wrong.
Any help/comments are highly appreciated.