perf(wizard): check only 1000 audio files for presence
For a large corpus, e.g., our 110k French sentence corpus, checking for the
presence of all audio files takes a long time and is pointless. So check only a
sample of 1000 when there are more than 1000.
Fixes #466