Below, I’ll outline these steps in detail, specifically in the context of 🤗 Datasets and the Whisper ASR (Automatic Speech Recognition) model. Resampling the Audio Data Resampling is the process of changing the sampling rate of audio data to match the expected sampling rate of the model. Most pretrained models, like Whisper, are trained on audio data with a specific sampling rate, often 16 kHz. If your dataset’s sampling rate differs, you should resample it.