mindspore.dataset.audio

This module is to support audio augmentations. It includes two parts: transforms and utils. transforms is a high performance processing module with common audio operations. utils provides some general methods for audio processing.

Common imported modules in corresponding API examples are as follows:

import mindspore.dataset as ds
import mindspore.dataset.audio.transforms as audio

Descriptions of common data processing terms are as follows:

  • TensorOperation, the base class of all data processing operations implemented in C++.

  • AudioTensorOperation, the base class of all audio processing operations. It is a derived class of TensorOperation.

mindspore.dataset.audio.transforms

mindspore.dataset.audio.transforms.AllpassBiquad

Design two-pole all-pass filter with central frequency and bandwidth for audio waveform.

mindspore.dataset.audio.transforms.AmplitudeToDB

Turn the input audio waveform from the amplitude/power scale to decibel scale.

mindspore.dataset.audio.transforms.Angle

Calculate the angle of complex number sequence.

mindspore.dataset.audio.transforms.BandBiquad

Design two-pole band-pass filter for audio waveform.

mindspore.dataset.audio.transforms.BandpassBiquad

Design two-pole Butterworth band-pass filter for audio waveform.

mindspore.dataset.audio.transforms.BandrejectBiquad

Design two-pole Butterworth band-reject filter for audio waveform.

mindspore.dataset.audio.transforms.BassBiquad

Design a bass tone-control effect, also known as two-pole low-shelf filter for audio waveform.

mindspore.dataset.audio.transforms.ComplexNorm

Compute the norm of complex number sequence.

mindspore.dataset.audio.transforms.Contrast

Apply contrast effect for audio waveform.

mindspore.dataset.audio.transforms.FrequencyMasking

Apply masking to a spectrogram in the frequency domain.

mindspore.dataset.audio.transforms.LowpassBiquad

Design two-pole low-pass filter for audio waveform.

mindspore.dataset.audio.transforms.TimeMasking

Apply masking to a spectrogram in the time domain.

mindspore.dataset.audio.transforms.TimeStretch

Stretch Short Time Fourier Transform (STFT) in time without modifying pitch for a given rate.

mindspore.dataset.audio.utils

mindspore.dataset.audio.utils.ScaleType

Scale Types.