Function mindspore::dataset::GTZAN

Function Documentation

inline std::shared_ptr<GTZANDataset> mindspore::dataset::GTZAN(const std::string &dataset_dir, const std::string &usage = "all", const std::shared_ptr<Sampler> &sampler = std::make_shared<RandomSampler>(), const std::shared_ptr<DatasetCache> &cache = nullptr)

Function to create a GTZANDataset.

Note

The generated dataset has three columns [“waveform”, “sample_rate”, “label”].

Parameters
  • dataset_dir[in] Path to the root directory that contains the dataset.

  • usage[in] Part of dataset of GTZAN, can be “train”, “valid”, “test”, or “all” (default = “all”).

  • sampler[in] Shared pointer to a sampler object used to choose samples from the dataset. If sampler is not given, a RandomSampler will be used to randomly iterate the entire dataset (default = RandomSampler()).

  • cache[in] Tensor cache to use (default=nullptr, which means no cache is used).

Returns

Shared pointer to the GTZANDataset.

/* Define dataset path and MindData object */
std::string folder_path = "/path/to/gtzan_dataset_directory";
std::shared_ptr<Dataset> ds =
    GTZANDataset(folder_path, usage = "all", std::make_shared<RandomSampler>(false, 10));

/* Create iterator to read dataset */
std::shared_ptr<Iterator> iter = ds->CreateIterator();
std::unordered_map<std::string, mindspore::MSTensor> row;
iter->GetNextRow(&row);

/* Note: In GTZAN dataset, each data dictionary has keys "waveform", "sample_rate" and "label" */
auto waveform = row["waveform"];