Function mindspore::dataset::IMDB

Function Documentation

inline std::shared_ptr<IMDBDataset> mindspore::dataset::IMDB(const std::string &dataset_dir, const std::string &usage, const std::reference_wrapper<Sampler> &sampler, const std::shared_ptr<DatasetCache> &cache = nullptr)

A source dataset for reading and parsing IMDB dataset.

Note

The generated dataset has two columns [“text”, “label”].

Parameters
  • dataset_dir[in] Path to the root directory that contains the dataset.

  • usage[in] The type of dataset. Acceptable usages include “train”, “test” or “all”.

  • sampler[in] Sampler object used to choose samples from the dataset.

  • cache[in] Tensor cache to use (default=nullptr, which means no cache is used).

Returns

Shared pointer to the IMDBDataset.