mindspore.dataset.vision
This module is to support vision augmentations. It includes two parts: c_transforms and py_transforms. C_transforms is a high performance image augmentation module which is developed with c++ opencv. Py_transforms provide more kinds of image augmentations which are developed with Python PIL.
Common imported modules in corresponding API examples are as follows:
import mindspore.dataset.vision.c_transforms as c_vision
import mindspore.dataset.vision.py_transforms as py_vision
from mindspore.dataset.transforms import c_transforms
mindspore.dataset.vision.c_transforms
| Apply automatic contrast on input image. | |
| Apply a given image transform on a random selection of bounding box regions of a given image. | |
| Crop the input image at the center to the given size. | |
| Change the color space of the image. | |
| Crop the input image at a specific location. | |
| Apply CutMix transformation on input batch of images and labels. | |
| Randomly cut (mask) out a given number of square patches from the input image array. | |
| Decode the input image in RGB mode(default) or BGR mode(deprecated). | |
| Apply histogram equalization on input image. | |
| Blur input image with the specified Gaussian kernel. | |
| Flip the input image horizontally. | |
| Transpose the input image from shape (H, W, C) to shape (C, H, W). | |
| Apply invert on input image in RGB mode. | |
| Apply MixUp transformation on input batch of images and labels. | |
| Normalize the input image with respect to mean and standard deviation. | |
| Normalize the input image with respect to mean and standard deviation then pad an extra channel with value zero. | |
| Pad the image according to padding parameters. | |
| Apply Random affine transformation to the input image. | |
| Adjust the color of the input image by a fixed or random degree. | |
| Randomly adjust the brightness, contrast, saturation, and hue of the input image. | |
| Crop the input image at a random location. | |
| 
 | A combination of Crop, Decode and Resize. | 
| Crop the input image at a random location and adjust bounding boxes accordingly. | |
| Randomly flip the input image horizontally with a given probability. | |
| 
 | Flip the input image horizontally randomly with a given probability and adjust bounding boxes accordingly. | 
| Reduce the number of bits for each color channel to posterize the input image randomly with a given probability. | |
| Resize the input image using a randomly selected interpolation mode. | |
| Crop the input image to a random size and aspect ratio. | |
| 
 | Crop the input image to a random size and aspect ratio and adjust bounding boxes accordingly. | 
| Tensor operation to resize the input image using a randomly selected interpolation mode and adjust bounding boxes accordingly. | |
| Rotate the input image randomly within a specified range of degrees. | |
| Choose a random sub-policy from a policy list to be applied on the input image. | |
| Adjust the sharpness of the input image by a fixed or random degree. | |
| Randomly selects a subrange within the specified threshold range and sets the pixel value within the subrange to (255 - pixel). | |
| Randomly flip the input image vertically with a given probability. | |
| 
 | Flip the input image vertically, randomly with a given probability and adjust bounding boxes accordingly. | 
| Rescale the input image with the given rescale and shift. | |
| Resize the input image to the given size with a given interpolation mode. | |
| Resize the input image to the given size and adjust bounding boxes accordingly. | |
| Rotate the input image by specified degrees. | |
| Slice Tensor to multiple patches in horizontal and vertical directions. | |
| 
 | A combination of Crop, Decode and Resize using the simulation algorithm of Ascend series chip DVPP module. | 
| 
 | Decode and resize JPEG image using the simulation algorithm of Ascend series chip DVPP module. | 
| Perform randomly selected augmentation on input image. | |
| Flip the input image vertically. | 
mindspore.dataset.vision.py_transforms
| Automatically maximize the contrast of the input PIL Image. | |
| Crop the central region of the input PIL Image with the given size. | |
| Randomly apply a given number of square patches of zeros to a location within the input numpy.ndarray image of shape (C, H, W). | |
| Decode the input raw image to PIL Image format in RGB mode. | |
| Apply histogram equalization on the input PIL Image. | |
| Crop the given image into one central crop and four corners. | |
| Convert the input PIL Image to grayscale. | |
| Convert one or more numpy.ndarray images from HSV to RGB. | |
| Transpose the input numpy.ndarray image of shape (H, W, C) to (C, H, W). | |
| Invert the colors of the input PIL Image. | |
| Transform the input numpy.ndarray image with a given square transformation matrix and a mean vector. | |
| Randomly mix up a batch of images together with its labels. | |
| Normalize the input numpy.ndarray image of shape (C, H, W) with the specified mean and standard deviation. | |
| Normalize the input numpy.ndarray image of shape (C, H, W) with the specified mean and standard deviation, then pad an extra channel filled with zeros. | |
| Pad the input image on all sides with the given padding parameters. | |
| Apply random affine transformation to the input PIL Image. | |
| Adjust the color balance of the input PIL Image by a random degree. | |
| Randomly adjust the brightness, contrast, saturation, and hue of the input PIL Image. | |
| Crop the input PIL Image at a random location with the specified size. | |
| Randomly erase the pixels within a random selected rectangle region with a given probability. | |
| Randomly convert the input image into grayscale with a given probability. | |
| Randomly flip the input image horizontally with a given probability. | |
| Randomly apply perspective transformation to the input PIL Image with a given probability. | |
| Randomly crop the image and resize it to a given size. | |
| Rotate the input PIL Image by a random angle. | |
| Adjust the sharpness of the input PIL Image by a random degree. | |
| Randomly flip the input image vertically with a given probability. | |
| Resize the input PIL Image to the given size. | |
| Convert one or more numpy.ndarray images from RGB to HSV. | |
| Crop the given image into one central crop and four corners plus the flipped version of these. | |
| Convert the input decoded numpy.ndarray image to PIL Image. | |
| Convert the input PIL Image or numpy.ndarray of shape (H, W, C) in the range [0, 255] to numpy.ndarray of shape (C, H, W) in the range [0.0, 1.0] with the desired dtype. | |
| Convert the input numpy.ndarray image to the desired dtype. | |
| Uniformly select a number of transformations from a sequence and apply them sequentially and randomly, which means that there is a chance that a chosen transformation will not be applied. | 
mindspore.dataset.vision.utils
| Padding Mode, Border Type. | |
| Data Format of images after batch operation. | |
| Interpolation Modes. | |
| Mode to Slice Tensor into multiple parts. |