C++ API Description for Custom Operators

Overview

The C++ API for MindSpore custom operators is divided into two categories:

API Interfaces: Interfaces marked as [API] are stable public interfaces intended for direct use by users. These interfaces have been thoroughly tested, have clear functionality, and are highly backward compatible.
Experimental Interfaces: Interfaces not marked as [API] are experimental. These interfaces may change or be removed in future versions and should be used with caution.

When developing custom operators, you can include the header files referenced by the following interfaces via #include "ms_extension/api.h", without worrying about the specific location of each interface.

namespace ms

enum TypeId

The TypeId enumeration type is defined in the type_id.h header file and specifies the tensor data types supported in MindSpore, including boolean, integer, floating-point, and complex types.

This interface is also included in the namespace ms namespace and can be accessed via ms::TypeId.

kNumberTypeBegin,       // Start value for the Number type
kNumberTypeBool,        // Boolean type
kNumberTypeInt,         // Default integer type
kNumberTypeInt8,        // 8-bit signed integer
kNumberTypeInt16,       // 16-bit signed integer
kNumberTypeInt32,       // 32-bit signed integer
kNumberTypeInt64,       // 64-bit signed integer
kNumberTypeUInt,        // Default unsigned integer type
kNumberTypeUInt8,       // 8-bit unsigned integer
kNumberTypeUInt16,      // 16-bit unsigned integer
kNumberTypeUInt32,      // 32-bit unsigned integer
kNumberTypeUInt64,      // 64-bit unsigned integer
kNumberTypeFloat,       // Default floating-point type
kNumberTypeFloat16,     // 16-bit half-precision floating-point
kNumberTypeFloat32,     // 32-bit single-precision floating-point
kNumberTypeFloat64,     // 64-bit double-precision floating-point
kNumberTypeBFloat16,    // 16-bit brain floating-point
kNumberTypeDouble,      // Double-precision floating-point (equivalent to kNumberTypeFloat64)
kNumberTypeComplex,     // Default complex number type
kNumberTypeComplex64,   // 64-bit complex number (composed of two 32-bit floating-point numbers)
kNumberTypeComplex128,  // 128-bit complex number (composed of two 64-bit floating-point numbers)
kNumberTypeInt4,        // 4-bit signed integer
kNumberTypeGLUInt,      // OpenGL unsigned integer type
kNumberTypeEnd,         // End value for the Number type

class Tensor

The Tensor class is defined in the tensor.h header file, representing the tensor object in MindSpore. It provides methods for operating on and querying tensor properties.

Constructors

Tensor()
- Description: [API] Constructs an undefined placeholder tensor.
Tensor(TypeId, const ShapeVector &)
```
Tensor(TypeId type_id, const ShapeVector &shape)
```
- Description: [API] Constructs a tensor with the specified data type and shape.
- Parameters:
  - type_id: The data type of the tensor.
  - shape: The shape of the tensor, represented as a vector of integers.
Tensor(const mindspore::ValuePtr &)
```
Tensor(const mindspore::ValuePtr &value)
```
- Description: Constructs a tensor object from the given ValuePtr.
- Parameters:
  - value: A smart pointer to a MindSpore Value object. If the value is nullptr, an undefined tensor is constructed.

Public Methods (Attributes and Configurations)

is_defined()
```
bool is_defined() const
```
- Description: [API] Checks whether the tensor is defined.
- Return Value: Returns true if the tensor is defined, otherwise returns false.
data_type()
```
TypeId data_type() const
```
- Description: [API] Retrieves the data type of the tensor.
- Return Value: The data type of the tensor.
shape()
```
const ShapeVector &shape() const
```
- Description: [API] Retrieves the shape of the tensor.
- Return Value: A reference to the shape of the tensor (ShapeVector, i.e., std::vector<int64_t>).
numel()
```
size_t numel() const
```
- Description: [API] Returns the total number of elements in the tensor.
- Return Value: The total number of elements.
stride()
```
std::vector<int64_t> stride() const
```
- Description: [API] Computes the strides of the tensor.
- Return Value: A vector representing the strides of the tensor for each dimension.
storage_offset()
```
int64_t storage_offset() const
```
- Description: [API] Retrieves the storage offset of the tensor.
- Return Value: The offset from the start of storage (in terms of elements).
is_contiguous()
```
bool is_contiguous() const
```
- Description: [API] Checks whether the tensor is stored contiguously in memory.
- Return Value: Returns true if the tensor is stored contiguously, otherwise returns false.
SetNeedContiguous(bool)
```
void SetNeedContiguous(bool flag) const
```
- Description: [API] Sets whether the tensor requires contiguous storage space.
- Parameters:
  - flag: A boolean value indicating whether the tensor needs contiguous storage.
GetDataPtr()
```
void *GetDataPtr() const
```
- Description: [API] Retrieves a pointer to the tensor data.
- Return Value: A void pointer pointing to the tensor data.
- Note: The returned pointer already includes the offset indicated by the storage_offset() interface.

Public Methods (Internal Processes)

The following methods are not part of the API and are used only in internal module processes. Due to syntax constraints, they are set as public methods but are not recommended for direct use by users.

need_contiguous()
```
bool need_contiguous() const
```
- Description: Checks whether the tensor requires contiguous storage space.
- Return Value: Returns true if the tensor requires contiguous storage, otherwise false.
stub_node()
```
const mindspore::ValuePtr &stub_node() const
```
- Description: Retrieves the stub node associated with the tensor.
- Return Value: A smart pointer to the stub node (ValuePtr).
tensor()
```
const mindspore::tensor::TensorPtr &tensor() const
```
- Description: Retrieves the underlying tensor object.
- Return Value: A smart pointer to the TensorPtr object.
ConvertStubNodeToTensor()
```
void ConvertStubNodeToTensor() const
```
- Description: Converts the stub node into a tensor object.
- Behavior: Ensures that the tensor is fully realized from its stub representation. After the conversion, the stub node is released.

namespace ms::pynative

class PyboostRunner

The PyboostRunner class for PyNative processes is defined in the pyboost_extension.h header file. It provides methods for managing execution, memory allocation, and kernel launching.

PyboostRunner is a subclass of std::enable_shared_from_this and requires the use of the smart pointer std::shared_ptr to manage its objects.

Constructor

PyboostRunner(const std::string &)
```
PyboostRunner(const std::string &op_name)
```
- Description: [API] Constructs a PyboostRunner.
- Parameters:
  - op_name: The name of the operator.

Static Public Methods

static Call(FuncType, Args &&…)
```
template <int OUT_NUM, typename FuncType, typename... Args>
static py::object Call(FuncType func, Args &&... args)
```
- Description: [API] Executes the given function and converts its output to a Python object.
- Template Parameters:
  - OUT_NUM: The number of outputs from the operator, which must match the length of the tensor list returned by func. Currently, scenarios with variable output numbers are not supported.
  - FuncType: The prototype of the operator entry function, which can be automatically recognized from the function arguments.
  - Args: The types of operator input arguments, which can also be automatically recognized from the function arguments. The order of arguments must match the parameter order of func.
- Parameters:
  - func: The function to execute.
  - args: The arguments required to execute the function.
- Return Value: A Python object representing the operator's output.

Public Methods

Run(const std::vector &, const std::vector &)
```
void Run(const std::vector<Tensor> &inputs, const std::vector<Tensor> &outputs)
```
- Description: [API] Runs the operator with the specified inputs and outputs.
- Parameters:
  - inputs: A list of input tensors.
  - outputs: A list of output tensors.
CalcWorkspace()
```
virtual size_t CalcWorkspace()
```
- Description: [API] Calculates the workspace size required by the operator.
- Return Value: The workspace size (in bytes). The default value is 0.
LaunchKernel()
```
virtual void LaunchKernel() = 0;
```
- Description: [API] Launches the kernel function of the operator.
op_name()
```
const std::string &op_name() const
```
- Description: [API] Retrieves the name of the operator associated with the runner.
- Return Value: A string containing the operator's name.
inputs()
```
const std::vector<ms::Tensor> &inputs() const
```
- Description: [API] Retrieves the list of input tensors.
- Return Value: A reference to the list of input tensors.
outputs()
```
const std::vector<ms::Tensor> &outputs() const
```
- Description: [API] Retrieves the list of output tensors.
- Return Value: A reference to the list of output tensors.
stream_id()
```
uint32_t stream_id() const
```
- Description: [API] Retrieves the stream ID associated with the runner.
- Return Value: The stream ID.
stream()
```
void *stream()
```
- Description: [API] Retrieves the stream pointer associated with the runner.
- Return Value: A pointer to the stream.
workspace_ptr()
```
void *workspace_ptr()
```
- Description: [API] Retrieves the workspace pointer of the operator.
- Return Value: A pointer to the workspace memory.

class AtbOpRunner

The AtbOpRunner class is a runner for executing Ascend Transformer Boost (ATB) operators, defined in the atb_common.h header file.

This class inherits from PyboostRunner and encapsulates the process of invoking ATB operators, including initialization, running the ATB operator, managing input/output tensors, memory allocation, and kernel scheduling.

Refer to the tutorial CustomOpBuilder Using AtbOpRunner to Integrate ATB Operators for usage methods.

Constructor

AtbOpRunner
```
using PyboostRunner::PyboostRunner;
```
Constructor inherited from PyboostRunner.

Public Methods

Init(const ParamType&)
```
template <typename ParamType>
void Init(const ParamType &param)
```
- Description: [API] Initializes the ATB operator with the given parameters. This method creates a corresponding atb::Operation instance for the operator via atb::CreateOperation and places it in the cache. Only one atb::Operation instance is created for operators with the same param hash value.
- Parameters:
  - param: Parameters used to configure the ATB operator.
- Note: For the ParamType type passed in, you need to specialize the template <> struct HashOpParam<ParamType>::operator() function in advance.

function RunAtbOp

The interface for executing ATB operators in dynamic graphs, defined in the atb_common.h header file.

template <typename ParamType>
void RunAtbOp(const std::string &op_name, const ParamType &param, const std::vector<Tensor> &inputs,
              const std::vector<Tensor> &outputs)

[API] Executes an ATB operator using the provided parameters, inputs, and outputs. This function is a wrapper around AtbOpRunner.

Parameters:
- op_name: The name of the ATB operator to execute.
- param: Parameters required to initialize the ATB operator.
- inputs: A list of input tensors for the operator.
- outputs: A list of output tensors for the operator.