MindSpore

Start

  • Overall Structure
  • Models

Quick Start

  • Installation
  • Calling Source Code to Start

Usage Tutorials

  • Development Migration
  • Multimodal Model Development
  • Pretraining
  • Supervised Fine-Tuning (SFT)
  • Evaluation
  • Inference
  • Quantization
  • Service Deployment
  • Dynamic Graph Parallelism

Function Description

  • Weight Format Conversion
  • Distributed Weight Slicing and Merging
  • Distributed Parallelism
  • Dataset
  • Weight Saving and Resumable Training
  • Training Metrics Monitoring
  • High Availability
  • Safetensors Weights
  • Fine-Grained Activations SWAP

Precision Optimization

  • Large Model Accuracy Optimization Guide

Performance Optimization

  • Large Model Performance Optimization Guide

API

  • mindformers
  • mindformers.core
  • mindformers.dataset
  • mindformers.generation
  • mindformers.models
  • mindformers.modules
  • mindformers.pet
  • mindformers.pipeline
  • mindformers.tools
  • mindformers.wrapper

Appendix

  • Environment Variable Descriptions
  • Configuration File Descriptions

FAQ

  • Model-Related
  • Function-Related
    • Q: The WikiText dataset download link is not available.
    • Q: How Do I Generate a Model Sharding Strategy File?
    • Q: How Can I Do When socket.gaierror: [Errno -2] Name or service not known or socket.gaierror: [Errno -3] Temporary failure in name resolution is Reported in ranktable Generation File?
  • MindSpore Transformers Contribution Guidelines
  • Modelers Contribution Guidelines

RELEASE NOTES

  • Release Notes
MindSpore
  • »
  • Function-Related
  • View page source

Function-Related

View Source On Gitee

Q: The WikiText dataset download link is not available.

A: The official download link is not available, please follow the community Issue #IBV35D.


Q: How Do I Generate a Model Sharding Strategy File?

A: The model sharding strategy file documents the sharding strategy for model weights in distributed scenarios and is generally used when slicing weights offline. Configure only_save_strategy: True in the network yaml file, and then start the distributed task normally, then the distributed strategy file can be generated in the output/strategy/ directory. For details, please refer to the Tutorial on Slicing and Merging Distributed Weights.


Q: How Can I Do When socket.gaierror: [Errno -2] Name or service not known or socket.gaierror: [Errno -3] Temporary failure in name resolution is Reported in ranktable Generation File?

A: Starting from MindSpore Transformers r1.2.0 version, cluster startup is unified using msrun method, and ranktable startup method is deprecated.


Previous Next

© Copyright MindSpore.

Built with Sphinx using a theme provided by Read the Docs.