Product Overview
SLURM Compatible CLI Tool

SLURM Compatible CLI Tool (SCC) provides AI developers with a CLI tool that is compatible with the SLURM syntax, helping AI developers use Slurm or Slurm-like command lines to perform actions on the SenseCore AI Compute Pool such as initiating and managing tasks and viewing node resources without the need of writing yaml files or configuring complex parameters when users are accustomed to minimal changes, so that the migration cost of users is minimized, and researchers can smoothly complete the transition of large-scale cluster resource management systems and fully enjoy the advantages of the optimized industrial-level large-scale computing power management system.

Product Superiority
Retain the characteristics of SLURM commands, take into account the advantages of containerized distribution, and provide the ultimate system compatibility experience.
  • 01SLURM compatible command line
  • 02Enjoy the unique advantages of container
  • 03Require no complex configuration
  • 04Simple and easy to use
SLURM compatible command line
01SLURM compatible command line

Command line compatible with the SLURM syntax
Support users to directly use slurm-native or slurm-like command lines, such as srun, scontrol, scancel, squeue, sinfo, and other commands to initiate and manage jobs

Enjoy the unique advantages of container
02Enjoy the unique advantages of container

Enjoy the unique advantages of container technology
Support user-initiated jobs to enjoy the ease of deployment, environment consistency, and observability brought by the containerized distributed architecture, and significantly improve resource isolation and utilization based on the modern large-scale cluster resource management system

Require no complex configuration
03Require no complex configuration

Quick application without the need of complex configuration
Automatically integrate built-in command line tools in the AI Cloud Lab, enabling users to easily complete installation and update in other application scenarios, and automatically trigger the encapsulation request when the command line interface is used to initiate related requests

Simple and easy to use
04Simple and easy to use

Simple and easy to use
Free users from the need of learning kubectrl or other related command line syntaxes based on the k8s architecture and the need of writing complex parameters in the yaml file, and enable them to complete script writing and request through simple parameter configuration with the Slurm-like command line

SLURM compatible command line
01
SLURM compatible command line

Command line compatible with the SLURM syntax Support users to directly use slurm-native or slurm-like command lines, such as srun, scontrol, scancel, squeue, sinfo, and other commands to initiate and manage jobs

Enjoy the unique advantages of container
02
Enjoy the unique advantages of container

Enjoy the unique advantages of container technology Support user-initiated jobs to enjoy the ease of deployment, environment consistency, and observability brought by the containerized distributed architecture, and significantly improve resource isolation and utilization based on the modern large-scale cluster resource management system

Require no complex configuration
03
Require no complex configuration

Quick application without the need of complex configuration Automatically integrate built-in command line tools in the AI Cloud Lab, enabling users to easily complete installation and update in other application scenarios, and automatically trigger the encapsulation request when the command line interface is used to initiate related requests

Simple and easy to use
04
Simple and easy to use

Simple and easy to use Free users from the need of learning kubectrl or other related command line syntaxes based on the k8s architecture and the need of writing complex parameters in the yaml file, and enable them to complete script writing and request through simple parameter configuration with the Slurm-like command line

01
/
04
Product Features
Parse and encapsulate SLURM-like commands submitted by users, and submit them to the AI Compute Pool.
  • Submit training tasks
    Submit training tasks

    Support users to initiate training tasks of different frameworks based on the Srun command line, configure the required resource specifications, and return the results in a specified way;

  • Manage training task
    Manage training task

    Support users to perform actions on submitted tasks based on scontrol, scancel, and other command lines, for example, view details and cancel a submitted task;

  • View cluster resources
    View cluster resources

    Support users to comprehensively view the available cluster workspace resources and the current user's task queuing condition based on sinfo, squeue, and other command lines.

Application Scenarios
Large-scale cluster resource management and use, and initiation of multi-node multi-GPU distributed trainings.
  • 01Large-scale distributed training
  • 02Large-scale cluster resource management
Large-scale distributed training
Large-scale distributed training
Based on the slurm command line syntax, enable users to configure resource specifications and the number of nodes, initiate distributed training tasks, and parse the tasks based on the returned results.

Build commands on the CLI frontend to verify user options and parameters.

Transform, encapsulate, and build the request body on the CLI backend and send it, and format the output.

Large-scale cluster resource management
Large-scale cluster resource management
Enable users to view the total amount of available cluster resources, usage, and other dimensions, as well as the queuing condition of current tasks, so as to comprehensively judge the utilization load of cluster resources.

Authenticate identity through authentication mechanism to realize resource permission control.

01Large-scale distributed training
02Large-scale cluster resource management
Large-scale distributed training
Large-scale distributed training
Based on the slurm command line syntax, enable users to configure resource specifications and the number of nodes, initiate distributed training tasks, and parse the tasks based on the returned results.

Build commands on the CLI frontend to verify user options and parameters.

Transform, encapsulate, and build the request body on the CLI backend and send it, and format the output.

Large-scale cluster resource management
Large-scale cluster resource management
Enable users to view the total amount of available cluster resources, usage, and other dimensions, as well as the queuing condition of current tasks, so as to comprehensively judge the utilization load of cluster resources.

Authenticate identity through authentication mechanism to realize resource permission control.

Continuously update the whole line of products and insist on sincere communication and win-win cooperation

Help you achieve new breakthroughs in business with professional AI solutions and advanced AI products