inference scale

Allows for the definition of scaling policies for an Inference Endpoint.

Usage

flexai inference scale <inference_endpoint_name> [flags]

Argument	Type	Required	Description
`inference_endpoint_name`	string	Yes	The name of the Inference Endpoint to scale.

Flag	Short	Type	Description
`--help`	`-h`	boolean	Displays this help page.
`--max-replicas`		integer	The maximum number of replicas to scale to.
`--min-replicas`		integer	The minimum number of replicas to maintain.
`--verbose`	`-v`	boolean	Provides more detailed output when scaling an Inference Endpoint.

⌘I