Annotation Based Configuration¶
You can configure aspects of Seldon Core via annotations in the SeldonDeployment resource and also the optional API OAuth Gateway. Please create an issue if you would like some configuration added.
SeldonDeployment Annotations¶
gRPC API Control¶
seldon.io/grpc-max-message-size
: Maximum gRPC message size (bytes)Locations : SeldonDeployment.spec.annotations
Default is MaxInt32
seldon.io/grpc-timeout
: gRPC timeout (msecs)Locations : SeldonDeployment.spec.annotations
Default is no timeout
REST API Control¶
Note
When using REST APIs, timeouts will only apply to each node and not to the
full inference graph.
Therefore, each sub-request for each individual node in the graph will be
able to take up to seldon.io/rest-timeout
milliseconds.
seldon.io/rest-timeout
: REST timeout (msecs)Locations : SeldonDeployment.spec.annotations
Default is no overall timeout but will use GoLang’s default transport settings which include a 30 sec connection timeout.
Service Orchestrator¶
seldon.io/engine-separate-pod
: Use a separate pod for the service orchestratorLocations : SeldonDeployment.metadata.annotations, SeldonDeployment.spec.annotations
Locations : SeldonDeployment.metadata.annotations, SeldonDeployment.spec.annotations
seldon.io/executor-logger-queue-size
: Size of request logging worker queueLocations: SeldonDeployment.metadata.annotations, SeldonDeployment.spec.annotations
seldon.io/executor-logger-write-timeout-ms
: Write timeout for adding to logging work queueLocations: SeldonDeployment.metadata.annotations, SeldonDeployment.spec.annotations
Misc¶
seldon.io/svc-name
: Custom service name for predictor. You will be responsible that it doesn’t clash with any existing service name in the namespace of the deployed SeldonDeployment.Locations : SeldonDeployment.spec.predictors[].annotations