Annotation Based Configuration¶

You can configure aspects of Seldon Core via annotations in the SeldonDeployment resource and also the optional API OAuth Gateway. Please create an issue if you would like some configuration added.

SeldonDeployment Annotations¶

gRPC API Control¶

  • seldon.io/grpc-max-message-size : Maximum gRPC message size (bytes)

  • seldon.io/grpc-timeout : gRPC timeout (msecs)

REST API Control¶

Note

When using REST APIs, timeouts will only apply to each node and not to the full inference graph. Therefore, each sub-request for each individual node in the graph will be able to take up to seldon.io/rest-timeout milliseconds.

  • seldon.io/rest-timeout : REST timeout (msecs)

    • Locations : SeldonDeployment.spec.annotations

    • Default is no overall timeout but will use Go’s default transport settings which include a 30 sec connection timeout.

    • REST timeout example

Service Orchestrator¶

  • seldon.io/engine-separate-pod : Use a separate pod for the service orchestrator

    • Locations : SeldonDeployment.metadata.annotations, SeldonDeployment.spec.annotations

    • Separate svc-orc pod example

    • Locations : SeldonDeployment.metadata.annotations, SeldonDeployment.spec.annotations

  • seldon.io/executor-logger-queue-size : Size of request logging worker queue

    • Locations: SeldonDeployment.metadata.annotations, SeldonDeployment.spec.annotations

  • seldon.io/executor-logger-write-timeout-ms : Write timeout for adding to logging work queue

    • Locations: SeldonDeployment.metadata.annotations, SeldonDeployment.spec.annotations

Misc¶

  • seldon.io/svc-name : Custom service name for predictor. You will be responsible that it doesn’t clash with any existing service name in the namespace of the deployed SeldonDeployment.