Module clients.dai_engine_profile.dai_engine_profile
Functions
from_api_object
def from_api_object(api_object: h2o_engine_manager.gen.model.v1_dai_engine_profile.V1DAIEngineProfile) ‑> DAIEngineProfile
Classes
DAIEngineProfile
class DAIEngineProfile(cpu_constraint: h2o_engine_manager.clients.constraint.profile_constraint_numeric.ProfileConstraintNumeric, gpu_constraint: h2o_engine_manager.clients.constraint.profile_constraint_numeric.ProfileConstraintNumeric, memory_bytes_constraint: h2o_engine_manager.clients.constraint.profile_constraint_numeric.ProfileConstraintNumeric, storage_bytes_constraint: h2o_engine_manager.clients.constraint.profile_constraint_numeric.ProfileConstraintNumeric, max_idle_duration_constraint: h2o_engine_manager.clients.constraint.profile_constraint_duration.ProfileConstraintDuration, max_running_duration_constraint: h2o_engine_manager.clients.constraint.profile_constraint_duration.ProfileConstraintDuration, config_editability: h2o_engine_manager.clients.dai_engine_profile.config_editability.ConfigEditability, name: str = '', display_name: str = '', priority: int = 0, enabled: bool = True, assigned_oidc_roles_enabled: bool = True, assigned_oidc_roles: Optional[List[str]] = None, max_running_engines: Optional[int] = None, max_non_interaction_duration: Optional[str] = None, max_unused_duration: Optional[str] = None, configuration_override: Optional[Dict[str, str]] = None, base_configuration: Optional[Dict[str, str]] = None, yaml_pod_template_spec: str = '', yaml_gpu_tolerations: str = '', triton_enabled: bool = False, create_time: Optional[datetime.datetime] = None, update_time: Optional[datetime.datetime] = None, creator: str = '', updater: str = '', creator_display_name: str = '', updater_display_name: str = '')
DAIEngineProfile represents a set of values that are used for DAIEngine.
DAIEngineProfile represents a set of values that are used for DAIEngine.
Args
cpu_constraint
- Constraint for each DAIEngine's cpu that uses this profile.
gpu_constraint
- Constraint for each DAIEngine's gpu that uses this profile.
memory_bytes_constraint
- Constraint for each DAIEngine's memory_bytes that uses this profile.
storage_bytes_constraint
- Constraint for each DAIEngine's storage_bytes that uses this profile.
max_idle_duration_constraint
- Constraint for each DAIEngine's max_idle_duration that uses this profile.
max_running_duration_constraint
- Constraint for each DAIEngine's max_running_duration that uses this profile.
config_editability
- Specifies the behavior of DAIEngine.config editability when DAIEngine is using this profile.
name
- Resource name. Format "daiEngineProfiles/*".
display_name
- Human-readable name.
priority
- Priority of the DAIEngineProfile. Lower value means higher priority. Priority is NOT a unique value (any two DAIEngineProfiles can have the same priority value).
enabled
- When set to true, the DAIEngineProfile is enabled and can be used in DAIEngine. When set to false, the DAIEngineProfile is disabled and cannot be used in any DAIEngine.
assigned_oidc_roles_enabled
- When set to true, the assigned_oidc_roles field is verified when a user uses this profile.
assigned_oidc_roles
- List of OIDC roles assigned to this DAIEngineProfile. When profile has assigned some OIDC roles and verification of this list is enabled (assigned_oidc_roles_enabled=true), then this profile can be used only by users who have assigned at least one role from this list.
max_running_engines
- Maximum number of DAIEngines per user that can be running when using this DAIEngineProfile.
max_non_interaction_duration
- Max non-interation duration applied on all DAIEngines that use this profile.
max_unused_duration
- Max unused duration applied on all DAIEngines that use this profile.
configuration_override
- configuration_override is applied on top of all other configurations when creating the final configuration that is passed to the DAI server.
base_configuration
- base configuration forms the basis of the final configuration that is passed to the DAI server.
yaml_pod_template_spec
- YAML representation of custom PodTemplateSpec. Definition of PodTemplateSpec: https://kubernetes.io/docs/reference/generated/kubernetes-api/v1.27/#podtemplatespec-v1-coreWhen specified, then it is applied for each DAIEngine that uses this profile. PodTemplateSpec describes what will be applied on top of a regular DriverlessAI pod before it is created. This template is merged into DriverlessAI default pod using StrategicMergePatch method (it overrides the default pod). More info about StrategicMergePatch: https://kubernetes.io/docs/tasks/manage-kubernetes-objects/update-api-object-kubectl-patch/
yaml_gpu_tolerations
- YAML representation of custom GPU Tolerations. Definition of one Toleration: https://kubernetes.io/docs/reference/generated/kubernetes-api/v1.27/#toleration-v1-coreWhen specified, then it is applied for each DAIEngine that uses this profile. GPUTolerations sets DriverlessAI's pod.spec.tolerations in case DAIEngine has GPU > 0. This will override any tolerations defined in yaml_pod_template_spec.PodSpec.Tolerations field.
triton_enabled
- True when DAI built-in Triton inference server is enabled, false when it is disabled.
create_time
- Time when the DAIEngineProfile was created.
update_time
- Time when the DAIEngineProfile was last updated.
creator
- Name of entity that created the DAIEngineProfile.
updater
- Name of entity that last updated the DAIEngineProfile.
creator_display_name
- Human-readable name of entity that created the DAIEngineProfile.
updater_display_name
- Human-readable name of entity that last updated the DAIEngineProfile.
Methods
Feedback
- Submit and view feedback for this page
- Send feedback about AI Engine Manager to cloud-feedback@h2o.ai