12.4 Deploying the Horizontal Pod Autoscaler
The steps below show how to configure and run the Horizontal Pod Autoscaler (HPA) to scale Oracle Unified Directory (OUD), based on the CPU or memory utilization resource metrics.
Assuming the example OUD configuration in Creating Oracle Unified Directory Instances, three OUD servers are started by
default (oud-ds-rs-0
, oud-ds-rs-1
,
oud-ds-rs-2
).
In the following example an HPA resource is created, targeted at the statefulset
oud-ds-rs
. This resource will autoscale OUD servers from a minimum of 3
OUD servers up to 5 OUD servers. Scaling up will occur when the average CPU is consistently
over 70%. Scaling down will occur when the average CPU is consistently below 70%.
- Find the statefulset used by
OUD:
For example:kubectl get statefulset -n <namespace>
The output will look similar to the following:kubectl get statefulset -n oudns
NAME READY AGE oud-ds-rs 3/3 23h
- Navigate to the
$WORKDIR/kubernetes/hpa
and create anautoscalehpa.yaml
file that contains the following:
where:# apiVersion: autoscaling/v2 kind: HorizontalPodAutoscaler metadata: name: oud-sts-hpa namespace: oudns spec: scaleTargetRef: apiVersion: apps/v1 kind: StatefulSet name: oud-ds-rs #statefulset name of oud behavior: scaleDown: stabilizationWindowSeconds: 60 scaleUp: stabilizationWindowSeconds: 60 minReplicas: 3 maxReplicas: 5 metrics: - type: Resource resource: name: cpu target: type: Utilization averageUtilization: 70
oud-ds-rs
is the stateful set returned earlierminReplicas
should match the number of OUD servers started by default.maxReplicas
should be set to the maximum amount of OUD servers that can be started by HPA.
Note:
For setting HPA based on Memory Metrics, update the metrics block with the following content. Please note, Oracle recommends using only CPU or Memory, not both:metrics: - type: Resource resource: name: memory target: type: Utilization averageUtilization: 70
- Run the following command to create the
autoscaler:
The output will look similar to the following:kubectl apply -f autoscalehpa.yaml
horizontalpodautoscaler.autoscaling/oud-sts-hpa created
- Verify the status of the autoscaler by running the
following:
The output will look similar to the following:kubectl get hpa -n oudns
NAME REFERENCE TARGETS MINPODS MAXPODS REPLICAS AGE oud-sts-hpa StatefulSet/oud-ds-rs 5%/70% 3 5 3 33s