DO180 - ch06s06

Bookmark this page

Guided Exercise: Reserve Compute Capacity for Applications

Configure an application with compute resource requests that allow and prevent successful scheduling and scaling of its pods.

Outcomes

Observe that memory resource requests allocate cluster node memory.
Explore how adjusting resource requests impacts the number of replicas that can be scheduled on a node.

As the student user on the workstation machine, use the lab command to prepare your system for this exercise.

This command ensures that the following conditions are true:

The reliability-requests project exists.
The resource files are available in the course directory.
The classroom registry has the registry.ocp4.example.com:8443/redhattraining/long-load:v1 container image.

The registry.ocp4.example.com:8443/redhattraining/long-load:v1 container image contains an application with utility endpoints. These endpoints perform such tasks as crashing the process and toggling the server's health status.

[student@workstation ~]$ lab start reliability-requests

Instructions

As the admin user, deploy the long-load application by applying the long-load-deploy.yaml file in the reliability-requests project.
1. Log in as the admin user with the redhatocp password.
```
[student@workstation ~]$ oc login -u admin -p redhatocp \
https://api.ocp4.example.com:6443
Login successful.
...output omitted...
```
  Note
  In general, use accounts with the least required privileges to perform a task. In the classroom environment, this account is the developer user. However, cluster administrator privileges are required to view the cluster node metrics in this exercise.
2. View the total memory request allocation for the node.
```
[student@workstation ~]$ oc describe node master01
...output omitted...
Allocated resources:
  (Total limits may be over 100 percent, i.e., overcommitted.)
  Resource           Requests       Limits
  --------           --------       ------
  cpu                3158m (42%)    980m (13%)
  memory             12667Mi (66%)  1250Mi (6%)
...output omitted...
```
  The command output shows that the pods that are currently running on the node requested a total of 12667 MiB of memory. That value might be slightly different on your system.
  Important
  Projects and objects from previous exercises can cause the memory usage from this exercise to mismatch the intended results. Delete any unrelated projects before continuing.
  If you still experience issues, re-create your classroom environment and try this exercise again.
3. Select the reliability-requests project.
```
[student@workstation ~]$ oc project reliability-requests
Now using project "reliability-requests" on server "https://api.ocp4.example.com:6443".
```
4. Navigate to the ~/DO180/labs/reliability-requests directory. Create a deployment, service, and route by using the oc apply command and the long-load-deploy.yaml file.
```
[student@workstation ~]$ cd DO180/labs/reliability-requests
[student@workstation reliability-requests]$ oc apply -f long-load-deploy.yaml
deployment.apps/long-load created
service/long-load created
route.route.openshift.io/long-load created
```

Add a resource request to the pod definition and scale the deployment beyond the cluster's capacity.

Modify the long-load-deploy.yaml file by adding a resource request. The request allocates one gibibyte (1 GiB) to each of the application pods.

spec:
  ...output omitted...
  template:
  ...output omitted...
    spec:
      containers:
      - image: registry.ocp4.example.com:8443/redhattraining/long-load:v1
        resources:
          requests:
            memory: 1Gi
...output omitted...

Apply the YAML file to modify the deployment with the resource request.

[student@workstation reliability-requests]$ oc apply -f long-load-deploy.yaml
deployment.apps/long-load configured
service/long-load unchanged
route.route.openshift.io/long-load unchanged

Scale the deployment to have 10 replicas.

[student@workstation reliability-requests]$ oc scale deploy/long-load \
  --replicas 10
deployment.apps/long-load scaled

Observe that the cluster cannot schedule all pods on the single node. The pods with a Pending status cannot be scheduled.

[student@workstation reliability-requests]$ oc get pods
NAME                         READY   STATUS    RESTARTS   AGE
...output omitted...
long-load-86bb4b79f8-44zwd   0/1     Pending   0          58s
...output omitted...

Retrieve the cluster event log, and observe that insufficient memory is the cause of the failed scheduling.

[student@workstation reliability-requests]$ oc get events \
  --field-selector reason="FailedScheduling"
...output omitted... pod/long-load-86bb4b79f8-44zwd   0/1 nodes are available: 1 Insufficient memory. ...output omitted...

Alternatively, view the events for a pending pod to see the reason. In the following command, replace the pod name with one of the pending pods in your classroom.

[student@workstation reliability-requests]$ oc describe \
  pod/long-load-86bb4b79f8-44zwd
...output omitted...
Events:
...output omitted...  0/1 nodes are available: 1 Insufficient memory. ...output omitted...

Observe that the node's requested memory usage is high.

[student@workstation reliability-requests]$ oc describe node master01
...output omitted...
Allocated resources:
  (Total limits may be over 100 percent, i.e., overcommitted.)
  Resource           Requests          Limits
  --------           --------          ------
  cpu                3158m (42%)       980m (13%)
  memory             18811Mi (99%)     1250Mi (6%)
...output omitted...

The command output shows that the pods from the long-load deployment requested most of the remaining memory from the node. However, not enough memory is available to accommodate the 10 replicas.

Reduce the requested memory per pod so that the replicas can run on the node.

Manually set the resource request to 250Mi.

[student@workstation reliability-requests]$ oc set resources deploy/long-load \
  --requests memory=250Mi
deployment.apps/long-load resource requirements updated

Delete the pods so that they are re-created with the new resource request.

[student@workstation reliability-requests]$ oc delete pod -l app=long-load
pod "long-load-557b4d94f5-29brx" deleted
...output omitted...

Observe that all pods can start with the lowered memory request. Within a minute, the pods are marked as Ready and in a Running state, with no pods in a Pending status.

[student@workstation reliability-requests]$ oc get pods
NAME                         READY   STATUS    RESTARTS   AGE
long-load-557b4d94f5-68hbb   1/1     Running   0          3m14s
long-load-557b4d94f5-bfk7c   1/1     Running   0          3m21s
long-load-557b4d94f5-bnpzh   1/1     Running   0          3m21s
long-load-557b4d94f5-chtv9   1/1     Running   0          3m21s
long-load-557b4d94f5-drg2p   1/1     Running   0          3m14s
long-load-557b4d94f5-hwsz6   1/1     Running   0          3m12s
long-load-557b4d94f5-k5vqj   1/1     Running   0          3m21s
long-load-557b4d94f5-lgstq   1/1     Running   0          3m21s
long-load-557b4d94f5-r8hq4   1/1     Running   0          3m21s
long-load-557b4d94f5-xrg7c   1/1     Running   0          3m21s

Observe that the memory usage of the node is lower.

[student@workstation reliability-requests]$ oc describe node master01
...output omitted...
Allocated resources:
  (Total limits may be over 100 percent, i.e., overcommitted.)
  Resource           Requests           Limits
  --------           --------           ------
  cpu                3158m (42%)        980m (13%)
  memory             15167Mi (80%)      1250Mi (6%)
...output omitted...

Return to the /home/student/ directory.

[student@workstation reliability-requests]$ cd /home/student/
[student@workstation ~]$

Finish

On the workstation machine, use the lab command to complete this exercise. This step is important to ensure that resources from previous exercises do not impact upcoming exercises.

[student@workstation ~]$ lab finish reliability-requests

Discuss Red Hat OpenShift Administration I: Operating a Production Cluster

Go to community

Version 4.12.2 versus 4.12 of DO180 course?

DRobitaille

21 lis 2023

I just noticed that there is now a version 4.12.2 of the DO180. Currently without any new updated videos. Do we have access to some sort of changelog to judge how different the new version is compared to 4.12. I'm currently in the process of reviewing the content of DO180/DO280 in preparation for my upcoming EX280 exam (based on "4.12"), so I'm wondering if it's worth studying the 4.12.2 version instead of 4.12.

470

Red Hat OpenShift Administration I: Containers & Kubernetes (DO180)

Haley_Ruccio

20 lip 2023

Deploy, manage, and troubleshoot containerized applications running as Kubernetes workloads in OpenShift clusters.Course DescriptionRed Hat OpenShift Administration I: Managing Containers and Kubernetes (DO180) prepares OpenShift cluster administrators to manage Kubernetes workloads and to collaborate with developers, DevOps engineers, system administrators, and SREs to ensure the availability of application workloads. This course focuses on managing typical end-user applications that are often accessible from a web or mobile UI and that represent most cloud-native and containerized workloads. Managing applications also includes deploying and updating their dependencies, such as databases, messaging, and authentication systems.The skills that you learn in this course apply to all versions of OpenShift, including Red Hat OpenShift on AWS (ROSA), Azure Red Hat OpenShift, and OpenShift Container Platform.This course is based on Red Hat OpenShift 4.12.Course Content SummaryManaging OpenShift clusters from the command-line interface and from the web console.Troubleshooting network connectivity between applications inside and outside an OpenShift cluster.Connecting Kubernetes workloads to storage for application data.Configuring Kubernetes workloads for high availability and reliability.Managing updates to container images, settings, and Kubernetes manifests of an application.Target AudienceSystem administrators and platform operators who are interested in managing OpenShift clusters and containerized applications.Site Reliability Engineers who are interested in maintaining and troubleshooting containerized applications on Kubernetes.System and software architects who are interested in learning and using the features and functions of an OpenShift cluster.Developers and Site Reliability Engineers that are new to container technology should enroll in Red Hat OpenShift Development I: Introduction to Containers with Podman (DO188).Recommended trainingTake our free assessment to gauge whether this offering is the best fit for your skills.Prerequisite: Containers, Kubernetes and Red Hat OpenShift Technical Overview or equivalent knowledge of Linux containers.Technology considerationsThis course requires internet access to access the cloud-based classroom environment that provides an OpenShift cluster and a remote administrator’s workstation.

Welcome to the Red Hat OpenShift Administration (DO180) group in the Red Hat Learning Community!

Deanna

18 lip 2023

We are excited to launch a space dedicated to the Red Hat Training course Red Hat OpenShift Administration I - Containers & Kubernetes! To gain the most value from this group - click the "Join Group" button in the upper right hand corner of the group home screen.We encourage group members to collaborate in this group to discuss topics, ask questions, share best practices and tips, provide course feedback, and share their accomplishments as it relates to DO180.Read more about Red Hat OpenShift Administration I here.

114

Revision: do180-4.14-b6cd706

Red Hat OpenShift Administration I: Operating a Production Cluster

Guided Exercise: Reserve Compute Capacity for Applications

Note

Important