Page Reader AI | Exam Professional Machine Learning Engineer topic 1 question 256 discussion

Exam Professional Machine Learning Engineer topic 1 question 256 discussion - ExamTopics

AI Summary Hide AI Generated Summary

Problem: Scaling a Machine Learning Model

An online grocery store uses a custom machine learning model deployed on Google Vertex AI to recommend recipes. The model is currently running on a single machine with 8 vCPUs. Anticipating a fourfold increase in traffic during the holiday season, the store needs to scale the model efficiently.

Options and Suggested Solution

Several options are presented to address the increased demand:

Option A: Maintain the current machine type, monitor CPU usage, and add compute nodes if necessary.
Option B: Increase the machine type's vCPUs to 32, monitor CPU usage, and scale further if needed.
Option C: Maintain the current machine type, enable autoscaling based on vCPU usage, monitor CPU usage, and investigate alerts.
Option D: Switch to a GPU-based machine type, enable autoscaling based on GPU usage, monitor GPU usage, and investigate alerts.

The suggested solution is Option C. This approach leverages autoscaling for efficient resource management and avoids the potential over-provisioning of resources in options B and D.

You work for an online grocery store. You recently developed a custom ML model that recommends a recipe when a user arrives at the website. You chose the machine type on the Vertex AI endpoint to optimize costs by using the queries per second (QPS) that the model can serve, and you deployed it on a single machine with 8 vCPUs and no accelerators.

A holiday season is approaching and you anticipate four times more traffic during this time than the typical daily traffic. You need to ensure that the model can scale efficiently to the increased demand. What should you do?

A. 1. Maintain the same machine type on the endpoint. 2. Set up a monitoring job and an alert for CPU usage. 3. If you receive an alert, add a compute node to the endpoint.
B. 1. Change the machine type on the endpoint to have 32 vCPUs. 2. Set up a monitoring job and an alert for CPU usage. 3. If you receive an alert, scale the vCPUs further as needed.
C. 1. Maintain the same machine type on the endpoint Configure the endpoint to enable autoscaling based on vCPU usage. 2. Set up a monitoring job and an alert for CPU usage. 3. If you receive an alert, investigate the cause.
D. 1. Change the machine type on the endpoint to have a GPU. Configure the endpoint to enable autoscaling based on the GPU usage. 2. Set up a monitoring job and an alert for GPU usage. 3. If you receive an alert, investigate the cause.

Show Suggested Answer Hide Answer

Was this article displayed correctly? Not happy with what you see?

See Archived Versions Request Manual Review

Category: Engineering

Tags: MachineLearning GoogleCloud VertexAI Autoscaling ModelDeployment

Tabs Reminder: Tabs piling up in your browser? Set a reminder for them, close them and get notified at the right time.

If you often open multiple tabs and struggle to keep track of them, Tabs Reminder is the solution you need. Tabs Reminder lets you set reminders for tabs so you can close them and get notified about them later. Never lose track of important tabs again with Tabs Reminder!

Try our Chrome extension today!

Add to Chrome

Save As Favorite

Add To Reading List

Share this article with your
friends and colleagues.
Earn points from views and
referrals who sign up.
Learn more

Twitter/X

WhatsApp

Facebook

Save articles to reading lists
and access them on any device