Exam Professional Machine Learning Engineer topic 1 question 256 discussion - ExamTopics


AI Summary Hide AI Generated Summary

Problem: Scaling a Machine Learning Model

An online grocery store uses a custom machine learning model deployed on Google Vertex AI to recommend recipes. The model is currently running on a single machine with 8 vCPUs. Anticipating a fourfold increase in traffic during the holiday season, the store needs to scale the model efficiently.

Options and Suggested Solution

Several options are presented to address the increased demand:

  • Option A: Maintain the current machine type, monitor CPU usage, and add compute nodes if necessary.
  • Option B: Increase the machine type's vCPUs to 32, monitor CPU usage, and scale further if needed.
  • Option C: Maintain the current machine type, enable autoscaling based on vCPU usage, monitor CPU usage, and investigate alerts.
  • Option D: Switch to a GPU-based machine type, enable autoscaling based on GPU usage, monitor GPU usage, and investigate alerts.

The suggested solution is Option C. This approach leverages autoscaling for efficient resource management and avoids the potential over-provisioning of resources in options B and D.

Sign in to unlock more AI features Sign in with Google

You work for an online grocery store. You recently developed a custom ML model that recommends a recipe when a user arrives at the website. You chose the machine type on the Vertex AI endpoint to optimize costs by using the queries per second (QPS) that the model can serve, and you deployed it on a single machine with 8 vCPUs and no accelerators.

A holiday season is approaching and you anticipate four times more traffic during this time than the typical daily traffic. You need to ensure that the model can scale efficiently to the increased demand. What should you do?

  • A. 1. Maintain the same machine type on the endpoint. 2. Set up a monitoring job and an alert for CPU usage. 3. If you receive an alert, add a compute node to the endpoint.
  • B. 1. Change the machine type on the endpoint to have 32 vCPUs. 2. Set up a monitoring job and an alert for CPU usage. 3. If you receive an alert, scale the vCPUs further as needed.
  • C. 1. Maintain the same machine type on the endpoint Configure the endpoint to enable autoscaling based on vCPU usage. 2. Set up a monitoring job and an alert for CPU usage. 3. If you receive an alert, investigate the cause.
  • D. 1. Change the machine type on the endpoint to have a GPU. Configure the endpoint to enable autoscaling based on the GPU usage. 2. Set up a monitoring job and an alert for GPU usage. 3. If you receive an alert, investigate the cause.
Show Suggested Answer Hide Answer
Suggested Answer: C πŸ—³οΈ

Was this article displayed correctly? Not happy with what you see?

Tabs Reminder: Tabs piling up in your browser? Set a reminder for them, close them and get notified at the right time.

Try our Chrome extension today!


Share this article with your
friends and colleagues.
Earn points from views and
referrals who sign up.
Learn more

Facebook

Save articles to reading lists
and access them on any device


Share this article with your
friends and colleagues.
Earn points from views and
referrals who sign up.
Learn more

Facebook

Save articles to reading lists
and access them on any device