Exam Professional Machine Learning Engineer topic 1 question 132 discussion - ExamTopics


A data science team at a bank needs to develop and compare multiple loan default risk prediction models using TensorFlow and Vertex AI on a massive BigQuery dataset, and the optimal data ingestion method is questioned.
AI Summary available β€” skim the key points instantly. Show AI Generated Summary
Show AI Generated Summary

You work on a data science team at a bank and are creating an ML model to predict loan default risk. You have collected and cleaned hundreds of millions of records worth of training data in a BigQuery table, and you now want to develop and compare multiple models on this data using TensorFlow and Vertex AI. You want to minimize any bottlenecks during the data ingestion state while considering scalability. What should you do?

  • A. Use the BigQuery client library to load data into a dataframe, and use tf.data.Dataset.from_tensor_slices() to read it.
  • B. Export data to CSV files in Cloud Storage, and use tf.data.TextLineDataset() to read them.
  • C. Convert the data into TFRecords, and use tf.data.TFRecordDataset() to read them.
  • D. Use TensorFlow I/O’s BigQuery Reader to directly read the data.
Show Suggested Answer Hide Answer
Suggested Answer: D πŸ—³οΈ

Was this article displayed correctly? Not happy with what you see?


Share this article with your
friends and colleagues.

Facebook



Share this article with your
friends and colleagues.

Facebook