You recently developed a wide and deep model in TensorFlow. You generated training datasets using a SQL script that preprocessed raw data in BigQuery by performing instance-level transformations of the data. You need to create a training pipeline to retrain the model on a weekly basis. The trained model will be used to generate daily recommendations. You want to minimize model development and training time. How should you develop the training pipeline?

Question

Accepted Answer

A. Use the Kubeflow Pipelines SDK to implement the pipeline. Use the BigQueryJobOp component Since the training dataset preprocessing is already handled by a SQL script in BigQuery, using the Kubeflow Pipelines SDK with a `BigQueryJobOp` (or similar BigQuery component) allows directly executing this existing SQL preprocessing within the pipeline. This approach minimizes model development time by avoiding rewriting or adapting the preprocessing logic in another framework.

Answer

B. Use the Kubeflow Pipelines SDK to implement the pipeline. Use the DataflowPythonJobOp Using `DataflowPythonJobOp` would require rewriting the existing SQL preprocessing logic into Python for Dataflow, which increases development time instead of minimizing it.

Answer

C. Use the TensorFlow Extended SDK to implement the pipeline. Use the ExampleGen component While TensorFlow Extended (TFX) is a robust framework, using its `ExampleGen` component alone might not be sufficient to directly incorporate complex BigQuery SQL preprocessing, and adapting the SQL logic into TFX's data transformation components would typically involve more development effort.

Answer

D. Use the TensorFlow Extended SDK to implement the pipeline. Implement the preprocessing Implementing preprocessing within the TensorFlow Extended (TFX) SDK would necessitate rewriting the existing BigQuery SQL preprocessing logic using `tf.Transform` or similar TFX components, which goes against the goal of minimizing development and training time.

You recently developed a wide and deep model in TensorFlow. You generated training datasets using a SQL script that preprocessed raw data in BigQuery by performing instance-level transformations of th

Question

Options

How the community answered

Why each option

Topics

Community Discussion