Google Professional-Data-Engineer - Google Professional Data Engineer Exam

Google Professional-Data-Engineer Premium Access Download Demo

Page: 7 / 8
Total 387 questions

Question # 61

What are two methods that can be used to denormalize tables in BigQuery?

1) Split table into multiple tables; 2) Use a partitioned table

1) Join tables into one table; 2) Use nested repeated fields

1) Use a partitioned table; 2) Join tables into one table

1) Use nested repeated fields; 2) Use a partitioned table

Question # 62

Which is the preferred method to use to avoid hotspotting in time series data in Bigtable?

Field promotion

Randomization

Salting

Hashing

Question # 63

Which of these statements about exporting data from BigQuery is false?

To export more than 1 GB of data, you need to put a wildcard in the destination filename.

The only supported export destination is Google Cloud Storage.

Data can only be exported in JSON or Avro format.

The only compression option available is GZIP.

Question # 64

Which of these sources can you not load data into BigQuery from?

File upload

Google Drive

Google Cloud Storage

Google Cloud SQL

Question # 65

Which of the following is NOT one of the three main types of triggers that Dataflow supports?

Trigger based on element size in bytes

Trigger that is a combination of other triggers

Trigger based on element count

Trigger based on time

Question # 66

Which row keys are likely to cause a disproportionate number of reads and/or writes on a particular node in a Bigtable cluster (select 2 answers)?

A sequential numeric ID

A timestamp followed by a stock symbol

A non-sequential numeric ID

A stock symbol followed by a timestamp

Question # 67

Why do you need to split a machine learning dataset into training data and test data?

So you can try two different sets of features

To make sure your model is generalized for more than just the training data

To allow you to create unit tests in your code

So you can use one dataset for a wide model and one for a deep model

Question # 68

To run a TensorFlow training job on your own computer using Cloud Machine Learning Engine, what would your command start with?

gcloud ml-engine local train

gcloud ml-engine jobs submit training

gcloud ml-engine jobs submit training local

You can't run a TensorFlow program on your own computer using Cloud ML Engine .

Question # 69

Google Cloud Bigtable indexes a single value in each row. This value is called the _______.

primary key

unique key

row key

master key

Question # 70

Which of the following is NOT true about Dataflow pipelines?

Dataflow pipelines are tied to Dataflow, and cannot be run on any other runner

Dataflow pipelines can consume data from other Google Cloud services

Dataflow pipelines can be programmed in Java

Dataflow pipelines use a unified programming model, so can work both with streaming and batch data sources

New Year Sale Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: xmas50

Google Professional-Data-Engineer - Google Professional Data Engineer Exam

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation:

The Answer Is:

Explanation: