Google Professional-Data-Engineer - Google Professional Data Engineer Exam
Total 400 questions
You create an important report for your large team in Google Data Studio 360. The report uses Google BigQuery as its data source. You notice that visualizations are not showing data that is less than 1 hour old. What should you do?
You designed a database for patient records as a pilot project to cover a few hundred patients in three clinics. Your design used a single database table to represent all patients and their visits, and you used self-joins to generate reports. The server resource utilization was at 50%. Since then, the scope of the project has expanded. The database must now store 100 times more patientrecords. You can no longer run the reports, because they either take too long or they encounter errors with insufficient compute resources. How should you adjust the database design?
What is the general recommendation when designing your row keys for a Cloud Bigtable schema?
When a Cloud Bigtable node fails, ____ is lost.
Which of these is NOT a way to customize the software on Dataproc cluster instances?
For the best possible performance, what is the recommended zone for your Compute Engine instance and Cloud Bigtable instance?
What are the minimum permissions needed for a service account used with Google Dataproc?
What Dataflow concept determines when a Window's contents should be output based on certain criteria being met?
When you design a Google Cloud Bigtable schema it is recommended that you _________.
Which is the preferred method to use to avoid hotspotting in time series data in Bigtable?
