Pre-Summer Sale Special Limited Time 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: xmas50

NVIDIA NCP-AAI - NVIDIA Agentic AI

Page: 2 / 4
Total 121 questions

An AI Engineer is analyzing a production agentic AI system’s compliance with responsible AI standards.

Which evaluation approaches effectively identify potential safety vulnerabilities and ethical risks in multi-agent workflows? (Choose two.)

A.

Emphasize latency metrics and throughput performance as key evaluation factors for safety vulnerabilities, providing a baseline for operational measures and resource allocation.

B.

Implement comprehensive audit trails using NVIDIA NeMo Guardrails with semantic similarity checks, tracking agent decisions across conversation flows and evaluating policy violations through automated compliance scoring.

C.

Use user feedback as a primary signal for risk identification, emphasizing post-deployment observations and qualitative experience reports alongside operational monitoring.

D.

Deploy multi-layered evaluation combining bias detection metrics (demographic parity, equalized odds) with adversarial testing to probe agent responses for harmful outputs across diverse user populations

You’re employing an LLM to automate the generation of email responses for a customer service team. The generated responses frequently miss the mark, failing to address the customer’s underlying concerns.

What’s the most crucial element to add to the prompt to enhance the quality of the email responses?

A.

Instructing the LLM with a detailed prompt containing instructions on how to format and compose the response in an easy-to-understand structure.

B.

Instructing the LLM to use a simple template for all email replies before generating a response.

C.

Instructing the LLM to “understand the customer’s issue” before generating a response.

D.

Instructing the LLM to provide a response that “is the most helpful” before generating a response.

Which two validation approaches are MOST critical for ensuring agent reliability in production deployments? (Choose two.)

A.

User satisfaction surveys as the primary quality metric

B.

Performance testing during development phases

C.

Structured output validation with Pydantic schemas

D.

Random sampling of agent interactions for manual review

E.

Automated consistency checking across multiple agent runs

Which memory architecture is most appropriate for an agent that must track conversation flow and remember user preferences across multiple interactions?

A.

Implement shared memory using NVSHMEM for short- and long-term context

B.

Single unified memory store with time-based expiration policies

C.

Hierarchical memory with separate short-term and long-term layers

D.

Distributed memory with full replication across all nodes

A large enterprise is preparing to roll out its AI-powered customer support agents worldwide. To maintain high availability and reliability, the operations team must select the best approach for monitoring, updating, and managing all agent instances across different locations.

Which solution most effectively ensures reliable operation and simplified management of large-scale agent deployments?

A.

Establishing centralized monitoring and automated deployment pipelines to oversee agent health, trigger updates, and manage rollbacks across all environments

B.

Allocating a dedicated support team to monitor agent logs and perform manual restarts to ensure human interaction in the data flywheel

C.

Scheduling updates and health checks on an annual basis to minimize service disruptions and ensure agent health, trigger updates, and manage rollbacks across all environments

D.

Provide separate monitoring tools and manual updates at each regional deployment for greater local control of agent health, trigger updates, and manage rollbacks across all environments

A Lead AI Architect at a global financial institution is designing a multi-agent fraud detection system using an agentic AI framework. The system must operate in real time, with distinct agents working collaboratively to monitor and analyze transactional patterns across accounts, retain and share contextual information over time, and escalate suspicious behaviors to a human fraud analyst when needed.

Which architectural approach enables intelligent specialization, shared memory, and inter-agent coordination in a dynamic and evolving threat environment?

A.

Design a modular multi-agent system where individual agents collaborate asynchronously using shared memory and structured messaging.

B.

Design a multi-agent system where individual agents collaborate synchronously using shared memory and structured messaging.

C.

Design a centralized rule-based service that checks all transactions against static fraud indicators and sends alerts when thresholds are exceeded.

D.

Design an agentic workflow where each agent acts independently on isolated data slices with no inter-agent communication to reduce latency and model complexity.

E.

Design monolithic LLM-based agents that handle all fraud detection tasks within a single loop, without modular roles or multi-agent coordination.

An AI architect at a national healthcare provider is maintaining an agentic AI system. The system must monitor model and system performance in real time, raise alerts on failures or anomalies, manage version control and rollback of diagnostic models, and provide transparent insight into agent behavior during patient care workflows.

Which operational approach best supports these requirements using the NVIDIA AI stack?

A.

Containerize each agent in NIM with basic health checks running on cron jobs, and manage version rollback by swapping prebuilt container images.

B.

Optimize all models with TensorRT and use periodic manual log reviews and NVIDIA shell scripts for detecting service anomalies and managing rollback.

C.

Deploy agent models on NVIDIA Triton Inference Server with Prometheus and Grafana for performance alerting, and manage model lifecycle via NGC and the Triton model repository.

D.

Expose agents as stateless NVIDIA API endpoints and monitor activity through application logs, with model versions tracked in a Git-based script repository.

You’re utilizing an LLM to translate complex technical documentation into multiple languages. The translations often lack nuance and fail to capture the original intent.

What’s the most effective strategy for improving the quality of the translations?

A.

Providing the LLM with a glossary of key terms, concepts in all languages and the dataset of previously translated text.

B.

Training the LLM on a dataset of translated texts.

C.

Providing the LLM with guidance to “translate the documents” without additional guidance, so it can use trained knowledge.

D.

Providing the LLM with guidance to translate “with high accuracy” without additional guidance, so it can use trained knowledge.

What is RAG Fusion primarily designed to achieve?

A.

Creating a separate, dedicated database for storing all the retrieved chunks.

B.

Minimizing the need for retrieval, allowing the LLM to generate responses directly from its internal knowledge.

C.

Blending information from multiple retrieved chunks into a single response generated by the LLM.

D.

Automatically translating and integrating all retrieved chunks into a single language.

An AI Engineer at a retail company is developing a customer support AI agent that needs to handle multi-turn conversations while keeping track of customers’ previous queries, preferences, and unresolved issues across multiple sessions.

Which approach is most effective for managing context retention and enabling the agent to respond coherently in real time?

A.

Use a sliding window of recent conversation tokens in memory to track only the last few exchanges.

B.

Retrain the model periodically using historical logs to improve long-term contextual understanding.

C.

Implement a hybrid memory system with vector-based search and key-value storage to retrieve relevant past interactions.

D.

Increase the maximum context window size so the full conversation history is processed each time.