Loading provider exams...

AWS Certified Generative AI Developer - Profession… Practice Exam

A company plans to establish an annual customer rewards program. The rewards customers earn differ according to various parameters, including the categories of items they order and the customers' purchase history.

The company requires a generative AI (GenAI) solution that uses three Amazon Bedrock agents to assist customers while they browse an online catalog. The agents must use knowledge bases and action groups to manage the search, recommendation, and order modules. The modules must run sequentially. An AWS Lambda function must calculate estimated rewards for every recommended item. The solution must offer graceful degradation during service disruptions.

Which solution meets these requirements with the MOST operational efficiency?

A Define an Amazon API Gateway REST API behind each agent. Create a second Lambda function to orchestrate the calls to the agents and the rewards Lambda function. Configure the second Lambda function with a retry/fallback mechanism.
B Create an AWS Step Functions state machine with four tasks that run the agents and the rewards Lambda function. Set up retry and catch branches for each of the task steps.
C Configure each agent with a separate retry/fallback mechanism. Create a second Lambda function to orchestrate the calls to the agents and the rewards Lambda function. Define an Amazon API Gateway REST API behind the second Lambda function.
D Create a second Lambda function to orchestrate the calls to the agents and the rewards Lambda function. Create an AWS Step Functions state machine with one task that runs the second Lambda function. Set up retry and catch branches for the task step.

Explanation

AWS Step Functions provides managed orchestration for sequential service tasks and supports task-level Retry and Catch error handling. A state machine can invoke the three Amazon Bedrock agents in order and then invoke the rewards-calculation Lambda function. Configuring retry and catch branches for each task enables targeted recovery or fallback behavior when an individual agent or function fails, without maintaining custom Lambda orchestration and error-handling code.

Learn more

Community Discussion

No comments yet. Be the first to start the discussion!

QuestionQ6

Operational Efficiency and Optimization for GenAI Applications

Community Discussion

No comments yet. Be the first to start the discussion!

QuestionQ7

AI Safety, Security, and Governance

QuestionQ8

Operational Efficiency and Optimization for GenAI Applications

QuestionQ9

Operational Efficiency and Optimization for GenAI Applications

QuestionQ10

Testing, Validation, and Troubleshooting

QuestionQ11

Operational Efficiency and Optimization for GenAI Applications

QuestionQ12

Operational Efficiency and Optimization for GenAI Applications

QuestionQ13

Operational Efficiency and Optimization for GenAI Applications

QuestionQ14

Foundation Model Integration, Data Management, and Compliance

QuestionQ15

Testing, Validation, and Troubleshooting

QuestionQ16

Testing, Validation, and Troubleshooting

QuestionQ17

Operational Efficiency and Optimization for GenAI Applications

QuestionQ18

Operational Efficiency and Optimization for GenAI Applications

QuestionQ19

Operational Efficiency and Optimization for GenAI Applications

QuestionQ20

Implementation and Integration

QuestionQ21

Operational Efficiency and Optimization for GenAI Applications

QuestionQ22

Operational Efficiency and Optimization for GenAI Applications

QuestionQ23

Operational Efficiency and Optimization for GenAI Applications

QuestionQ24

AI Safety, Security, and Governance

QuestionQ25

AI Safety, Security, and Governance

A company uses an AWS Organizations organization with all features enabled to manage multiple AWS accounts. Employees use Amazon Bedrock in multiple accounts. The company must prevent particular topics and proprietary information from being included in prompts submitted to Amazon Bedrock models. The company must ensure that employees can use only approved Amazon Bedrock models. The company centrally administers IAM roles for employees.

Which combination of solutions will satisfy these requirements?

Choose two

A Create an IAM permissions boundary for each employee's IAM role. Configure the permissions boundary to require an approved Amazon Bedrock guardrail identifier to invoke Amazon Bedrock models. Create an SCP that allows employees to use only approved models.
B Create an SCP that allows employees to use only approved models. Configure the SCP to require employees to specify a guardrail identifier in calls to invoke an approved model.
C Create an SCP that prevents an employee from invoking a model if a centrally deployed guardrail identifier is not specified in a call to the model. Create a permissions boundary on each employee's IAM role that allows each employee to invoke only approved models.
D Use AWS CloudFormation to create a custom Amazon Bedrock guardrail that has a block filtering policy. Use stack sets to deploy the guardrail to each account in the organization.
E Use AWS CloudFormation to create a custom Amazon Bedrock guardrail that has a mask filtering policy. Use stack sets to deploy the guardrail to each account in the organization.

Explanation

Amazon Bedrock Guardrails can block denied topics and sensitive information in model input prompts. A block filtering policy prevents the content from being processed, whereas a mask policy redacts detected values. Deploying the guardrail configuration through AWS CloudFormation StackSets provides consistent central deployment to member accounts. An SCP can explicitly deny model invocations that do not include the required guardrail identifier, while permissions boundaries on centrally managed employee roles restrict invocation permissions to approved model resources.

Learn more

A company is developing a generative AI (GenAI) application that uses Amazon Bedrock APIs to process complex customer inquiries. During periods of peak use, the application has intermittent API timeouts that result in issues such as broken response chunks and delayed data delivery. The application has difficulty ensuring prompts stay within token limits when processing complex customer inquiries of different lengths. Users have reported truncated inputs and incomplete responses. The company has also identified foundation model (FM) invocation failures.

The company requires a retry strategy that automatically manages transient service errors and avoids overwhelming Amazon Bedrock during peak usage periods. The strategy must adapt to changing service availability and support response streaming and token-aware request handling.

Which solution meets these requirements?

A Implement a standard retry strategy that uses a 1-second fixed delay between attempts and a 3-retry maximum for all errors. Handle streaming response timeouts by restarting streams. Cap token usage for each session.
B Implement an adaptive retry strategy that uses exponential backoff with jitter and a circuit breaker pattern that temporarily disables retries when error rates exceed a predefined threshold. Implement a streaming response handler that monitors for chunk delivery timeouts. Configure the handler to buffer successfully received chunks and intelligently resume streaming from the last received chunk when connections are re-established.
C Use the AWS SDK to configure a retry strategy in standard mode. Wrap Amazon Bedrock API calls in try-catch blocks that handle timeout exceptions. Return cached completions for failed streaming requests. Enforce a global token limit for all users. Add jitter-based retry logic and lightweight token trimming for each request. Resume broken streams by requesting only the missing chunks from the point of failure. Maintain a small in-memory buffer of the most recent chunks to minimize redundant data transfer.
D Set Amazon Bedrock client request timeouts to 30 seconds. Implement client-side load shedding. Buffer partial results and stop new requests when the application performance begins to degrade. Set static token usage caps for all requests. Configure exponential backoff retries, dynamic chunk sizing, and context-aware token limits.

Explanation

The requirement to "adapt to changing service availability" without overwhelming Bedrock points to the AWS SDK adaptive retry mode, which layers dynamic client-side rate limiting (a token bucket that adjusts the call rate based on observed throttling) on top of standard mode's exponential backoff with jitter and circuit-breaking. Option B combines adaptive retries, exponential backoff with jitter, and a circuit breaker with a streaming handler that buffers received chunks and resumes from the last chunk, satisfying transient-error handling, streaming support, and adaptability. Option C fixes the retry mode to "standard," which does not adapt the call rate to changing availability, and its "return cached completions for failed streaming requests" would serve stale or incorrect data. Options A (fixed 1-second delay) and D (static timeouts/caps) are neither adaptive nor jittered.

Learn more

A retail company uses Amazon Bedrock to build a customer service AI assistant. Analysis indicates that 70% of customer inquiries are simple product questions that a smaller model can handle effectively. However, 30% of inquiries are complex return-policy questions requiring advanced reasoning. The company wants to implement a cost-effective model-selection framework that automatically routes customer inquiries to appropriate models according to inquiry complexity. The framework must preserve high customer satisfaction and minimize response latency.

Which solution meets these requirements with the LEAST implementation effort?

A Create a multi-stage architecture that uses a small foundation model (FM) to classify the complexity of each inquiry. Route simple inquiries to a smaller, more cost-effective model. Route complex inquiries to a larger, more capable model. Use AWS Lambda functions to handle the routing logic.
B Use Amazon Bedrock intelligent prompt routing to automatically analyze inquiries. Route simple product inquiries to smaller models, and route complex return policy inquiries to more capable larger models.
C Implement a single-model solution that uses an Amazon Bedrock mid-sized foundation model (FM) with on-demand pricing. Include special instructions in model prompts to handle both simple and complex inquiries by using the same model.
D Create separate Amazon Bedrock endpoints for simple and complex inquiries. Implement a rule-based routing system based on keyword detection. Use on-demand pricing for the smaller model and provisioned throughput for the larger model.

Explanation

Amazon Bedrock intelligent prompt routing provides a managed, serverless endpoint that analyzes each incoming prompt, predicts response quality across the selected models, and dynamically sends the request to the model offering the appropriate quality-cost tradeoff. This eliminates custom classification and routing infrastructure while allowing simpler inquiries to use a lower-cost model and complex inquiries to use a more capable model.

Learn more

Understanding intelligent prompt routing in Amazon Bedrock

A GenAI developer is developing a Retrieval Augmented Generation (RAG)-based customer-support application that uses Amazon Bedrock foundation models (FMs). The application must process 50 GB of historical customer conversations stored as JSON files in an Amazon S3 bucket. It must use the processed data as its retrieval corpus.

The application's data-processing workflow must extract relevant data from customer-support documents, remove customer personally identifiable information (PII), and generate embeddings for vector storage. The workflow must be cost-effective and complete within 4 hours.

Which solution meets these requirements with the LEAST operational overhead?

A Use AWS Lambda and Amazon Comprehend to process files in parallel, remove PII, and call Amazon Bedrock APIs to generate vectors. Configure Lambda concurrency limits and memory settings to optimize throughput.
B Create an AWS Glue ETL job to run PII detection scripts on the data. Use Amazon SageMaker Processing to run the HuggingFaceProcessor to generate embeddings by using a pre-trained model. Store the embeddings in Amazon OpenSearch Service.
C Deploy an Amazon EMR cluster that runs Apache Spark with user-defined functions (UDFs) that call Amazon Comprehend to detect PII. Use Amazon Bedrock APIs to generate vectors. Store outputs in Amazon Aurora PostgreSQL with the pgvector extension.
D Implement a data processing pipeline that uses AWS Step Functions to orchestrate a workload that uses Amazon Comprehend to detect PII and Amazon Bedrock to generate embeddings. Directly integrate the workflow with Amazon OpenSearch Serverless to store vectors and provide similarity search capabilities.

Explanation

AWS Step Functions can orchestrate managed service integrations for PII detection and embedding generation without operating compute fleets or custom processing infrastructure. Amazon Comprehend detects PII, Amazon Bedrock generates embeddings, and Amazon OpenSearch Serverless stores vectors and supports similarity search. This serverless pipeline minimizes operational administration while supporting scalable parallel processing.

Learn more

A company uses Amazon Bedrock to create technical content for customers. The company has recently seen a surge in hallucinated outputs when its model produces summaries of lengthy technical documents. The outputs contain incorrect or invented details. The current solution uses a large foundation model (FM) with a basic one-shot prompt that supplies the complete document in one input.

The company needs a solution that reduces hallucinations and satisfies factual-accuracy objectives. The solution must process more than 1,000 documents per hour and provide summaries within 3 seconds for each document.

Which combination of solutions meets these requirements?

Choose two

A Implement zero-shot chain-of-thought (CoT) instructions that require step-by-step reasoning with explicit fact verification before the model generates each summary.
B Use Retrieval Augmented Generation (RAG) with an Amazon Bedrock knowledge base. Apply semantic chunking and tuned embeddings to ground summaries in source content.
C Configure Amazon Bedrock guardrails to block any generated output that matches patterns that are associated with hallucinated content.
D Increase the temperature parameter in Amazon Bedrock.
E Prompt the Amazon Bedrock model to summarize each full document in one pass.

Explanation

Retrieval Augmented Generation grounds summaries in semantically retrieved source chunks, reducing unsupported details while avoiding the cost and context limitations of passing an entire long document in one prompt. Amazon Bedrock Knowledge Bases automate document chunking, embeddings, retrieval, and response generation with source citations. Explicit chain-of-thought instructions that require fact verification add a prompt-level check for whether a proposed summary is supported by the provided evidence. Pattern-based output blocking does not establish factual grounding, higher temperature increases generation variability, and one-pass full-document prompting retains the original hallucination-prone design.

Learn more

A company upgraded its Amazon Bedrock-powered foundation model (FM) that supports a multilingual customer service assistant. Following the upgrade, the assistant showed inconsistent behavior between languages. It started producing different responses in some languages for identical questions.

The company needs a solution to identify and remediate similar issues in future updates. The evaluation must:

Complete within 45 minutes for every supported language.
Process at least 15,000 test conversations concurrently.
Be fully automated and integrated into the CI/CD pipeline.
Prevent deployment when quality thresholds are not met.

Which solution meets these requirements?

A Create a distributed traffic simulation framework that sends translation-heavy workloads to the assistant in multiple languages simultaneously. Use Amazon CloudWatch metrics to monitor latency, concurrency, and throughput. Run simulations before production releases to identify infrastructure bottlenecks.
B Deploy the assistant in multiple AWS Regions with Amazon Route 53 latency-based routing and AWS Global Accelerator to improve global performance. Store multilingual conversation logs in Amazon S3. Perform weekly post-deployment audits to review consistency.
C Create a pre-processing pipeline that normalizes all incoming messages into a consistent format before sending the messages to the assistant. Apply rule-based checks to flag potential hallucinations in the outputs. Focus the evaluation on the normalized text to simplify testing across languages.
D Set up standardized multilingual test conversations with identical meaning. Run the test conversations in parallel by using Amazon Bedrock model evaluation jobs. Apply similarity and hallucination thresholds. Integrate the process into the CI/CD pipeline to block releases that fail.

Explanation

Amazon Bedrock automatic model evaluation jobs can run programmatically against a custom prompt dataset and produce computed quality metrics. Standardized conversations with the same meaning in each supported language make inconsistent multilingual responses measurable; similarity and hallucination thresholds provide objective release criteria. Invoking the evaluation from CI/CD and failing the pipeline when those criteria are not met creates the required automated deployment gate. AWS: Evaluate the performance of Amazon Bedrock resources

Learn more

A large ecommerce company has deployed a foundation model (FM) to create product descriptions. The company’s engineering team uses Amazon CloudWatch to monitor technical metrics such as token usage, latency, and error rates. Its marketing team tracks business metrics, including conversion rates and revenue impact, in separate systems.

The company requires a unified observability solution that correlates technical performance with business outcomes. The solution must automatically alert stakeholders when operational metrics show degradation. It must also deliver comprehensive visibility across both technical and business metrics.

Which solution meets these requirements?

A Create CloudWatch dashboards that include technical metrics and imported business metrics. Configure CloudWatch composite alarms that combine technical data and business data. Use Amazon SNS to set up notifications to stakeholders.
B Use Amazon Managed Grafana to visualize technical metrics from CloudWatch with business metrics from external sources. Configure Amazon Managed Grafana alerts to invoke AWS Lambda functions. Configure the Lambda functions to remediate issues automatically when metrics exceed predefined thresholds.
C Stream CloudWatch metrics to Amazon S3 by using CloudWatch metric streams. Create Amazon QuickSight dashboards to visualize the combined technical metrics and business metrics. Set up Amazon EventBridge rules to send notifications to stakeholders when metrics exceed predefined thresholds.
D Configure CloudWatch custom dashboards that integrate operational metrics with imported business metrics. Set up CloudWatch composite alarms with anomaly detection. Use Amazon SNS to create alarm actions to notify stakeholders when correlated metrics indicate performance issues.

Explanation

Business metrics can be imported or published as CloudWatch custom metrics and shown with operational metrics in CloudWatch dashboards. Metric alarms can monitor each metric, and CloudWatch composite alarms use Boolean logic over underlying alarm states to represent correlated technical and business conditions. A composite alarm can send state-change notifications to stakeholders through Amazon SNS.

Learn more

A financial services company is building a Retrieval Augmented Generation (RAG) application to help investment analysts query complex financial relationships spanning multiple investment vehicles, market sectors, and regulatory environments. The dataset includes highly interconnected entities with multi-hop relationships. Analysts must be able to review these relationships holistically to deliver accurate investment guidance. The application must provide comprehensive answers that include indirect relationships among financial entities. The application must return responses in under 3 seconds.

Which solution meets these requirements with the LEAST operational overhead?

A Use Amazon Bedrock Knowledge Bases with Graph RAG and Amazon Neptune Analytics to store the financial data. Analyze the multi-hop relationships between entities and automatically identify related information across documents.
B Use Amazon Bedrock Knowledge Bases and an Amazon OpenSearch Service vector store to implement custom relationship identification logic that uses AWS Lambda functions to query multiple vector embeddings in sequence.
C Use an Amazon OpenSearch Serverless vector database with k-nearest neighbor (k-NN) searches. Implement manual relationship mapping in an application layer that runs in an Amazon EC2 Auto Scaling group.
D Use Amazon DynamoDB to store financial data in a custom indexing system. Use an AWS Lambda function to query relevant records based on input questions. Use Amazon SageMaker AI to generate responses.

Explanation

Amazon Bedrock Knowledge Bases GraphRAG with Amazon Neptune Analytics is a fully managed graph-based RAG solution that automatically identifies entity relationships and traverses related graph nodes. It is designed to connect information through multiple logical steps, producing comprehensive, contextually relevant responses for highly interconnected data while avoiding custom relationship-mapping and query-orchestration infrastructure.

Learn more

A company uses Amazon Bedrock to build an AI-powered application with a foundation model (FM) that supports cross-Region inference and provisioned throughput. The application must provide consistently low latency to users in Europe and North America. It must also comply with data-residency regulations requiring European user data to stay in Europe-based AWS Regions.

During testing, the application has service degradation when Regional traffic spikes reach service quotas. The company requires a solution that preserves application resilience while minimizing operational complexity.

Which solution meets these requirements?

A Deploy separate Amazon Bedrock instances in North American and European Regions. Use a custom routing layer that directs traffic based on user location. Configure Amazon CloudWatch alarms to monitor Regional service usage. Use Amazon SNS to send email alerts to the company when usage approaches specified thresholds.
B Use Amazon Bedrock cross-Region inference profiles by specifying geographical codes in profile IDs when the application calls the InvokeModel API. Configure separate Amazon API Gateway HTTP APIs to direct European and North American users to the appropriate Regional endpoints.
C Deploy a multi-Region Amazon API Gateway HTTP API and AWS Lambda functions that implement retry logic to handle throttling. Configure the Lambda functions to call the FM in the nearest secondary Region when the application reaches service quotas in the primary Region. Use intelligent routing to ensure compliance with data residency requirements.
D Configure provisioned throughput for Amazon Bedrock in multiple Regions. Implement failover logic in the application code to switch between Regions when throttling occurs. Use AWS Global Accelerator to route traffic to the appropriate endpoints based on user location.

Explanation

Geographic Amazon Bedrock cross-Region inference profiles automatically route invocations among the supported Regions within a specified geography, such as the EU or US. This increases available throughput and handles unplanned traffic bursts without custom Regional failover logic. An EU geographic profile confines processing to EU Regions, supporting European data-residency requirements; separate regional API endpoints can direct each user population to the applicable geographic profile. Amazon Bedrock inference profiles currently do not support Provisioned Throughput.

Learn more

A company is developing a canary deployment strategy for a payment-processing API. The system must support automated, gradual traffic shifting among multiple Amazon Bedrock models based on real-time inference metrics, historical traffic patterns, and service health. The solution must be able to progressively increase traffic to new model versions. It must increase traffic when metrics stay healthy and reduce traffic when performance degrades below acceptable thresholds.

The company needs comprehensive monitoring of inference latency and error rates throughout the deployment phase. The company must also be able to stop deployments and roll back to a previous model version without manual intervention.

Which solution meets these requirements?

A Use Amazon Bedrock with provisioned throughput to host the versions of the model. Configure an Amazon EventBridge rule to invoke an AWS Step Functions workflow when a new model version is released. Configure the workflow to shift traffic in stages, wait for a specified time period, and invoke an AWS Lambda function to check Amazon CloudWatch performance metrics. Configure the workflow to increase traffic if the metrics meet thresholds and to trigger a traffic rollback if performance metrics fall below thresholds.
B Use AWS Lambda functions to invoke various Amazon Bedrock model versions. Use an Amazon API Gateway HTTP API with stage variables and weighted routing to shift traffic gradually to new model versions. Use Amazon CloudWatch to monitor performance metrics. Use external logic to adjust traffic between model versions and to roll back if performance falls below thresholds.
C Use Amazon SageMaker AI endpoint variants to represent multiple Amazon Bedrock model versions. Use variant weights to shift traffic. Use Amazon CloudWatch to monitor performance metrics. Use SageMaker Model Monitor to trigger AWS Lambda functions to roll back a model deployment if performance drops below a specified threshold. Configure an Amazon EventBridge rule to roll back model deployments if an anomaly is detected.
D Use Amazon OpenSearch Service to track inference logs. Configure OpenSearch Service to invoke an AWS Systems Manager Automation runbook to update Amazon Bedrock model endpoints to shift traffic based on the inference logs.

Explanation

AWS Step Functions can orchestrate a staged deployment by waiting between traffic changes and using conditional workflow logic after a Lambda function evaluates Amazon CloudWatch metrics. This creates an automated feedback loop: healthy latency and error-rate metrics permit the next traffic increase, while a threshold breach triggers an immediate rollback to the prior model version. Amazon Bedrock Provisioned Throughput supplies dedicated capacity for the model versions being invoked. AWS documents that Step Functions rolling deployments can monitor CloudWatch alarms and automatically route all traffic back to the prior version when an alarm is triggered.

Learn more

A company operates a recommendation system whose applications run on Amazon EC2 instances. The applications make API calls to Amazon Bedrock foundation models (FMs) to analyze customer behavior and produce personalized product recommendations.

The system is having intermittent problems. Some recommendations fail to align with customer preferences. The company requires an observability solution that monitors operational metrics and identifies patterns of operational performance degradation against established baselines. The solution must also produce alerts containing correlation data within 10 minutes when FM behavior departs from expected patterns.

Which solution meets these requirements?

A Configure Amazon CloudWatch Container Insights for the application infrastructure. Set up CloudWatch alarms for latency thresholds. Add custom metrics for token counts by using the CloudWatch embedded metric format. Create CloudWatch dashboards to visualize the data.
B Implement AWS X-Ray to trace requests through the application components. Enable CloudWatch Logs Insights for error pattern detection. Set up AWS CloudTrail to monitor all API calls to Amazon Bedrock. Create custom dashboards in Amazon QuickSight.
C Enable Amazon CloudWatch Application Insights for the application resources. Create custom metrics for recommendation quality, token usage, and response latency by using the CloudWatch embedded metric format with dimensions for request types and user segments. Configure CloudWatch anomaly detection on the model metrics. Establish log pattern analysis by using CloudWatch Logs Insights.
D Use Amazon OpenSearch Service with the Observability plugin. Ingest model metrics and logs by using Amazon Kinesis. Create custom Piped Processing Language (PPL) queries to analyze model behavior patterns. Establish operational dashboards to visualize anomalies in real time.

Explanation

Amazon CloudWatch Application Insights monitors applications and underlying EC2 resources, continuously analyzes metrics and logs, detects anomalies based on historical patterns, and correlates metric anomalies with log errors into detected problems. Application-specific recommendation-quality, token-usage, and latency metrics can be published through the CloudWatch embedded metric format, and CloudWatch anomaly detection can identify deviations from established metric baselines. Application Insights can generate events and SNS notifications for detected problems, with dashboards that include correlated diagnostic information.

Learn more

A company is developing an API for a generative AI (GenAI) application that uses a foundation model (FM) hosted on a managed model service. The API must stream responses to lower latency, enforce token limits to control compute resource usage, and implement retry logic for model timeouts and partial responses.

Which solution meets these requirements with the LEAST operational overhead?

A Integrate an Amazon API Gateway HTTP API with an AWS Lambda function to invoke Amazon Bedrock. Use Lambda response streaming to stream responses. Enforce token limits within the Lambda function. Implement retry logic for model timeouts by using Lambda and API Gateway timeout configurations.
B Connect an Amazon API Gateway HTTP API directly to Amazon Bedrock. Simulate streaming by using client-side polling. Enforce token limits on the frontend. Configure retry behavior by using API Gateway integration settings.
C Connect an Amazon API Gateway WebSocket API to an Amazon ECS service that hosts a containerized inference server. Stream responses by using the WebSocket protocol. Enforce token limits within Amazon ECS. Handle model timeouts by using ECS task lifecycle hooks and restart policies.
D Integrate an Amazon API Gateway REST API with an AWS Lambda function that invokes Amazon Bedrock. Use Lambda response streaming to stream responses. Enforce token limits within the Lambda function. Implement retry logic by using Lambda and API Gateway timeout configurations.

Explanation

API Gateway response payload streaming is supported for REST APIs and Lambda proxy integrations, allowing Lambda to relay incremental Amazon Bedrock model output rather than waiting for a complete response. Lambda is also an appropriate centralized layer for enforcing token limits and handling timeout or partial-response retries, while avoiding the operational management of a containerized inference service. API Gateway HTTP APIs do not support this response-streaming capability.

Learn more

A company is building a workflow to review customer-facing communications before it sends them. The company uses a predefined message template to generate the communications and stores the communications in an Amazon S3 bucket. The workflow must extract a specific portion of the template and send it to an Amazon Bedrock model. The workflow must save model responses back to the original S3 bucket.

Which solution meets these requirements?

A Create a flow in Amazon Bedrock Flows. Configure S3 action nodes at the beginning and end of the flow to retrieve and store the communications and the model responses. In the middle of the flow, configure an expression to parse each communication. Configure an agent step to send the parsed input to the model for review.
B Create an AWS Step Functions Express workflow state machine. Use an Amazon S3 integration GetObject step to retrieve the original communications. Use an intrinsic function Pass step to parse the communications and to pass the results to an Amazon Bedrock InvokeModel step. Configure an Amazon S3 integration PutObject step to store the model responses back to the S3 bucket.
C Create an Amazon Bedrock agent that has an action group. Configure instructions to define how the agent should parse the communications. Configure the action group to retrieve the communications from the S3 bucket, invoke the Amazon Bedrock model, and store the model responses back to the S3 bucket.
D Create an Amazon Bedrock agent that has a single action group. Configure three AWS Lambda functions in the action group. Configure the functions to retrieve the communications from the S3 bucket, parse the communications and invoke the Amazon Bedrock model, and store the model responses back to the S3 bucket.

Explanation

Amazon Bedrock Flows includes S3 retrieval nodes to read content from an Amazon S3 location, S3 storage nodes to write flow data to an S3 bucket, expressions to extract the relevant part of the input for a node, and agent nodes to send input to an agent for model-based processing. These capabilities form the required retrieve, extract, review, and store workflow.

Learn more

An e-commerce company is building an internal platform for developing generative AI applications with Amazon Bedrock foundation models (FMs). Developers must choose models based on evaluations aligned with e-commerce use cases. The platform must show accuracy metrics for text generation and summarization in dashboards. The company has custom e-commerce datasets to use as standardized evaluation inputs.

Which combination of steps meets these requirements with the LEAST operational overhead?

Choose two

A Import the datasets to an Amazon S3 bucket. Provide appropriate IAM permissions and cross-origin resource sharing (CORS) permissions to give the evaluation jobs access to the datasets.
B Import the datasets to an Amazon S3 bucket. Provide appropriate IAM permissions and a VPC endpoint configuration to give the evaluation jobs access to the datasets.
C Configure an AWS Lambda function to create model evaluation jobs on a schedule in the Amazon Bedrock console. Provide the URI of the S3 bucket that contains the datasets as an input. Configure the evaluation jobs to measure the real world knowledge (RWK) score for text generation and BERT Score for summarization. Configure a second Lambda function to check the status of the jobs and publish custom logs to Amazon CloudWatch. Create a custom Amazon CloudWatch Logs Insights dashboard.
D Use Amazon SageMaker Clarify on a schedule to create model evaluation jobs. Use open source frameworks to create and run standardized evaluations. Publish results to Amazon CloudWatch namespaces. Use the word error rate score for text generation and toxicity for summarization as metrics for accuracy. Configure an AWS Lambda function to check the status of the jobs and publish custom logs to CloudWatch. Create a custom Amazon CloudWatch Logs Insights dashboard.
E Run an Amazon SageMaker AI notebook job on a schedule by using the fmevals or ragas framework to run evaluations that use the datasets in the S3 bucket. Write Python code in the notebook that makes direct InvokeModel API calls to the FMs and processes their responses for evaluation. Publish job status and results to Amazon CloudWatch Logs to measure the real world knowledge (RWK) score for text generation and toxicity for summarization as metrics for accuracy. Create a custom CloudWatch Logs Insights dashboard.

Explanation

Amazon Bedrock managed model evaluation jobs can use custom prompt datasets stored in Amazon S3 with appropriate IAM permissions. For automatic evaluations, accuracy for general text generation is calculated as the Real World Knowledge (RWK) score, and accuracy for text summarization is calculated as BERTScore. Using the managed Amazon Bedrock evaluation capability minimizes operational work compared with running custom SageMaker jobs, open-source evaluation frameworks, or direct InvokeModel processing code.

Learn more

A financial services company must build a document-analysis system using Amazon Bedrock to process quarterly reports. The system must analyze financial data, perform sentiment analysis, and validate compliance across batches of reports. Each batch contains 5 reports. Each report requires multiple foundation model (FM) calls. The solution must complete the analysis within 10 seconds for every batch. The current sequential processing takes 45 seconds for each batch.

Which solution will satisfy these requirements?

A Use AWS Lambda functions with provisioned concurrency to process each analysis type sequentially. Configure the Lambda function timeouts to 10 seconds. Configure automatic retries with exponential backoff.
B Use AWS Step Functions with a Parallel state to invoke separate AWS Lambda functions for each analysis type simultaneously. Configure Amazon Bedrock client timeouts. Use Amazon CloudWatch metrics to track execution time and model inference latency.
C Create an Amazon SQS queue to buffer analysis requests. Deploy multiple AWS Lambda functions with reserved concurrency. Configure each Lambda function to process different aspects of each report sequentially and then combine the results.
D Deploy an Amazon ECS cluster that runs containers that process each report sequentially. Use a load balancer to distribute batch workloads. Configure an auto-scaling policy based on CPU utilization to handle demand fluctuations.

Explanation

AWS Step Functions can use a Parallel state to run independent branches concurrently. Invoking separate Lambda functions simultaneously for financial analysis, sentiment analysis, and compliance validation removes the sequential processing bottleneck, making the 10-second batch target achievable when the parallel branches complete within that limit. Bedrock client timeouts and CloudWatch latency metrics support controlling and monitoring the concurrent inference workflow.

Learn more

A specialty coffee company has a mobile app that uses Amazon Bedrock and a three-stage prompt chain to generate personalized coffee roast profiles. The prompt chain converts user inputs into structured metadata, retrieves relevant coffee-roast logs, and produces a personalized roast recommendation for each customer.

Users across multiple AWS Regions report inconsistent roast recommendations for identical inputs, slow inference in the retrieval step, and unsafe recommendations, such as brewing at excessively high temperatures. The company must improve output stability for repeated inputs, application performance, and the safety of the application's outputs. The updated solution must ensure 99.5% output consistency for identical inputs and inference latency of less than 1 second. The solution must also block unsafe or hallucinated recommendations by using validated safety controls.

Which solution will meet these requirements?

A Deploy Amazon Bedrock with provisioned throughput to stabilize inference latency. Apply Amazon Bedrock guardrails that have semantic denial rules to block unsafe outputs. Use Amazon Bedrock Prompt Management to manage prompts by using approval workflows.
B Use Amazon Bedrock Agents to manage chaining. Log model inputs and outputs to Amazon CloudWatch Logs. Use logs from Amazon CloudWatch to perform A/B testing for prompt versions.
C Cache prompt results in Amazon ElastiCache. Use AWS Lambda functions to pre-process metadata and to trace end-to-end latency. Use AWS X-Ray to identify and remediate performance bottlenecks.
D Use Amazon Kendra to improve roast log retrieval accuracy. Store normalized prompt metadata within Amazon DynamoDB. Use AWS Step Functions to orchestrate multistep prompts.

Explanation

Amazon Bedrock Provisioned Throughput supplies dedicated model capacity for more predictable inference performance. Amazon Bedrock Guardrails can block undesirable recommendations through denied-topic policies and can detect or filter hallucinated responses with contextual grounding checks. Amazon Bedrock Prompt Management supports versioned prompt configurations, enabling a consistent approved prompt version to be deployed across application environments.

Learn more

A media company is releasing a platform that enables thousands of users each hour to upload images and text content. The platform uses Amazon Bedrock to process uploaded content and generate creative compositions.

The company requires a solution that ensures the platform neither processes nor produces inappropriate content. The platform must not reveal personally identifiable information (PII) in the compositions. The solution must integrate with the company’s existing Amazon S3 storage workflow.

Which solution meets these requirements with the LEAST infrastructure-management overhead?

A Enable the Enhanced Monitoring tool. Use an Amazon CloudWatch alarm to filter traffic to the platform. Use Amazon Comprehend PII detection to pre-process the data. Create a CloudWatch alarm to monitor for Amazon Comprehend PII detection events. Create an AWS Step Functions workflow that includes an Amazon Rekognition image moderation step.
B Use an Amazon API Gateway HTTP API with request validation templates to screen content before storing the uploaded content in Amazon S3. Use Amazon SageMaker AI to build custom content moderation models that process content before sending the processed content to Amazon Bedrock.
C Create an Amazon Cognito user pool that uses pre-authentication AWS Lambda functions to run content moderation checks. Use Amazon Textract to filter text content and Amazon Rekognition to filter image content before allowing users to upload content to the platform.
D Create an AWS Step Functions workflow that uses built-in Amazon Bedrock guardrails to filter content. Use Amazon Comprehend PII detection to pre-process the content. Use Amazon Rekognition image moderation.

Explanation

Amazon Bedrock Guardrails supplies managed content filters for harmful text and images and sensitive-information controls for PII in model inputs and responses. Amazon Comprehend PII detection can preprocess uploaded text, and Amazon Rekognition moderation can evaluate uploaded images. AWS Step Functions can orchestrate these managed checks as part of an Amazon S3-based workflow without building and operating custom moderation models or separate request-screening infrastructure.

Learn more

AWS Certified Generative AI Developer - Professional AIP-C01Demo

AWS Certified Generative AI Developer - Professional AIP-C01Demo

Exam Topics

How to Use This Practice Exam

Download the Full Exam PDF

QuestionQ1

Community Discussion

QuestionQ2

Community Discussion

QuestionQ3

Community Discussion

QuestionQ4

Community Discussion

QuestionQ5

Community Discussion

QuestionQ6

Community Discussion

QuestionQ7

QuestionQ8

QuestionQ9

QuestionQ10

QuestionQ11

QuestionQ12

QuestionQ13

QuestionQ14

QuestionQ15

QuestionQ16

QuestionQ17

QuestionQ18

QuestionQ19

QuestionQ20

QuestionQ21

QuestionQ22

QuestionQ23

QuestionQ24

QuestionQ25

It's free

QuestionQ6

Community Discussion

QuestionQ7

QuestionQ8

QuestionQ9

QuestionQ10

QuestionQ11

QuestionQ12

QuestionQ13

QuestionQ14

QuestionQ15

QuestionQ16

QuestionQ17

QuestionQ18

QuestionQ19

QuestionQ20

QuestionQ21

QuestionQ22

QuestionQ23

QuestionQ24

QuestionQ25

Community Discussion

Community Discussion

Community Discussion

Community Discussion

Community Discussion

Community Discussion

Community Discussion

Want a break from the ads?

Community Discussion

Community Discussion

Community Discussion

Community Discussion

Community Discussion

Community Discussion

Community Discussion

Community Discussion

Community Discussion

Community Discussion

Community Discussion

Community Discussion