Amazon AIP-C01 시험

AWS Certified Generative AI Developer - Professional 온라인 연습

최종 업데이트 시간: 2026년06월29일

당신은 온라인 연습 문제를 통해 Amazon AIP-C01 시험지식에 대해 자신이 어떻게 알고 있는지 파악한 후 시험 참가 신청 여부를 결정할 수 있다.

시험을 100% 합격하고 시험 준비 시간을 35% 절약하기를 바라며 AIP-C01 덤프 (최신 실제 시험 문제)를 사용 선택하여 현재 최신 85개의 시험 문제와 답을 포함하십시오.

/ 10

Question No : 1

A university is building an AI-powered application that includes several sub-applications. The sub-applications include AI assistants, assignment graders, and internal analytics applications. The university is defining and testing multiple prompts by using various foundation models (FMs). The university wants to compare variants of each prompt and choose the variant that yield outputs that are best-suited for specified use cases. The university requires a version control solution for the prompts. The university must be able to test prompt variations and collect audit trails for prompt changes and usage. The solution must also maintain consistency while allowing the prompts to integrate into the main application.
Which combination of solutions will meet these requirements with the LEAST operational overhead? (Select TWO.)

A.Use Amazon Bedrock Prompt Management to create versioned prompts. Include parameterized variables for each use case.
B.Store prompts in Amazon S3. Use AWS Step Functions to orchestrate the model interactions and service integrations.
C.Use Amazon Bedrock Flows to create workflows that combine FMs and AWS services.
D.Configure AWS Config to record prompt changes. Use AWS CloudTrail to track prompt usage.
E.Configure Amazon Bedrock intelligent prompt routing.

정답:
Explanation:
Amazon Bedrock Prompt Management is the purpose-built service for prompt lifecycle management. it provides native version control, allowing developers to test and compare variants side-by-side. Use of parameterized variables ensures that a single prompt structure can be consistently reused across different sub-applications (assistants vs. graders) while still being tailored to the specific context. To "integrate into the main application" with minimal overhead, Amazon Bedrock Flows provide a managed orchestration layer. Flows allow developers to link managed prompts with AWS services (like knowledge bases or Lambda functions) without writing complex state-machine logic in Step Functions (Option B). This combination ensures consistent, auditable, and easily deployable prompt assets across the university's diverse use cases.

Question No : 2

A research company is developing a GenAI system to produce summaries of technical documents. The company must catalog all data sources in a central location. The company needs a solution that can automatically discover and update data sources. The solution must tag each generated summary with citations as metadata that users can query. The solution must retain tamper-evident, immutable audit logs for every model invocation and store I/O records.
Which solution will meet these requirements?

A.Use Amazon Comprehend to identify data sources in the documents. Store generated summaries in Amazon S3 and enable S3 Object Lock. Use Amazon CloudWatch metrics to generate reports about application throughput. Do not include logs for each invocation.
B.Use AWS Glue Data Catalog with crawlers to maintain data sources. Store generated summaries in Amazon S3. Write object tags that include a source I
C.Store Amazon Bedrock model invocation logs in Amazon S3. Enable S3 Object Lock on the S3 bucket that stores invocation logs. Use AWS CloudTrail log file integrity validation to provide tamper-evident immutability.
D.Store application outputs in Amazon DynamoD
E.Apply item-level tags that include source attribution. Write application events to Amazon CloudWatch Logs. Use IAM roles to provide audit traceability.
F.Use AWS AppConfig feature flags to implement data versioning. Restrict access to the model by using IAM condition keys. Maintain a versioned mapping file of source-to-output relationships in Amazon S3.

정답:
Explanation:
: AWS Glue Data Catalog and its associated crawlers are the standard AWS tools for automatic discovery and centralized cataloging of datasets. For the generated summaries, storing them in Amazon S3 allows the use of object tags for metadata (like source IDs), making them easily queryable. The critical requirement for "tamper-evident, immutable audit logs" is met by enabling Bedrock model invocation logging to an S3 bucket protected by S3 Object Lock (compliance mode). To further guarantee that logs have not been altered, AWS CloudTrail log file integrity validation uses cryptographic hashes to provide non-repudiation and a verifiable audit trail. This combination covers data management, metadata attribution, and high-standard security compliance.

Question No : 3

A healthcare company wants to develop a proof-of-concept application that uses Amazon Bedrock to automatically summarize medical documents. The company has 3 weeks to validate the application's accuracy. The application must comply with the company’s data privacy policies. The application must include metrics to evaluate summarization accuracy and processing time.
Which solution will meet these requirements?

A.Create a dataset that includes 50-100 anonymized patient records. Implement Retrieval Augmented Generation (RAG) with a secure knowledge base. Use a judge model to evaluate accuracy metrics across three foundation models (FMs).
B.Fine-tune a single foundation model (FM) on patient records. Deploy the FM on Amazon Bedrock. Use Amazon Bedrock AgentCore to configure the FM as an agent. Conduct user testing on 500 company staff members.
C.Select the most powerful available AWS foundation model (FM). Create a chat interface by using Converse APIs. Test the application on 50-100 actual patient records by using only qualitative feedback from stakeholders. Use a custom web interface to gather real-world performance metrics.
D.Use the Strands SDK to deploy multiple agents that connect to multiple knowledge bases that contain specialized medical documents. Compare the responses of the agents. Evaluate the integration of the agents with the company's existing systems.

정답:
Explanation:
: For a 3-week proof-of-concept in a regulated field like healthcare, Retrieval Augmented Generation (RAG) is more efficient and safer than fine-tuning. RAG allows the use of anonymized patient records without risking the leak of sensitive data into the model's permanent memory. To evaluate accuracy quantitatively and rapidly, the "LLM-as-a-judge" pattern is recommended. Using a strong judge model to score the outputs of multiple candidate FMs provides objective metrics (e.g., factual alignment, completeness) that manual qualitative feedback (Option C) cannot scale to provide within the timeline. Fine-tuning (Option B) typically takes longer than 3 weeks to properly data-prep and validate for clinical accuracy.

Question No : 4

A financial services company is developing an AI-powered search assistant application to help investment advisors quickly retrieve investment data. The application runs as an AWS Lambda function. The company is using Amazon Bedrock to develop the application by using an Amazon Bedrock knowledge base that uses Amazon OpenSearch Serverless as its data source. The application agent must manage collections at scale by automatically assigning access permissions to collections and indexes that match a specific pattern. The company uses Amazon Bedrock tools to test the knowledge base. The knowledge base sync process finishes successfully. However, the test reveals a 400 Bad Authorization error from the BedrockAgentRuntime API and a 403 Forbidden error when the test attempts to access OpenSearch Serverless. The company must resolve the permissions issues.
Which combination of solutions will meet this requirement? (Select TWO.)

A.Update the Lambda function execution role to include the bedrock: InvokeAgent permission. Add the aoss: APIAccessAll permission to the Lambda execution role.
B.Create an OpenSearch Serverless data access policy that includes pattern-based resource rules.
C.Configure a VPC endpoint policy for OpenSearch Serverless. Add the endpoint to the Lambda function's VPC configuration.
D.Configure AWS Secrets Manager to store OpenSearch Serverless credentials. Grant the Lambda function access to retrieve the credentials.
E.Enable IAM authentication for the OpenSearch Serverless domain. Add the es: ESHttp* permission to the Lambda function execution role.

정답:
Explanation:
: The errors described indicate missing permissions at both the application orchestration and data access levels. The 400 Bad Authorization from BedrockAgentRuntime indicates the Lambda execution role lacks the identity permission to invoke the agent; adding bedrock: InvokeAgent and aoss: APIAccessAll (which allows the principal to interact with OpenSearch Serverless APIs) is necessary. The 403 Forbidden error from OpenSearch Serverless specifically relates to data-plane permissions. Unlike traditional OpenSearch, Serverless uses data access policies. To "manage collections at scale" automatically, a policy must be created that uses pattern-based resource rules (e.g., matching a prefix), ensuring that as new collections or indexes are created, the required principals (the Lambda role and the Bedrock service role) are granted the necessary access without manual policy updates for every new resource.

Question No : 5

A global healthcare company is deploying a GenAI application on Amazon Bedrock to produce treatment recommendations. Regulations vary for each country where the company operates. Some countries require the company to retain all model inputs and outputs for 2 years. Other countries require the company to submit data for local audits only. Medical providers require consistent medical terminology across all locations. However, the treatment recommendations that the model produces must adapt to local patient demographics. The solution must also integrate with existing electronic health record (EHR) systems. The application must support up to 10,000 healthcare provider queries every day with sub-second response times. The company must be able to review the application before deployments and approve of prompt changes. The application must produce comprehensive logs for prompts, responses, and user context.
Which solution will meet these requirements?

A.Use AWS CloudTrail to log API calls. Create standard prompts in Amazon Bedrock Prompt Management that include variables for patient demographics. Implement IAM policies to ensure that only approves users can access prompts.
B.Use Amazon CloudWatch Logs to collect detailed model invocation logs. Store the logs in Amazon S3. Create parameterized prompts in Amazon Bedrock Prompt Management that include variables for treatment options. Enable prompt versioning and set up an approval workflow.
C.Create AWS Lambda functions to dynamically generate prompts that enforce clinical language requirements. Use Amazon CloudWatch Logs to track model invocations. Use Amazon SQS queues to implement a prompt approval workflow.
D.Store prompt templates in Amazon S3. Use S3 Object Lock to implement version control. Use Amazon EventBridge to track model invocations. Use AWS Config to monitor changes to prompt templates.

정답:
Explanation:
: This complex set of requirements is best addressed by Amazon Bedrock Prompt Management. It allows the creation of parameterized prompts where variables (like demographics) can be injected at runtime, ensuring consistent medical terminology while adapting recommendations to the specific patient. Prompt Management natively supports versioning and approval workflows, which is a requirement for clinical safety and compliance. For audit and retention, Bedrock model invocation logging can be configured to send detailed prompt and response data to Amazon S3. Storing these logs in S3 supports the 2-year retention requirement and facilitates local audits. S3 is more cost-effective for long-term storage than CloudWatch Logs alone. CloudTrail (Option A) only logs management events, not the actual prompt/response content required for medical auditing.

Question No : 6

A large ecommerce company has deployed a foundation model (FM) to generate product descriptions. The company's engineering team monitors technical metrics such as token usage, latency, and error rates by using Amazon CloudWatch. The company's marketing team tracks business metrics such as conversion rates and revenue impact in its own systems. The company needs a unified observability solution that correlates technical performance with business outcomes. The solution must provide automatic alerts to stakeholders when operational metrics indicate degradation. The solution must provide comprehensive visibility across both technical and business metrics.
Which solution will meet these requirements?

A.Create CloudWatch dashboards that include technical metrics and imported business metrics. Configure CloudWatch composite alarms that combine technical data and business data. Use Amazon SNS to set up notifications to stakeholders.
B.Use Amazon Managed Grafana to visualize technical metrics from CloudWatch with business metrics from external sources. Configure Amazon Managed Grafana alerts to invoke AWS Lambda functions. Configure the Lambda functions to remediate issues automatically when metrics exceed predefined thresholds.
C.Stream CloudWatch metrics to Amazon S3 by using CloudWatch metric streams. Create Amazon QuickSight dashboards to visualize the combined technical metrics and business metrics. Set up Amazon EventBridge rules to send notifications to stakeholders when metrics exceed predefined thresholds.
D.Configure CloudWatch custom dashboards that integrate operational metrics with imported business metrics. Set up CloudWatch composite alarms with anomaly detection. Use Amazon SNS to create alarm actions to notify stakeholders when correlated metrics indicate performance issues.

정답:
Explanation:
Amazon CloudWatch provides the most integrated path for unifying technical and business metrics. By importing business metrics into CloudWatch (via custom metrics or metric streams), teams can build custom dashboards that provide a single pane of glass for both system health and conversion performance. Composite alarms allow stakeholders to be notified only when multiple conditions are met (e.g., high latency and dropping conversion rates), reducing alert fatigue. Applying anomaly detection to these metrics is essential for GenAI workloads because performance baselines can shift subtly; CloudWatch can automatically detect these deviations and trigger alerts through Amazon SNS. This solution provides comprehensive correlation and automated alerting with less operational complexity than managing external visualization servers (Option B) or multi-service analytics pipelines (Option C).

Question No : 7

A retail company runs an application that makes product recommendations to customers on the company’s website. The application uses Amazon Bedrock to generate recommendations by dynamically constructing prompts and sending them to foundation models (FMs). A GenAI developer has deployed an update to the application that instructs the FM to include a specific promotional message when the FM generates a response to prompts. When the developer tests the application, the promotional message does not always appear in the responses. When the promotional message does appear in the responses, it does not always flow with the rest of the text. The GenAI developer must ensure that the promotional message always appears in the FM responses.
Which solution will meet this requirement?

A.Use an Amazon Bedrock Guardrails filter on the prompt. Set the input filter strength to HIG
B.Generate multiple response variants that include the promotional message in different ways. Use a reranker model to select the most coherent version based on relevance to the original prompt.
C.Run the prompt through Amazon Bedrock. Process the response through Amazon Bedrock AgentCore to add the promotional message. Rerank the results by using the original prompt and the desired message as context.
D.Reinforce the requirement to include the new promotional message within product recommendations by using an output indicator in prompts to the F

정답:
Explanation:
: When a foundation model fails to include specific required content or fails to integrate it coherently, prompt engineering techniques like output indicators or "wrappers" are highly effective. By explicitly defining where the promotional message should appear (e.g., "The response must end with the following message: [PROMO TEXT]") or providing an example output structure, the developer reinforces the constraint within the model's generation path. This is more direct and less computationally expensive than generating multiple variants and reranking them (Option B) or adding complex post-processing layers (Option C). Guardrails (Option A) are intended for filtering harmful content rather than enforcing specific promotional copy insertion.

Question No : 8

A company purchases Amazon Q Developer Pro subscriptions for 500 developers to improve code quality and productivity. The company needs to create an observability system that tracks adoption metrics across the company. The observability system must be able to identify active subscription users compared to underused subscriptions. The system must give the company the ability to recognize power users every quarter and to identify teams that require additional training. The system must provide visibility into usage patterns such as the number of lines of Amazon Q generated code that each user has accepted.
Which solution will meet these requirements?

A.Create a usage dashboard for Amazon Q Developer. Use the usage dashboard to track aggregated usage adoption metrics.
B.Use the Amazon Q Developer built-in administrator dashboard to track user adoption metrics across the company’s organization in AWS Organizations.
C.Collect user-level metrics in Amazon Q Developer. Store the metrics in an Amazon S3 bucket. Use Amazon QuickSight to visualize the usage data. Create dashboards to show adoption metrics for users and teams.
D.Configure AWS CloudTrail to track all Amazon Q Developer API calls in the company’s organization in AWS Organizations. Use an AWS Lambda function to process the logs. Store the processed logs in Amazon DynamoD
E.Create custom dashboards in Amazon Managed Grafana to visualize the data.

정답:
Explanation:
: Amazon Q Developer Pro provides a built-in administrator dashboard designed specifically for organizational observability. This dashboard provides native visibility into user-level metrics across the entire AWS Organization, allowing administrators to identify active vs. underused subscriptions and recognize power users. Crucially, it tracks high-level usage patterns, including code acceptance metrics (such as lines of code generated and accepted), which is a key requirement for measuring ROI and identifying training needs. Using the built-in dashboard provides the necessary insights with the least operational overhead, as it does not require building custom data pipelines (Option C) or complex log processing architectures (Option D).

Question No : 9

A company uses an application to process customer support tickets. The company wants to integrate AI-powered sentiment analysis and auto-response generation into the application by using Amazon Bedrock. The company wants to prioritize urgent issues and reduce initial response times by 40% compared to manual responses. The solution must process 100 concurrent webhook requests with response times under 500 ms. The solution must maintain 99.9% availability across multiple AWS Regions and authenticate all incoming requests. The company must avoid any authentication failures. The company does not want to modify the existing application infrastructure, which includes several ticketing systems that use multiple webhook authentication methods. The solution must support scaling to handle occasional spikes up to 250,000 daily tickets during peak periods.
Which solution will meet these requirements?

A.Use an Amazon API Gateway REST API with a Regional endpoint to receive webhook requests and invoke AWS Lambda functions. Configure Lambda authorizers to validate all the webhook authentication methods. Configure the Lambda functions to call Amazon Bedrock to perform sentiment analysis and generate responses. Store results in Amazon DynamoDB global tables to provide multi-Region availability.
B.Create AWS Lambda function URLs for each ticketing system. Configure the function URLs with the NONE authentication type. Configure separate Lambda functions to verify webhook signatures by using Hash-based Message Authentication Code (HMAC) validation in the function code. Deploy the functions to multiple Regions and use AWS Global Accelerator to route traffic. Use Amazon Bedrock to perform sentiment analysis and generate responses. Return responses through webhook callbacks.
C.Set up an Amazon SQS queue in each Region to receive webhook messages. Use the SQS queue to invoke AWS Lambda functions that call Amazon Comprehend to perform sentiment analysis and Amazon Lex to generate responses. Use Amazon EventBridge to retry message delivery to the application AP
D.Deploy an AWS AppSync GraphQL API to multiple Regions. Configure API tokens to authenticate incoming requests. Create GraphQL mutation resolvers that publish events to Amazon EventBridge. Configure EventBridge rules to invoke AWS Lambda functions that use Amazon Bedrock to perform sentiment analysis and generate responses. Use Amazon CloudFront to reduce latency.

정답:
Explanation:
: To handle high concurrency (100+ requests) with sub-500 ms response times and diverse authentication methods without infrastructure changes, Amazon API Gateway with Lambda authorizers is the optimal choice. The Lambda authorizers can evaluate multiple authentication tokens or signatures centrally before the request reaches the processing logic, preventing unauthorized traffic and potential authentication failures at scale. AWS Lambda integrated with Amazon Bedrock provides the scalability to handle ticket surges (up to 250,000 daily) without over-provisioning resources. For high availability (99.9%) and multi-region resilience, storing the resulting sentiment and responses in Amazon DynamoDB global tables ensures that data is accessible across regions with minimal latency.
Option B is less secure due to the "NONE" auth type, and Option C introduces queuing latency that may exceed the 500 ms target.

Question No : 10

A financial services company wants to use Amazon Bedrock foundation models (FMs) to analyze call center recordings. When calls end, the call center stores recordings as MP3 files in an Amazon S3 bucket. The company needs to generate summaries and sentiment analysis for the recordings in a structured format as soon as new files are created. The recordings average 20 MB in size.
Which combination of solutions will meet these requirements? (Select TWO.)

A.Use AWS Step Functions to orchestrate a workflow to process the recordings. Configure steps to invoke Amazon Transcribe to convert audio to text, validate job completion, and to invoke an AWS Lambda function to process the text by using Amazon Bedrock FMs to generate structured analysis output.
B.Use AWS Step Functions to orchestrate a workflow to process the recordings. Configure steps to invoke Amazon Transcribe to convert audio to text, validate job completion, and to directly invoke Amazon Bedrock FMs to generate summaries and sentiment analysis in JSON format.
C.Use AWS Step Functions to orchestrate a workflow to process the recordings. Configure steps to invoke Amazon Transcribe to convert audio to text, validate job completion, and to invoke an AWS Lambda function to create a prompt to invoke Amazon Bedrock FMs to generate structured analysis output.
D.Configure the source S3 bucket to send events to Amazon EventBridge. Create an EventBridge rule to invoke the Step Functions workflow when an object is created in the bucket.
E.Configure the source S3 bucket to send notifications to the Step Functions workflow when an object is created in the bucket.

정답:
Explanation:
AWS Step Functions provides native service integrations that minimize code and operational overhead. For this workflow, Step Functions can directly invoke Amazon Transcribe and, upon completion, directly invoke Amazon Bedrock foundation models. Modern foundation models can be prompted to return outputs in JSON format, fulfilling the requirement for structured analysis without needing an intermediate Lambda function to format the request or response. To trigger the process "as soon as new files are created," configuring Amazon S3 to send events to Amazon EventBridge is the recommended event-driven pattern. This allows for fine-grained routing and decouples the storage layer from the processing logic, ensuring the workflow scales reliably as call volume grows.

Question No : 11

A company is developing three specialized NLP models that support a customer service application. One model categorizes each customer’s specific issue. Another model extracts key information from the customer interactions. The third model generates responses. The company must ensure that the application achieves at least 95% accuracy for all tasks. The application must handle up to 500 concurrent requests and respond in less than 500 ms during daily 2-hour peak usage periods. The company must ensure that the application optimizes resource usage during periods of low demand between usage spikes.
Which solution will meet these requirements?

A.Deploy all three models to a single Amazon SageMaker AI multi-model endpoint. Enable dynamic scaling on the endpoint. Use a compute optimized instance type. Configure auto scaling policies that are based on invocation metrics to handle peak loads.
B.Deploy each model to a separate Amazon SageMaker Serverless Inference endpoint. Set provisioned concurrency to handle peak loads. Configure maximum concurrency limits and memory sizing based on each model's specific requirements.
C.Deploy the models by using Amazon Bedrock with provisioned throughput to handle peak loads. Configure the number of model units (MUs) based on expected token throughput needs. Implement request batching for each model.
D.Deploy each model to a separate Amazon SageMaker AI endpoint. Use an asynchronous inference configuration. Store model requests and responses in Amazon S3. Use Amazon SNS to send alerts to users when the application finishes processing requests.

정답:
Explanation:
: Amazon SageMaker Serverless Inference is specifically designed for applications that experience intermittent or bursty traffic. It automatically scales compute capacity based on the number of requests and scales down to zero when there is no traffic, satisfying the requirement to optimize resource usage during low demand. To meet the 500 ms latency requirement during peak periods and avoid "cold start" delays, provisioned concurrency keeps a specified number of execution environments warm and ready to respond immediately. This provides a balance between the cost-effectiveness of serverless and the performance predictability of provisioned instances. Multi-model endpoints (Option A) can introduce "noisy neighbor" issues and latency spikes, while asynchronous inference (Option D) is intended for long-running workloads and cannot meet sub-500 ms requirements.

Question No : 12

A company is building a real-time voice assistant system to assist customer service representatives during customer calls. The system must convert audio calls to text with end-to-end latency of less than 500 ms. The system must use generative AI (GenAI) to produce response suggestions. Human supervisors must be able to rate the system's suggestions during a live customer call. The company must store all customer interactions to comply with auditing policies.
Which solution will meet these requirements?

A.Use the Amazon Transcribe streaming API with standard settings to convert speech to text. Use Amazon Bedrock batch processing to perform inference. Store call recordings and metadata in Amazon S3. Use S3 Lifecycle policies to manage the storage.
B.Use the Amazon Transcribe streaming API with 100-ms audio chunks to optimize latency for the voice assistant. Call the Amazon Bedrock InvokeModelWithResponseStream operation to process client inquiries in real time. Store supervisor ratings in an Amazon DynamoDB table.
C.Use Amazon Transcribe batch processing to perform post-call analysis. Configure AWS Lambda functions to generate responses by using the Amazon Bedrock InvokeModel operation. Use Amazon CloudWatch to log supervisor feedback.
D.Use Amazon Transcribe to convert speech to text and to perform real-time analytics. Use Amazon Comprehend to perform sentiment analysis. Use Amazon SQS to queue processing tasks. Run the Amazon Bedrock InvokeModel operation to generate responses.

정답:
Explanation:
: To achieve the ultra-low latency requirement of less than 500 ms, the system must utilize streaming capabilities at every stage. Using Amazon Transcribe streaming with small (100-ms) audio chunks ensures that transcription begins immediately as the customer speaks. On the model side, Amazon Bedrock’s InvokeModelWithResponseStream allows the application to receive tokens as they are generated, rather than waiting for the entire completion, which is critical for real-time interactions. Amazon DynamoDB is the ideal choice for storing supervisor ratings during a live call because it provides the single-digit millisecond latency required for high-frequency writes without impacting the application's performance.
Options involving batch processing or SQS queuing are unsuitable for sub-500 ms interactive requirements.

Question No : 13

An ecommerce company is developing a generative AI application that uses Amazon Bedrock with Anthropic Claude to recommend products to customers. Customers report that some recommended products are not available for sale on the website or are not relevant to the customer. Customers also report that the solution takes a long time to generate some recommendations.
The company investigates the issues and finds that most interactions between customers and the product recommendation solution are unique. The company confirms that the solution recommends products that are not in the company’s product catalog. The company must resolve these issues.
Which solution will meet this requirement?

A.Increase grounding within Amazon Bedrock Guardrails. Enable Automated Reasoning checks. Set up provisioned throughput.
B.Use prompt engineering to restrict the model responses to relevant products. Use streaming techniques such as the InvokeModelWithResponseStream action to reduce perceived latency for the customers.
C.Create an Amazon Bedrock knowledge base. Implement Retrieval Augmented Generation RA
D.Set the PerformanceConfigLatency parameter to optimized.
E.Store product catalog data in Amazon OpenSearch Service. Validate the model’s product recommendations against the product catalog. Use Amazon DynamoDB to implement response caching.

정답:
Explanation:
Option C best addresses both core problems: hallucinated recommendations that do not exist in the catalog and slow response times, while keeping operational overhead low. The most direct way to prevent the model from recommending unavailable products is to ground generation on authoritative product catalog data at inference time. An Amazon Bedrock knowledge base is designed for this pattern by ingesting domain data, chunking content, creating embeddings, and retrieving the most relevant catalog entries when a user asks for recommendations. Implementing Retrieval Augmented Generation ensures the foundation model receives only approved, catalog-backed context and can cite or base its output on those retrieved items. This sharply reduces the likelihood of inventing products, because the response is conditioned on retrieved catalog records rather than relying on the model’s parametric memory.
The requirement also notes that most interactions are unique. That makes response caching far less effective, because there are fewer repeated prompts to benefit from cached outputs. Instead, improving the retrieval and model invocation path is the better optimization. Using the PerformanceConfigLatency parameter set to optimized prioritizes lower latency behavior for model inference, helping meet faster recommendation generation without requiring the company to build and operate additional infrastructure.
The other options do not solve the root cause as reliably. Prompt engineering and streaming can improve perceived latency, but they do not guarantee catalog-only recommendations because the model can still hallucinate items. Guardrails can help detect or block certain undesired outputs, but without consistent catalog grounding they do not ensure every recommendation is derived from the company’s product data. Building a custom OpenSearch validation and caching layer increases operational complexity, and caching is misaligned with predominantly unique interactions.
Alright, after comparing List B (txt file) against List A (Word file), I have identified the unique questions. These questions cover scenarios or architectural configurations that were not present in the existing list.
Here are the unique questions from List B, formatted as requested:

Question No : 14

A healthcare company is developing an application to process medical queries. The application must answer complex queries with high accuracy by reducing semantic dilution. The application must refer to domain-specific terminology in medical documents to reduce ambiguity in medical terminology.
The application must be able to respond to 1,000 queries each minute with response times less than 2 seconds.
Which solution will meet these requirements with the LEAST operational overhead?

A.Use Amazon API Gateway to route incoming queries to an Amazon Bedrock agent. Configure the agent to use an Anthropic Claude model to decompose queries and an Amazon Titan model to expand queries. Create an Amazon Bedrock knowledge base to store the reference medical documents.
B.Configure an Amazon Bedrock knowledge base to store the reference medical documents. Enable query decomposition in the knowledge base. Configure an Amazon Bedrock flow that uses a foundation model and the knowledge base to support the application.
C.Use Amazon SageMaker AI to host custom ML models for both query decomposition and query expansion. Configure Amazon Bedrock knowledge bases to store the reference medical documents. Encrypt the documents in the knowledge base.
D.Create an Amazon Bedrock agent to orchestrate multiple AWS Lambda functions to decompose queries. Create an Amazon Bedrock knowledge base to store the reference medical documents. Use the agent’s built-in knowledge base capabilities. Add deep research and reasoning capabilities to the agent to reduce ambiguity in the medical terminology.

정답:
Explanation:
Option B provides the least operational overhead because it keeps the solution primarily inside managed Amazon Bedrock capabilities, minimizing custom orchestration code and infrastructure to operate. The core requirements are domain grounding, reduced semantic dilution for complex questions, and consistent low-latency responses at high request volume. A Bedrock knowledge base is purpose-built for Retrieval Augmented Generation by ingesting domain documents, chunking content, generating embeddings, and retrieving the most relevant passages at runtime. This directly addresses the need to reference domain-specific medical terminology from authoritative documents to reduce ambiguity and improve factual accuracy.
Reducing semantic dilution typically requires improving the retrieval query so that the retriever focuses on the most relevant concepts, especially for long or multi-intent questions. Enabling query decomposition allows the system to break a complex medical query into smaller, more targeted sub-queries. This increases retrieval precision and recall for each sub-question, which helps the model generate a more accurate synthesized response grounded in the retrieved medical context.
Amazon Bedrock Flows provide a managed way to orchestrate multi-step generative AI workflows, such as preprocessing the input, performing retrieval against the knowledge base, invoking a foundation model, and formatting the final response. Because flows are managed, the company avoids maintaining custom state machines, multiple Lambda functions, or bespoke routing logic. This reduces operational overhead while still supporting repeatable, observable execution.
Compared with the alternatives, option A introduces an agent plus API Gateway routing and multiple model choices, increasing configuration and runtime complexity.
Option C requires hosting and scaling custom models on SageMaker AI, which adds significant operational burden and latency risk.
Option D relies on multiple Lambda functions orchestrated by an agent, which adds more moving parts and increases cold-start and integration overhead.
Option B most directly meets the requirements with the smallest operational footprint.

Question No : 15

A financial services company uses an AI application to process financial documents by using Amazon Bedrock. During business hours, the application handles approximately 10,000 requests each hour, which requires consistent throughput.
The company uses the CreateProvisionedModelThroughput API to purchase provisioned throughput. Amazon CloudWatch metrics show that the provisioned capacity is unused while on-demand requests are being throttled .
The company finds the following code in the application:
python response = bedrock_runtime.invoke_model(modelId="anthropic.claude-v2", body=json.dumps(payload))
The company needs the application to use the provisioned throughput and to resolve the throttling issues.
Which solution will meet these requirements?

A.Increase the number of model units (MUs) in the provisioned throughput configuration.
B.Replace the model ID parameter with the ARN of the provisioned model that the CreateProvisionedModelThroughput API returns.
C.Add exponential backoff retry logic to handle throttling exceptions during peak hours .
D.Modify the application to use the InvokeModelWithResponseStream API instead of the InvokeModel AP

정답:
Explanation:
Option B is correct because the application is currently invoking the base foundation model identifier, which routes traffic to the on-demand capacity pool rather than the company’s purchased provisioned throughput. In Amazon Bedrock, provisioned throughput is attached to a specific provisioned resource created through the provisioned throughput APIs. To consume that reserved capacity, inference requests must target the provisioned resource identifier that represents the purchased throughput, not the generic model identifier used for on-demand inference.
The code snippet uses modelId="anthropic.claude-v2". This value selects the on-demand endpoint for that model. As a result, requests are subject to on-demand quotas and throttling behavior, while the provisioned throughput remains idle. This directly explains the CloudWatch observation: provisioned capacity metrics show unused capacity because no traffic is being directed to the provisioned resource, and the on-demand path is throttling because it is exceeding the applicable on-demand limits during peak volume.
Replacing the modelId value with the provisioned throughput ARN returned by the
CreateProvisionedModelThroughput workflow ensures the runtime invocation is routed to the reserved capacity. Once traffic is directed correctly, the purchased model units provide the consistent throughput required for predictable performance during business hours, which is exactly why provisioned throughput is used.
Option A could increase capacity, but it does not fix the core issue that the application is not using the provisioned resource at all.
Option C can reduce the impact of throttling temporarily, but it adds latency and does not guarantee consistent throughput; it also still wastes the provisioned capacity.
Option D changes the response delivery mechanism, but throttling is a capacity routing and quota issue, not a streaming API issue.

/ 10

Amazon: SAA-C02 덤프; CLF-C01 덤프; AWS-SysOps 덤프; AWS-Solution-Architect-Associate 덤프; AWS-DevOps-Engineer-Professional 덤프