NCA-GENM試験無料問題集「NVIDIA Generative AI Multimodal 認定」

ページ: 1 / 27
トータル 403 問

サインアップ、ログインされた後に、試験全体を無料で表示できるようになります。

出題：1

When building a multimodal chatbot that handles both text and voice inputs, what are the primary challenges related to data alignment and synchronization that you need to address?

A. All of the above.

B. Handling variations in speech rate, accent, and background noise in voice inputs.

C. Matching the semantic meaning between text and voice inputs when paraphrasing is used.

D. Ensuring that the text and voice encoders use the same vocabulary.

E. None of the above.

正解：A 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：2

Which of the following statements accurately describes the purpose and functionality of 'LoRA' (Low-Rank Adaptation) in the context of fine-tuning large language models?

A. LoRA is a data augmentation technique used to increase the size of the training dataset.

B. LoRA is a regularization technique used to prevent overfitting during fine-tuning.

C. LoRA is a fine-tuning technique that freezes the original weights of a pre-trained model and trains a small set of low-rank matrices to adapt the model to a specific task.

D. LoRA is a method for compressing the weights of a pre-trained language model to reduce its memory footprint.

E. LoRA is a type of attention mechanism used in transformer models.

正解：C 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：3

You are tasked with optimizing a large multimodal AI model for deployment on edge devices with limited computational resources. Which combination of techniques would provide the BEST trade-off between model accuracy and inference speed? (Select TWO)

A. Adding more layers to the model to increase its representational capacity.

B. Pruning to remove less important connections in the model.

C. Using larger batch sizes during inference to maximize GPIJ utilization.

D. Increasing the number of attention heads in the transformer architecture.

E. Model quantization (e.g., INT8) to reduce model size and improve inference speed.

正解：B,E 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：4

Consider the following Python code snippet using PyTorch, designed to combine text and image embeddings before feeding them into a transformer. Assume 'text_embedding' has shape '(batch_size, seq_len, hidden_dim)' and 'image_embedding' has shape '(batch_size, image_features)'. Which of the following code snippets MOST correctly combines these embeddings for a multimodal transformer input?

正解：B 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：5

You have a multimodal model that processes images and text, and you want to deploy it on an edge device with limited computational resources. Which of the following hardware acceleration strategies would be MOST effective in improving the model's inference speed on the edge device?

A. Converting the model to a smaller architecture with fewer parameters, accepting a lower accuracy.

B. Offloading complex computations to a cloud server.

C. Using NVIDIA TensorRT to optimize the model for the specific edge device.

D. Using a larger batch size to improve GPU utilization.

E. Implementing distributed inference across multiple edge devices.

正解：A,C 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：6

You have a multimodal model combining video and text data for action recognition. The model performs well on standard datasets but struggles with videos containing unusual camera angles or lighting conditions. Which data augmentation strategy would be MOST effective in improving the model's robustness?

A. Adding random noise to the audio track.

B. Applying random rotations, flips, and color jittering to the video frames.

C. Randomly cropping and scaling the video frames.

D. Replacing random words in the text descriptions with synonyms.

E. Reducing the frame rate of the videos.

正解：B 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：7

Consider the following code snippet using a hypothetical Generative A1 library. This code is intended to generate an image from a text prompt and then refine it based on a user-provided style image. However, it's not producing the desired results. What is the MOST likely cause of the issue?

A. The 'style_image' is not preprocessed correctly before being passed to the 'refine_image' function.

B. The 'generate_image' function does not support the parameter.

C. The 'strength' parameter in 'refine_image' is set too low, resulting in minimal stylistic changes.

D. The library being used is incompatible with the GPU.

E. The text prompt provided is too short.

正解：C 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：8

You are working on a project to classify images of different types of flowers. You have a relatively small dataset (around 500 images per class). Which of the following techniques would be the MOST effective to improve the performance of your image classifier, considering the limited data?

A. Apply aggressive data augmentation techniques, such as random rotations, flips, and crops.

B. Reduce the image resolution to decrease the number of parameters in the model.

C. Use a simple linear classifier.

D. Train a very deep convolutional neural network from scratch-

E. Use a pre-trained convolutional neural network on a large dataset like ImageNet and fine-tune it on your flower dataset

正解：E 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：9

You are tasked with optimizing a multimodal A1 model that processes both image and text data for generating image captions. The model exhibits slow inference times, particularly when handling high-resolution images. Which of the following optimization strategies would be MOST effective in reducing inference latency, considering the NVIDIA ecosystem?

A. Increasing the batch size during inference to better utilize GPU resources.

B. Using a simpler loss function during training.

C. Switching to a larger model architecture with more parameters.

D. Implementing TensorRT for model optimization and quantization.

E. Removing dropout layers from the model.

正解：D 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：10

You are building a real-time image captioning system using a Transformer model. You observe significant latency issues when generating captions for high-resolution images. Which optimization strategies would be most effective in reducing the latency without significantly sacrificing caption quality? (Select all that apply)

A. Increase the number of Transformer layers to improve caption accuracy, even if it increases latency.

B. Implement knowledge distillation, training a smaller, faster student model to mimic the behavior of the larger Transformer model.

C. Employ model quantization techniques (e.g., INT8 quantization) to reduce model size and memory bandwidth requirements.

D. Use mixed precision training and inference, leveraging Tensor Cores on NVIDIA GPUs.

E. Decrease the batch size during inference.

正解：B,C,D 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：11

You are using NeMo to fine-tune a pre-trained language model for a specific text generation task. You want to implement a custom data augmentation technique to improve the model's robustness. Which of the following approaches is most appropriate for integrating your custom augmentation within the NeMo framework?

A. Use a separate data processing pipeline outside of NeMo and save the augmented data to disk before training.

B. Monkey-patch the existing NeMo data loading functions to inject your augmentation logic.

C. Augment the data directly within the training loop, applying transformations to each batch before feeding it to the model. method.

D. Create a custom *Dataset* class that inherits from 'nemo.core.Dataset' and implements your augmentation within the '_getitem

E. Modify the core NeMo library files to directly incorporate your augmentation logic.

正解：D 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：12

You are developing an Avatar Cloud Engine (ACE) application for a virtual assistant that needs to generate realistic facial expressions based on user emotions detected from text. Which ACE microservice would be most directly responsible for this functionality?

A. Natural Language Understanding (NLU)

B. Text to Speech (TTS)

C. speech to Text (STT)

D. Facial Animation

E. Lip Sync

正解：D 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：13

Consider this Python code snippet using PyTorch:

A. torch.Size([256, 512]). This implementation is correct and efficient for cross-modal attention.

B. torch.Size ([32, 32]). This is correct and computes attention weights for each text-image pair in the batch independently

C. torch.Size ([32, 5121). The issue is a dimension mismatch.

D. Error. The transpose operation is incorrect for achieving cross-modal attention.

正解：E 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：14

You are building a multimodal generative A1 model that combines text, images, and audio. You notice that the model performs well on text and images but struggles with audio, particularly in noisy environments. Which of the following strategies would be MOST effective in improving the model's performance with audio data?

A. Apply data augmentation techniques specifically designed for audio, such as adding noise or varying the speed and pitch.

B. Increase the learning rate for the audio modality during training.

C. Use transfer learning by pre-training the audio component of the model on a large audio dataset.

D. Decrease the weight of the audio modality in the loss function.

E. Reduce the dimensionality of the audio features to simplify the learning task.

正解：A,C 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

出題：15

Which of the following are key benefits of using multimodal learning compared to unimodal learning? (Select TWO correct answers)

A. Guaranteed perfect accuracy.

B. Improved robustness to noise and missing data in one modality.

C. Enhanced ability to capture complex relationships between different data types.

D. Simpler model architectures.

E. Reduced computational complexity.

正解：B,C 解答を投票する

解説: (GoShiken メンバーにのみ表示されます)

ページ: 1 / 27
トータル 403 問

NCA-GENM の機能をすべて解除する

キャプチャ不要
365日無料更新サービス
希望する合格率を設定できる
時間の割り当てられる（時間：分）
NCA-GENM に2つの練習用モード
サポートサービス対応

完全版を入手する

弊社のサイトにはあなたの試験合格を助けるために研究された効果的な知能問題集を提供しています。材料はすべてのユーザーによって称賛されています。弊社のサイトは、最短時間で多くの証明書を取得するのに役立つ学習プラットフォームになります。

掲示板

試験PMI-ACP-JPN トピック2 問題150 スレッド
試験AZ-900-JPN トピック2 問題193 スレッド
試験CV0-004J トピック1 問題133 スレッド
試験CV0-004 トピック1 問題169 スレッド
試験Development-Lifecycle-and-Deployment-Architect-JPN トピック1 問題17 スレッド
試験Agentforce-Specialist-JPN トピック1 問題155 スレッド
試験MS-900-JPN トピック1 問題169 スレッド

弊社を連絡する

我々の働いている時間：( UTC+9 ) 9:00-24:00

月曜日から土曜日まで

サポート：現在連絡

我々は１２時間以内ですべてのお問い合わせを答えます。