Complete Subject Question Bank (Dumps)
Which of the following is NOT a type of AI?
The source marks the correct answer as: Generative Art.
What does AI stand for?
The source marks the correct answer as: Artificial Intelligence.
In which application is Generative AI NOT typically used?
The source marks the correct answer as: Automating customer service chats.
Which of the following fields can utilize Generative AI to create new, original content or simulations?
The source marks the correct answer as: Art and Music.
Which of the following is a real-world example of Generative AI?
The source marks the correct answer as: Generating realistic human faces in movies.
Which type of AI is primarily concerned with how data is generated rather than how it's separated?
The source marks the correct answer as: Generative AI.
Generative AI is closely related to which type of models?
The source marks the correct answer as: Generative models.
Which AI type primarily focuses on labeling data?
The source marks the correct answer as: Supervised AI.
Why is Generative AI considered significant in the realm of artificial intelligence?
The source marks the correct answer as: It can produce new, previously unseen data samples.
In the context of AI, which model type is more concerned with the underlying distribution of data?
The source marks the correct answer as: Generative AI.
Which AI type is best for predicting outcomes?
The source marks the correct answer as: Regression AI.
How does Generative AI differ from Classification AI?
The source marks the correct answer as: It generates new data rather than categorizing existing data.
If an AI system is designed to label images of cats and dogs, it is primarily a _______ model.
The source marks the correct answer as: Discriminative.
What is Generative AI primarily used for?
The source marks the correct answer as: Generating new data.
Which of the following is a direct application of Generative AI in the entertainment industry?
The source marks the correct answer as: Creating realistic CGI characters.
Generative AI can be used to create which of the following?
The source marks the correct answer as: New artworks and music pieces.
Which is NOT a real-world application of Generative AI?
The source marks the correct answer as: Predicting stock market prices.
Which statement best describes the role of Generative AI?
The source marks the correct answer as: It focuses on generating data based on learned patterns.
What distinguishes Generative AI from Discriminative AI?
Generative models learn the joint probability distribution P(X,Y) of the data, enabling them to model how data is generated. Discriminative models learn the decision boundary (conditional probability P(Y|X)) between classes. Options [4] and [5] together form this correct answer: 'Generative models data distribution, while Discriminative models the boundary between classes.'
Which statement best defines Generative AI?
This question array contains two merged questions. For 'Which statement best defines Generative AI?' — option [0] 'AI that can generate new data samples' is correct, as Generative AI learns data distributions to produce new content. For the embedded sub-question 'What does AI stand for?' — option [9] 'Artificial Intelligence' is correct.
Which of the following is a real-world example of GenerativeAI?
Generating realistic human faces (e.g. via GANs or diffusion models for visual effects and deepfakes) is a direct real-world application of Generative AI. The other options — translating languages, predicting stock prices, self-driving cars, sorting emails — primarily rely on discriminative or rule-based AI rather than generative approaches.
Who introduced Generative Adversarial Networks (GANs)?
The source marks the correct answer as: Ian Goodfellow.
Which model marked a significant milestone in the use of transformers in NLP?
The source marks the correct answer as: BERT.
Which model uses a probabilistic approach to encode and decode data?
The source marks the correct answer as: VAE.
Which of the following is NOT a direct application of GANs but rather an outcome of its influence?
The source marks the correct answer as: Reinforcement learning in game playing.
Which architecture is primarily associated with attention mechanisms?
The source marks the correct answer as: Transformer.
Which of the following research papers is foundational for Variational Autoencoders (VAEs)?
The source marks the correct answer as: "Auto-Encoding Variational Bayes".
In which year were Generative Adversarial Networks (GANs) first introduced?
The source marks the correct answer as: 2014.
What is the primary purpose of generative models?
The source marks the correct answer as: Generating new data.
What are the two main components of a GAN?
The source marks the correct answer as: Generator and Discriminator.
Which model can transform horse photos into zebra photos without direct comparison?
The source marks the correct answer as: CycleGAN.
What is the main innovation introduced by the "Attention Is All You Need" paper?
The source marks the correct answer as: Transformer architecture.
Which model is known for its rules for creating stable and effective AI image-makers?
The source marks the correct answer as: DCGAN.
What is the primary advantage of Transformers over RNNs in terms of processing sequences?
The source marks the correct answer as: Parallel Processing.
What mechanism allows the Transformer model to weigh the importance of different words in a sequence?
The source marks the correct answer as: Self-Attention Mechanism.
Which AI model series by OpenAI, based on the Transformer architecture, is known for generating highly coherent content?
The source marks the correct answer as: GPT series.
In the context of GANs, what is the role of the Discriminator?
The source marks the correct answer as: To distinguish between real and generated data.
Which model demonstrated that using larger architectures can produce better images?
The source marks the correct answer as: BigGAN.
Which of the following is NOT a direct application of the Transformer architecture?
The source marks the correct answer as: Image recognition.
Which generative model introduced a stochastic layer that models data in a latent space?
The source marks the correct answer as: VAE Additional Reading 3 - Transformers.
Which pioneering research in Generative AI specifically emphasized the generation of text sequences?
"Sequence to Sequence Learning with Neural Networks" (Sutskever, Vinyals & Le, 2014) is the pioneering paper that specifically addressed generation of text sequences using encoder-decoder RNNs for tasks like machine translation. The other options cover computer vision (DeepFace, ConvNets, Neural Style) or general ML theory.
What is the primary goal of machine learning?
The source marks the correct answer as: To allow computers to learn from data.
In the context of neural networks, what does the term "backpropagation" refer to?
The source marks the correct answer as: The method of adjusting weights based on the error.
Which activation function outputs a value between 0 and 1?
The source marks the correct answer as: Sigmoid.
Which application of ML is used to group similar items?
The source marks the correct answer as: Clustering.
Which of the following is a technique to prevent overfitting in neural networks?
The source marks the correct answer as: Dropout.
Which component of a neural network is responsible for combining inputs and passing them to the next layer?
The source marks the correct answer as: Neuron (or Node).
Which of the following is NOT a type of machine learning?
The source marks the correct answer as: Recursive Learning.
Which of the following is NOT a common machine learning algorithm?
The source marks the correct answer as: Quantum Entanglement Brief History of Generative AI - Post Quiz.
Which of the following is a challenge in training deep neural networks?
The source marks the correct answer as: Vanishing/Exploding gradients.
Which function introduces non-linearity in a neural network?
The source marks the correct answer as: Activation Function.
In a neural network, what does a neuron compute?
The source marks the correct answer as: A weighted sum followed by an activation function.
Which of the following is a common activation function in neural networks?
The source marks the correct answer as: ReLU (Rectified Linear Unit).
Which application of ML is used to detect unusual patterns in data?
The source marks the correct answer as: Anomaly Detection.
What is the primary purpose of backpropagation?
The source marks the correct answer as: Adjusting weights based on the error.
How is a neural network's performance typically evaluated during training?
The source marks the correct answer as: Using a validation set.
Which of the following is NOT a layer type in a typical neural network?
The source marks the correct answer as: Quantum Layer.
In which type of ML does an agent learn by interacting with an environment?
The source marks the correct answer as: Reinforcement Learning Fundamentals of Machine Learning and Neural Networks.
What is the primary purpose of a loss function in training neural networks?
The primary purpose of a loss function is to quantify the difference between the model's predicted output and the actual (ground truth) values. This scalar error signal drives backpropagation and weight updates during training. Options [2] and [3] together form the complete answer: 'To quantify the difference between predicted and actual values.'
What is the main difference between regression and classification?
Regression predicts a continuous numeric output (e.g. house price), while classification predicts a discrete class label (e.g. cat vs. dog). Options [0] and [1] together form the complete answer: 'Regression predicts a continuous output, Classification predicts a discrete label.'
What is the role of the loss function in training a neural network?
The loss function's role in training a neural network is to quantify the difference between predicted and actual values, providing the error signal used by backpropagation to update weights. Options [1] and [2] together form the complete answer: 'To quantify the difference between predicted and actual values.'
What does likelihood measure in the context of a model?
The source marks the correct answer as: How well the model explains the observed data.
Which of the following is crucial for understanding the behavior of generative models?
The source marks the correct answer as: Probability distributions and likelihood.
Which of the following is NOT a generative model?
The source marks the correct answer as: Support Vector Machines.
Which model type is primarily concerned with determining P(y | x)?
The source marks the correct answer as: Discriminative Model.
In the context of models, what does P(x | y) typically represent?
The source marks the correct answer as: The probability of x given y.
Generative models are primarily used for which of the following tasks?
The source marks the correct answer as: Generating new data samples similar to the input data.
What is the primary goal of generative models in AI?
The source marks the correct answer as: To generate new data samples.
If a model is better at distinguishing between classes rather than generating data, it is likely a _______.
The source marks the correct answer as: Discriminative model.
In the context of generative models, what does P(x) represent?
The source marks the correct answer as: The probability distribution of the data x.
Within the architecture of Generative Adversarial Networks (GANs), which duo of fundamental elements are paramount?
The source marks the correct answer as: Generator and Discriminator.
Which model type aims to capture the joint probability P(x, y)?
The source marks the correct answer as: Generative Model.
What's a significant hurdle when training GANs?
The source marks the correct answer as: Mode collapse.
Which of the following is NOT a property of likelihood?
The source marks the correct answer as: It is not normalized like a probability.
How is the likelihood of data given a model symbolized?
The source marks the correct answer as: P(data | model).
Within generative models, what function does the discriminator serve in GANs?
The source marks the correct answer as: To distinguish between real and generated data.
For what tasks can generative models be applied?
The source marks the correct answer as: Data generation, denoising, inpainting, and more.
Which statement best differentiates generative from discriminative models?
Generative models learn the joint probability distribution P(X,Y), modeling how input features and labels are jointly distributed. Discriminative models learn the conditional probability P(Y|X) — the boundary between classes. Options [4] and [5] together form the complete answer: 'Generative models learn the joint probability distribution, while discriminative models learn the conditional probability.'
If a model is better at distinguishing between classes rather than generating data, it is likely a _______. Generative model
A model better at distinguishing between classes rather than generating data is a discriminative model. Discriminative models learn the conditional probability P(Y|X) to draw decision boundaries, rather than modeling the full joint data distribution.
What does a probability distribution provide?
A probability distribution provides a mathematical description of the likelihood of outcomes for a random variable — specifying how probability mass or density is assigned across all possible values. Options [3] and [4] together form the complete answer: 'A mathematical description of outcomes for a random variable.'
Which of the following best describes the difference between generative and discriminative models?
Generative models learn the underlying data distribution P(X) or P(X,Y), enabling them to generate new samples. Discriminative models focus on learning the decision boundary (P(Y|X)) between classes. Options [3] and [4] together form the complete answer: 'Generative models learn the data distribution, while discriminative models learn the decision boundary.'
Which claim regarding generative models isn't true?
The false claim is option [1]: 'They always require labeled data for training.' Generative models such as GANs, VAEs, and autoregressive language models can be trained in an unsupervised manner without requiring labeled data. All other options state true properties of generative models.
What does VAE stand for?
The source marks the correct answer as: Variational Autoencoder.
In which application might you use a VAE for generating new, coherent samples?
The source marks the correct answer as: Designing virtual fashion items.
Which application does NOT typically use VAEs?
The source marks the correct answer as: Text summarization.
Which component of the VAE loss function ensures the latent variables adhere to a standard distribution?
The source marks the correct answer as: KL divergence.
Which of the following is NOT a type of autoencoder?
The source marks the correct answer as: Supervised autoencoder.
What is the primary role of autoencoders in generative modeling?
The source marks the correct answer as: Data compression and reconstruction.
In the context of Variational Autoencoders (VAEs), what does variational inference help achieve?
The source marks the correct answer as: Approximation of complex posterior distributions.
Why is the reparameterization trick crucial in training VAEs?
The source marks the correct answer as: It allows backpropagation through stochastic nodes.
Reparameterization trick is used to...
The source marks the correct answer as: Deal with the non-differentiability of sampling in VAEs.
What do VAEs use to generate a distribution over latent variables?
The source marks the correct answer as: Variational inference.
Why is the reparameterization trick important in VAEs?
The source marks the correct answer as: It allows backpropagation through random nodes.
Autoencoders primarily focus on which aspect of data?
The source marks the correct answer as: Reconstruction.
Which of the following is NOT a typical use case for VAEs?
The source marks the correct answer as: Real-time speech translation.
In which application can VAEs detect unusual patterns?
The source marks the correct answer as: Anomaly detection in industrial equipment.
Why is variational inference used in VAEs?
The source marks the correct answer as: To approximate intractable posterior distributions.
In which application might VAEs be used to enhance image quality?
The source marks the correct answer as: Medical imaging.
How do VAEs differ from traditional autoencoders?
The source marks the correct answer as: VAEs introduce randomness via a probabilistic layer.
Which optimization technique is commonly used with VAEs?
The source marks the correct answer as: Stochastic gradient descent (SGD).
Which of the following is a key component of the VAE loss function?
The source marks the correct answer as: KL divergence Variational Autoencoders.
What criterion is used to determine if a data point is anomalous?
The source marks the correct answer as: If its error is above the 99th percentile.
What type of dataset does the manufacturing plant collect?
The source marks the correct answer as: Time Series Dataset.
Which is NOT a challenge in implementing VAEs for this use-case?
The source marks the correct answer as: Increasing data storage costs.
What is the VAE trained to learn effectively?
The source marks the correct answer as: A compressed representation of the data.
For how many epochs is the VAE trained?
The source marks the correct answer as: 50.
Over time, due to certain changes, what might be required of the VAE model?
The source marks the correct answer as: Continuous adaptation.
What is a primary application of VAEs mentioned in the case study?
The source marks the correct answer as: Anomaly Detection.
Why is understanding the VAE's outputs challenging?
The source marks the correct answer as: They can be complex and non-intuitive.
Why is data preprocessing required before training the VAE?
The source marks the correct answer as: To ensure it is suitable for training.
What is the y-axis label of the chart visualizing the error?
The source marks the correct answer as: Reconstruction Error.
What does the VAE attempt to minimize during training?
The source marks the correct answer as: Loss.
In the VAE, what does the sampling function introduce?
The source marks the correct answer as: Randomness.
How is the data divided for training the VAE?
The source marks the correct answer as: 80-20.
What two components combine to form the VAE's loss?
The source marks the correct answer as: MSE and KL divergence.
Which of the following is NOT an attribute in the given data?
The source marks the correct answer as: Humidity.
Why are autoencoders considered generative models?
Autoencoders are considered generative models because they learn a compressed latent representation from which they can reconstruct (and generate) new data similar to the training input. Variational Autoencoders extend this by learning a probability distribution over the latent space. Options [2] and [3] together form the complete answer: 'They can reconstruct and generate data similar to the input.'
Reparameterization trick is used to... Improve model accuracy Deal with the non-differentiability of sampling in VAEs
The reparameterization trick in VAEs is used to deal with the non-differentiability of sampling operations, enabling backpropagation through stochastic latent variables by expressing the sample as a deterministic function of the parameters plus separate Gaussian noise. The true answer ('Deal with the non-differentiability of sampling in VAEs') is stated in the question stem but not among the three remaining options [0,1,2], making option [1] 'None of the given options' the correct selection.
The training process of GANs is often likened to which game?
The source marks the correct answer as: Minimax.
What does GAN stand for?
The source marks the correct answer as: Generative Adversarial Network.
What is a challenge faced during GAN training due to the minimax game concept?
The source marks the correct answer as: Oscillations and non-convergence.
In GANs, which component is responsible for evaluating the authenticity of data?
The source marks the correct answer as: Discriminator.
Which component of a GAN is responsible for generating new data samples?
The source marks the correct answer as: Generator.
Progressive GANs are designed to address which challenge in traditional GANs?
The source marks the correct answer as: Training stability and generating high-resolution images.
Which type of GAN allows for generating data based on specific categories?
The source marks the correct answer as: Conditional GAN.
In the GAN architecture, what is the primary goal of the Discriminator?
The source marks the correct answer as: Distinguish between real and generated samples.
Which of the following is a real-world application where GANs have shown significant promise?
The source marks the correct answer as: Image-to-image translation.
What is mode collapse in the context of GANs?
The source marks the correct answer as: When the generator produces limited varieties of outputs.
Which GAN variant focuses on gradually increasing the resolution of generated images?
The source marks the correct answer as: Progressive GAN.
Which is NOT a real-world application of GANs?
The source marks the correct answer as: Real-time weather prediction.
In GANs, if the discriminator becomes too powerful, what can happen?
The source marks the correct answer as: The generator may struggle to improve.
Which statement about GANs is true?
The source marks the correct answer as: They can generate new, previously unseen data.
Mode collapse is problematic because...
The source marks the correct answer as: It limits the diversity of generated outputs.
What is a challenge in evaluating the performance of GANs?
The source marks the correct answer as: Determining the quality of generated data.
Which component of a GAN tries to produce fake data?
The source marks the correct answer as: Generator.
The generator's objective in GANs is to...
The source marks the correct answer as: Fool the discriminator.
In the minimax game of GANs, what is the discriminator's goal?
The source marks the correct answer as: Distinguish between real and fake data.
Which GAN variant can be conditioned on labels to generate specific outputs?
The source marks the correct answer as: Conditional GAN Generative Adversarial Networks.
1How many images are there in each class of the CIFAR-10 dataset?
The source marks the correct answer as: 6000.
What is used to refine the models during training?
The source marks the correct answer as: Adam Optimizer.
In the provided code, why is discriminator.trainable set to False when setting up the combined system?
The source marks the correct answer as: To make sure only the generator is trained in this step.
Which of the following is NOT a feedback given to the generator during training?
The source marks the correct answer as: This image is pixelated.
1Why might someone want to use GANs on the CIFAR-10 dataset?
The source marks the correct answer as: To generate novel and relevant images to augment dataset.
Which technique can help in dealing with training instability in GANs?
The source marks the correct answer as: Gradient clipping.
Which of the following best describes the role of the generator in a GAN?
The source marks the correct answer as: To produce images.
Which challenge refers to the generator producing limited varieties or even the same sample every time?
The source marks the correct answer as: Mode Collapse.
Which architecture can help address convergence issues in traditional GANs?
The source marks the correct answer as: WGAN.
In the generator code, what is the purpose of the Reshape layer?
The source marks the correct answer as: To reshape the dense layer into a 3D tensor for images.
During training, what does the generator use to improve itself?
The source marks the correct answer as: Feedback from the discriminator.
What does the discriminator do in a GAN?
The source marks the correct answer as: Evaluates if an image is real or fake.
In the discriminator's code, which layer helps in reducing the dimensions of the input image?
The source marks the correct answer as: Conv2D with strides.
Which activation function is used in the final layer of the generator model?
The source marks the correct answer as: tanh.
RNNs are primarily used for which type of data?
The source marks the correct answer as: Sequential.
What is the key advantage of using LSTMs over basic RNNs in sequence generation tasks?
The source marks the correct answer as: Ability to remember long-term dependencies.
Which problem in RNNs does LSTM help to address?
The source marks the correct answer as: Vanishing gradient.
When using RNNs for music generation, what does each neuron in the output layer typically represent?
The source marks the correct answer as: A possible note or rest in the musical vocabulary.
In NLP, what does RNNs help to predict?
The source marks the correct answer as: Next word.
Which RNN architecture utilizes update and reset gates to manage memory?
The source marks the correct answer as: GRU.
What does RNN stand for?
The source marks the correct answer as: Recurrent Neural Network.
During the training of RNNs for sequence generation, what is the common technique used to mitigate the vanishing gradient problem?
The source marks the correct answer as: Gradient clipping.
Which of the following is NOT a type of RNN architecture?
The source marks the correct answer as: CNN CASE STUDY - GANS - CIFAR - Quiz.
Which of the following is NOT a typical use case for RNNs?
The source marks the correct answer as: Image classification.
Which of the following is a common application of RNNs in NLP?
The source marks the correct answer as: Text generation.
Why might one use GRU over LSTM?
The source marks the correct answer as: GRU is simpler and sometimes faster.
In sequence generation tasks, what is the primary input to an RNN at each time step?
The source marks the correct answer as: Previous output.
Which RNN architecture uses a reset and update gate?
The source marks the correct answer as: GRU.
How do RNNs handle variable-length sequences in NLP?
The source marks the correct answer as: Through padding and truncation.
Which problem arises when training RNNs on long sequences?
The source marks the correct answer as: Vanishing or exploding gradients.
What is the main advantage of LSTM over basic RNN?
The source marks the correct answer as: Handling long-term dependencies.
What is the role of the `<OOV>` token?
The source marks the correct answer as: Placeholder for out-of-vocabulary words.
Which layer in the RNN model represents words as detailed feature lists?
The source marks the correct answer as: Embedding Layer.
Why is padding used in the preprocessing step?
The source marks the correct answer as: To handle variable review length.
What advantage does LSTM have over traditional RNNs?
The source marks the correct answer as: Tackles the vanishing gradient problem.
What is the purpose of the Dropout layer in the LSTM with Dropout model?
The source marks the correct answer as: Regularization to prevent overfitting.
What might be a concern if the training accuracy is high but validation accuracy is significantly low?
The source marks the correct answer as: Model is overfitting.
In which scenario might you prefer a simple RNN over an LSTM?
The source marks the correct answer as: Fast training with limited resources.
Which parameter in `model.fit()` signifies the number of times the model is exposed to the dataset?
The source marks the correct answer as: epochs.
Why is the loss function important during model compilation?
The source marks the correct answer as: Specifies how errors are measured.
How does the model handle reviews of varying lengths?
The source marks the correct answer as: Uses padding.
Why might the vanishing gradient problem be a challenge in RNNs?
The source marks the correct answer as: Impedes learning of long-range dependencies.
In the given LSTM model, which layer(s) help in retaining memory and context?
The source marks the correct answer as: LSTM layer.
When using a tokenizer with a fixed number of words, what could be a potential drawback?
The source marks the correct answer as: Limited understanding due to missed words.
What is the primary function of an Embedding Layer?
The source marks the correct answer as: Representing words in dense vector format.
After training, what can be inferred if the validation loss keeps decreasing but training loss remains high?
The source marks the correct answer as: Model is underfitting.
In the context of natural language processing, how are RNNs typically utilized for machine translation?
RNNs are used in machine translation via an encoder-decoder (seq2seq) architecture: the encoder RNN processes the source sentence into a context vector, and the decoder RNN generates the translated output sequence. Options [1] and [2] together form the complete answer: 'Encoding the input sequence and decoding the output sequence.'
What is the primary difference between LSTM and GRU?
LSTM has three gates — input, forget, and output gates — that control information flow through the cell state. GRU simplifies this with only two gates: reset and update gates, making it more computationally efficient. Options [4] and [5] together form the precise answer: 'LSTM has input, forget, and output gates; GRU has reset and update gates.'
In music generation, what might an RNN be trained to predict?
In music generation, an RNN is typically trained to predict the next note or chord given the sequence of previous musical tokens. This autoregressive approach models the temporal dependencies in musical sequences.
The Transformer architecture introduced the concept of self- attention to handle which primary challenge in sequence modeling?
The Transformer's self-attention mechanism was introduced primarily to address the challenge of capturing long-range dependencies in sequences regardless of the distance between dependent tokens in the input — a fundamental limitation of RNNs that process data sequentially. Options [2] and [3] form the complete answer: 'Capturing dependencies regardless of their distance in the input.'
What is the primary advantage of pretraining a Transformer on a large corpus before fine-tuning on a specific task?
The primary advantage of pretraining a Transformer on a large corpus is that the model acquires broad general language understanding (grammar, semantics, world knowledge) which is then efficiently adapted to specific tasks via fine-tuning on smaller labeled datasets. Options [1] and [2] together form the complete answer: 'It allows the model to leverage general language understanding.'
Why is attention particularly crucial in sequence-to-sequence tasks like translation?
Attention is crucial in sequence-to-sequence translation because it allows the decoder to dynamically focus on the most relevant parts of the encoded source sequence when generating each output token, overcoming the bottleneck of compressing the entire source into a single fixed vector. Options [2] and [3] form the complete answer: 'It allows the model to focus on relevant parts of the input when producing an output.'
Which of the following is NOT a sequence-to-sequence task?
The source marks the correct answer as: Image Classification.
In the context of Transformers for language translation, what does the encoder primarily focus on?
The source marks the correct answer as: Processing and representing the source language.
What is the primary component of the Transformer architecture that helps it handle sequences?
The source marks the correct answer as: Attention Mechanism.
What is the first step in training a Transformer model for a specific task?
The source marks the correct answer as: Pre-training.
Which application showcases the use of Transformers in image tasks?
The source marks the correct answer as: Image generation using DALL·E.
Which Transformer model is specifically designed for language translation?
The source marks the correct answer as: T5.
What does the Multi-head attention mechanism in Transformers help with?
The source marks the correct answer as: Capturing different types of information from the input.
Which model can be used for both image and text tasks?
The source marks the correct answer as: None of the options given.
Which mechanism allows Transformers to weigh the importance of different words in a sequence?
The source marks the correct answer as: Self Attention Mechanism.
What is the primary task BERT is designed for?
The source marks the correct answer as: Bidirectional understanding of text.
In sequence-to-sequence tasks, why is attention important?
The source marks the correct answer as: It helps the model focus on relevant parts of the input.
In the context of Transformers, what does "seq to seq" stand for?
The source marks the correct answer as: Sequence to Sequence.
Which of the following models is designed for image generation?
The source marks the correct answer as: DALL·E.
Which Transformer model is known for generating coherent paragraphs of text?
The source marks the correct answer as: GPT.
For which task might you use a Transformer to generate a concise summary of a long article?
The source marks the correct answer as: Summarization.
How did the processing capabilities of Transformers affect GlobeTech's translation time?
The source marks the correct answer as: Reduced it drastically.
Traditional MT models required extensive what for each new language?
The source marks the correct answer as: Re-training and fine-tuning.
The attention mechanism in Transformers allows the model to focus on what?
The source marks the correct answer as: Different parts of the input sentence.
How did GlobeTech offer real-time customer support in multiple languages?
The source marks the correct answer as: Integrating Transformer-based MT into their chatbots.
What technology does GlobeTech plan to integrate with Transformers for customer support in the future?
The source marks the correct answer as: Voice recognition.
Why can we say that Transformers brought a paradigm shift in machine translation?
The source marks the correct answer as: They made translations context-aware and faster.
How did Transformers improve GlobeTech's user interface experience for users of different languages?
The source marks the correct answer as: By providing real-time translations of UI elements.
How did Transformers improve GlobeTech's scalability issue for new languages?
The source marks the correct answer as: Leveraged pre-trained models like BERT and GPT.
Combining voice recognition and Transformers will help GlobeTech offer what?
The source marks the correct answer as: Real-time voice translations for customer support.
What was a major challenge faced by GlobeTech in their previous MT methods?
The source marks the correct answer as: Contextual Translation.
What unique mechanism in Transformers aids in understanding context?
The source marks the correct answer as: Self-attention.
After adopting Transformer-based MT, by how much did GlobeTech reduce translation-related complaints?
The source marks the correct answer as: 0.4.
Which paper introduced the Transformer architecture?
The source marks the correct answer as: "Attention Is All You Need".
How does Multi-head attention differ from standard attention?
Multi-head attention runs multiple parallel attention mechanisms (heads), each attending to different representation subspaces, allowing the model to jointly attend to information from multiple positions simultaneously and capture diverse relationships. Options [2] and [3] form the complete answer: 'It allows the model to focus on multiple parts of the input simultaneously.'
What is the main difference between pre-training and fine- tuning in Transformers?
Pre-training involves training a Transformer on a large, general corpus (e.g. web text) to learn broad language representations. Fine-tuning then adapts those representations to a specific downstream task using a smaller task-specific labeled dataset. Options [2] and [3] form the complete answer: 'Pre-training is on a large corpus and fine-tuning is task-specific.'
Why did GlobeTech's product descriptions sound off with earlier MT models?
Earlier RNN-based machine translation models without attention struggled with contextual meaning, especially for long sentences, because they compressed the entire source into a single fixed-length vector, losing information. This is the case study reason GlobeTech's product descriptions sounded off. Options [0] and [1] form the complete answer: 'Struggled with contextual meaning, especially with long sentences.'
What unique aspect is GlobeTech exploring to further enhance translations using Transformers?
GlobeTech is exploring using Transformer-based models to offer translations that account for regional dialects and cultural nuances, going beyond literal translation to provide contextually and culturally appropriate output for diverse markets. Options [4] and [5] form the complete answer: 'Offering translations considering regional dialects and nuances.'
What differentiates Google Bard's data access from ChatGPT?
The source marks the correct answer as: Bard extracts real-time information.
DALL-E's image generation can be optimal for which of the following applications?
The source marks the correct answer as: Designing book covers.
Which industry utilizes AI for personalized care programs enhancing patient recovery?
The source marks the correct answer as: Healthcare.
In the realm of manufacturing, how does generative AI impact the design process?
The source marks the correct answer as: By creating product designs.
During the harvesting phase, how does AI offer a boon to the agricultural sector?
The source marks the correct answer as: By distinguishing inferior plants.
Which conversational AI is not constructed on the Transformer neural network foundation?
The source marks the correct answer as: DALL-E.
Which AI platform, integrated into Microsoft's Bing, delivers instant query answers?
The source marks the correct answer as: Bing AI.
For which sector does generative AI replicate potential threat environments to bolster proactive defense?
The source marks the correct answer as: Cybersecurity.
Which AI model, developed by Google, is designed to engage in open-ended conversations, often generating creative responses to user prompts?
The source marks the correct answer as: LaMDA Case Study - Generative AI Applications in Key Industries.
Compared to its predecessor, DALL-E, what is an improved feature of DALL-E 2?
The source marks the correct answer as: Higher resolution.
What is a unique feature of ChatGPT that distinguishes it from Bard by Google?
The source marks the correct answer as: Retains conversation history.
For which feature might users of the basic plans of Synthesia encounter quality concerns?
The source marks the correct answer as: Audio quality.
Bard by Google has limitations in which of the following aspects?
The source marks the correct answer as: Limited to English language.
Cohere Generate primarily targets which type of content?
The source marks the correct answer as: Marketing and sales content.
Which of the following is NOT an attribute of GPT-4 by OpenAI?
The source marks the correct answer as: Audio outputs.
Which database does GitHub Copilot ground its data on?
The source marks the correct answer as: OpenAI Codex and GitHub.
What is a potential concern when using Code Whisperer by AWS with open-source projects?
The source marks the correct answer as: Potential open-source legal issues.
How does Claude by Anthropic enhance its safety features?
The source marks the correct answer as: Using "red team" prompts for safety.
Approximately what percentage of false positive rate does AlphaCode by DeepMind have?
The source marks the correct answer as: 0.04 Case Study - Generative AI Tools.
Which AI methodology specializes in data set differentiation?
Discriminative AI specializes in differentiating between datasets or classes — it learns the decision boundary between categories (e.g. classifying spam vs. legitimate email). This is distinct from Generative AI, which models the data distribution to create new samples.
If an AI system is designed to label images of cats and dogs, it is primarily a _______ model. Unsupervised
A model designed to label (classify) images of cats and dogs is a discriminative model — it learns to map input data to class labels by distinguishing between categories. Despite 'Unsupervised' appearing in the question stem (a distractor), image labeling is a supervised discriminative classification task.
The generator's objective in GANs is to... Reduce mode collapse None of the given options
The generator's objective in a GAN is to fool the discriminator — to generate samples so realistic that the discriminator classifies them as real. This adversarial objective drives the generator to continuously improve the quality of generated data.
Mode collapse is problematic because... It requires more data It makes the discriminator weak
Mode collapse in GANs is problematic because the generator produces only a limited subset of possible outputs — it collapses to generating similar/identical samples regardless of the input noise — severely limiting the diversity of generated outputs and failing to capture the full data distribution.
Based on our question bank analysis, master these concepts to score high in Generative AI.
"Focus on understanding the logic behind pseudocode loops and selection statements, as they form the bulk of technical assessments."