Skip to main content Link Search Menu Expand Document (external link)

Deep Learning

Difference between LSTMs and GRUs

Problem

Which of the following is not true with respect to LSTMs and GRUs?

Options

Answer

Solution

The update equation of the GRU cells also uses both tanh and sigmoid functions.


Discussing dropout

Problem

Which of the following are true regarding dropout?

Options

Answer

Solution

Dropout needs to be turned off during testing as we want the entire representational capacity of the model during testing. Dropout is similar to a bagging model but there are a few key differences. First in bagging the models are trained independently of each other whereas the dropout models share parameters with each other. Second, in bagging each individual model is trained to convergence while that is not the case in dropout (where each model is trained for 1 step). Further D is incorrect since dropout helps reduce overfitting.