Support passing layer instances to produce attentional hidden states (#14974)
* Support passing Layer instances to the AttentionWrapper. * Use _compute_output_shape to get the attention layer depth * compute_output_shape is now a public method * Move new argument at the end
Loading
Please sign in to comment