RNNs with only linear sequential dependencies
can be parallelized over the sequence length using the parallel scan algorithm
, parallel linear recurrence CUDA kernel
and show that new framework of linear surrogate RNNs
linear surrogate model for the long short-term memory unit, the GILR-LSTM
, that utilizes parallel linear recurrence.