Were RNNs All We Needed? A GPU Programming Perspective | Dhruv Sheth

An implementation of parallelizable GRUs and LSTMs for CS179 in CUDA.