Introduction to Deep Learning
For an overview on the topic, start with the blog post “Attention and Memory in Deep Learning and NLP” by Denny Britz.
Continue by reading the following articles that provide details on two very recently proposed approaches,
Ashish et al. “Attention Is All You Need” in NIPS (2017) (The version on arxiv seems to be most recent.)
Follow the links from the blog post to the referenced articles for further reading.