2007
‘Large-scale Deep Unsupervised Learning using Graphics Processors’ (2009) from Ranja, Madhavan and Andrew Ng is probably the first really important paper that introduced GPUs to large neural networks.
Shortly after (2010), Dan Ciresan (from Schmidhuber’s lab, with Schmidhuber being the last author in the paper) broke all records in MNIST using a large convolutional neural network trained on GPUs.
Alex Krizhevsky, Ilya Sutskever and Geoffrehy Hinton (2012) used a similar (but larger, and much better) network to classify ImageNet and they trained it on GPUs.
Deep Learning introduced
Who introduced GPU to deep learning?
Answer (1 of 7): TL;DR: it was Raina 2009 followed by Ciresan in 2010 to first use GPUs for deep learning. There was some earlier work from Microsoft Research which lead the way towards using deep learning for training neural networks before that. There’s a great description of this history in s...
https://www.quora.com/Who-introduced-GPU-to-deep-learning

Accelerating AI with GPUs: A New Computing Model | NVIDIA Blog
This paper accompanies a presentation given this week by NVIDIA CEO Jen-Hsun Huang at The Future of AI Symposium at New York University.
https://blogs.nvidia.com/blog/2016/01/12/accelerating-ai-artificial-intelligence-gpus/


Seonglae Cho