My Deep Learning Reading List

A list of papers that I believe are either important or useful for understanding deep learning.

Start Here

The most landmark and important papers in deep learning. Read these papers in order.

Paper	Description
Backpropagation	The original paper that described the backpropagation algorithm, the central algorithm behind how modern deep learning models work.
AlexNet	Often considered the paper that kickstarted the modern era of deep learning, this paper proposed the idea of just stacking a bunch of layers as a performance improvement method
The Adam Optimizer	A key landmark in optimization algorithms for deep learning models, this paper proposes a new framework for weight updates during backpropagation.
Long Short-Term Memory	A huge breakthrough in sequential understanding for deep learning models, allowing them to store and tune the information they saw previously and use it for future predictions.
Attention is All You Need	Arugably one of the two most important papers in modern deep learning, along with AlexNet, this paper proposed the Transformer, the building block to large language models, and a huge milestone in language understanding for deep learning models.
Deep Reinforcement Learning	A key breakthrough in reinforcement learning, this paper combined modern efforts in deep learning with goal-based learning approaches, instead of loss-based approaches.
Denoising Diffusion Models	This paper proposed an architecture and algorithm for image generation that produced highly life-like images, a key landmark in artificial image understanding.
Language Models are Few-Shot Learners	This is the paper that was released alongside the original ChatGPT, explaining how very large language models could demonstrate viable performance in tasks they had limited knowledge in.

Computer Vision

Important works in computer vision.

Paper
Residual Networks

Traditional NLP

Papers about pre-transformer NLP breakthroughs.

Paper
word2vec
Nucleus Sampling

Transformers

Papers about transformers and their applications.

Paper
BERT
Vision Transformers

Large Language Model Hype Train

Papers about large language models and related works on them.

Paper
Chain of Thought Reasoning
Instruction Tuning
Speculative Decoding

Historically Important

Historically famous papers that are still used today, but not essential reading.

Paper
ReLU
UNet
XGBoost
Batch Normalization

Generative Models

Papers about deep generative models.

Paper
Variational Autoencoders
GANs

Reinforcement Learning

Papers about reinforcement learning

Paper
Proximal Policy Optimization

Deep Learning Science

Papers about research about deep learning.

Paper
Contrastive Representation Learning (CLIP)
The Lottery Ticket Hypothesis

Applied Deep Learning

Papers using deep learning to solve huge problems.

Paper
AlphaFold

DL Theory

Papers exploring the math of deep learning advancements.

Paper
Dropout
Low Rank Adaptation
GANs as a Nash Equilibrium

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

My Deep Learning Reading List

Start Here

Computer Vision

Traditional NLP

Transformers

Large Language Model Hype Train

Historically Important

Generative Models

Reinforcement Learning

Deep Learning Science

Applied Deep Learning

DL Theory

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

My Deep Learning Reading List

Start Here

Computer Vision

Traditional NLP

Transformers

Large Language Model Hype Train

Historically Important

Generative Models

Reinforcement Learning

Deep Learning Science

Applied Deep Learning

DL Theory

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages