This is Part 1 of the PyTorch Primer Series.
Topics Discussed:
It’s a Python based package for serving as a replacement of Numpy and to provide flexibility as a Deep Learning Development Platform.
I encourage you to read Fast AI’s blog post for the reason of the course’s switch to PyTorch.
Or simply put:
Tensors are similar to numpy’s ndarrays, with the addition being that Tensors can also be used on a GPU to accelerate computing.
Tensors are multi dimensional Matrices.
torch.Tensor(x, y)
This will create a X by Y dimensional Tensor that has been instantiated with random values.
To Create a 5x3 Tensor with values randomly selected from a Uniform Distribution between -1 and 1,
torch.Tensor(5, 3).uniform_(-1, 1)
Tensors have a size attribute that can be called to check their size
print(x.size())
PyTorch supports various Tensor Functions with different syntax:
Consider Addition:
y = torch.rand(5, 3)print(x + y)
result = torch.Tensor(5, 3)torch.add(x, y, out=result)
y.add_(x)
Inline functions are denoted by an underscore following their name. Note: These have faster execution time (With a higher memory complexity tradeoff)
All Numpy Indexing, Broadcasting and Reshaping functions are supported
Note: PyTorch doesn’t support a negative hop so [::-1] will result in an error
print(x[:, 1])
y = torch.randn(5, 10, 15)print(y.size())print(y.view(-1, 15).size())
PyTorch supports various types of Tensors:
Note: Be careful when working with different Tensor Types to avoid type errors
Types supported:
Converting a torch Tensor to a numpy array and vice versa is a breeze.
Note: The torch Tensor and numpy array will share their underlying memory locations, and changing one will change the other.
a = torch.ones(5)b = a.numpy()
Moving the Tensors to GPU can be done as:
if torch.cuda.is_available():x = x.cuda()y = y.cuda()x + y
Central to all neural networks in PyTorch is the autograd
package. Let’s first briefly visit this, and we will then go to training our first neural network.
The autograd
package provides automatic differentiation for all operations on Tensors. It is a define-by-run framework, which means that your backprop is defined by how your code is run, and that every single iteration can be different.
Let us see this in more simple terms with some examples.
autograd.Variable
is the central class of the package. It wraps a Tensor, and supports nearly all of operations defined on it. Once you finish your computation you can call .backward()
and have all the gradients computed automatically.
You can access the raw tensor through the .data
attribute, while the gradient w.r.t. this variable is accumulated into .grad
.
Source: PyTorch Docs
x_data = [1.0, 2.0, 3.0]y_data = [2.0, 4.0, 6.0]
w = Variable(torch.Tensor([1.0]), requires_grad=True)
Calling the Backward function
l = loss(x_val, y_val)l.backward()
As explained by this Blog Post by Radek, My friend and Mentor from the Fast AI community
Feel free to ask any questions below. Also drop us a comment on the tutorials that you’d love to read, I will try to have that up ASAP.
If you want to read about Week 2 in my Self Driving Journey, here is the blog post
The Next Part in the Series will discuss about Linear Regression.
You can find me on Twitter @bhutanisanyam1, connect with me on Linkedin here
Subscribe to my Newsletter for a weekly curated list of Deep Learning and Computer Vision Reads