15,161 reads

PyTorch Basics in 4 Minutes

by Sanyam BhutaniJanuary 14th, 2018

Too Long; Didn't Read

This is Part 1 of the PyTorch <a href="https://medium.com/init27-labs/pytorch-primer-series-0-e2e5df9b31c6?source=collection_home---4------1----------------" target="_blank">Primer Series</a>.

Company Mentioned

featured image - PyTorch Basics in 4 Minutes

You can find all the accompanying code in this Github repo

This is Part 1 of the PyTorch Primer Series.

Topics Discussed:

Tensors
Basic Operations. (Inline, Tensor Indexing, Slicing)
Numpy-PyTorch Bridge
PyTorch-Numpy Bridge
Variable
Gradients

What is PyTorch?

It’s a Python based package for serving as a replacement of Numpy and to provide flexibility as a Deep Learning Development Platform.

Why PyTorch?

I encourage you to read Fast AI’s blog post for the reason of the course’s switch to PyTorch.

Or simply put:

Dynamic Graphs
More intuitive than TF (Personal View)

Tensors

Tensors are similar to numpy’s ndarrays, with the addition being that Tensors can also be used on a GPU to accelerate computing.

Tensors are multi dimensional Matrices.

torch.Tensor(x, y)

This will create a X by Y dimensional Tensor that has been instantiated with random values.

To Create a 5x3 Tensor with values randomly selected from a Uniform Distribution between -1 and 1,

torch.Tensor(5, 3).uniform_(-1, 1)

Tensors have a size attribute that can be called to check their size

print(x.size())

Operations

PyTorch supports various Tensor Functions with different syntax:

Consider Addition:

Normal Addition

y = torch.rand(5, 3)print(x + y)

Getting Result in a Tensor

result = torch.Tensor(5, 3)torch.add(x, y, out=result)

In Line

y.add_(x)

Inline functions are denoted by an underscore following their name. Note: These have faster execution time (With a higher memory complexity tradeoff)

All Numpy Indexing, Broadcasting and Reshaping functions are supported

Note: PyTorch doesn’t support a negative hop so [::-1] will result in an error

print(x[:, 1])

y = torch.randn(5, 10, 15)print(y.size())print(y.view(-1, 15).size())

Types of Tensors

PyTorch supports various types of Tensors:

Note: Be careful when working with different Tensor Types to avoid type errors

Types supported:

32-bit (Float + Int)
64-bit (Float + Int)
16-bit (Float + Int)
8-bit (Signed + Unsigned)

Numpy Bridge

Converting a torch Tensor to a numpy array and vice versa is a breeze.

Note: The torch Tensor and numpy array will share their underlying memory locations, and changing one will change the other.

a = torch.ones(5)b = a.numpy()

CUDA Tensors

Moving the Tensors to GPU can be done as:

if torch.cuda.is_available():x = x.cuda()y = y.cuda()x + y

Autograd: automatic differentiation

Central to all neural networks in PyTorch is the autograd package. Let’s first briefly visit this, and we will then go to training our first neural network.

The autograd package provides automatic differentiation for all operations on Tensors. It is a define-by-run framework, which means that your backprop is defined by how your code is run, and that every single iteration can be different.

Let us see this in more simple terms with some examples.

Variable

autograd.Variable is the central class of the package. It wraps a Tensor, and supports nearly all of operations defined on it. Once you finish your computation you can call .backward() and have all the gradients computed automatically.

You can access the raw tensor through the .data attribute, while the gradient w.r.t. this variable is accumulated into .grad.

Source: PyTorch Docs

x_data = [1.0, 2.0, 3.0]y_data = [2.0, 4.0, 6.0]

w = Variable(torch.Tensor([1.0]), requires_grad=True)

Calling the Backward function

l = loss(x_val, y_val)l.backward()

Common Pitfalls

As explained by this Blog Post by Radek, My friend and Mentor from the Fast AI community

Gradients accumulate everytime you call them, by default, be sure to call zero.gradient() to avoid that
Data Types, As mentioned in the Tensor Section, PyTorch supports various Tensor types. Be sure to check for the types to avoid Type compatibility errors.

Feel free to ask any questions below. Also drop us a comment on the tutorials that you’d love to read, I will try to have that up ASAP.

If you want to read about Week 2 in my Self Driving Journey, here is the blog post

The Next Part in the Series will discuss about Linear Regression.