Reset is probably one of the least understood git commands with the addition of having a bad reputation for being dangerous. There is a valid reason for both of these claims: yes, the reset command is a bit harder to understand and in some cases, it can be dangerous. But, it is not all that hard. So in this post, I will give my best to present you with a clear and distilled tutorial to the reset command. To make it short and not too overwhelming I have abstracted the non-essential details and simplified some things, but if you want to know more on git’s internal workings you can also check my Understanding Git series for more details of some stuff presented here.
Before we dive into the reset
command we need to take a look at what is called the trees of git: Working Directory, Staging Area and the Repository.
You can think of them as three areas where changes can reside from git’s point of view:
The reset
command operates on these threes/areas, but first, let’s take a look how do add
and commit
commands (that we use daily) affect these areas.
Say we have a web app and we do some refactoring on our index.php
file. Changes that we make are reflected in the Working Directory:
We can confirm this by running git status
:
Changes not staged for commit:(use "git add <file>..." to update what will be committed)(use "git checkout -- <file>..." to discard changes in working directory)
modified: index.php
Now we move those changes to the Staging Area by using the add
command:
Running the status
command will tell us:
Changes to be committed:(use "git reset HEAD <file>..." to unstage)
modified: index.php
Because the status
command sees that we have the same version of index.php
file both in Working Directory and in the Staging Area, but not in Repository.
To add it there, we use the commit
command:
And now the Working Directory, Staging Area and Repository all contain the same version of index.php
, and running git status
will tell us that there is:
nothing to commit, working tree clean
So, the way the status
command works is that it compares file versions in Working Directory, Staging Area and Repository and if there are different than there are files to be staged/committed.
Let’s say we do some more refactoring on the index.php
file and do the whole add/commit cycle again.
Now our Working Directory, Staging Area and the Repository will all contain the new second version of our index.php
file.
But what about the first version? If you remember, we did say that the Repository keeps all previous commits, so the first version of index.php
is still there:
To keep track which version of the index.php
file is the current one, Repository has a special pointer called HEAD
that points to the current version (and the status
command only looks at the current version that HEAD
is pointing to when comparing it to the Staging Area’s version).
Now that we have that covered we can finally go to our reset
command and see how it works by manipulating the content of these git areas (trees).
This first mode of reset
command will only do one thing:
HEAD
pointerIn our case, we will move it to the previous commit (the first version of index.php
) by runing:
git reset --soft HEAD~1
The trees of git now look like this:
And when we run git status
we see a familiar message:
Changes to be committed:(use "git reset HEAD <file>..." to unstage)
modified: index.php
So, running git reset — soft HEAD~1
has basically undone our last commit, but the changes contained in that commit are not lost — they are in our Staging Area and Working Directory.
The second mode of reset
command will do two things:
HEAD
pointerHEAD
is pointing to)So, the first step is the same as with the--soft
mode. The second step takes whatever the HEAD
points to ( in this case, it is version one of the index.php
file) and puts it into the Staging Area.
So, after running git reset --mixed HEAD~1
our areas look like this:
And running git status
now again gives us a familiar message:
Changes not staged for commit:(use "git add <file>..." to update what will be committed)(use "git checkout -- <file>..." to discard changes in working directory)
modified: index.php
So, running git reset — mixed HEAD~1
has undone our last commit, but this time the changes from that commit are (only) in our Working Directory.
And now for the notorious hard mode. Running reset — hard
will do three things:
HEAD
pointerHEAD
is pointing to)So, the first two steps are the same as with--mixed.
The third makes the Working Directory look like the Staging Area (that was already filled with what HEAD
is pointing to).
So, after running git reset --hard HEAD~1
our areas look like this:
And running git status gives us:
nothing to commit, working tree clean
So, running git reset — hard HEAD~1
has undone our last commit and the changes contained in that commit are neither in our Working Directory or the Staging Area. But they are not completely lost. Git doesn’t delete commits from Repository (actually, it does sometimes, but very rarely), so this means our commit with the second version is still in the Repository, it is just a bit hard to find (you can track it by looking at something called reflog).
So, what is then with this reputation of reset being dangerous? Well, there is one case where some changes could be permanently lost. Consider a scenario where after the second commit you make some more changes to your index.php
file, but you don’t stage and commit them:
And now you run git reset --hard HEAD~1
:
Since the reset --hard
overwrites the content of your Working Directory to match the Staging Area (that is already made to match HEAD
) and you never staged and committed your changes so there is no commit with those changes in the repository, all those changes will now be lost in time… Like tears in the rain.
The danger of hard reset lies in the fact that it is not Working Directory safe — meaning it won’t give you any warning if you have file changes in your Working Directory that will be overwritten (and lost) if you run it. So be (extra) careful with a hard reset.
And there you have it — the reset command. I hope I did a good job explaining it and that you’ll agree it is not that hard after all. And, yes it can be dangerous but only if used with --hard
option.
As said in the beginning if you would like to know more on the inner workings of git, you can check my Understanding Git series, and if you want a more in-depth explanation on the reset command you can check the git reset demystified chapter from Scot Cachon’s git pro book.
HEAD~1
as an argument for the reset
command. As you probably already know every commit in git has a unique identifier called a checksum, and we can use it as an argument for the reset command too to reset to a specific commit
HEAD
pointer usually doesn’t point directly to a commit (as shown in examples for simplicity) but to a branch pointer that then points to a specific commit