When working with local files, you might need to open, create, or update Zip files.
In this article, we will learn how to work with Zip files in C#. We will learn how to perform basic operations such as opening, extracting, and creating a Zip file.
The main class we will use is named ZipFile
and comes from the System.IO.Compression
namespace. It’s been present in C# since .NET Framework 4.5, so we can say it’s pretty stable 😉 Nevertheless, there are some tricky points that you need to know before using this class. Let’s learn!
Once you have a Zip file, you can access the internal items without extracting the whole Zip.
You can use the ZipFile.Open
method.
using ZipArchive archive = ZipFile.Open(zipFilePath, ZipArchiveMode.Read);
System.Collections.ObjectModel.ReadOnlyCollection<ZipArchiveEntry> entries = archive.Entries;
Notice that I specified the ZipArchiveMode
. This is an Enum whose values are Read
, Create
, and Update
.
Using the Entries
property of the ZipArchive
, you can access the whole list of files stored within the Zip folder, each represented by a ZipArchiveEntry
instance.
The ZipArchiveEntry
object contains several fields, like the file’s name and the full path from the root archive.
There are a few key points to remember about the entries listed in the ZipArchiveEntry
.
ReadOnlyCollection<ZipArchiveEntry>
: it means that even if you find a way to add or update the items in memory, the changes are not applied to the actual files;Name
is the actual file name (like presentation.pptx), while FullName
contains the path from the root of the archive (e.g. TestZip/InnerFolder/presentation.pptx);Name
property is empty, and their Length
is 0;
Lastly, remember that ZipFile.Open
returns an IDisposable
, so you should place the operations within a using
statement.
❓❓A question for you! Why do we see an item for the TestZip/InnerFolder folder, but there is no reference to the TestZip folder? Drop a comment below 📩
Extracting a Zip folder is easy but not obvious.
We have only one way to do that: by calling the ZipFile.ExtractToDirectory
method.
It accepts as mandatory parameters the path of the Zip file to be extracted and the path to the destination:
var zipPath = @"C:\Users\d.bellone\Desktop\TestZip.zip";
var destinationPath = @"C:\Users\d.bellone\Desktop\MyDestination";
ZipFile.ExtractToDirectory(zipPath, destinationPath);
Once you run it, you will see the content of the Zip copied and extracted to the MyDestination folder.
Note that this method creates the destination folder if it does not exist.
This method accepts two more parameters:
entryNameEncoding
, by which you can specify the encoding. The default value is UTF-8.overwriteFiles
allows you to specify whether it must overwrite existing files. The default value is false
. If set to false and the destination files already exist, this method throws a System.IO.IOException
saying that the file already exists.The key method here is ZipFile.CreateFromDirectory
, which allows you to create Zip files in a flexible way.
The first mandatory value is, of course, the source directory path.
The second mandatory parameter is the destination of the resulting Zip file.
It can be the local path to the file:
string sourceFolderPath = @"\Desktop\myFolder";
string destinationZipPath = @"\Desktop\destinationFile.zip";
ZipFile.CreateFromDirectory(sourceFolderPath, destinationZipPath);
Or it can be a Stream
that you can use later for other operations:
using (MemoryStream memStream = new MemoryStream())
{
string sourceFolderPath = @"\Desktop\myFolder";
ZipFile.CreateFromDirectory(sourceFolderPath, memStream);
var lenght = memStream.Length;// here the Stream is populated
}
You can finally add some optional parameters:
compressionLevel
, whose values are Optimal
, Fastest
, NoCompression
, SmallestSize
.includeBaseDirectory
: a flag that defines if you have to copy only the first-level files or also the root folder.As we just saw, we have four compression levels: Optimal
, Fastest
, NoCompression
, and SmallestSize
.
What happens if I use the different values to zip all the photos and videos of my latest trip?
The source folder’s size is 16.2 GB.
Let me zip it with the four compression levels:
private long CreateAndTrack(string sourcePath, string destinationPath, CompressionLevel compression)
{
Stopwatch stopwatch = Stopwatch.StartNew();
ZipFile.CreateFromDirectory(
sourceDirectoryName: sourcePath,
destinationArchiveFileName: destinationPath,
compressionLevel: compression,
includeBaseDirectory: true
);
stopwatch.Stop();
return stopwatch.ElapsedMilliseconds;
}
// in Main...
var smallestTime = CreateAndTrack(sourceFolderPath,
Path.Combine(rootFolder, "Smallest.zip"),
CompressionLevel.SmallestSize);
var noCompressionTime = CreateAndTrack(sourceFolderPath,
Path.Combine(rootFolder, "NoCompression.zip"),
CompressionLevel.NoCompression);
var fastestTime = CreateAndTrack(sourceFolderPath,
Path.Combine(rootFolder, "Fastest.zip"),
CompressionLevel.Fastest);
var optimalTime = CreateAndTrack(sourceFolderPath,
Path.Combine(rootFolder, "Optimal.zip"),
CompressionLevel.Optimal);
By executing this operation, we have this table:
We can see a bunch of weird things:
This is to say: don’t trust the names; remember to benchmark the parts where you need performance, even with a test as simple as this.
This was a quick article about one specific class in the .NET ecosystem.
As we saw, even though the class is simple and it’s all about three methods, there are some things you should keep in mind before using this class in your code.
I hope you enjoyed this article! Let’s keep in touch on Twitter or LinkedIn! 🤜🤛
Happy coding!
🐧
Also published here.