Parsing and Mapping a Docx file with Javaby@ereznik
537 reads
537 reads

Parsing and Mapping a Docx file with Java

by Evgenij Reznik7mFebruary 18th, 2021
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

The docx is a standard document format, first introduced in Microsoft Office 2007. It stores documents as a set of individual folders and files in a zip archive. The main content is located in the file document.xml in the folder word.xml. It contains the actual text and some styling information of the entire document. Java provides us with the class ZipFile. We create a new instance and pass our docx file as a.param to the constructor. To the method public ZipEntry getEntry getEnter​(String name) we pass the entry that we want to read. We return the input stream of that specific entry, so that we can read its contents.

Company Mentioned

Mention Thumbnail
featured image - Parsing and Mapping a Docx file with Java
Evgenij Reznik HackerNoon profile picture
Evgenij Reznik

Evgenij Reznik

@ereznik

Software developer from Germany

About @ereznik
LEARN MORE ABOUT @EREZNIK'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Evgenij Reznik HackerNoon profile picture
Evgenij Reznik@ereznik
Software developer from Germany

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite
Also published here