paint-brush
High Performance Text Parsing Using Finite State Machines (FSM)by@spidim
7,013 reads
7,013 reads

High Performance Text Parsing Using Finite State Machines (FSM)

by Spiros Dimopoulos5mFebruary 13th, 2021
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

In this article, I compare two parsing methods with a focus on their performance. The first method uses regular expressions for parsing and data extraction. The second method uses a Non-deterministic Finite Automata (NDFA) to parse text. The NDFA is a class of FSMs that can be used for any task that has distinct states of processing or operation. In practice, the algorithm has to follow every possible transition from a state to another and when a possible path fails it goes back and continues with another path.

People Mentioned

Mention Thumbnail
Mention Thumbnail

Coin Mentioned

Mention Thumbnail
featured image - High Performance Text Parsing Using Finite State Machines (FSM)
Spiros Dimopoulos HackerNoon profile picture
Spiros Dimopoulos

Spiros Dimopoulos

@spidim

Senior Software Architect / Engineering Lead at Behavioral Signals

About @spidim
LEARN MORE ABOUT @SPIDIM'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Spiros Dimopoulos HackerNoon profile picture
Spiros Dimopoulos@spidim
Senior Software Architect / Engineering Lead at Behavioral Signals

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite