High Performance Text Parsing Using Finite State Machines (FSM)

Written by spidim | Published 2021/02/13
Tech Story Tags: finite-automatons | regular-expressions | java | parser | parsing | programming | text-parsing | hackernoon-top-story

TLDR In this article, I compare two parsing methods with a focus on their performance. The first method uses regular expressions for parsing and data extraction. The second method uses a Non-deterministic Finite Automata (NDFA) to parse text. The NDFA is a class of FSMs that can be used for any task that has distinct states of processing or operation. In practice, the algorithm has to follow every possible transition from a state to another and when a possible path fails it goes back and continues with another path.via the TL;DR App

no story

Written by spidim | Senior Software Architect / Engineering Lead at Behavioral Signals
Published by HackerNoon on 2021/02/13