TR96-13

Introduction to Finite-State Devices in Natural Language Processing


    •  Emmanuel Roche, Yves Schabes, "Introduction to Finite-State Devices in Natural Language Processing", Tech. Rep. TR96-13, Mitsubishi Electric Research Laboratories, Cambridge, MA, June 1996.
      BibTeX TR96-13 PDF
      • @techreport{MERL_TR96-13,
      • author = {Emmanuel Roche, Yves Schabes},
      • title = {Introduction to Finite-State Devices in Natural Language Processing},
      • institution = {MERL - Mitsubishi Electric Research Laboratories},
      • address = {Cambridge, MA 02139},
      • number = {TR96-13},
      • month = jun,
      • year = 1996,
      • url = {https://www.merl.com/publications/TR96-13/}
      • }
Abstract:

The theory of finite-state automata (FSA) is rich and finite-state automata techniques have been used in a wide range of domains, such as switching theory, pattern matching, pattern recognition, speech processing, hand writing recognition, optical character recognition, encryption algorithm, data compression, indexing and operating system analysis (Petri-net). In this chapter, we describe the basic notions of finite-state automata and finite-state transducers. We also describe the fundamental properties of these machines while illustrating their use. We give simple formal language examples as well as natural language examples. We also illustrate some of the main algorithms used with finite-state automata and transducers.