SPPF-Style Parsing From Earley Recognisers

Elizabeth Scott

doi:10.1016/j.entcs.2008.03.044

SPPF-Style Parsing From Earley Recognisers

Elizabeth Scott

Department of Computer Science

Research output: Contribution to journal › Article › peer-review

Abstract

In its recogniser form, Earley's algorithm for testing whether a string can be derived from a grammar is worst case cubic on general context free grammars (CFG). Earley gave an outline of a method for turning his recognisers into parsers, but it turns out that this method is incorrect. Tomita's GLR parser returns a shared packed parse forest (SPPF) representation of all derivations of a given string from a given CFG but is worst case unbounded polynomial order. We have given a modified worst-case cubic version, the BRNGLR algorithm, that, for any string and any CFG, returns a binarised SPPF representation of all possible derivations of a given string. In this paper we apply similar techniques to develop two versions of an Earley parsing algorithm that, in worst-case cubic time, return an SPPF representation of all derivations of a given string from a given CFG.

Original language	English
Pages (from-to)	53-67
Journal	Electronic Notes in Theoretical Computer Science
Volume	203
Issue number	2
DOIs	https://doi.org/10.1016/j.entcs.2008.03.044
Publication status	Published - Apr 2008

Keywords

parsing
Grammar types

Access to Document

10.1016/j.entcs.2008.03.044

Cite this

@article{5fb7bd5424db4a11bb88023ba4f6a41f,

title = "SPPF-Style Parsing From Earley Recognisers",

abstract = "In its recogniser form, Earley's algorithm for testing whether a string can be derived from a grammar is worst case cubic on general context free grammars (CFG). Earley gave an outline of a method for turning his recognisers into parsers, but it turns out that this method is incorrect. Tomita's GLR parser returns a shared packed parse forest (SPPF) representation of all derivations of a given string from a given CFG but is worst case unbounded polynomial order. We have given a modified worst-case cubic version, the BRNGLR algorithm, that, for any string and any CFG, returns a binarised SPPF representation of all possible derivations of a given string. In this paper we apply similar techniques to develop two versions of an Earley parsing algorithm that, in worst-case cubic time, return an SPPF representation of all derivations of a given string from a given CFG.",

keywords = "parsing, Grammar types",

author = "Elizabeth Scott",

year = "2008",

month = apr,

doi = "10.1016/j.entcs.2008.03.044",

language = "English",

volume = "203",

pages = "53--67",

journal = "Electronic Notes in Theoretical Computer Science",

issn = "1571-0661",

publisher = "Elsevier",

number = "2",

}

TY - JOUR

T1 - SPPF-Style Parsing From Earley Recognisers

AU - Scott, Elizabeth

PY - 2008/4

Y1 - 2008/4

N2 - In its recogniser form, Earley's algorithm for testing whether a string can be derived from a grammar is worst case cubic on general context free grammars (CFG). Earley gave an outline of a method for turning his recognisers into parsers, but it turns out that this method is incorrect. Tomita's GLR parser returns a shared packed parse forest (SPPF) representation of all derivations of a given string from a given CFG but is worst case unbounded polynomial order. We have given a modified worst-case cubic version, the BRNGLR algorithm, that, for any string and any CFG, returns a binarised SPPF representation of all possible derivations of a given string. In this paper we apply similar techniques to develop two versions of an Earley parsing algorithm that, in worst-case cubic time, return an SPPF representation of all derivations of a given string from a given CFG.

AB - In its recogniser form, Earley's algorithm for testing whether a string can be derived from a grammar is worst case cubic on general context free grammars (CFG). Earley gave an outline of a method for turning his recognisers into parsers, but it turns out that this method is incorrect. Tomita's GLR parser returns a shared packed parse forest (SPPF) representation of all derivations of a given string from a given CFG but is worst case unbounded polynomial order. We have given a modified worst-case cubic version, the BRNGLR algorithm, that, for any string and any CFG, returns a binarised SPPF representation of all possible derivations of a given string. In this paper we apply similar techniques to develop two versions of an Earley parsing algorithm that, in worst-case cubic time, return an SPPF representation of all derivations of a given string from a given CFG.

KW - parsing

KW - Grammar types

U2 - 10.1016/j.entcs.2008.03.044

DO - 10.1016/j.entcs.2008.03.044

M3 - Article

SN - 1571-0661

VL - 203

SP - 53

EP - 67

JO - Electronic Notes in Theoretical Computer Science

JF - Electronic Notes in Theoretical Computer Science

IS - 2

ER -