Island Grammar-Based Parsing Using GLL and Tom. / Afroozeh, Ali; Bach, Jean-Christophe; van den Brand, Mark; Johnstone, Adrian; Manders, Maarten; Moreau, Pierre-Etienne; Scott, Elizabeth.

Software Language Engineering Lecture Notes in Computer Science : 5th International Conference, SLE 2012, Dresden, Germany, September 26-28, 2012, Revised Selected Papers. 2013. p. 224-243.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Published

Standard

Island Grammar-Based Parsing Using GLL and Tom. / Afroozeh, Ali; Bach, Jean-Christophe; van den Brand, Mark; Johnstone, Adrian; Manders, Maarten; Moreau, Pierre-Etienne; Scott, Elizabeth.

Software Language Engineering Lecture Notes in Computer Science : 5th International Conference, SLE 2012, Dresden, Germany, September 26-28, 2012, Revised Selected Papers. 2013. p. 224-243.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Harvard

Afroozeh, A, Bach, J-C, van den Brand, M, Johnstone, A, Manders, M, Moreau, P-E & Scott, E 2013, Island Grammar-Based Parsing Using GLL and Tom. in Software Language Engineering Lecture Notes in Computer Science : 5th International Conference, SLE 2012, Dresden, Germany, September 26-28, 2012, Revised Selected Papers. pp. 224-243. https://doi.org/10.1007/978-3-642-36089-3_13

APA

Afroozeh, A., Bach, J-C., van den Brand, M., Johnstone, A., Manders, M., Moreau, P-E., & Scott, E. (2013). Island Grammar-Based Parsing Using GLL and Tom. In Software Language Engineering Lecture Notes in Computer Science : 5th International Conference, SLE 2012, Dresden, Germany, September 26-28, 2012, Revised Selected Papers (pp. 224-243) https://doi.org/10.1007/978-3-642-36089-3_13

Vancouver

Afroozeh A, Bach J-C, van den Brand M, Johnstone A, Manders M, Moreau P-E et al. Island Grammar-Based Parsing Using GLL and Tom. In Software Language Engineering Lecture Notes in Computer Science : 5th International Conference, SLE 2012, Dresden, Germany, September 26-28, 2012, Revised Selected Papers. 2013. p. 224-243 https://doi.org/10.1007/978-3-642-36089-3_13

Author

Afroozeh, Ali ; Bach, Jean-Christophe ; van den Brand, Mark ; Johnstone, Adrian ; Manders, Maarten ; Moreau, Pierre-Etienne ; Scott, Elizabeth. / Island Grammar-Based Parsing Using GLL and Tom. Software Language Engineering Lecture Notes in Computer Science : 5th International Conference, SLE 2012, Dresden, Germany, September 26-28, 2012, Revised Selected Papers. 2013. pp. 224-243

BibTeX

@inproceedings{1bccc7567ebc4b14abd28910ef5dcbeb,
title = "Island Grammar-Based Parsing Using GLL and Tom",
abstract = "Extending a language by embedding within it another language presents significant parsing challenges, especially if the embedding is recursive. The composite grammar is likely to be nondeterministic as a result of tokens that are valid in both the host and the embedded language. In this paper we examine the challenges of embedding the Tom language into a variety of general-purpose high level languages. Tom provides syntax and semantics for advanced pattern matching and tree rewriting facilities. Embedded Tom constructs are translated into the host language by a preprocessor, the output of which is a composite program written purely in the host language. Tom implementations exist for Java, C, C#, Python and Caml. The current parser is complex and difficult to maintain. In this paper, we describe how Tom can be parsed using island grammars implemented with the Generalised LL (GLL) parsing algorithm. The grammar is, as might be expected, ambiguous. Extracting the correct derivation relies on our disambiguation strategy which is based on pattern matching within the parse forest. We describe different classes of ambiguity and propose patterns for resolving them.",
author = "Ali Afroozeh and Jean-Christophe Bach and {van den Brand}, Mark and Adrian Johnstone and Maarten Manders and Pierre-Etienne Moreau and Elizabeth Scott",
year = "2013",
doi = "10.1007/978-3-642-36089-3_13",
language = "English",
isbn = "978-3-642-36088-6",
pages = "224--243",
booktitle = "Software Language Engineering Lecture Notes in Computer Science",

}

RIS

TY - GEN

T1 - Island Grammar-Based Parsing Using GLL and Tom

AU - Afroozeh, Ali

AU - Bach, Jean-Christophe

AU - van den Brand, Mark

AU - Johnstone, Adrian

AU - Manders, Maarten

AU - Moreau, Pierre-Etienne

AU - Scott, Elizabeth

PY - 2013

Y1 - 2013

N2 - Extending a language by embedding within it another language presents significant parsing challenges, especially if the embedding is recursive. The composite grammar is likely to be nondeterministic as a result of tokens that are valid in both the host and the embedded language. In this paper we examine the challenges of embedding the Tom language into a variety of general-purpose high level languages. Tom provides syntax and semantics for advanced pattern matching and tree rewriting facilities. Embedded Tom constructs are translated into the host language by a preprocessor, the output of which is a composite program written purely in the host language. Tom implementations exist for Java, C, C#, Python and Caml. The current parser is complex and difficult to maintain. In this paper, we describe how Tom can be parsed using island grammars implemented with the Generalised LL (GLL) parsing algorithm. The grammar is, as might be expected, ambiguous. Extracting the correct derivation relies on our disambiguation strategy which is based on pattern matching within the parse forest. We describe different classes of ambiguity and propose patterns for resolving them.

AB - Extending a language by embedding within it another language presents significant parsing challenges, especially if the embedding is recursive. The composite grammar is likely to be nondeterministic as a result of tokens that are valid in both the host and the embedded language. In this paper we examine the challenges of embedding the Tom language into a variety of general-purpose high level languages. Tom provides syntax and semantics for advanced pattern matching and tree rewriting facilities. Embedded Tom constructs are translated into the host language by a preprocessor, the output of which is a composite program written purely in the host language. Tom implementations exist for Java, C, C#, Python and Caml. The current parser is complex and difficult to maintain. In this paper, we describe how Tom can be parsed using island grammars implemented with the Generalised LL (GLL) parsing algorithm. The grammar is, as might be expected, ambiguous. Extracting the correct derivation relies on our disambiguation strategy which is based on pattern matching within the parse forest. We describe different classes of ambiguity and propose patterns for resolving them.

U2 - 10.1007/978-3-642-36089-3_13

DO - 10.1007/978-3-642-36089-3_13

M3 - Conference contribution

SN - 978-3-642-36088-6

SP - 224

EP - 243

BT - Software Language Engineering Lecture Notes in Computer Science

ER -