Does My Rebuttal Matter? Insights from a Major NLP Conference

Yang Gao; Steffen Eger; Ilia Kuznetsov; Iryna Gurevych; Yusuke Miyao

Does My Rebuttal Matter? Insights from a Major NLP Conference

Yang Gao, Steffen Eger, Ilia Kuznetsov, Iryna Gurevych, Yusuke Miyao

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

Peer review is a core element of the scientific process, particularly in conference-centered fields such as ML and NLP. However, only few studies have evaluated its properties empirically. Aiming to fill this gap, we present a corpus that contains over 4k reviews and 1.2k author responses from ACL-2018. We quantitatively and qualitatively assess the corpus. This includes a pilot study on paper weaknesses given by reviewers and on quality of author responses. We then focus on the role of the rebuttal phase, and propose a novel task to predict after-rebuttal (i.e., final) scores from initial reviews and author responses. Although author responses do have a marginal (and statistically significant) influence on the final scores, especially for borderline papers, our results suggest that a reviewer’s final score is largely determined by her initial score and the distance to the other reviewers’ initial scores. In this context, we discuss the conformity bias inherent to peer reviewing, a bias that has largely been overlooked in previous research. We hope our analyses will help better assess the usefulness of the rebuttal phase in NLP conferences.

Original language	English
Title of host publication	Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Place of Publication	Minneapolis, Minnesota
Publisher	Association for Computational Linguistics
Pages	1274–1290
Number of pages	17
Volume	1
Publication status	Published - Jun 2019

Access to Document

https://aclweb.org/anthology/papers/N/N19/N19-1129/

Cite this

Gao, Y., Eger, S., Kuznetsov, I., Gurevych, I., & Miyao, Y. (2019). Does My Rebuttal Matter? Insights from a Major NLP Conference. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Vol. 1, pp. 1274–1290). Association for Computational Linguistics. https://aclweb.org/anthology/papers/N/N19/N19-1129/

@inproceedings{c2f3bc8600494d828b9ca10066d4ab9c,

title = "Does My Rebuttal Matter? Insights from a Major NLP Conference",

abstract = "Peer review is a core element of the scientific process, particularly in conference-centered fields such as ML and NLP. However, only few studies have evaluated its properties empirically. Aiming to fill this gap, we present a corpus that contains over 4k reviews and 1.2k author responses from ACL-2018. We quantitatively and qualitatively assess the corpus. This includes a pilot study on paper weaknesses given by reviewers and on quality of author responses. We then focus on the role of the rebuttal phase, and propose a novel task to predict after-rebuttal (i.e., final) scores from initial reviews and author responses. Although author responses do have a marginal (and statistically significant) influence on the final scores, especially for borderline papers, our results suggest that a reviewer{\textquoteright}s final score is largely determined by her initial score and the distance to the other reviewers{\textquoteright} initial scores. In this context, we discuss the conformity bias inherent to peer reviewing, a bias that has largely been overlooked in previous research. We hope our analyses will help better assess the usefulness of the rebuttal phase in NLP conferences.",

author = "Yang Gao and Steffen Eger and Ilia Kuznetsov and Iryna Gurevych and Yusuke Miyao",

year = "2019",

month = jun,

language = "English",

volume = "1",

pages = "1274–1290",

booktitle = "Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",

publisher = "Association for Computational Linguistics",

}

Gao, Y, Eger, S, Kuznetsov, I, Gurevych, I & Miyao, Y 2019, Does My Rebuttal Matter? Insights from a Major NLP Conference. in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. vol. 1, Association for Computational Linguistics, Minneapolis, Minnesota, pp. 1274–1290. <https://aclweb.org/anthology/papers/N/N19/N19-1129/>

Does My Rebuttal Matter? Insights from a Major NLP Conference. / Gao, Yang; Eger, Steffen; Kuznetsov, Ilia et al.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Vol. 1 Minneapolis, Minnesota: Association for Computational Linguistics, 2019. p. 1274–1290.

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

TY - GEN

T1 - Does My Rebuttal Matter? Insights from a Major NLP Conference

AU - Gao, Yang

AU - Eger, Steffen

AU - Kuznetsov, Ilia

AU - Gurevych, Iryna

AU - Miyao, Yusuke

PY - 2019/6

Y1 - 2019/6

N2 - Peer review is a core element of the scientific process, particularly in conference-centered fields such as ML and NLP. However, only few studies have evaluated its properties empirically. Aiming to fill this gap, we present a corpus that contains over 4k reviews and 1.2k author responses from ACL-2018. We quantitatively and qualitatively assess the corpus. This includes a pilot study on paper weaknesses given by reviewers and on quality of author responses. We then focus on the role of the rebuttal phase, and propose a novel task to predict after-rebuttal (i.e., final) scores from initial reviews and author responses. Although author responses do have a marginal (and statistically significant) influence on the final scores, especially for borderline papers, our results suggest that a reviewer’s final score is largely determined by her initial score and the distance to the other reviewers’ initial scores. In this context, we discuss the conformity bias inherent to peer reviewing, a bias that has largely been overlooked in previous research. We hope our analyses will help better assess the usefulness of the rebuttal phase in NLP conferences.

AB - Peer review is a core element of the scientific process, particularly in conference-centered fields such as ML and NLP. However, only few studies have evaluated its properties empirically. Aiming to fill this gap, we present a corpus that contains over 4k reviews and 1.2k author responses from ACL-2018. We quantitatively and qualitatively assess the corpus. This includes a pilot study on paper weaknesses given by reviewers and on quality of author responses. We then focus on the role of the rebuttal phase, and propose a novel task to predict after-rebuttal (i.e., final) scores from initial reviews and author responses. Although author responses do have a marginal (and statistically significant) influence on the final scores, especially for borderline papers, our results suggest that a reviewer’s final score is largely determined by her initial score and the distance to the other reviewers’ initial scores. In this context, we discuss the conformity bias inherent to peer reviewing, a bias that has largely been overlooked in previous research. We hope our analyses will help better assess the usefulness of the rebuttal phase in NLP conferences.

M3 - Conference contribution

VL - 1

SP - 1274

EP - 1290

BT - Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

PB - Association for Computational Linguistics

CY - Minneapolis, Minnesota

ER -