Anaphoric annotation and corpus-based anaphora resolution

Standard

Anaphoric annotation and corpus-based anaphora resolution : An experiment. / Protopopova, E. V.; Bodrova, A. A.; Volskaya, S. A.; Krylova, I. V.; Chuchunkov, A. S.; Alexeeva, S. V.; Bocharov, V. V.; Granovsky, D. V.

По материалам ежегодной Международной конференции "Диалог" 2014. Российский государственный гуманитарный университет, 2014. p. 562-571 (Komp'juternaja Lingvistika i Intellektual'nye Tehnologii; Vol. 13).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review

Harvard

Protopopova, EV, Bodrova, AA, Volskaya, SA, Krylova, IV, Chuchunkov, AS, Alexeeva, SV , Bocharov, VV & Granovsky, DV 2014, Anaphoric annotation and corpus-based anaphora resolution: An experiment. in По материалам ежегодной Международной конференции "Диалог" 2014. Komp'juternaja Lingvistika i Intellektual'nye Tehnologii, vol. 13, Российский государственный гуманитарный университет, pp. 562-571. <http://www.dialog-21.ru/digests/dialog2014/materials/pdf/ProtopopovaEV.pdf>

APA

Protopopova, E. V., Bodrova, A. A., Volskaya, S. A., Krylova, I. V., Chuchunkov, A. S., Alexeeva, S. V., Bocharov, V. V., & Granovsky, D. V. (2014). Anaphoric annotation and corpus-based anaphora resolution: An experiment. In По материалам ежегодной Международной конференции "Диалог" 2014 (pp. 562-571). (Komp'juternaja Lingvistika i Intellektual'nye Tehnologii; Vol. 13). Российский государственный гуманитарный университет. http://www.dialog-21.ru/digests/dialog2014/materials/pdf/ProtopopovaEV.pdf

Vancouver

Protopopova EV, Bodrova AA, Volskaya SA, Krylova IV, Chuchunkov AS, Alexeeva SV et al. Anaphoric annotation and corpus-based anaphora resolution: An experiment. In По материалам ежегодной Международной конференции "Диалог" 2014. Российский государственный гуманитарный университет. 2014. p. 562-571. (Komp'juternaja Lingvistika i Intellektual'nye Tehnologii).

Author

Protopopova, E. V. ; Bodrova, A. A. ; Volskaya, S. A. ; Krylova, I. V. ; Chuchunkov, A. S. ; Alexeeva, S. V. ; Bocharov, V. V. ; Granovsky, D. V. / Anaphoric annotation and corpus-based anaphora resolution : An experiment. По материалам ежегодной Международной конференции "Диалог" 2014. Российский государственный гуманитарный университет, 2014. pp. 562-571 (Komp'juternaja Lingvistika i Intellektual'nye Tehnologii).

BibTeX

@inproceedings{b3fa12dd16e34b82a3e33313b85afa94,

title = "Anaphoric annotation and corpus-based anaphora resolution: An experiment",

abstract = "The paper describes the noun phase and anaphora annotation in OpenCorpora and compares it to that in other corpora. We discuss the choice of representative texts for anaphoric annotation and the basic principles of syntactic annotation. In case of noun phrase annotation we followed the scheme introduced earlier for morphological annotation: it was carried out in two stages: firstly, all noun phrases and some other syntactic units were annotated by a heterogenous group of people, then a linguist compared all markup results and found the best one, or corrected mistakes. We present some annotation results and cases of annotator's disagreement and proceed to introduce our data-driven anaphora resolution system based on decision trees. We then list the features used to fit the classificator and discuss their relevance and some changes which improved the classificator performance. We also present out rule-based approach to automated noun phrase extraction using Tomita parser. A baseline for anaphora resolution is introduced and we compare it with our results.",

keywords = "Anaphora resolution, Corpora, Crowdsourcing, Syntactic annotation",

author = "Protopopova, {E. V.} and Bodrova, {A. A.} and Volskaya, {S. A.} and Krylova, {I. V.} and Chuchunkov, {A. S.} and Alexeeva, {S. V.} and Bocharov, {V. V.} and Granovsky, {D. V.}",

year = "2014",

language = "English",

isbn = "2221-7932",

series = "Komp'juternaja Lingvistika i Intellektual'nye Tehnologii",

publisher = "Российский государственный гуманитарный университет",

pages = "562--571",

booktitle = "По материалам ежегодной Международной конференции {"}Диалог{"} 2014",

address = "Russian Federation",

}

RIS

TY - GEN

T1 - Anaphoric annotation and corpus-based anaphora resolution

T2 - An experiment

AU - Protopopova, E. V.

AU - Bodrova, A. A.

AU - Volskaya, S. A.

AU - Krylova, I. V.

AU - Chuchunkov, A. S.

AU - Alexeeva, S. V.

AU - Bocharov, V. V.

AU - Granovsky, D. V.

PY - 2014

Y1 - 2014

N2 - The paper describes the noun phase and anaphora annotation in OpenCorpora and compares it to that in other corpora. We discuss the choice of representative texts for anaphoric annotation and the basic principles of syntactic annotation. In case of noun phrase annotation we followed the scheme introduced earlier for morphological annotation: it was carried out in two stages: firstly, all noun phrases and some other syntactic units were annotated by a heterogenous group of people, then a linguist compared all markup results and found the best one, or corrected mistakes. We present some annotation results and cases of annotator's disagreement and proceed to introduce our data-driven anaphora resolution system based on decision trees. We then list the features used to fit the classificator and discuss their relevance and some changes which improved the classificator performance. We also present out rule-based approach to automated noun phrase extraction using Tomita parser. A baseline for anaphora resolution is introduced and we compare it with our results.

AB - The paper describes the noun phase and anaphora annotation in OpenCorpora and compares it to that in other corpora. We discuss the choice of representative texts for anaphoric annotation and the basic principles of syntactic annotation. In case of noun phrase annotation we followed the scheme introduced earlier for morphological annotation: it was carried out in two stages: firstly, all noun phrases and some other syntactic units were annotated by a heterogenous group of people, then a linguist compared all markup results and found the best one, or corrected mistakes. We present some annotation results and cases of annotator's disagreement and proceed to introduce our data-driven anaphora resolution system based on decision trees. We then list the features used to fit the classificator and discuss their relevance and some changes which improved the classificator performance. We also present out rule-based approach to automated noun phrase extraction using Tomita parser. A baseline for anaphora resolution is introduced and we compare it with our results.

KW - Anaphora resolution

KW - Corpora

KW - Crowdsourcing

KW - Syntactic annotation

UR - http://www.scopus.com/inward/record.url?scp=84904797641&partnerID=8YFLogxK

M3 - Conference contribution

SN - 2221-7932

T3 - Komp'juternaja Lingvistika i Intellektual'nye Tehnologii

SP - 562

EP - 571

BT - По материалам ежегодной Международной конференции "Диалог" 2014

PB - Российский государственный гуманитарный университет

ER -

ID: 4682461