TY - GEN
T1 - Development of a general-purpose categorial grammar treebank
AU - Kubota, Yusuke
AU - Mineshima, Koji
AU - Hayashi, Noritsugu
AU - Okano, Shinya
N1 - Funding Information:
In ongoing work, we are also developing a syntactic parser trained on our treebank. Since the standard rules in the ABC grammar (function application and function composition) are also rules in CCG, we can make use of an off-the-shelf CCG parser (Yoshikawa et al., 2017) for our purpose. Acknowledgements Our appreciation goes to Masashi Yoshikawa (NAIST) for his help and constructive suggestions. This work is supported by JSPS KAKENHI GRANTs 18K00523 and 15H03210, and the NINJAL collaboratvi e research project ‘Cross-linguistic Studies of Japanese Prosody and Grammar’.
Publisher Copyright:
© European Language Resources Association (ELRA), licensed under CC-BY-NC
PY - 2020
Y1 - 2020
N2 - This paper introduces ABC Treebank, a general-purpose categorial grammar (CG) treebank for Japanese. It is 'general-purpose' in the sense that it is not tailored to a specific variant of CG, but rather aims to offer a theory-neutral linguistic resource (as much as possible) which can be converted to different versions of CG (specifically, CCG and Type-Logical Grammar) relatively easily. In terms of linguistic analysis, it improves over the existing Japanese CG treebank (Japanese CCGBank) on the treatment of certain linguistic phenomena (passives, causatives, and control/raising predicates) for which the lexical specification of the syntactic information reflecting local dependencies turns out to be crucial. In this paper, we describe the underlying 'theory' dubbed ABC Grammar that is taken as a basis for our treebank, outline the general construction of the corpus, and report on some preliminary results applying the treebank in a semantic parsing system for generating logical representations of sentences.
AB - This paper introduces ABC Treebank, a general-purpose categorial grammar (CG) treebank for Japanese. It is 'general-purpose' in the sense that it is not tailored to a specific variant of CG, but rather aims to offer a theory-neutral linguistic resource (as much as possible) which can be converted to different versions of CG (specifically, CCG and Type-Logical Grammar) relatively easily. In terms of linguistic analysis, it improves over the existing Japanese CG treebank (Japanese CCGBank) on the treatment of certain linguistic phenomena (passives, causatives, and control/raising predicates) for which the lexical specification of the syntactic information reflecting local dependencies turns out to be crucial. In this paper, we describe the underlying 'theory' dubbed ABC Grammar that is taken as a basis for our treebank, outline the general construction of the corpus, and report on some preliminary results applying the treebank in a semantic parsing system for generating logical representations of sentences.
KW - Annotation
KW - Categorial grammar
KW - Japanese
KW - Treebank
UR - http://www.scopus.com/inward/record.url?scp=85096557579&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85096557579&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:85096557579
T3 - LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings
SP - 5195
EP - 5201
BT - LREC 2020 - 12th International Conference on Language Resources and Evaluation, Conference Proceedings
A2 - Calzolari, Nicoletta
A2 - Bechet, Frederic
A2 - Blache, Philippe
A2 - Choukri, Khalid
A2 - Cieri, Christopher
A2 - Declerck, Thierry
A2 - Goggi, Sara
A2 - Isahara, Hitoshi
A2 - Maegaard, Bente
A2 - Mariani, Joseph
A2 - Mazo, Helene
A2 - Moreno, Asuncion
A2 - Odijk, Jan
A2 - Piperidis, Stelios
PB - European Language Resources Association (ELRA)
T2 - 12th International Conference on Language Resources and Evaluation, LREC 2020
Y2 - 11 May 2020 through 16 May 2020
ER -