TY - GEN
T1 - Semantic annotations in Japanese frame net
T2 - 8th International Conference on Language Resources and Evaluation, LREC 2012
AU - Ohara, Kyoko Hirose
N1 - Funding Information:
The analysis presented in this paper was supported by a grant from Keio University. The JFN project currently annotates data extracted from the “Balanced Corpus of Contemporary Written Japanese (BCCWJ)”. I am indebted to Charles J. Fillmore, Michael Ellsworth, and Russell Lee-Goldman for discussing the semantic frames with me and to Collin Baker, Miriam Petruck, and other members of FrameNet for their comments at the FrameNet staff meeting. Special thanks to Toshiko Kigoshi, Junya Kato, Hiroya Hagino, Naoko Kurokawa, and Hiroaki Saito of the JFN project.
Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.
PY - 2012
Y1 - 2012
N2 - Since 2008, the Japanese FrameNet (JFN, http://jfn.st.hc.keio.ac.jp/) project has been annotating the Balanced Corpus of Contemporary Written Japanese (BCCWJ), the first such corpus, officially released in October 2011. This paper reports annotation results of the book genre of BCCWJ (Ohara 2011, Ohara, Saito, Fujii &Sato 2011). Comparing the semantic frames needed to annotate BCCWJ with those that the FrameNet (FN) project (Fillmore and Baker 2009, Fillmore 2006) already has defined revealed that: 1) differences in the Japanese and English semantic frames often concern different perspectives and different lexical aspects exhibited by the two lexicons; and 2) in most of the cases where JFN defined new semantic frame for a word, the frame did not involve culture-specific scenes. We investigated the extent to which existing semantic frames originally defined for analyzing English words were used, annotating 810 sentences of the so-called core data of the book genre of BCCWJ. In the 810 sentences we were able to assign semantic frames to approximately 4000 words, although we could not assign any to 587 words. That is, of all the LUs in the sentences, we were able to identify semantic frames to about 87 per cent of them. In other words, the semantic frames already defined in FN for English could be used for 87 per cent of the Japanese LUs.
AB - Since 2008, the Japanese FrameNet (JFN, http://jfn.st.hc.keio.ac.jp/) project has been annotating the Balanced Corpus of Contemporary Written Japanese (BCCWJ), the first such corpus, officially released in October 2011. This paper reports annotation results of the book genre of BCCWJ (Ohara 2011, Ohara, Saito, Fujii &Sato 2011). Comparing the semantic frames needed to annotate BCCWJ with those that the FrameNet (FN) project (Fillmore and Baker 2009, Fillmore 2006) already has defined revealed that: 1) differences in the Japanese and English semantic frames often concern different perspectives and different lexical aspects exhibited by the two lexicons; and 2) in most of the cases where JFN defined new semantic frame for a word, the frame did not involve culture-specific scenes. We investigated the extent to which existing semantic frames originally defined for analyzing English words were used, annotating 810 sentences of the so-called core data of the book genre of BCCWJ. In the 810 sentences we were able to assign semantic frames to approximately 4000 words, although we could not assign any to 587 words. That is, of all the LUs in the sentences, we were able to identify semantic frames to about 87 per cent of them. In other words, the semantic frames already defined in FN for English could be used for 87 per cent of the Japanese LUs.
KW - Frame Semantics
KW - Japanese
KW - Semantic Annotation
UR - http://www.scopus.com/inward/record.url?scp=84906533655&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84906533655&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84906533655
T3 - Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012
SP - 1559
EP - 1562
BT - Proceedings of the 8th International Conference on Language Resources and Evaluation, LREC 2012
A2 - Dogan, Mehmet Ugur
A2 - Mariani, Joseph
A2 - Moreno, Asuncion
A2 - Goggi, Sara
A2 - Choukri, Khalid
A2 - Calzolari, Nicoletta
A2 - Odijk, Jan
A2 - Declerck, Thierry
A2 - Maegaard, Bente
A2 - Piperidis, Stelios
A2 - Mazo, Helene
A2 - Hamon, Olivier
PB - European Language Resources Association (ELRA)
Y2 - 21 May 2012 through 27 May 2012
ER -