Annotating Japanese Numeral Expressions for a Logical and Pragmatic Inference Dataset

Kana Koyano, Hitomi Yanaka, Koji Mineshima, Daisuke Bekki

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Numeral expressions in Japanese are characterized by the flexibility of quantifier positions and the variety of numeral suffixes. However, little work has been done to build annotated corpora focusing on these features and datasets for testing the understanding of Japanese numeral expressions. In this study, we build a corpus that annotates each numeral expression in an existing phrase structure-based Japanese treebank with its usage and numeral suffix types. We also construct an inference test set for numerical expressions based on this annotated corpus. In this test set, we particularly pay attention to inferences where the correct label differs between logical entailment and implicature and those contexts such as negations and conditionals where the entailment labels can be reversed. The baseline experiment with Japanese BERT models shows that our inference test set poses challenges for inference involving various types of numeral expressions.

Original languageEnglish
Title of host publicationProceedings of the 18th Joint ACL - ISO Workshop on Interoperable Semantic Annotation, ISA 2022 at LREC 2022 Workshop - Language Resources and Evaluation Conference
EditorsHarry Bunt
PublisherEuropean Language Resources Association (ELRA)
Pages127-132
Number of pages6
ISBN (Electronic)9791095546818
Publication statusPublished - 2022
Event18th Joint ACL - ISO Workshop on Interoperable Semantic Annotation, ISA 2022 - Marseille, France
Duration: 2022 Jun 20 → …

Publication series

NameProceedings of the 18th Joint ACL - ISO Workshop on Interoperable Semantic Annotation, ISA 2022 at LREC 2022 Workshop - Language Resources and Evaluation Conference

Conference

Conference18th Joint ACL - ISO Workshop on Interoperable Semantic Annotation, ISA 2022
Country/TerritoryFrance
CityMarseille
Period22/6/20 → …

Keywords

  • entailment
  • implicature
  • Japanese
  • natural language inference
  • numeral expressions

ASJC Scopus subject areas

  • Industrial and Manufacturing Engineering

Fingerprint

Dive into the research topics of 'Annotating Japanese Numeral Expressions for a Logical and Pragmatic Inference Dataset'. Together they form a unique fingerprint.

Cite this