An automatic sameAs link discovery from Wikipedia

Kosuke Kagawa, Susumu Tamagawa, Takahira Yamaguchi

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Spelling variants of words or word sense ambiguity takes many costs in such processes as Data Integration, Information Searching, data pre-processing for Data Mining, and so on. It is useful to construct relations between a word or phrases and a representative name of the entity to meet these demands. To reduce the costs, this paper discusses how to automatically discover "sameAs" and "meaningOf" links from Japanese Wikipedia. In order to do so, we gathered relevant features such as IDF, string similarity, number of hypernym, and so on. We have identified the link-based score on salient features based on SVM results with 960,000 anchor link pairs. These case studies show us that our link discovery method goes well with more than 70% precision/ recall rate.

Original languageEnglish
Title of host publicationSemantic Technology - Third Joint International Conference, JIST 2013, Revised Selected Papers
PublisherSpringer Verlag
Pages399-413
Number of pages15
ISBN (Print)9783319068251
DOIs
Publication statusPublished - 2014
Externally publishedYes
Event3rd Joint International Semantic Technology Conference, JIST 2013 - Seoul, Korea, Republic of
Duration: 2013 Nov 282013 Nov 30

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8388 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other3rd Joint International Semantic Technology Conference, JIST 2013
Country/TerritoryKorea, Republic of
CitySeoul
Period13/11/2813/11/30

Keywords

  • Disambiguation
  • Ontology
  • SameAs link
  • Spelling variants
  • Synonym
  • Wikipedia

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint

Dive into the research topics of 'An automatic sameAs link discovery from Wikipedia'. Together they form a unique fingerprint.

Cite this