TY - GEN
T1 - An information retrieval approach for text mining of medical records based on graph descriptor
AU - Dudko, Alexander
AU - Endrjukaite, Tatiana
AU - Kiyoki, Yasushi
N1 - Publisher Copyright:
© 2019 The authors and IOS Press. All rights reserved.
PY - 2019
Y1 - 2019
N2 - This paper describes a new method of data retrieval from free text documents in medical domain. Proposed approach creates the document summary and highlights most important keywords in the text. To achieve this result we process the document natural language text and build a descriptor as an internal representation of the document. This descriptor is a graph with concepts, relations between them, and concept points as a metric of relevance. By means of points in the descriptor the approach performs ambiguity resolution, selects most relevant concepts to display in the summary, and votes for keywords highlighting in the text. Besides the direct representation of identified information in the summary, this work proposes a way to provide extended summary by using additional knowledge about relations between medications, procedures, diseases and anatomy. The described approach helps to speed up analysis and decision making processes by means of providing aggregated summary for a document and highlighting most meaningful parts of the document's text. Experiment results demonstrate that automatic summary generation and keywords highlighting can be successfully performed by the proposed approach to achieve meaningful and highly relevant results.
AB - This paper describes a new method of data retrieval from free text documents in medical domain. Proposed approach creates the document summary and highlights most important keywords in the text. To achieve this result we process the document natural language text and build a descriptor as an internal representation of the document. This descriptor is a graph with concepts, relations between them, and concept points as a metric of relevance. By means of points in the descriptor the approach performs ambiguity resolution, selects most relevant concepts to display in the summary, and votes for keywords highlighting in the text. Besides the direct representation of identified information in the summary, this work proposes a way to provide extended summary by using additional knowledge about relations between medications, procedures, diseases and anatomy. The described approach helps to speed up analysis and decision making processes by means of providing aggregated summary for a document and highlighting most meaningful parts of the document's text. Experiment results demonstrate that automatic summary generation and keywords highlighting can be successfully performed by the proposed approach to achieve meaningful and highly relevant results.
KW - ambiguity resolution
KW - graph descriptor
KW - information retrieval
KW - summary generation
KW - text mining
UR - http://www.scopus.com/inward/record.url?scp=85059572767&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85059572767&partnerID=8YFLogxK
U2 - 10.3233/978-1-61499-933-1-334
DO - 10.3233/978-1-61499-933-1-334
M3 - Conference contribution
AN - SCOPUS:85059572767
T3 - Frontiers in Artificial Intelligence and Applications
SP - 334
EP - 352
BT - Information Modelling and Knowledge Bases XXX
A2 - Endrjukaite, Tatiana
A2 - Jaakkola, Hannu
A2 - Dudko, Alexander
A2 - Kiyoki, Yasushi
A2 - Thalheim, Bernhard
A2 - Yoshida, Naofumi
PB - IOS Press
ER -