TY - GEN
T1 - Medical documents processing for summary generation and keywords highlighting based on natural language processing and ontology graph descriptor approach
AU - Dudko, Alexander
AU - Endrjukaite, Tatiana
AU - Kiyoki, Yasushi
N1 - Publisher Copyright:
© 2017 Association for Computing Machinery.
PY - 2017/12/4
Y1 - 2017/12/4
N2 - In this paper a new method of data retrieval from free text documents in medical domain is proposed. Presented approach gives the document summary and highlights important keywords in the text to support further analysis of multiple medical documents. The document is processed with natural language processing techniques to find medical keywords and assign them to concepts in the medical ontology. These concepts contribute to higher levels in the hierarchy and build the document descriptor as a graph with concepts in the nodes and corresponding relevance points. The descriptor is used to generate the summary in a form of tree. Finally, we highlight the most important keywords in the original text. Presented experiments demonstrate the proposed approach, which successfully summarizes and highlights meaningful medical information.
AB - In this paper a new method of data retrieval from free text documents in medical domain is proposed. Presented approach gives the document summary and highlights important keywords in the text to support further analysis of multiple medical documents. The document is processed with natural language processing techniques to find medical keywords and assign them to concepts in the medical ontology. These concepts contribute to higher levels in the hierarchy and build the document descriptor as a graph with concepts in the nodes and corresponding relevance points. The descriptor is used to generate the summary in a form of tree. Finally, we highlight the most important keywords in the original text. Presented experiments demonstrate the proposed approach, which successfully summarizes and highlights meaningful medical information.
KW - Concept
KW - Data mining
KW - Document descriptor
KW - Information retrieval
KW - Medical documents processing
KW - Ontology
KW - Summary generation
UR - http://www.scopus.com/inward/record.url?scp=85044281145&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85044281145&partnerID=8YFLogxK
U2 - 10.1145/3151759.3151784
DO - 10.1145/3151759.3151784
M3 - Conference contribution
AN - SCOPUS:85044281145
T3 - ACM International Conference Proceeding Series
SP - 58
EP - 65
BT - 19th International Conference on Information Integration and Web-Based Applications and Services, iiWAS2017 - Proceedings
A2 - Anderst-Kotsis, Gabriele
A2 - Steinbauer, Matthias
A2 - Khalil, Ismail
A2 - Indrawan-Santiago, Maria
A2 - Salvadori, Ivan Luiz
PB - Association for Computing Machinery
T2 - 19th International Conference on Information Integration and Web-Based Applications and Services, iiWAS2017
Y2 - 4 December 2017 through 6 December 2017
ER -