TY - JOUR
T1 - Identification of hand-foot syndrome from cancer patients’ blog posts
T2 - BERT-based deep-learning approach to detect potential adverse drug reaction symptoms
AU - Nishioka, Satoshi
AU - Watanabe, Tomomi
AU - Asano, Masaki
AU - Yamamoto, Tatsunori
AU - Kawakami, Kazuyoshi
AU - Yada, Shuntaro
AU - Aramaki, Eiji
AU - Yajima, Hiroshi
AU - Kizaki, Hayato
AU - Hori, Satoko
N1 - Funding Information:
This work was supported by the Japan Society for the Promotion of Science (JSPS) KAKENHI Grant Number 21H03170. For more information on JSPS, please visit https://www.jsps.go.jp/index.html. The funder had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Publisher Copyright:
© 2022 Nishioka et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2022/5
Y1 - 2022/5
N2 - Early detection and management of adverse drug reactions (ADRs) is crucial for improving patients’ quality of life. Hand-foot syndrome (HFS) is one of the most problematic ADRs for cancer patients. Recently, an increasing number of patients post their daily experiences to internet community, for example in blogs, where potential ADR signals not captured through routine clinic visits can be described. Therefore, this study aimed to identify patients with potential ADRs, focusing on HFS, from internet blogs by using natural language processing (NLP) deep-learning methods. From 10,646 blog posts, written in Japanese by cancer patients, 149 HFS-positive sentences were extracted after pre-processing, annotation and scrutiny by a certified oncology pharmacist. The HFS-positive sentences described not only HFS typical expressions like “pain" or “spoon nail”, but also patient-derived unique expressions like onomatopoeic ones. The dataset was divided at a 4 to 1 ratio and used to train and evaluate three NLP deep-learning models: long short-term memory (LSTM), bidirectional LSTM and bidirectional encoder representations from transformers (BERT). The BERT model gave the best performance with precision 0.63, recall 0.82 and f1 score 0.71 in the HFS user identification task. Our results demonstrate that this NLP deep-learning model can successfully identify patients with potential HFS from blog posts, where patients’ real wordings on symptoms or impacts on their daily lives are described. Thus, it should be feasible to utilize patient-generated text data to improve ADR management for individual patients.
AB - Early detection and management of adverse drug reactions (ADRs) is crucial for improving patients’ quality of life. Hand-foot syndrome (HFS) is one of the most problematic ADRs for cancer patients. Recently, an increasing number of patients post their daily experiences to internet community, for example in blogs, where potential ADR signals not captured through routine clinic visits can be described. Therefore, this study aimed to identify patients with potential ADRs, focusing on HFS, from internet blogs by using natural language processing (NLP) deep-learning methods. From 10,646 blog posts, written in Japanese by cancer patients, 149 HFS-positive sentences were extracted after pre-processing, annotation and scrutiny by a certified oncology pharmacist. The HFS-positive sentences described not only HFS typical expressions like “pain" or “spoon nail”, but also patient-derived unique expressions like onomatopoeic ones. The dataset was divided at a 4 to 1 ratio and used to train and evaluate three NLP deep-learning models: long short-term memory (LSTM), bidirectional LSTM and bidirectional encoder representations from transformers (BERT). The BERT model gave the best performance with precision 0.63, recall 0.82 and f1 score 0.71 in the HFS user identification task. Our results demonstrate that this NLP deep-learning model can successfully identify patients with potential HFS from blog posts, where patients’ real wordings on symptoms or impacts on their daily lives are described. Thus, it should be feasible to utilize patient-generated text data to improve ADR management for individual patients.
UR - http://www.scopus.com/inward/record.url?scp=85129396616&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85129396616&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0267901
DO - 10.1371/journal.pone.0267901
M3 - Article
C2 - 35507636
AN - SCOPUS:85129396616
SN - 1932-6203
VL - 17
JO - PloS one
JF - PloS one
IS - 5 May
M1 - e0267901
ER -