TY - GEN
T1 - Proposal of the kawaii search system based on the first sight of impression
AU - Hashiguchi, Kyoko
AU - Ogawa, Katsuhiko
PY - 2011
Y1 - 2011
N2 - We propose a blog search engine called "Kawaii Search" (where Kawaii means pretty) to search blogs based on the impression of their text on a printing surface, considering factors such as the format and layout of text and density of words. Particularly in Japan, blogs reveal the personality characteristics of users depending on how they place their text. For example, some writers leave more space between lines or use hieroglyphics and "Gal words[1]," which consist of slang or abbreviations. Further, words can be categorized using four types of characters: kanji, hiragana, katakana, and alphabet. Each results in a different impression that reveals a writer's personality. Given this approach, blog readers can not only read blog, but also interpret each writer's personality. By focusing on impression differences, we propose a new search algorithm specialized for Japanese blogs. To show that these differences can act as the base of our search algorithm, we conducted an experiment that successfully verified the algorithm applied to the following three blog patterns: "kawaii" (pretty or lovely), "majime" (seriousness or industrious), and "futsu" (normal). The results show that in terms of the accuracy of the algorithm, our study categorized "kawaii" well; however, "majime" and "futsu" did not show good results.
AB - We propose a blog search engine called "Kawaii Search" (where Kawaii means pretty) to search blogs based on the impression of their text on a printing surface, considering factors such as the format and layout of text and density of words. Particularly in Japan, blogs reveal the personality characteristics of users depending on how they place their text. For example, some writers leave more space between lines or use hieroglyphics and "Gal words[1]," which consist of slang or abbreviations. Further, words can be categorized using four types of characters: kanji, hiragana, katakana, and alphabet. Each results in a different impression that reveals a writer's personality. Given this approach, blog readers can not only read blog, but also interpret each writer's personality. By focusing on impression differences, we propose a new search algorithm specialized for Japanese blogs. To show that these differences can act as the base of our search algorithm, we conducted an experiment that successfully verified the algorithm applied to the following three blog patterns: "kawaii" (pretty or lovely), "majime" (seriousness or industrious), and "futsu" (normal). The results show that in terms of the accuracy of the algorithm, our study categorized "kawaii" well; however, "majime" and "futsu" did not show good results.
KW - Blog search engine
KW - Impression
KW - Japanese blogosphere
KW - information retrieval
KW - text formatting
UR - http://www.scopus.com/inward/record.url?scp=79960295662&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79960295662&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-21669-5_3
DO - 10.1007/978-3-642-21669-5_3
M3 - Conference contribution
AN - SCOPUS:79960295662
SN - 9783642216688
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 21
EP - 30
BT - Human Interface and the Management of Information
T2 - Human Interface and the Management of Information: Interacting with Information - Symposium on Human Interface 2011, Held as Part of HCI International 2011
Y2 - 9 July 2011 through 14 July 2011
ER -