blog の自動収集と監視  [in Japanese] Automatically Collecting and Monitoring Japanese Weblogs  [in Japanese]

Access this Article

Search this Article

Author(s)

    • 南野 朋之 NANNO Tomoyuki
    • 東京工業大学 大学院総合理工学研究科 Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology
    • 鈴木 泰裕 SUZUKI Yasuhiro
    • 東京工業大学 大学院総合理工学研究科 Interdisciplinary Graduate School of Science and Engineering, Tokyo Institute of Technology
    • 奥村 学 OKUMURA Manabu
    • 東京工業大学 精密工学研究所 Precision and Intelligence Laboratory, Tokyo Institute of Technology

Abstract

Weblogs (blogs) are now thought of as a potentially useful information source. Although the definition of blogs is not necessarily definite, it is generally understood that they are personal web pages authored by a single individual and made up of a sequence of dated entries of the author's thoughts, that are arranged chronologically. In Japan, since long before blog software became available, people have written `diaries' on the web. These web diaries are quite similar to blogs in their content, and people still write them without any blog software. As we will show, hand-edited blogs are quite numerous in Japan, though most people now think of blogs as pages usually published using one of the variants of public-domain blog software. Therefore, it is quite difficult to exhaustively collect Japanese blogs, i.e., collect blogs made with blog software and web diaries written as normal web pages. With this as the motivation for our work, we present a system that tries to automatically collect and monitor Japanese blog collections that include not only ones made with blog software but also ones written as normal web pages. Our approach is based on extraction of date expressions and analysis of HTML documents, to avoid having to depend on specific blog software, RSS, or the ping server.

Journal

  • Transactions of the Japanese Society for Artificial Intelligence

    Transactions of the Japanese Society for Artificial Intelligence 19, 511-520, 2004-11-01

    The Japanese Society for Artificial Intelligence

References:  14

Cited by:  12

Codes

  • NII Article ID (NAID)
    10014164934
  • NII NACSIS-CAT ID (NCID)
    AA11579226
  • Text Lang
    JPN
  • Article Type
    Journal Article
  • ISSN
    13460714
  • NDL Article ID
    7264454
  • NDL Source Classification
    ZM13(科学技術--科学技術一般--データ処理・計算機)
  • NDL Call No.
    Z74-C589
  • Data Source
    CJP  CJPref  NDL  J-STAGE 
Page Top