大規模疫病データのための非線形モデル解析

IPSJ HANDLE Open Access

Bibliographic Information

Other Title
  • Non-linear Mining of Spatially Coevolving Epidemics

Search this article

Abstract

本論文では,大規模疫病データのための非線形モデル解析手法であるFUNNELについて述べる.FUNNELは,(disease, location, time)の三つ組で構成されたdの疫病とlの州を含む,長さnのテンソルデータが与えられたときに,そのテンソルの中から非線形パターンや外れ値を統合的に解析,要約,表現する.提案手法は(a)疫病の季節性,ワクチンの出現,外部要因による疫病の発生等の重要なパターンを発見し,(b)パラメータのチューニングを必要とせず,(c)計算量は入力データのサイズに対して線形である.さらに,本手法は(d)麻疹やインフルエンザ等の疫病だけでなく,コンピュータウィルスの感染パターンについても同様に解析することができる.実データを用いた実験では,FUNNELが大規模疫病テンソルの中から(P1)病気の季節性,(P2)ワクチン効果,(P3)地域性,(P4)外因性による疫病パターン,(P5)データ入力エラー等の重要パターンを正確に発見することを確認した.

Given a large collection of epidemiological data consisting of the count of d contagious diseases for l locations of duration n, how can we find patterns, rules and outliers? For example, the Project Tycho provides open access to the count infections for U.S. states from 1888 to 2013, for 56 contagious diseases (e.g., measles, influenza), which include missing values, possible recording errors, sudden spikes (or dives) of infections, etc. So how can we find a combined model, for all these diseases, locations, and time-ticks? In this paper, we present FUNNEL, a unifying analytical model for large scale epidemiological data, as well as a novel fitting algorithm, FUNNELFIT, which solves the above problem. Our method has the following properties: (a) Sense-making: it detects important patterns of epidemics, such as periodicities, the appearance of vaccines, external shock events, and more; (b) Parameter-free: our modeling framework frees the user from providing parameter values; (c) Scalable: FunnelFit is carefully designed to be linear on the input size; (d) General: our model is general and practical, which can be applied to various types of epidemics, including computer-virus propagation, as well as human diseases. Extensive experiments on real data demonstrate that FUNNELFIT does indeed discover important properties of epidemics: (P1) disease seasonality, e.g., influenza spikes in January, Lyme disease spikes in July and the absence of yearly periodicity for gonorrhea; (P2) disease reduction effect, e.g., the appearance of vaccines; (P3) local/state-level sensitivity, e.g., many measles cases in NY; (P4) external shock events, e.g., historical flu pandemics; (P5) detect incongruous values, i.e., data reporting errors.

Journal

Related Projects

See more

Details 詳細情報について

Report a problem

Back to top