大量のソフトウェアを対象にしたソフトウェアバースマークによる盗用検出――全文検索システムを用いた検査対象の絞り込み手法

中村, 潤, 玉田, 春昭

大量のプログラムからソフトウェアの盗用を発見するために，ソフトウェアバースマークが提案されている．バースマークは，プログラム中の特徴を抽出し比較を行い盗用を発見する技術である．従来のバースマークで想定している対象のプログラム数は数百から数千程度であり，それを超えて比較する場合，現実的な時間では比較できない場合もある．そこで本稿では，対象となるプログラムの大幅なスケールアップかつ，盗用検出に要する処理時間の短縮を目的とする．そのために，対象プログラムの比較の前に，精度は高くないが高速に比較できる手法を用いて，無関係なプログラムを除外することを考える．これを絞り込み段階として，従来の抽出段階，比較段階の間に導入する．提案手法に基づき，バースマーク絞り込みシステムMitubaを構築し，実験を行った．評価項目は所要時間，絞り込み率，誤検出，検出漏れ，精度，そして，保存性の6項目である．結果は，盗用か否かを判定するための閾値が0.2のとき，所要時間は従来の40%以下に抑えられ，80%以上のプログラムが無関係と判定された．残ったプログラムのうち，誤検出は90%程度と非常に高いものの，検出漏れは0%であり，精度も70%以上となっている．また保存性評価においても，一番強力な難読化が施された場合であっても80%以上のプログラムを見つけ出せ，良好な結果を示している．これらの結果をもとに最適な閾値を議論した結果，標準的には0.6程度の閾値が最適であるが，ユーザの問題設定によっては，閾値が0.2でも本手法は有効であることを示した．

The software birthmarks were proposed for detecting the software theft from a large number of suspected programs. The birthmark is a technique to compare the extracted characteristics as the software birthmarks, and compute similarities between two birthmarks. The conventional birthmarks require a much time by increasing the target programs for comparison. Therefore, it is the bottleneck of the conventional birthmarking technique. In this paper, we propose a method for reducing total processing time by introducing the narrowing phase between conventional extracting and comparison phases. The narrowing phase employs the fast but rough comparison algorithm in order to eliminate unrelated programs. We developed the narrowing system for the birthmark, named Mituba, then conducted the experimental evaluation with Mituba. The evaluation points are required time, narrowing rate, false negatives, false positives, accuracy, and preservation property of the birthmarks. The results are, in the threshold for deciding the theft is 0.2, the required time reduced to 20%, the system successfully narrowed 80% programs. The false negatives in the rest of the programs are about 80%, it was quite high. However, the false positives were 0%, and the accuracies were over 70%. Additionally, the proposed method satisfied the preservation property by 80% in the strongest obfuscation method in the experiments. The results were totally quite well. From the results of the experimental evaluations, we discussed the suitable threshold. In ordinal case, the suitable threshold is 0.6, however, the proposed method is acceptable in the threshold is 0.2 in the user's discretion.

大量のソフトウェアを対象にしたソフトウェアバースマークによる盗用検出――全文検索システムを用いた検査対象の絞り込み手法

書誌事項

この論文をさがす

抄録

収録刊行物

関連プロジェクト

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

大量のソフトウェアを対象にしたソフトウェアバースマークによる盗用検出――全文検索システムを用いた検査対象の絞り込み手法

書誌事項

この論文をさがす

抄録

収録刊行物

関連プロジェクト

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について