AutoSort：レガシーシステム分析のためのプログラミング言語の判定支援手法

岡田, 譲二; 石尾, 隆; 坂田, 祐司; 井上, 克郎; Joji, Okada; Takashi, Ishio; Yuji, Sakata; Katsuro, Inoue

WEKO3

インデックスツリー

RootNode

アイテム

AutoSort：レガシーシステム分析のためのプログラミング言語の判定支援手法

https://ipsj.ixsq.nii.ac.jp/records/190000

名前 / ファイル	ライセンス	アクション
IPSJ-JNL5906004.pdf (513.5 kB)	Copyright (c) 2018 by the Information Processing Society of Japan
オープンアクセス

Item type

Journal(1)

公開日

2018-06-15

タイトル

AutoSort：レガシーシステム分析のためのプログラミング言語の判定支援手法

タイトル

言語

タイトル

AutoSort: A Supporting Method of Programming Language Detection for Analyzing Legacy Systems

言語

jpn

キーワード

主題Scheme

Other

主題

[一般論文] レガシーマイグレーション，リバースエンジニアリング，プログラム理解，クラスタリング

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_6501

資源タイプ

journal article

著者所属

株式会社NTTデータ技術革新統括本部／大阪大学大学院情報科学研究科

著者所属

奈良先端科学技術大学院大学情報科学研究科

著者所属

株式会社NTTデータ技術革新統括本部

著者所属

大阪大学大学院情報科学研究科

著者所属(英)

NTT DATA Corporation / Osaka University

著者所属(英)

Nara Institute of Science and Technology

著者所属(英)

NTT DATA Corporation

著者所属(英)

Osaka University

著者名

岡田, 譲二
石尾, 隆
坂田, 祐司
井上, 克郎

著者名(英)

Joji, Okada
Takashi, Ishio
Yuji, Sakata
Katsuro, Inoue

論文抄録

内容記述タイプ

Other

内容記述

レガシーなメインフレームシステムには，拡張子が存在せずプログラミング言語が不明なソースコードファイルが多数存在する．レガシーシステムを分析する際には各ソースコードファイルのプログラミング言語を判定する必要があるが，これを手作業で行うと多大な労力が必要となってしまう．本研究ではプログラミング言語の判定作業を支援するために，手作業で判定すべきファイルの代表をクラスタリングによって選出する手法を提案する．正確な判定を支援するため，提案手法はパターンマッチによる自動判定と，手作業での判定結果を用いた解析による判定誤りの補正をクラスタリングに組み合わせて用いる．提案手法の評価として，人手で正解のプログラミング言語を付与した2つの実際のレガシーシステムのファイル集合に対して本手法を適用した．その結果，提案手法は19万ファイルのうち，99.49%のファイルを正しく分類できることを確認した．また，これらのファイルに対する人手での判定は3.3人月の工数が必要だったが，提案手法は8時間の計算時間と，人手による15分の確認だけで判定を完了した．

論文抄録(英)

内容記述タイプ

Other

内容記述

Legacy mainframe systems involve many source code files without file extensions. Their programming languages are undocumented. Therefore, their respective programming languages are not easily judged by hand. Although detecting a programming language for each file is necessary for various program analysis tasks, it is time consuming to manually analyze a large amount of files. In this research, we propose a method to support the process of programming language detection employing a clustering technique to select a small number of representatives for manual detection of programming languages. To improve accuracy, our method combines pattern matching and a static analysis using a result of manual detection. In the experiment, we applied our method to two actual legacy systems whose source code files have been manually analyzed. As a result, our method correctly classified 99.49% of the 190,000 files. While a manual detection of programming languages for the systems required 3.3 man-months, our method completed the analysis in eight hours for computation and fifteen minutes for manual checks of programming languages.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN00116647

書誌情報

情報処理学会論文誌

巻 59, 号 6, p. 1405-1414, 発行日 2018-06-15

ISSN

収録物識別子タイプ

ISSN

収録物識別子

1882-7764

戻る

views

See details

	Views

Versions

Ver.1

2025-01-20 01:23:37.102331

Show All versions

Cite as

岡田, 譲二, 石尾, 隆, 坂田, 祐司, 井上, 克郎, 2018: 1405–1414 p.

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

AutoSort：レガシーシステム分析のためのプログラミング言語の判定支援手法

× 岡田, 譲二

× 石尾, 隆

× 坂田, 祐司

× 井上, 克郎

× Joji, Okada

× Takashi, Ishio

× Yuji, Sakata

× Katsuro, Inoue

Versions

Share

Cite as

エクスポート