高性能な8倍精度浮動小数点演算機構の実現

泊, 久信; 平木, 敬; Hisanobu, Tomari; Kei, Hiraki

WEKO3

インデックスツリー

RootNode

アイテム

高性能な8倍精度浮動小数点演算機構の実現

https://ipsj.ixsq.nii.ac.jp/records/75586

名前 / ファイル	ライセンス	アクション
IPSJ-HPC11130045.pdf (125.7 kB)	Copyright (c) 2011 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2011-07-20

タイトル

高性能な8倍精度浮動小数点演算機構の実現

タイトル

言語

タイトル

High-performance Octuple Precision Floating Point Processor

言語

jpn

キーワード

主題Scheme

Other

主題

数値計算アルゴリズム

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

東京大学大学院情報理工学系研究科

著者所属

東京大学大学院情報理工学系研究科

著者所属(英)

The University of Tokyo

著者所属(英)

The University of Tokyo

著者名

泊, 久信平木, 敬

著者名(英)

Hisanobu, Tomari Kei, Hiraki

論文抄録

内容記述タイプ

Other

内容記述

計算機が高性能になったことにより，より大きな問題を解くことができるようになった．入力が計算結果として出力されるまでに演算器を通る回数も，問題の規模と反復回数に応じて大きくなった．計算アルゴリズムの中には，演算器を通る回数が増えると誤差が蓄積していくものがある．このようなアルゴリズムを，より高性能な計算機を用い大規模な問題に対して適用するためには，より高精度な浮動小数点演算が必要である．ところが，高精度な浮動小数点数を扱うハードウェアは市販品としては少なく，結果としてソフトウェア実装を用いるのが一般的であった．ソフトウェアによる実装は幅広い環境で動作させることができる利点がある一方，性能を出しにくいという欠点がある．性能が出ない場合，そもそも高精度な浮動小数点数を扱う必要性は低い．本研究では，IEEE 754 規格を拡張して，8 倍精度 (256-bit) 浮動小数点数を定義した．評価では，POWER7 マシンでの倍精度の演算と，8 倍精度演算の 64 ビットPowerPC アセンブリでの実装との性能を比較し，8 倍精度が倍精度の 1/44 程度の性能の劣化になることを確認した．ハードウェア実装として，CPU の FSB に FPGA が結合された，Convey HC-1 を用いて，高性能な演算器を実装した．この FPGA ベースの実装を用いた場合，POWER7 の 8 コアのシステムに比べ，約 4.5 倍の 8 倍精度浮動小数点処理性能を実現した．

論文抄録(英)

内容記述タイプ

Other

内容記述

The faster the processor becomes, the larger grows the size of the problem that the processor is capable of solving. The number of operations that are applied to input data is subject to the size and the number of iterations. There are algorithms where the error accumulates as the size or the number of iterations increases. To apply these algorithms to the larger set of problems that are solved on the next-generation computers, a higher-precision floating point format is required. Notwithstanding the need, there are little support for arithmetic on floating point numbers of quadruple or more precisions. When they really needed it they tend to implement them using software. Using software to process higher-precision floating point number benefits from portability, but at the grave cost of the performance. When the performance is limited, we often do not need higher precision floating point numbers in the first place. We propose an extension to the IEEE 754 floating point number formats to define a octuple-precision (256-bit) floating point numbers. We compared the performance of our octuple precision implementation to the double-precision operations on IBM POWER7. On POWER7, octuple precision operations take about 44 times more processing time than double-precision counterparts. We implemented FPGA-based arithmetic unit for the data format on Convey HC-1 system, where FPGA chips are connected to the host using the front side bus. On this system, octuple precision operations are 4.5 times faster than those on the 8-core POWER7 system.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10463942

書誌情報

研究報告ハイパフォーマンスコンピューティング（HPC）

巻 2011-HPC-130, 号 45, p. 1-7, 発行日 2011-07-20

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

出版者

情報処理学会

戻る

views

See details

	Views

Versions

Ver.1

2025-01-21 21:10:06.795547

Show All versions

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

高性能な8倍精度浮動小数点演算機構の実現

× 泊, 久信平木, 敬

× Hisanobu, Tomari Kei, Hiraki

Versions

Share

Cite as

エクスポート

インデックスリンク

インデックスツリー

アイテム

高性能な8倍精度浮動小数点演算機構の実現

× 泊, 久信 平木, 敬

× Hisanobu, Tomari Kei, Hiraki

Versions

Share

Cite as

エクスポート

× 泊, 久信平木, 敬