スペクトログラムの階層的クラスタリングを用いたタイムスパン・セグメンテーション抽出について

澤田, 隼; 竹川, 佳成; 平田, 圭二; Shun, Sawada; Yoshinari, Takegawa; Keiji, Hirata

WEKO3

インデックスツリー

RootNode

アイテム

スペクトログラムの階層的クラスタリングを用いたタイムスパン・セグメンテーション抽出について

https://ipsj.ixsq.nii.ac.jp/records/186826

名前 / ファイル	ライセンス	アクション
IPSJ-JNL5903017.pdf (2.2 MB)	Copyright (c) 2018 by the Information Processing Society of Japan
オープンアクセス

Item type

Journal(1)

公開日

2018-03-15

タイトル

スペクトログラムの階層的クラスタリングを用いたタイムスパン・セグメンテーション抽出について

タイトル

言語

タイトル

On Extracting Time-span Segmentation Using Hierarchical Clustering of Spectrogram

言語

jpn

キーワード

主題Scheme

Other

主題

[特集：若手研究者] generative theory of tonal music，time-span segmentation，グレーレベル同時生起行列，自己相似性行列，系統樹

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_6501

資源タイプ

journal article

著者所属

公立はこだて未来大学／現在，公立はこだて未来大学大学院

著者所属

公立はこだて未来大学

著者所属

公立はこだて未来大学

著者所属(英)

Future University Hakodate / Presently with Intelligent Information Systems, Future University Hakodate

著者所属(英)

Future University Hakodate

著者所属(英)

Future University Hakodate

著者名

澤田, 隼
竹川, 佳成
平田, 圭二

著者名(英)

Shun, Sawada
Yoshinari, Takegawa
Keiji, Hirata

論文抄録

内容記述タイプ

Other

内容記述

本稿では，Generative Theory of Tonal Music（GTTM）を音楽のスペクトログラムに直接適用して階層的クラスタリングによってタイムスパン・セグメンテーションを生成する新しい方法を提案する．まず初めに，スペクトログラムを時間軸方向に分割し，周波数方向に縦長の矩形（bin）をピッチイベントとして，スペクトログラムを一連のbinの集合として考える．binのテクスチャの特徴は，グレーレベル同時生起行列（Gray level co-occurrence matrix: GLCM）を使用して抽出され，テクスチャ特徴量の時系列データを生成する．テクスチャ特徴量による隣接bin間の類似度によってフレーズの近接度および変化量が計算される．並列性および反復性などの大域的な構造は，一連のbinの自己相似性行列（Self-similarity matrix: SSM）によって検出される．隣接するbin間の境界の強さを表す時系列データが与えられ，隣接するbinをボトムアップに反復的に併合していくことで，最終的にタイムスパン・セグメンテーションに対応する系統樹を生成するアルゴリズムを開発する．MozartのK.331とK.550を入力して実験を行った結果，音高や調和などの音楽知識をほとんど考慮していないにもかかわらず，有望な結果が得られた．

論文抄録(英)

内容記述タイプ

Other

内容記述

We propose a new method of applying Generative Theory of Tonal Music directly to a spectrogram of music to produce a time-span segmentation as hierarchical clustering. We first consider a vertically long rectangle in a spectrogram (bin) as a pitch event and a spectrogram as a sequence of bins. The texture feature of a bin is extracted using a gray level co-occurrence matrix to generate a sequence of the texture features. The proximity and change of phrases are calculated by the distance between the adjacent bins by their texture features. The global structures such as parallelism and repetition are detected by a self-similarity matrix of a sequence of bins. We develop an algorithm which is given a sequence of the boundary strength between adjacent bins, iteratively merges adjacent bins in the bottom-up manner, and finally generates a dendrogram, which corresponds to a time-span segmentation. We conducted an experiment with inputting Mozart's K.331 and K.550 and obtained promising results although the algorithm does not take into account almost any musical knowledge such as pitch and harmony.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN00116647

書誌情報

情報処理学会論文誌

巻 59, 号 3, p. 941-950, 発行日 2018-03-15

ISSN

収録物識別子タイプ

ISSN

収録物識別子

1882-7764

戻る

views

See details

	Views

Versions

Ver.1

2025-01-20 02:26:49.094118

Show All versions

Cite as

澤田, 隼, 竹川, 佳成, 平田, 圭二, 2018: 941–950 p.

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

スペクトログラムの階層的クラスタリングを用いたタイムスパン・セグメンテーション抽出について

× 澤田, 隼

× 竹川, 佳成

× 平田, 圭二

× Shun, Sawada

× Yoshinari, Takegawa

× Keiji, Hirata

Versions

Share

Cite as

エクスポート