情報学広場：情報処理学会電子図書館

WEKO3

インデックスツリー

RootNode

アイテム

マルチメディアコンテンツにおける音楽と映像の調和度計算モデル

https://ipsj.ixsq.nii.ac.jp/records/55812

名前 / ファイル	ライセンス	アクション
IPSJ-MUS07069006.pdf (1.2 MB)	Copyright (c) 2007 by the Information Processing Society of Japan
オープンアクセス

Item type

SIG Technical Reports(1)

公開日

2007-02-23

タイトル

タイトル

マルチメディアコンテンツにおける音楽と映像の調和度計算モデル

タイトル

言語

en

タイトル

A Computational Model of Congruency between Music and Video in Multimedia Content

言語

言語

jpn

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_18gh

資源タイプ

technical report

著者所属

京都大学大学院情報学研究科

著者所属

京都大学大学院情報学研究科

著者所属

京都大学大学院情報学研究科

著者所属

京都大学大学院情報学研究科

著者所属

京都大学大学院情報学研究科

著者所属(英)

en

Graduate School of Informatics, Kyoto University

著者所属(英)

en

Graduate School of Informatics, Kyoto University

著者所属(英)

en

Graduate School of Informatics, Kyoto University

著者所属(英)

en

Graduate School of Informatics, Kyoto University

著者所属(英)

en

Graduate School of Informatics, Kyoto University

著者名

西山, 正紘北原, 鉄朗駒谷, 和範尾形, 哲也奥乃, 博

著者名(英)

Masahiro, NISHIYAMA Tetsuro, KITAHARA Kazunori, KOMATANI Tetsuya, OGATA Hiroshi, G.OKUNO

論文抄録

内容記述タイプ

Other

内容記述

本稿では、アクセント構造およびムードの一致に基づいて、音楽と映像の調和の度合い（調和度）を計算する枠組を提案する。一般に、音楽と映像の調和要因としては、時間的なアクセントの一致による時間的調和と、ムードの一致による意味的調和の２つが存在する。従来の研究では、それぞれの要因のみしか扱っておらず、両要因を統一的に扱った事例は存在しない。そこで本稿では、音楽と映像の調和度を、アクセント構造の一致に基づいて定量化した調和度とムードの一致に基づいて定量化した調和度の重み付き線形和で表現する。アクセント構造の一致は音楽と映像それぞれの特徴量系列間の相関に基づいて、ムードの一致はそれぞれの特徴部分空間内における相互の特徴量の連想に基づいて定量化する。実映像作品を対象とし、本手法の有効性を実験により評価した。

論文抄録(英)

内容記述タイプ

Other

内容記述

In this paper, we propose a framework that understands congruency between music and video based on similarity of accent structure and mood. There are two types of congruency between music and video: temporal congruency related to synchronization of accents and semantic congruency related to similarity of mood. Previous works, however, have dealt only with either congruency. We model the temporal congruency based on the correlation between accent feature sequences extracted from audio and visual content, and the semantic congruency based on mutual mapping between two feature spaces representing music and video respectively. Then, we integrate the two types of congruency as a weighted linear sum. Our experiments with real-world content show the effects of our method.

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN10438388

書誌情報

情報処理学会研究報告音楽情報科学（MUS）

巻 2007, 号 15(2007-MUS-069), p. 31-36, 発行日 2007-02-23

Notice

SIG Technical Reports are nonrefereed and hence may later appear in any journals, conferences, symposia, etc.

出版者

言語

ja

出版者

情報処理学会

戻る

0

views

	Views

Versions

Ver.1

2025-01-22 05:16:31.731346

Show All versions

Share

Cite as

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX