A Real-time Audio-to-audio Karaoke Generation System for Monaural Recordings Based on Singing Voice Suppression and Key Conversion Techniques

Hideyuki, Tachibana; Yu, Mizuno; Nobutaka, Ono; Shigeki, Sagayama; Hideyuki, Tachibana; Yu, Mizuno; Nobutaka, Ono; Shigeki, Sagayama

WEKO3

インデックスツリー

RootNode

アイテム

A Real-time Audio-to-audio Karaoke Generation System for Monaural Recordings Based on Singing Voice Suppression and Key Conversion Techniques

https://ipsj.ixsq.nii.ac.jp/records/160365

名前 / ファイル	ライセンス	アクション
IPSJ-JNL5705002.pdf (1.7 MB)	Copyright (c) 2016 by the Information Processing Society of Japan
オープンアクセス

Item type

Journal(1)

公開日

2016-05-15

タイトル

A Real-time Audio-to-audio Karaoke Generation System for Monaural Recordings Based on Singing Voice Suppression and Key Conversion Techniques

タイトル

言語

タイトル

A Real-time Audio-to-audio Karaoke Generation System for Monaural Recordings Based on Singing Voice Suppression and Key Conversion Techniques

言語

eng

キーワード

主題Scheme

Other

主題

[特集：音楽情報処理技術の進歩とその拡がり] karaoke, music signal processing, singing voice, music application

資源タイプ

資源タイプ識別子

http://purl.org/coar/resource_type/c_6501

資源タイプ

journal article

著者所属

The University of Tokyo / Presently with PKSHA Technology Inc.

著者所属

The University of Tokyo / Presently with Aichi Prefectural Government

著者所属

National Institute of Informatics

著者所属

School of Interdisciplinary Mathematical Sciences, Meiji University

著者所属(英)

The University of Tokyo / Presently with PKSHA Technology Inc.

著者所属(英)

The University of Tokyo / Presently with Aichi Prefectural Government

著者所属(英)

National Institute of Informatics

著者所属(英)

School of Interdisciplinary Mathematical Sciences, Meiji University

著者名

Hideyuki, Tachibana
Yu, Mizuno
Nobutaka, Ono
Shigeki, Sagayama

著者名(英)

Hideyuki, Tachibana
Yu, Mizuno
Nobutaka, Ono
Shigeki, Sagayama

論文抄録

内容記述タイプ

Other

内容記述

This paper describes an automatic karaoke generation system, which can suppress the singing voice in audio music signals, and can also change the pitch of the song. Furthermore, this system accepts the streaming input, and it works in real-time. To the best of our knowledge, there have been no real-time audio-to-audio karaoke system that has the two functions above. This paper particularly describes the two technical components, as well as some comments on the implementation. In this system, the authors employed two signal processing techniques: singing voice suppression that is based on two-stage HPSS, a vocal enhancement technique that the authors proposed previously, and a pitch shift technique that is based on the spectrogram stretch and phase vocoder. The attached video file shows that the system works in real-time, and the sound quality may be practically acceptable.
\n------------------------------
This is a preprint of an article intended for publication Journal of
Information Processing(JIP). This preprint should not be cited. This
article should be cited as: Journal of Information Processing Vol.24(2016) No.3 (online)
DOI　http://dx.doi.org/10.2197/ipsjjip.24.470
------------------------------

論文抄録(英)

内容記述タイプ

Other

内容記述

書誌レコードID

収録物識別子タイプ

NCID

収録物識別子

AN00116647

書誌情報

情報処理学会論文誌

巻 57, 号 5, 発行日 2016-05-15

ISSN

収録物識別子タイプ

ISSN

収録物識別子

1882-7764

サプリメンタルコンテンツ

Versions

Ver.1

2025-01-20 06:55:43.565186

Show All versions

Cite as

Hideyuki, Tachibana, Yu, Mizuno, Nobutaka, Ono, Shigeki, Sagayama, 2016.

エクスポート

OAI-PMH

JPCOAR
DublinCore
DDI

Other Formats

JSON
BIBTEX

インデックスリンク

インデックスツリー

アイテム

A Real-time Audio-to-audio Karaoke Generation System for Monaural Recordings Based on Singing Voice Suppression and Key Conversion Techniques

× Hideyuki, Tachibana

× Yu, Mizuno

× Nobutaka, Ono

× Shigeki, Sagayama

× Hideyuki, Tachibana

× Yu, Mizuno

× Nobutaka, Ono

× Shigeki, Sagayama

Versions

Share

Cite as

エクスポート