WEKO3
-
RootNode
アイテム
A Real-time Audio-to-audio Karaoke Generation System for Monaural Recordings Based on Singing Voice Suppression and Key Conversion Techniques
https://ipsj.ixsq.nii.ac.jp/records/160365
https://ipsj.ixsq.nii.ac.jp/records/160365db53bd0c-e5a6-408a-b285-b6e4c4bc9526
名前 / ファイル | ライセンス | アクション |
---|---|---|
![]() |
Copyright (c) 2016 by the Information Processing Society of Japan
|
|
オープンアクセス |
Item type | Journal(1) | |||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
公開日 | 2016-05-15 | |||||||||||||
タイトル | ||||||||||||||
タイトル | A Real-time Audio-to-audio Karaoke Generation System for Monaural Recordings Based on Singing Voice Suppression and Key Conversion Techniques | |||||||||||||
タイトル | ||||||||||||||
言語 | en | |||||||||||||
タイトル | A Real-time Audio-to-audio Karaoke Generation System for Monaural Recordings Based on Singing Voice Suppression and Key Conversion Techniques | |||||||||||||
言語 | ||||||||||||||
言語 | eng | |||||||||||||
キーワード | ||||||||||||||
主題Scheme | Other | |||||||||||||
主題 | [特集:音楽情報処理技術の進歩とその拡がり] karaoke, music signal processing, singing voice, music application | |||||||||||||
資源タイプ | ||||||||||||||
資源タイプ識別子 | http://purl.org/coar/resource_type/c_6501 | |||||||||||||
資源タイプ | journal article | |||||||||||||
著者所属 | ||||||||||||||
The University of Tokyo / Presently with PKSHA Technology Inc. | ||||||||||||||
著者所属 | ||||||||||||||
The University of Tokyo / Presently with Aichi Prefectural Government | ||||||||||||||
著者所属 | ||||||||||||||
National Institute of Informatics | ||||||||||||||
著者所属 | ||||||||||||||
School of Interdisciplinary Mathematical Sciences, Meiji University | ||||||||||||||
著者所属(英) | ||||||||||||||
en | ||||||||||||||
The University of Tokyo / Presently with PKSHA Technology Inc. | ||||||||||||||
著者所属(英) | ||||||||||||||
en | ||||||||||||||
The University of Tokyo / Presently with Aichi Prefectural Government | ||||||||||||||
著者所属(英) | ||||||||||||||
en | ||||||||||||||
National Institute of Informatics | ||||||||||||||
著者所属(英) | ||||||||||||||
en | ||||||||||||||
School of Interdisciplinary Mathematical Sciences, Meiji University | ||||||||||||||
著者名 |
Hideyuki, Tachibana
× Hideyuki, Tachibana
× Yu, Mizuno
× Nobutaka, Ono
× Shigeki, Sagayama
|
|||||||||||||
著者名(英) |
Hideyuki, Tachibana
× Hideyuki, Tachibana
× Yu, Mizuno
× Nobutaka, Ono
× Shigeki, Sagayama
|
|||||||||||||
論文抄録 | ||||||||||||||
内容記述タイプ | Other | |||||||||||||
内容記述 | This paper describes an automatic karaoke generation system, which can suppress the singing voice in audio music signals, and can also change the pitch of the song. Furthermore, this system accepts the streaming input, and it works in real-time. To the best of our knowledge, there have been no real-time audio-to-audio karaoke system that has the two functions above. This paper particularly describes the two technical components, as well as some comments on the implementation. In this system, the authors employed two signal processing techniques: singing voice suppression that is based on two-stage HPSS, a vocal enhancement technique that the authors proposed previously, and a pitch shift technique that is based on the spectrogram stretch and phase vocoder. The attached video file shows that the system works in real-time, and the sound quality may be practically acceptable. \n------------------------------ This is a preprint of an article intended for publication Journal of Information Processing(JIP). This preprint should not be cited. This article should be cited as: Journal of Information Processing Vol.24(2016) No.3 (online) DOI http://dx.doi.org/10.2197/ipsjjip.24.470 ------------------------------ |
|||||||||||||
論文抄録(英) | ||||||||||||||
内容記述タイプ | Other | |||||||||||||
内容記述 | This paper describes an automatic karaoke generation system, which can suppress the singing voice in audio music signals, and can also change the pitch of the song. Furthermore, this system accepts the streaming input, and it works in real-time. To the best of our knowledge, there have been no real-time audio-to-audio karaoke system that has the two functions above. This paper particularly describes the two technical components, as well as some comments on the implementation. In this system, the authors employed two signal processing techniques: singing voice suppression that is based on two-stage HPSS, a vocal enhancement technique that the authors proposed previously, and a pitch shift technique that is based on the spectrogram stretch and phase vocoder. The attached video file shows that the system works in real-time, and the sound quality may be practically acceptable. \n------------------------------ This is a preprint of an article intended for publication Journal of Information Processing(JIP). This preprint should not be cited. This article should be cited as: Journal of Information Processing Vol.24(2016) No.3 (online) DOI http://dx.doi.org/10.2197/ipsjjip.24.470 ------------------------------ |
|||||||||||||
書誌レコードID | ||||||||||||||
収録物識別子タイプ | NCID | |||||||||||||
収録物識別子 | AN00116647 | |||||||||||||
書誌情報 |
情報処理学会論文誌 巻 57, 号 5, 発行日 2016-05-15 |
|||||||||||||
ISSN | ||||||||||||||
収録物識別子タイプ | ISSN | |||||||||||||
収録物識別子 | 1882-7764 | |||||||||||||
サプリメンタルコンテンツ | ||||||||||||||
関連タイプ | isSupplementedBy | |||||||||||||
識別子タイプ | URI | |||||||||||||
関連識別子 | http://id.nii.ac.jp/1012/00000006/ | |||||||||||||
言語 | ja | |||||||||||||
関連名称 | A Real-time Audio-to-audio Karaoke Generation System for Monaural Recordings Based on Singing Voice Suppression and Key Conversion Techniques(Supplementary material) | |||||||||||||
言語 | en | |||||||||||||
関連名称 | A Real-time Audio-to-audio Karaoke Generation System for Monaural Recordings Based on Singing Voice Suppression and Key Conversion Techniques(Supplementary material) |