Highly contributed researchers
In recent 10 years: Junichi Yamagishi (4)
Tomoki Toda (3)
Since 2007: Junichi Yamagishi (5)
Tomoki Toda (3) / Takahiro Shinozaki (3)
Statistics
Editorial Introduction to the Special Issue on Biometrics at a Distance in the Deep Learning Era
Authors: Manuel J. Marn-Jimnez, Shiqi Yu, Yasushi Makihara, Vishal M. Patel, Maneet Singh, Maria De Marsico
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Authors: Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Xiangzhan Yu, Furu Wei
Momentum Pseudo-Labeling: Semi-Supervised ASR With Continuously Improving Pseudo-Labels
Authors: Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori
A Comparative Study of Self-Supervised Speech Representation Based Voice Conversion
Authors: Wen-Chin Huang, Shu-Wen Yang, Tomoki Hayashi, Tomoki Toda
Automatic Spoken Language Acquisition Based on Observation and Dialogue
Authors: Ryota Komatsu, Shengzhou Gao, Wenxin Hou, Mingxin Zhang, Tomohiro Tanaka, Keisuke Toyoda, Yusuke Kimura, Kent Hino, Yu Iwamoto, Kosuke Mori, Takuma Okamoto, Takahiro Shinozaki
Investigation of Japanese PnG BERT Language Model in Text-to-Speech Synthesis for Pitch Accent Language
Authors: Yusuke Yasuda and Tomoki Toda
Uncertainty-Aware Semantic Guidance and Estimation for Image Inpainting
Authors: Liang Liao, Jing Xiao, Zheng Wang, Chia-Wen Lin, Shin'ichi Satoh
Deep Griffin-Lim Iteration: Trainable Iterative Phase Reconstruction Using Neural Network
Authors: Yoshiki Masuyama, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada
Iteratively Training Look-Up Tables for Network Quantization
Authors: Fabien Cardinaux, Stefan Uhlich, Kazuki Yoshiyama, Javier Alonso Garca, Lukas Mauch, Stephen Tiedemann, Thomas Kemp, Akira Nakamura
A 1.15-TOPS 6.57-TOPS/W Neural Network Processor for Multi-Scale Object Detection With Reduced Convolutional Operations
Authors: Reiya Kawamoto, Masakazu Taichi, Masaya Kabuto, Daisuke Watanabe, Shintaro Izumi, Masahiko Yoshimoto, Hiroshi Kawaguchi, Go Matsukawa, Toshio Goto, Motoshi Kojima
Differential-Detection Aided Large-Scale Generalized Spatial Modulation is Capable of Operating in High-Mobility Millimeter-Wave Channels
Authors: Naoki Ishikawa, Rakshith Rajashekar, Chao Xu, Mohammed El-Hajjar, Shinya Sugiura, Lie-Liang Yang, Lajos Hanzo
Constant-Envelope Space-Time Shift Keying
Authors: Chao Xu, Tong Bai, Jiankang Zhang, Robert G. Maunder, Shinya Sugiura, Zhaocheng Wang, Lajos Hanzo
An Overview of Enhanced Massive MIMO With Array Signal Processing Techniques
Authors: Mingjin Wang, Feifei Gao, Shi Jin, Hai Lin
Introduction to the Issue on Far-Field Speech Processing in the Era of Deep Learning: Speech Enhancement, Separation, and Recognition
Authors: Shinji Watanabe, Shoko Araki, Michiel Bacchiani, Reinhold Haeb-Umbach, Michael L. Seltzer
SpeakerBeam: Speaker Aware Neural Network for Target Speaker Extraction in Speech Mixtures
Authors: Katerina Zmolkov, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Tomohiro Nakatani, Luks Burget, Jan Cernock
Design of Non-Orthogonal Beamspace Multiple Access for Cellular Internet-of-Things
Authors: Rundong Jia, Xiaoming Chen, Caijun Zhong, Derrick Wing Kwan Ng, Hai Lin, Zhaoyang Zhang
Angle Feedback for NOMA Transmission in mmWave Drone Networks
Authors: Nadisanka Rupasinghe, Yavuz Yapici, Ismail Gven, Monisha Ghosh, Yuichi Kakishima
Adversarial Training for Speech Super-Resolution
Authors: Sefik Emre Eskimez, Kazuhito Koishida, Zhiyao Duan
Sparse Representation of a Spatial Sound Field in a Reverberant Environment
Authors: Shoichi Koyama and Laurent Daudet
Subspace-Based Algorithms for Localization and Tracking of Multiple Near-Field Sources
Authors: Weiliang Zuo, Jingmin Xin, Hiromitsu Ohmori, Nanning Zheng, Akira Sano
Wasserstein Stationary Subspace Analysis
Authors: Stephan Kaltenstadler, Shinichi Nakajima, Klaus-Robert Mller, Wojciech Samek
Estimation of Deterioration Levels of Transmission Towers via Deep Learning Maximizing Canonical Correlation Between Heterogeneous Features
Authors: Keisuke Maeda, Sho Takahashi, Takahiro Ogawa, Miki Haseyama
Unified Architecture for Multichannel End-to-End Speech Recognition With Neural Beamforming
Authors: Tsubasa Ochiai, Shinji Watanabe, Takaaki Hori, John R. Hershey, Xiong Xiao
Hybrid CTC/Attention Architecture for End-to-End Speech Recognition
Authors: Shinji Watanabe, Takaaki Hori, Suyoun Kim, John R. Hershey, Tomoki Hayashi
Almost Tight Spectral Graph Wavelets With Polynomial Filters
Authors: David B. H. Tay, Yuichi Tanaka, Akie Sakiyama
Spoofing Speech Detection Using Modified Relative Phase Information
Authors: Longbiao Wang, Seiichi Nakagawa, Zhaofeng Zhang, Yohei Yoshida, Yuta Kawakami
ASVspoof: The Automatic Speaker Verification Spoofing and Countermeasures Challenge
Authors: Zhizheng Wu, Junichi Yamagishi, Tomi Kinnunen, Cemal Hanili, Md. Sahidullah, Aleksandr Sizov, Nicholas W. D. Evans, Massimiliano Todisco
Introduction to the Issue on Spoofing and Countermeasures for Automatic Speaker Verification
Authors: Junichi Yamagishi, Tomi Kinnunen, Nicholas W. D. Evans, Phillip L. De Leon, Isabel Trancoso
Wireless Power Transfer for Distributed Estimation in Sensor Networks
Authors: Vien V. Mai, Won-Yong Shin, Koji Ishibashi
QUVE: QoE Maximizing Framework for Video-Streaming
Authors: Takuto Kimura, Masahiro Yokota, Arifumi Matsumoto, Kei Takeshita, Taichi Kawano, Kazumichi Sato, Hiroshi Yamamoto, Takanori Hayashi, Kohei Shiomoto, Kenichi Miyazaki
Closing the Gap: Human Factors in Cross-Device Media Synchronization
Authors: Mu Mu, Lyndon Fawcett, Jamie Bird, Jamie Jellicoe, Steven Simpson, Hans Stokking, Nicholas J. P. Race
Multilinear Discriminant Analysis With Subspace Constraints for Single-Trial Classification of Event-Related Potentials
Authors: Hiroshi Higashi, Tomasz M. Rutkowski, Toshihisa Tanaka, Yuichi Tanaka
Proposal on Millimeter-Wave Channel Modeling for 5G Cellular System
Authors: Sooyoung Hur, Sangkyu Baek, Byungchul Kim, Youngbin Chang, Andreas F. Molisch, Theodore S. Rappaport, Katsuyuki Haneda, Jeongho Park
Introduction to the Issue on Spatial Audio
Authors: Lauri Savioja, Akio Ando, Ramani Duraiswami, Emanul A. P. Habets, Sascha Spors
Spring Model Based Collaborative Indoor Position Estimation With Neighbor Mobile Devices
Authors: Daisuke Taniuchi, Xiaopeng Liu, Daisuke Nakai, Takuya Maekawa
Glottal Spectral Separation for Speech Synthesis
Authors: Joo P. Cabral, Korin Richmond, Junichi Yamagishi, Steve Renals
Integrated Expression Prediction and Speech Synthesis From Text
Authors: Langzhou Chen, Mark J. F. Gales, Norbert Braunschweiler, Masami Akamine, Kate Knill
Statistical Parametric Speech Synthesis Based on Gaussian Process Regression
Authors: Tomoki Koriyama, Takashi Nose, Takao Kobayashi
A Parameter Generation Algorithm Using Local Variance for HMM-Based Speech Synthesis
Authors: Takashi Nose, Vataya Chunwijitra, Takao Kobayashi
Combining Vocal Tract Length Normalization With Hierarchical Linear Transformations
Authors: Lakshmi Saheer, Junichi Yamagishi, Philip N. Garner, John Dines
Contextual Additive Structure for HMM-Based Speech Synthesis
Authors: Shinji Takaki, Yoshihiko Nankaku, Keiichi Tokuda
Parameter Generation Methods With Rich Context Models for High-Quality and Flexible Text-To-Speech Synthesis
Authors: Shinnosuke Takamichi, Tomoki Toda, Yoshinori Shiga, Sakriani Sakti, Graham Neubig, Satoshi Nakamura
Introduction to the Issue on Statistical Parametric Speech Synthesis
Authors: Jianhua Tao, Keikichi Hirose, Keiichi Tokuda, Alan W. Black, Simon King
Building HMM-TTS Voices on Diverse Data
Authors: Vincent Wan, Javier Latorre, Kayoko Yanagisawa, Norbert Braunschweiler, Langzhou Chen, Mark J. F. Gales, Masami Akamine
Multipath Exploitation in Non-LOS Urban Synthetic Aperture Radar
Authors: Pawan Setlur, Tadahiro Negishi, Natasha Devroye, Danilo Erricolo
Adaptive Loop Filtering for Video Coding
Authors: Chia-Yang Tsai, Ching-Yeh Chen, Tomoo Yamakage, In Suk Chong, Yu-Wen Huang, Chih-Ming Fu, Takayuki Itoh, Takashi Watanabe, Takeshi Chujoh, Marta Karczewicz, Shawmin Lei
Minkovskian Gradient for Sparse Optimization
Authors: Shun-ichi Amari and Masahiro Yukawa
Learning Ancestral Atom via Sparse Coding
Authors: Toshimitsu Aritake, Hideitsu Hino, Noboru Murata
Robust Independent Component Analysis via Minimum Gamma -Divergence Estimation
Authors: Pengwen Chen, Hung Hung, Osamu Komori, Su-Yun Huang, Shinto Eguchi
Introduction to the issue on differential geometry in signal processing
Authors: Jonathan H. Manton, David Applebaum, Shiro Ikeda, Nicolas Le Bihan
Introduction to the issue on adaptation and learning over complex networks
Authors: Ali H. Sayed, Sergio Barbarossa, Sergios Theodoridis, Isao Yamada
Enhancement of Depth Maps With Alpha Channel Estimation for 3-D Video
Authors: Ji-Ho Cho, Kwan H. Lee, Kiyoharu Aizawa
New Video Coding Scheme Optimized for High-Resolution Video Sources
Authors: Kohtaro Asai, Tokumichi Murakami, Shuichi Yamagishi, Akira Minezawa, Yusuke Itani, Kazuo Sugimoto, Shun-ichi Sekiguchi, Yoshihisa Yamada, Yoshiaki Kato
Introduction to the Issue on Emerging Technologies for Video Compression
Authors: David R. Bull, Edward J. Delp, Seishi Takamura, Thomas Wiegand, Feng Wu
LyricSynchronizer: Automatic Synchronization System Between Musical Audio Signals and Lyrics
Authors: Hiromasa Fujihara, Masataka Goto, Jun Ogata, Hiroshi G. Okuno
Introduction to the Special Issue on Music Signal Processing
Authors: Meinard Mller, Daniel P. W. Ellis, Anssi Klapuri, Gal Richard, Shigeki Sagayama
Polyphonic Pitch Estimation and Instrument Identification by Joint Modeling of Sustained and Attack Sounds
Authors: Jun Wu, Emmanuel Vincent, Stanislaw Andrzej Raczynski, Takuya Nishimoto, Nobutaka Ono, Shigeki Sagayama
Distributed Asymptotic Minimization of Sequences of Convex Functions by a Broadcast Adaptive Subgradient Method
Authors: Renato L. G. Cavalcante, Alex Rogers, Nicholas R. Jennings, Isao Yamada
Dynamic Coded Cooperation Using Multiple Turbo Codes in Wireless Relay Networks
Authors: Koji Ishibashi, Koji Ishii, Hideki Ochiai
Online Unsupervised Classification With Model Comparison in the Variational Bayes Framework for Voice Activity Detection
Authors: David Cournapeau, Shinji Watanabe, Atsushi Nakamura, Tatsuya Kawahara
Measuring the Gap Between HMM-Based ASR and TTS
Authors: John Dines, Junichi Yamagishi, Simon King
A Sequential Pattern Classifier Based on Hidden Markov Kernel Machine and Its Application to Phoneme Classification
Authors: Yotaro Kubo, Shinji Watanabe, Atsushi Nakamura, Erik McDermott, Tetsunori Kobayashi
Unsupervised Acoustic Model Adaptation Based on Ensemble Methods
Authors: Takahiro Shinozaki, Yu Kubota, Sadaoki Furui
Long-Term Spectro-Temporal and Static Harmonic Features for Voice Activity Detection
Authors: Takashi Fukuda, Osamu Ichikawa, Masafumi Nishimura
Dynamic Features in the Linear-Logarithmic Hybrid Domain for Automatic Speech Recognition in a Reverberant Environment
Authors: Osamu Ichikawa, Takashi Fukuda, Masafumi Nishimura
Gaussian Mixture Optimization Based on Efficient Cross-Validation
Authors: Takahiro Shinozaki, Sadaoki Furui, Tatsuya Kawahara
Trellis Shaping for Controlling Envelope of Single-Carrier High-Order QAM Signals
Authors: Makoto Tanahashi and Hideki Ochiai
Comparison of Segmentation Methods for Melanoma Diagnosis in Dermoscopy Images
Authors: Margarida Silveira, Jacinto C. Nascimento, Jorge S. Marques, Andr R. S. Maral, Teresa Mendona, Syogo Yamauchi, Junji Maeda, Jorge Rozeira
Design of the Family of Orthogonal and Spectrally Efficient UWB Waveforms
Authors: Igor Dotlic and Ryuji Kohno