Yosuke Higuchi at Waseda University

University	Waseda University
Position	___
Citations(all)	704
Citations(since 2020)	702
Cited By	42
hIndex(all)	13
hIndex(since 2020)	13
i10Index(all)	14
i10Index(since 2020)	14
Email	Access Email
University Profile Page	Waseda University
Google Scholar	View Google Scholar Profile

Parody Detection Using Source-Target Attention with Teacher-Forced Lyrics

2024/4/14

Yosuke Higuchi

H-Index: 3

Naoki Okamoto

H-Index: 11

Tetsuji Ogawa

H-Index: 10

Mask-Conformer: Augmenting Conformer with Mask-Predict Decoder

2023/12/16

Yosuke Higuchi

H-Index: 3

Yuan Wang

H-Index: 4

Murali Karthick Baskar

H-Index: 9

Segment-Level Vectorized Beam Search Based on Partially Autoregressive Inference

2023/12/16

Nicholas Eng

H-Index: 1

Yosuke Higuchi

H-Index: 3

Shinji Watanabe

H-Index: 45

Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition

arXiv preprint arXiv:2309.10524

2023/9/19

Yosuke Higuchi

H-Index: 3

Tetsuji Ogawa

H-Index: 10

Tetsunori Kobayashi

H-Index: 15

Spotting Parodies: Detecting Alignment Collapse Between Lyrics and Singing Voice

2023/9

Yosuke Higuchi

H-Index: 3

Naoki Okamoto

H-Index: 11

Tetsuji Ogawa

H-Index: 10

Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech Recognition

2023/9

Yosuke Higuchi

H-Index: 3

Tetsuji Ogawa

H-Index: 10

Tetsunori Kobayashi

H-Index: 15

Parody Detection Based on Alignment Collapse Between Lyrics and Singing Voice

IEICE Technical Report; IEICE Tech. Rep.

2023/6/16

Yosuke Higuchi

H-Index: 3

Naoki Okamoto

H-Index: 11

Tetsuji Ogawa

H-Index: 10

Bectra: Transducer-based end-to-end asr with bert-enhanced encoder

2023/6/4

Yosuke Higuchi

H-Index: 3

Tetsuji Ogawa

H-Index: 10

Tetsunori Kobayashi

H-Index: 15

Shinji Watanabe

H-Index: 45

Intermpl: Momentum Pseudo-Labeling With Intermediate CTC Loss

2023/6/4

Yosuke Higuchi

H-Index: 3

Tetsuji Ogawa

H-Index: 10

Tetsunori Kobayashi

H-Index: 15

Shinji Watanabe

H-Index: 45

Metric learning of speaker diarization

2023/5/16

A study on the integration of pre-trained ssl, asr, lm and slu models for spoken language understanding

2023/1/9

Yifan Peng

H-Index: 0

Siddhant Arora

H-Index: 3

Yosuke Higuchi

H-Index: 3

Karthik Ganesan

H-Index: 3

Siddharth Dalmia

H-Index: 7

Xuankai Chang

H-Index: 11

Shinji Watanabe

H-Index: 45

BERT meets CTC: New formulation of end-to-end speech recognition with pre-trained masked language model

arXiv preprint arXiv:2210.16663

2022/12

Yosuke Higuchi

H-Index: 3

Brian Yan

H-Index: 0

Siddhant Arora

H-Index: 3

Tetsuji Ogawa

H-Index: 10

Tetsunori Kobayashi

H-Index: 15

Shinji Watanabe

H-Index: 45

CTC alignments improve autoregressive translation

arXiv preprint arXiv:2210.05200

2022/10/11

Brian Yan

H-Index: 0

Siddharth Dalmia

H-Index: 7

Yosuke Higuchi

H-Index: 3

Graham Neubig

H-Index: 46

Florian Metze

H-Index: 30

Alan W Black

H-Index: 44

Shinji Watanabe

H-Index: 45

Momentum pseudo-labeling: Semi-supervised asr with continuously improving pseudo-labels

IEEE Journal of Selected Topics in Signal Processing

2022/8/1

Yosuke Higuchi

H-Index: 3

Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models

2022/5/23

Shinji Watanabe

H-Index: 45

Yosuke Higuchi

H-Index: 3

Advancing momentum pseudo-labeling with conformer and initialization strategy

2022/5/23

Yosuke Higuchi

H-Index: 3

Hierarchical conditional end-to-end asr with ctc and multi-granular subword units

2022/5/23

Yosuke Higuchi

H-Index: 3

Tetsuji Ogawa

H-Index: 10

Tetsunori Kobayashi

H-Index: 15

An investigation of enhancing CTC model for triggered attention-based streaming ASR

2021/12

Tetsuji Ogawa

H-Index: 10

Tetsunori Kobayashi

H-Index: 15

A comparative study on non-autoregressive modelings for speech-to-text generation

2021/12/13

Yosuke Higuchi

H-Index: 3

Nanxin Chen

H-Index: 18

Hirofumi Inaguma

H-Index: 9

Tianzi Wang

H-Index: 2

Shinji Watanabe

H-Index: 45

Non-autoregressive end-to-end speech translation with parallel autoregressive rescoring

arXiv preprint arXiv:2109.04411

2021/9/9

Hirofumi Inaguma

H-Index: 9

Yosuke Higuchi

H-Index: 3

Kevin Duh

H-Index: 29

Tatsuya Kawahara

H-Index: 23

Shinji Watanabe

H-Index: 45

Yosuke Higuchi

Waseda University

About Yosuke Higuchi

Yosuke Higuchi Information

Yosuke Higuchi Skills & Research Interests

Top articles of Yosuke Higuchi

Parody Detection Using Source-Target Attention with Teacher-Forced Lyrics

Yosuke Higuchi

Naoki Okamoto

Tetsuji Ogawa

Mask-Conformer: Augmenting Conformer with Mask-Predict Decoder

Yosuke Higuchi

Yuan Wang

Murali Karthick Baskar

Segment-Level Vectorized Beam Search Based on Partially Autoregressive Inference

Nicholas Eng

Yosuke Higuchi

Shinji Watanabe

Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition

Yosuke Higuchi

Tetsuji Ogawa

Tetsunori Kobayashi

Spotting Parodies: Detecting Alignment Collapse Between Lyrics and Singing Voice

Yosuke Higuchi

Naoki Okamoto

Tetsuji Ogawa

Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech Recognition

Yosuke Higuchi

Tetsuji Ogawa

Tetsunori Kobayashi

Parody Detection Based on Alignment Collapse Between Lyrics and Singing Voice

Yosuke Higuchi

Naoki Okamoto

Tetsuji Ogawa

Bectra: Transducer-based end-to-end asr with bert-enhanced encoder

Yosuke Higuchi

Tetsuji Ogawa

Tetsunori Kobayashi

Shinji Watanabe

Intermpl: Momentum Pseudo-Labeling With Intermediate CTC Loss

Yosuke Higuchi

Tetsuji Ogawa

Tetsunori Kobayashi

Shinji Watanabe

Metric learning of speaker diarization

A study on the integration of pre-trained ssl, asr, lm and slu models for spoken language understanding

Yifan Peng

Siddhant Arora

Yosuke Higuchi

Karthik Ganesan

Siddharth Dalmia

Xuankai Chang

Shinji Watanabe

BERT meets CTC: New formulation of end-to-end speech recognition with pre-trained masked language model

Yosuke Higuchi

Brian Yan

Siddhant Arora

Tetsuji Ogawa

Tetsunori Kobayashi

Shinji Watanabe

CTC alignments improve autoregressive translation

Brian Yan

Siddharth Dalmia

Yosuke Higuchi

Graham Neubig

Florian Metze

Alan W Black

Shinji Watanabe

Momentum pseudo-labeling: Semi-supervised asr with continuously improving pseudo-labels

Yosuke Higuchi

Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models

Shinji Watanabe

Yosuke Higuchi

Advancing momentum pseudo-labeling with conformer and initialization strategy

Yosuke Higuchi

Hierarchical conditional end-to-end asr with ctc and multi-granular subword units

Yosuke Higuchi

Tetsuji Ogawa

Tetsunori Kobayashi

An investigation of enhancing CTC model for triggered attention-based streaming ASR