Sho Takase
Tokyo Institute of Technology
H-index: 14
Asia-Japan
Top articles of Sho Takase
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Device, method and program for natural language processing | 2024/1/4 | ||
Exploring effectiveness of gpt-3 in grammatical error correction: A study on performance and controllability in prompt-based methods | arXiv preprint arXiv:2305.18156 | Mengsay Loem Masahiro Kaneko Sho Takase Naoaki Okazaki | 2023/5/29 |
Nearest neighbor non-autoregressive text generation | Journal of Information Processing | Ayana Niwa Sho Takase Naoaki Okazaki | 2023 |
Spike No More: Stabilizing the Pre-training of Large Language Models | arXiv preprint arXiv:2312.16903 | Sho Takase Shun Kiyono Sosuke Kobayashi Jun Suzuki | 2023/12/28 |
Bridging the Gap between Subword and Character Segmentation in Pretrained Language Models | Shun Kiyono Sho Takase Shengzhe Li Toshinori Sato | 2023/9 | |
Dynamic Structured Neural Topic Model with Self-Attention Mechanism | Nozomu Miyamoto Masaru Isonuma Sho Takase Junichiro Mori Ichiro Sakata | 2023/7 | |
B2t connection: Serving stability and performance in deep transformers | arXiv preprint arXiv:2206.00330 | Sho Takase Shun Kiyono Sosuke Kobayashi Jun Suzuki | 2022/6/1 |
Word-level perturbation considering word length and compositional subwords | Tatsuya Hiraoka Sho Takase Kei Uchiumi Atsushi Keyaki Naoaki Okazaki | 2022/5 | |
Single Model Ensemble for Subword Regularized Models in Low-Resource Machine Translation | arXiv preprint arXiv:2203.13528 | Sho Takase Tatsuya Hiraoka Naoaki Okazaki | 2022/3/25 |
NT5 at WMT 2022 General Translation Task | Makoto Morishita Keito Kudo Yui Oka Katsuki Chousa Shun Kiyono | 2022/12 | |
Interpretability for language learners using example-based grammatical error correction | arXiv preprint arXiv:2203.07085 | Masahiro Kaneko Sho Takase Ayana Niwa Naoaki Okazaki | 2022/3/14 |
Are Neighbors Enough? Multi-Head Neural n-gram can be Alternative to Self-attention | arXiv preprint arXiv:2207.13354 | Mengsay Loem Sho Takase Masahiro Kaneko Naoaki Okazaki | 2022/7/27 |
Extraphrase: Efficient data augmentation for abstractive summarization | arXiv preprint arXiv:2201.05313 | Mengsay Loem Sho Takase Masahiro Kaneko Naoaki Okazaki | 2022/1/14 |
Information processing apparatus, information learning apparatus, information processing method, information learning method and program | 2022/7/21 | ||
Word coding device, analysis device, language model learning device, method, and program | 2021/11/4 | ||
Recurrent neural hidden Markov model for high-order transition | Transactions on Asian and Low-Resource Language Information Processing | Tatsuya Hiraoka Sho Takase Kei Uchiumi Atsushi Keyaki Naoaki Okazaki | 2021/10/31 |
Joint optimization of tokenization and downstream model | arXiv preprint arXiv:2105.12410 | Tatsuya Hiraoka Sho Takase Kei Uchiumi Atsushi Keyaki Naoaki Okazaki | 2021/5/26 |
Lessons on parameter sharing across layers in transformers | arXiv preprint arXiv:2104.06022 | Sho Takase Shun Kiyono | 2021/4/13 |
Rethinking perturbations in encoder-decoders for fast training | arXiv preprint arXiv:2104.01853 | Sho Takase Shun Kiyono | 2021/4/5 |
Optimizing word segmentation for downstream task | Tatsuya Hiraoka Sho Takase Kei Uchiumi Atsushi Keyaki Naoaki Okazaki | 2020/11 |