Komei Sugiura
Keio University
H-index: 21
Asia-Japan
Top articles of Komei Sugiura
Title | Journal | Author(s) | Publication Date |
---|---|---|---|
Trimodal Navigable Region Segmentation Model: Grounding Navigation Instructions in Urban Areas | IEEE Robotics and Automation Letters | Naoki Hosomi Shumpei Hatanaka Yui Iioka Wei Yang Katsuyuki Kuyo | 2024/3/18 |
Polos: Multimodal Metric Learning from Human Feedback for Image Captioning | arXiv preprint arXiv:2402.18091 | Yuiga Wada Kanta Kaneda Daichi Saito Komei Sugiura | 2024/2/28 |
Learning-To-Rank Approach for Identifying Everyday Objects Using a Physical-World Search Engine | IEEE Robotics and Automation Letters | Kanta Kaneda Shunya Nagashima Ryosuke Korekata Motonari Kambara Komei Sugiura | 2024/1/10 |
DialMAT: Dialogue-Enabled Transformer with Moment-Based Adversarial Training | arXiv preprint arXiv:2311.06855 | Kanta Kaneda Ryosuke Korekata Yuiga Wada Shunya Nagashima Motonari Kambara | 2023/11/12 |
Switching Head-Tail Funnel UNITER による対象物体および配置目標に関する指示文理解と物体操作 | 是方諒介, 神原元就, 吉田悠, 石川慎太朗, 川崎陽祐, 髙橋正樹, 杉浦孔明 | 2023 | |
生活支援タスクにおける大規模視覚言語モデルと拡散確率モデルを用いた参照表現セグメンテーション | 飯岡雄偉, 吉田悠, 和田唯我, 畑中駿平, 杉浦孔明 | 2023 | |
Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query | arXiv preprint arXiv:2306.13879 | Hidenori Itaya Tsubasa Hirakawa Takayoshi Yamashita Hironobu Fujiyoshi Komei Sugiura | 2023/6/24 |
JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models | arXiv preprint arXiv:2311.04192 | Yuiga Wada Kanta Kaneda Komei Sugiura | 2023/11/7 |
物体再配置タスクのための Co-Scale Cross-Attentional Transformer | 松尾榛夏, 石川慎太朗, 杉浦孔明 | 2023 | |
Nearest Neighbor Future Captioning: 物体配置タスクにおける衝突リスクに関する説明文生成 | 小松拓実, 神原元就, 畑中駿平, 松尾榛夏, 平川翼, 山下隆義, 藤吉弘亘, 杉浦孔明 | 2023 | |
Switching Text-Based Image Encoders for Captioning Images With Text | IEEE Access | Arisa Ueda Wei Yang Komei Sugiura | 2023/6/2 |
Fully Automated Task Management for Generation, Execution, and Evaluation: A Framework for Fetch-and-Carry Tasks with Natural Language Instructions in Continuous Space | arXiv preprint arXiv:2311.04260 | Motonari Kambara Komei Sugiura | 2023/11/7 |
Learning to Rank Physical Objects: ランキング学習による物理世界検索エンジン | 兼田寛大, 神原元就, 杉浦孔明 | 2023 | |
マルチモーダル言語処理に基づく Fetch-and-Carry タスクの自動化と実行 | 神原元就, 杉浦孔明 | 2023 | |
Affective image captioning for visual artworks using emotion-based cross-attention mechanisms | IEEE Access | Shintaro Ishikawa Komei Sugiura | 2023/3/10 |
Switching Head-Tail Funnel UNITER for Dual Referring Expression Comprehension with Fetch-and-Carry Tasks | Ryosuke Korekata Motonari Kambara Yu Yoshida Shintaro Ishikawa Yosuke Kawasaki | 2023/10/1 | |
Multimodal encoder with gated cross-attention for text-vqa tasks | 29th Annual Conference of the Language Processing Society | Wei Yang Arisa Ueda Komei Sugiura | 2023 |
シーングラフに基づく画像キャプション生成モデルの自動評価と解析 | 田中励雄, 和田唯我, 杉浦孔明 | 2023 | |
複数粒度のマルチモダル情報を用いたテキスト付き画像の説明文生成 | 楊巍, 植田有咲, 杉浦孔明 | 2023 | |
Multimodal Diffusion Segmentation Model for Object Segmentation from Manipulation Instructions | Yui Iioka Yu Yoshida Yuiga Wada Shumpei Hatanaka Komei Sugiura | 2023/10/1 |