Search Results - Springer

Page %P

Close Plain text

Sort By Newest First Oldest First

Chapter and Conference Paper

PANDA: Prompt-Based Context- and Indoor-Aware Pretraining for Vision and Language Navigation

Pretrained visual-language models have extensive world kno- wledge and are widely used in visual and language navigation (VLN). However, they are not sensitive to indoor scenarios for VLN tasks. Another challe...

Ting Liu, Yue Hu, Wansen Wu, Youkai Wang, Kai Xu, Quanjun Yin in MultiMedia Modeling (2024)
Chapter and Conference Paper

ACT: Action-assoCiated and Target-Related Representations for Object Navigation

Object navigation tasks require an agent to find a target in an unknown environment based on its observations. Researchers employ various techniques, such as extracting high-level semantic information and buil...

Youkai Wang, Yue Hu, Wansen Wu, Ting Liu, Yong Peng in MultiMedia Modeling (2024)