To solve the above problems, we propose a novel zero-shot goal-directed scanpath prediction model named CLIPGaze. We use CLIP to extract pre-matched features for the target prompt and input image, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results