DiffGaze: A Diffusion Model for Modelling Fine-grained Human Gaze Behaviour on 360 \({}^{\circ}\) Images
\({}^{\circ}\)
images is important for various human–computer interaction applications. However, existing methods are limited to predicting discrete fixation sequences or aggregated saliency maps, thereby neglecting fine-grained gaze behaviour such as saccadic eye movements that can be captured by commercial eye-trackers. We introduce a more challenging task—
fine-grained gaze sequence generation
. This task aims to generate eye-tracker-like gaze data for given stimuli. We propose
DiffGaze
, a diffusion-based method for generating realistic and diverse fine-grained human gaze sequences conditioned on 360
\({}^{\circ}\)
images. We evaluate DiffGaze on two 360
\({}^{\circ}\)
image benchmarks for fine-grained gaze sequence generation as well as two downstream tasks, scanpath prediction and saliency prediction. Our evaluations show that DiffGaze outperforms the fine-grained gaze generation baselines in all tasks on both benchmarks. We also report a 21-participant survey study showing that our method generates gaze sequences that are indistinguishable from real human sequences. Taken together, our evaluations not only demonstrate the effectiveness of DiffGaze but also point towards a new generation of methods that faithfully model the rich spatial and temporal nature of natural human gaze behaviour.
No keywords indexed for this article. Browse by subject →
Zhiming Hu, Congyi Zhang, Sheng Li et al.
L. Itti, C. Koch, E. Niebur
Matthias Kümmerer, Matthias Bethge, Thomas S. A. Wallis
Ludwig Sidenmark, Hans Gellersen
Showing 50 of 69 references
- Published
- Jan 09, 2026
- Vol/Issue
- 16(1)
- Pages
- 1-23
You May Also Like
Sina Mohseni, Niloofar Zarei · 2021
448 citations
Cecilia Panigutti, Andrea Beretta · 2023
88 citations