High-resolution image reconstruction with latent diffusion models from human brain activity
National Institute of Information and Communications Technology · Ube Frontier University · +1 more institution
Abstract
Reconstructing visual experiences from human brain activity offers a unique way to understand how the brain represents the world, and to interpret the connection between computer vision models and our visual system. While deep generative models have recently been employed for this task, reconstructing realistic images with high semantic fidelity is still a challenging problem. Here, we propose a new method based on a diffusion model (DM) to reconstruct images from human brain activity obtained via functional magnetic resonance imaging (fMRI). More specifically, we rely on a latent diffusion model (LDM) termed Stable Diffusion. This model reduces the computational cost of DMs, while preserving their high…
Citation impact
- FWCI
- 46.42
- Percentile
- 100%
- References
- 72
Authors
2Topics & keywords
- Computer science
- Artificial intelligence
- Generative model
- Fidelity
- Perspective (graphical)
- Deep learning
- Generative grammar
- Iterative reconstruction