Apple researchers have developed SHARP technology that can generate realistic 3D scene representations from a single photo in under a second on a standard GPU. This technology uses a neural network with a single forward pass to regress 3D Gaussian representation parameters of the scene, supporting real-time rendering of high-resolution, detailed nearby views at over 100fps. Experimental results show that SHARP achieves state-of-the-art effects on multiple datasets, with LPIPS error reduced by 25-34% and DISTS error reduced by 21-43% compared to previous best models, while synthesis time is reduced by three orders of magnitude. The technology has metric properties, supports absolute scale and camera movement, and demonstrates strong zero-shot generalization capabilities, potentially bringing revolutionary applications in AR/VR, autonomous driving, and gaming.
Original link:Hacker News
最新评论
照片令人惊艳。万分感谢 温暖。
氛围绝佳。由衷感谢 感受。 你的博客让人一口气读完。敬意 真诚。
实用的 杂志! 越来越好!
又到年底了,真快!
研究你的文章, 我体会到美好的心情。
感谢激励。由衷感谢
好久没见过, 如此温暖又有信息量的博客。敬意。
很稀有, 这么鲜明的文字。谢谢。