Imagine if robots could fill in the blanks in cluttered scenes.
โจ Enter RaySt3R: a single masked RGB-D image in, complete 3D out.
It infers depth, object masks, and confidence for novel views, and merges the predictions into a single point cloud. rayst3r.github.io
Big thanks to the awesome contributors to this project!๐ Jan Oberst, @bowenwen_me, @BirchfieldStan, @RamananDeva and @jeff_ichnowski. Also thanks to OctMAE author @s1wase, @nvidia for sponsoring compute ๐ฅ๏ธ, and the scientists at @naverlabseurope for the inspiration! ๐งโโ๏ธ