The program will be available shortly. Please check back later.
Young Min Kim, Associate Professor at Seoul NationalUniversity, South Korea
Abstract:
Foundational models contain impressive emergent knowledge demonstrated by seminal success in various multimodal generative tasks. Nonetheless, these models are known to fail in basic spatial perception tasks despite requiring a significant amount of energy and training data. In this talk, I would like to share various 3D representations that encapsulate necessary 3D knowledge. Deep learning architecture on 3D data often utilizes regular structure but may take advantage of the sparsity of surface geometry. However, the information can still be extensive, and we propose using line structure to convey compact structural information. Additionally, functional representations, combined with latent representations, may provide compressed embedding that might be useful for various downstream applications. Finally, I will conclude with future directions incorporating natural dynamics to blend physical interpretation into the representations.
.
In compliance with GDPR consent requirements, presentations given in a Visual Intelligence context may be recorded with the consent of the speaker. All recordings are edited to remove all faces, names and voices of other participants. Questions and comments by the audience will hence be removed and will not appear in the recording. With the freely given consent from the speaker, recorded presentation may be posted on the Visual Intelligence YouTube channel.
This seminar is open for members of the consortium. If you want to participate as a guest please sign up.
Young Min Kim, Associate Professor at Seoul National University, South Korea
This seminar is open for members of the consortium. If you want to participate as a guest please sign up.
Young Min Kim, Associate Professor at Seoul National University, South Korea
This seminar is open for members of the consortium. If you want to participate as a guest please sign up.