VCLA provides research opportunities for Undergraduate and Master's students at UCLA to work together with our Ph.D students and Postdocs.
See our Project Bulletin.

AI: Commonsense Reasoning, Multi-agent System, VR/AR Task Platforms

 

Cognition: Functionality, Physics, Intentionality, Causality, Value

A Massively Parallel and Scalable Multi-GPU Material Point Method

Xinlei Wang*, Yuxing Qiu*, Stuart R. Slattery, Yu Fang, Minchen Li, Song-Chun Zhu, Yixin Zhu, Min Tang, Dinesh Manocha, and Hongjing Lu
*Equal contributors

Joint Inference of States, Robot Knowledge, and Human (False-)Beliefs

Tao Yuan, Hangxin Liu, Lifeng Fan, Zilong Zheng, Tao Gao, Yixin Zhu, and Song-Chun Zhu

Human Causal Transfer: Challenges for Deep Reinforcement Learning

Mark Edmonds*, James Kubricht*, Colin Summers, Yixin Zhu, Brandon Rothrock, Song-Chun Zhu, and Hongjing Lu
*Equal first author

Spatially Perturbed Collision Sounds Attenuate Perceived Causality in 3D Launching Events

James Kubricht*, Yixin Zhu*, Wei Liang, Song-Chun Zhu, Chenfanfu Jiang, and Hongjing Lu
*Equal first author

Probabilistic Simulation Predicts Human Performance on Viscous Fluid-Pouring Problem

James Kubricht*, Chenfanfu Jiang*, Yixin Zhu*, Song-Chun Zhu, Demetri Terzopoulos, and Hongjing Lu
*Equal first author

Evaluating Human Cognition of Containing Relations with Physical Simulation

Wei Liang, Yibiao Zhao, Yixin Zhu, and Song-Chun Zhu

Detecting Potential Falling Objects by Inferring Human Action and Natural Disturbance

Bo Zheng*, Yibiao Zhao*, Joey C. Yu, Katsushi Ikeuchi, and Song-Chun Zhu
(*equal contribution)

Scene Parsing by Integrating Function, Geometry and Appearance Models

Yibiao Zhao, and Song-Chun Zhu

Beyond Point Clouds: Scene Understanding by Reasoning Geometry and Physics

Bo Zheng, Yibiao Zhao, Joey C. Yu, Katsushi Ikeuchi, and Song-Chun Zhu

 

Robotics: Task and Motion Planning, Robot Perception, Sensoring

 

Vision: Parsing Objects, Scenes and Events

PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points
Siyuan Huang, Yixin Chen, Tao Yuan Siyuan Qi, Yixin Zhu, and Song-Chun Zhu
Hoilistc++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commensense
Yixin Chen*, Siyuan Huang*, Tao Yuan Siyuan Qi, Yixin Zhu, and Song-Chun Zhu
Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation
Siyuan Huang, Siyuan Qi, Yinxue Xiao, Yixin Zhu, Ying Nian Wu,and Song-Chun Zhu
Holistic 3D Scene Parsing and Reconstruction from a Single RGB Image
Siyuan Huang, Siyuan Qi, Yixin Zhu, Yinxue Xiao, Yuanlu Xu,and Song-Chun Zhu
*Equal contributors
Tracking Occluded Objects and Recovering Incomplete Trajectories by Reasoning about Containment Relations and Human Actions
Wei Liang, Yixin Zhu, and Song-Chun Zhu
What is Where: Inferring Containment Relations from Videos
Wei Liang, Yibiao Zhao, Yixin Zhu, and Song-Chun Zhu
Mining And-Or Graphs for Graph Matching and Object Discovery

Quanshi Zhang, Ying Nian Wu, and Song-Chun Zhu

Learning Near-Optimal Cost-Sensitive Decision Policies for Object Detection

Tianfu Wu, and Song-Chun Zhu

Concurrent Action Detection with Structural Prediction
Modeling 4D Human-Object Interactions for Event and Object Recognition

Ping Wei, Yibiao Zhao, Nanning Zheng, and Song-Chun Zhu

Discriminatively Trained And-Or Tree Models for Object Detection

Xi Song, Tianfu Wu, Yunde Jia, and Song-Chun Zhu

Weakly Supervised Learning for Attribute Localization in Outdoor Scenes

Shuo Wang, Jungseock Joo, Yizhou Wang, and Song-Chun Zhu

Cost-Sensitive Top-down/Bottom-up Inference for Multiscale Activity Recognition

Mohamed R. Amer, Dan Xie, Mingtian Zhao, Sinisa Todorovic, and Song-Chun Zhu

Human Parsing using Stochastic And-Or Grammars and Rich Appearances

Brandon Rothrock, and Song-Chun Zhu

LEMMA: A Multi-view Dataset for LEarning Multi-agent Multi-task Activities

Baoxiong Jia, Yixin Chen, Siyuan Huang, Yixin Zhu, and Song-Chun Zhu

Joint Inference of States, Robot Knowledge, and Human (False-)Beliefs

Tao Yuan, Hangxin Liu, Lifeng Fan, Zilong Zheng, Tao Gao, Yixin Zhu, and Song-Chun Zhu

Configurable, Photorealistic Image Rendering and Ground Truth Synthesis by Sampling Stochastic Grammars Representing Indoor Scenes
Chenfanfu Jiang*, Yixin Zhu*, Siyuan Qi*, Siyuan Huang*, Lap-Fai Yu, Demetri Terzopoulos, and Song-Chun Zhu
*Equal contributors
Inferring Hidden Statuses and Actions
in Video by Causal Reasoning
Amy Fire, and Song-Chun Zhu

Joint Inference of Groups, Events and Human Roles in Aerial Videos

Tianmin Shu, Dan Xie, Brandon Rothrock, Sinisa Todorovic, and Song-Chun Zhu

Cosegmentation and Cosketch by Unsupervised Learning

Jifeng Dai, Ying Nian Wu, Jie Zhou, and Song-Chun Zhu

Human Attribute Recognition By Rich Appearance Dictionary

Jungseock Joo, Shuo Wang, and Song-Chun Zhu

Bottom-up / Top-down Inference Processes -- a Numerical Answer

Tianfu Wu, and Song-Chun Zhu

Image Parsing via Stochastic Scene Grammar

Yibiao Zhao, and Song-Chun Zhu

Video Primal Sketch

Z. Han, Z. Xu, and S.-C. Zhu

A Dynamic Model for Face Aging Simulation

Jinli Suo, Song-Chun Zhu, Shiguang Shan, and Xilin Chen

A Hierarchical and Contextual Model for Aerial Image Parsing

Jacob Porway, Qiongchen Wang, and Song-Chun Zhu

Restricted Visual Turing Test for Deep Scene and Event Understanding

Hang Qi, Tianfu Wu, Mun Wai Lee, and Song-Chun Zhu

 

Language: Joint Parsing, Grounding, Alignment, Communication

Learning: Statistics, Causality, Utility