Recent News

We will be hosting the tutorial of “Object-centric Representations from the definitions, feature learning to real-world applications” in CVPR 2024. Stay tuned and see you in Seattle!
Introduce our ICLR 2024 work, 🔥Instruct Video-to-Video🔥, an efficient approach for video editing that eliminates the need for per-video-per-model finetuning by constructing a synthetic paired video dataset. (Paper, Code)
Four papers got accpeted to ICCV 2023: OC-MOT (Paper, Code), Slot-Naming (Paper), C2F-Seg(Paper, Project Page), EoRaS(Paper).
One paper is accepted to ICLR 2023: Bridging the Gap to Real-World Object-Centric Learning. Paper link and code link.
One paper is accepted to NeurIPS 2022 (Spotlight): Self-supervised Amodal Video Object Segmentation. Paper link and code link.
One paper is accepted to ECCV 2022: PSS: Progressive Sample Selection for Open-World Visual Representation Learning. Check out our paper and code.
One paper is accepted to NeurIPS 2021: GRIN: Generative Relation and Intention Network for Multi-agent Trajectory Prediction. Check it out!

About Me

I am an Applied Science Manager at Amazon Web Service AI Shanghai Lablet, leading computer vision efforts. I play a lot with objects. In this period, I will be focusing on object-centric learning, visual-language model, graph neural network and causal representation learning, exploring and exploiting their usage in applications like video analysis, 3D vision, autonomous driving and robotics. I also contributed to the Graph Neural Network framework DGL and Object-centric Learning Framework OCLF.
Before joining Amazon, I was a Staff Machine Learning Scientist at Tesla Autopilot AI/Vision team, working with Dr. Andrej Karpathy. I was one of the major contributors of the Autopilot vision neural network stack and the task owner of Autopilot (Dynamic and Static) Object Detection during 2017 - 2020. My working items have been shipped into hundreds of thousands of Tesla cars worldwide during major Autopilot releases, contributing to Autopilot functionalities like Traffic-Aware Cruise Control, Auto Lane Change, Automatic Emergency Braking, Navigation on Autopilot, Smart Summon, etc.
Prior to Tesla, I spent 3.25 years at Microsoft. I was a Software Engineer 2 at Microsoft Bing Multimedia team (now under Microsoft AI & Research Org) working with Dr. Linjun Yang, where I was working on Image-Text Semantic Embedding to contribute to functionalities like Image Annotation and Image Search in Bing Search Engine. And during my graduate years, I interned at Microsoft Research Asia, advised by Prof. Zheng Zhang and Dr. Kuiyuan Yang, where I was working on both training platform and vision applications of deep learning. I was a major contributor of the open-source deep learning training framework Minerva and also contributed to the machine learning library MXNet.
I received M.S degree in Computer Science from Wangxuan Institute Of Computer Technology, Peking University, advised by Prof. Yuxin Peng. And B.S degree in Computer Science from Nankai University.
My enthusiasm is to apply machine learning to large-scale, life-changing technologies, currently with a focus on computer vision related applications.

Tianjun Xiao

Recent News

About Me