Personal Homepage

Hetian Guo Embodied AI, Robotics, and Intelligent Transportation Research

My current work centers on policy learning for embodied AI, dual-arm and humanoid VLA systems, and task planning and execution frameworks for robots. I have also worked on intelligent transportation systems, including multi-agent trajectory prediction, traffic flow modeling, and co-simulation.

Email Selected Work View CV

Current Stage

AI Algorithm Intern

Shenzhen Daohe Tongtai Robotics Co., Ltd. · Jan 2026 – Present

Incoming PhD student at the School of Artificial Intelligence, Jilin University, expected to start in Aug. 2026

Current Focus Embodied AI, dual-arm and humanoid VLA, robot task planning and execution

Research Background Intelligent transportation systems, trajectory prediction, traffic flow modeling, and co-simulation

About

Research Interests and Overview

My research interests include embodied AI, vision-language-action models, latent world models, multi-agent systems, and agent operating systems for robots. In my current work, I participate in building an agent operating system for robots by organizing language understanding, navigation, action generation, and speech interaction into reusable skills, and deploying these capabilities on platforms such as Unitree G1. I also study VLA frameworks with latent world models to improve modeling of future state evolution.

In intelligent transportation systems, I have also worked on multi-agent trajectory prediction, traffic flow and occupancy prediction, lane-level road network extraction from remote sensing imagery, and energy evaluation based on co-simulation and vehicle dynamics modeling.

Focus

Research Areas

Embodied AI and VLA Systems

I work on policy learning, VLA reproduction, action generation, and real-world deployment for dual-arm and humanoid robots, with emphasis on the coordination of language interaction, perception, and execution at the system level.

Latent World Models for Robot Manipulation

I explore integrating latent world models into robot manipulation so that future state evolution can be modeled from heterogeneous multi-source data, improving policy generalization, temporal modeling, and physical consistency.

ITS and Spatiotemporal Multi-Agent Modeling

I study trajectory prediction, traffic flow modeling, road network extraction, and co-simulation in complex traffic scenarios, integrating multi-agent interactions, map priors, and spatiotemporal dependencies into interpretable learning frameworks.

Timeline

Education and Internships

Education

Aug. 2026 onward

School of Artificial Intelligence, Jilin University

PhD in Artificial Intelligence · Changchun, China

Expected to start in Aug. 2026. Research interests include embodied AI, dual-arm and humanoid VLA algorithms, and embodied multi-agent systems.

2024 — 2025

University of Georgia

M.S. in Engineering (Mechanical Engineering Track) · Athens, GA, USA

Conducted research on intelligent transportation systems, autonomous driving, and co-simulation under the supervision of Prof. Yunli Shao.

2020 — 2024

Southern University of Science and Technology

B.Eng. in Computer Science and Technology · Shenzhen, China

Focused on spatiotemporal data mining and traffic prediction.

Internships

Jan. 2026 — Present

Shenzhen Daohe Tongtai Robotics Co., Ltd.

AI Algorithm Intern · Shenzhen, China

Designed a collaborative "big-brain/small-brain" policy system for embodied agents based on Aggno and the Qwen family; abstracted capabilities such as TTS, VLA, and navigation into reusable skills; and built a Pipecat-based online speech interaction agent to close the loop from natural language input to robot execution.

Aug. 2023 — Oct. 2023

LocationMind Inc.

Research Intern · Tokyo, Japan

Worked on AIS data cleaning, trajectory reconstruction, and ship behavior recognition, and extracted key maritime nodes using trajectory clustering and spatial statistics.

Jul. 2022 — Aug. 2022

Shenzhen Institute of Computational Science

Research Intern · Shenzhen, China

Built workflows for multimodal time-series data cleaning and causal discovery, and reproduced and evaluated algorithms such as PC, GES, and NOTEARS for dependency analysis in complex systems.

Selected Work

Projects and Publications

Ongoing 2026

VLA Frameworks with Latent World Models for Robotic Manipulation

I investigate how latent world models can be integrated into robot manipulation pipelines to address the lack of temporal modeling and physical priors in existing VLA systems. The goal is to improve policy generalization and future-state reasoning with heterogeneous data from multiple robot platforms.

ICTD 2026

Automatic Lane-Level Road Network Extraction from Aerial Imagery

Proposed a lane-level road network extraction framework for transportation digital twins, together with a heuristic intersection topology inference algorithm that supports export to standard formats such as OpenDRIVE and SUMO.

Guo, H., Shao, Y., Saroj, A., Xu, G., Yuan, J., Luo, X., Wang, C. (2026). Automatic lane-level road network extraction from aerial imagery for transportation digital twins. ICTD.

TRB 2025

Energy Evaluation with Vehicle Dynamics in XIL Co-Simulation

Built an XIL co-simulation framework integrating SUMO, a connected vehicle controller, and a Simulink vehicle dynamics model, and analyzed how model granularity and penetration rate affect system-level energy evaluation.

Yuan, J., Guo, H., Shao, Y., Xu, G., Saroj, A., Wang, C. (2025). Impact of vehicle dynamics on the energy evaluation of connected and automated vehicles via anything-in-the-loop co-simulation. TRB Annual Meeting.

MECC 2025

Traffic Flow-Aware Occupancy Prediction

Proposed a traffic flow-aware occupancy prediction framework for connected and automated vehicles, explicitly modeling spatiotemporal coupling between adjacent road segments to improve long-horizon consistency.

Guo, H., Shao, Y. (2025). Traffic flow aware occupancy prediction for energy and mobility centric connected and automated vehicles. MECC.

TR Part E 2024

Marine Traffic Flow Forecasting from AIS Data

Built an end-to-end marine traffic flow forecasting framework that decomposes traffic flow into trend and periodic components, jointly modeling spatiotemporal and static-dynamic dependencies for both accuracy and interpretability.

Yuan, Y., Guo, H., Fan, Z., Peng, Y., Zhang, J., Song, X., Shibasaki, R. (2024). A decomposable multi-fusion spatiotemporal marine traffic flow forecasting algorithm. Transportation Research Part E.

ICRA 2024

Multi-Agent Trajectory Prediction with Heterogeneous Hypergraphs

Proposed HHGNN to model high-order group interactions in complex traffic scenes through heterogeneous hypergraphs, with type-aware message passing and local high-definition map information.

Guo, H., Peng, Y., Fan, Z., Zhu, H., Song, X. (2024). HHGNN: Heterogeneous hypergraph neural network for traffic agents trajectory prediction in complex scenarios. IEEE ICRA.

Capabilities

Technical Skills

Programming and Engineering

Python, C++, MATLAB
PyTorch, PyTorch Lightning
Hugging Face Transformers / Diffusers / Accelerate

Deep Learning and Robotics

ROS / ROS2
Vision-Language-Action Models
Policy Learning, Imitation Learning, Inference & Fine-tuning

Embodied AI Experience

Reproduced dual-arm VLA baselines including PI0 / PI0.5, F1-VLA, and VLA-OS
Experience with humanoid robot policy deployment, teleoperation, and real-world execution
Built the policy integration and control pipeline for Unitree G1

Data and Systems

Worked with AgiBot, InternData, RobotWin, and EgoDex datasets
Experience in skill abstraction, task orchestration, and multimodal interaction pipelines
System design for robot task planning and execution

Simulation and ITS

SUMO, CARLA, Vissim
Simulink, IPG-CarMaker
Traffic flow modeling, trajectory prediction, and co-simulation

Contact

Feel Free to Reach Out

If you are interested in embodied AI, robotics, VLA systems, intelligent transportation systems, or interdisciplinary research collaboration, feel free to contact me by email.

hetianguoml@gmail.com +86 186 8239 8096 View CV PDF