Sicong Jiang

I'm a final-year Ph.D. candidate at McGill University's Centre for Intelligent Machines (CIM) and also as Research Director at Abaka AI. In the open-source AI community, I'm a core contributor at 2077AI2077AI and work closely with M-A-PM-A-P. I received my Master's degree in Electrical and Computer Engineering from Georgia Institute of Technology.

My research focuses on building reliable and robust AI agents powered by Large Language Models and multimodal foundation models. I develop structured benchmarks, agent environments, and reward models to enhance open-ended reasoning, long-horizon planning, and real-world robustness. My work aims to bridge foundation models and real-world autonomous systems, enabling agents that are both capable and trustworthy.

Scholar  •   LinkedIn  •   GitHub  •   X  •   WeChat  •   Email

Sicong Jiang

Shot in Tamarindo, Costa Rica

Feb 2026

🎉 Two papers accepted by ICRA 2026. Check FASIONAD+, MTRDrive

Jan 2026

🎉 One paper accepted by ICLR 2026. Check EditReward

Nov 2025

🎉 One paper accepted (oral) by Bridge Program of AAAI 2026.

Aug 2025

🎉 One paper accepted by EMNLP 2025. Check AgentThink

Aug 2025

Joined 2077AI-Foundation—thrilled to contribute to the AI open-source community!

Jul 2025

Joined Abaka AI as Research Scientist in Palo Alto, California

Jul 2025

🎉 One paper accepted by ICCV 2025 Foundation Models for AD Workshop. Check VLA4AD Survey

Apr 2025

Invited to contribute to Humanity's Last Exam AGI safety benchmark

Feb 2025

🎉 One paper accepted by ICLR 2025 Trustworthy LLM Workshop. Check SparseAttack-LLM4TS

Jan 2025

🎉 One paper accepted by AISTATS 2025. Check Attack-LLM4TS
* indicates equal contribution. For full list, visit Google Scholar
EditReward pipeline

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

K. Wu*, S. Jiang*, M. Ku, P. Nie, M. Liu, W. Chen
ICLR 2026
Website  •  Paper  •  GitHub ⭐ 119

AgentThink: Tool-Augmented Reasoning in VLMs for Autonomous Driving

AgentThink: Tool-Augmented Reasoning in VLMs for Autonomous Driving

K. Qian*, S. Jiang*, Y. Zhong*, Z. Luo, Z. Huang, et al.
EMNLP 2025
Website  •  Paper  •  GitHub ⭐ 138

Survey on Vision-Language-Action Models for Autonomous Driving

A Survey on Vision–Language–Action Models for Autonomous Driving

S. Jiang*, Z. Huang*, K. Qian*, Z. Luo, T. Zhu, et al.
ICCV Workshop, 2025
Paper  •  GitHub ⭐ 504

FASIONAD+ framework

FASIONAD+: Enhanced Safety in Autonomous Driving with Adaptive Feedback

Z. Luo*, S. Jiang*, K. Qian*, Z. Huang, J. Miao, et al.
ICRA 2026
Paper

Attack-LLM4TS

Adversarial Vulnerabilities in Large Language Models for Time Series Forecasting

F. Liu*, S. Jiang*, L. Miranda-Moreno, S. Choi, L. Sun
AISTATS 2025
Paper  •  GitHub ⭐ 15

Communication-Aware RL

Communication-Aware Reinforcement Learning for Cooperative Adaptive Cruise Control

S. Jiang, S. Choi, L. Sun
TRB Annual Meeting (Oral), 2024
Paper

Director of Research
Aug 2025 - Feb 2026 · Palo Alto, CA, United States

Research: As a founding member of the Research team, I lead benchmarking and evaluation for agentic and multimodal LLMs. I led the EditReward project and co-developed large-scale benchmarks including SuperGPQA and VeriWeb.

Advanced Dataset & Pipeline Design: Architected and deployed high-difficulty dataset solutions and production pipelines across coding, IMO-level math, multimodal data, agentic trajectories, and RL environments. These datasets and pipelines are directly used for model training and evaluation for multiple frontier AI labs.

Data Scientist Intern
May 2025 – Aug 2025 · Remote

Multimodal Data Pipelines: Built data pipelines and multi-stage QA systems for multimodal LLM projects, overseeing large-scale annotation workflows and label consistency.

Dataset Quality & Validation: Conducted analysis and validation to refine annotations and ensure robust datasets for LLM post-training.

Research Assistant
2022 – Present · Montreal, QC, Canada

AgentThink (Agent Reasoning): Led a collaboration with Xiaomi and Tsinghua on tool-augmented reasoning for vision-language models in autonomous driving, achieving +54% answer accuracy on open-source models.

Adversarial LLM4TS: Developed a black-box attack framework and public benchmarks for LLM-based time-series forecasting, in collaboration with the Amazon Chronos and Nixtla teams.

Research Assistant
Aug 2019 – Dec 2020 · Atlanta, GA, United States

Multi-Agent RL Exploration: Developed a multi-agent search strategy combining MADDPG with frontier-based exploration, and built evaluation benchmarks for exploration efficiency.

Awards

2024

McGill Engineering Doctoral Award (MEDA)

2021

TISED Doctoral Recruitment Award (DRA), McGill University

2019

Outstanding Graduate of Liaoning Province; Most Influential Graduate, Northeastern University

2017

National 1st Prize, China Undergraduate Mathematical Contest in Modeling

2017

1st Class Academic Scholarship, Northeastern University

Academic Service

Conf.

NeurIPS, ICLR, AISTATS, CVPR, ICCV, COLM, EMNLP, AAAI, IROS, ICRA, ITSC Reviewer

Journ.

IEEE RA-L, Transportation Research Part C (TRC), IEEE T-ITS Reviewer

I enjoy music by Tyler, the Creator, SZA and Chappell Roan.

My favorite influencer is Allywoo on RedNote.

Cat: Bobo, a golden shaded British Shorthair who is good at programming with buttons.

Bobo