Publications
* equal contribution. † corresponding author.
Remote Sensing Image Interpretation
arXiv 2025

DescribeEarth: Describe Anything for Remote Sensing Images
Kaiyu Li*, Zixuan Jiang*, Xiangyong Cao†, Jiayu Wang, Yuchen Xiao, Deyu Meng, Zhi Wang
Resources: Code, Dataset, Benchmark
- We introduce geo-spatial detailed localized captioning.
- We build the first describe-anything model in remote sensing.
- We release the related dataset and benchmark.
- CVPRW 2026The Second Challenge on Cross-Domain Few-Shot Object Detection at NTIRE 2026: Methods and Results
Xingyu Qiu, Yuqian Fu, Jiawei Geng, Bin Ren, ..., Kaiyu Li, Bowen Fu, Zixuan Jiang, Ke Li, Hui Qiao, Xiangyong Cao, ... - CJIG 2026Advances in Open-Vocabulary Perception for Remote-Sensing Images
Kaiyu Li, Xiangyong Cao†, Zixuan Jiang, Deyu Meng. - arXiv 2025Annotation-Free Open-Vocabulary Segmentation for Remote-Sensing Images
Kaiyu Li, Xiangyong Cao†, Ruixun Liu, Shihong Wang, Zixuan Jiang, Zhi Wang, Deyu Meng.
Audio Intelligence
arXiv 2026

Towards Human-Like Interactive Speech Recognition With Agentic Correction and Semantic Evaluation
Zixuan Jiang*, Yanqiao Zhu*, Peng Wang*, Qinyuan Chen, Xinjian Zhao, Xipeng Qiu, Wupeng Wang, Zhifu Gao, Xiangang Li, Kai Yu, Xie Chen†
Resources: Project Page, Live Demo
- We propose Interactive ASR, extending one-pass ASR into an interactive system with user feedback and semantic correction.
- We propose Agentic ASR, an agent-based framework enabling interactive speech recognition.
- We develop the semantic consistency metric $S^2ER$ and a simulation framework, ISS, for evaluating Interactive ASR.
- arXiv 2026MMAE: A Massive Multitask Audio Editing Benchmark
Ziyang Ma, Ruiqi Yan, Ruiyang Xu, Jie Fang, ..., Yanru Huo, Zixuan Jiang, Xiquan Li, Yalin Li, ..., Xie Chen.
Code | Dataset - arXiv 2026Interactive ASR: Towards Human-Like Interaction and Semantic Coherence Evaluation for Agentic Speech Recognition
Peng Wang*, Yanqiao Zhu*, Zixuan Jiang*, Qinyuan Chen, Xingjian Zhao, Xipeng Qiu, Wupeng Wang, Zhifu Gao, Xiangang Li, Kai Yu, Xie Chen†