Publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2026

  1. StreamingTOM: Streaming Token Compression for Efficient Video Understanding
    Xueyi Chen, Keda Tao, Kele Shao, and Huan Wang
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026
  2. ICLR-W
    MobileKernelBench: Can LLMs Write Efficient Kernels for Mobile Devices?
    Xingze Zou*, Jing Wang*, Yuhua Zheng, Haolei Bai, Xueyi Chen, Lingcheng Kong, Zhaode Wang, Chengfei Lv, Syed A.R. Abu-Bakar, Haoji Hu, and Huan Wang
    In ICLR Workshop on Data-centric Foundation Models (DATA-FM), Mar 2026
  3. LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs
    Keda Tao*, Yuhua Zheng*, Jia Xu, Wenjie Du, Kele Shao, Hesong Wang, Xueyi Chen, Xin Jin, Junhan Zhu, Bohan Yu, Weiqiang Wang, Jian Liu, Can Qin, Yulun Zhang, Ming-Hsuan Yang, and Huan Wang
    arXiv preprint arXiv:2603.19217, Mar 2026
    Under review
  4. DICE: Diffusion Large Language Models Excel at Generating CUDA Kernels
    Haolei Bai, Lingcheng Kong, Xueyi Chen, Jianmian Wang, Zhiqiang Tao, and Huan Wang
    arXiv preprint arXiv:2602.11715, Feb 2026
    Under review

2025

  1. On-Policy Optimization with Group Equivalent Preference for Multi-Programming Language Understanding
    Haoyuan Wu*, Rui Ming*, Jilong Gao*, Hangyu Zhao, Xueyi Chen, Yikai Yang, Haisheng Zheng, Zhuolun He, and Bei Yu
    In Advances in Neural Information Processing Systems (NeurIPS), Sep 2025
  2. ToTRL: Unlock LLM Tree-of-Thoughts Reasoning Potential through Puzzles Solving
    Haoyuan Wu*, Xueyi Chen*, Rui Ming, Jilong Gao, Shoubo Hu, Zhuolun He, and Bei Yu
    arXiv preprint arXiv:2505.12717, May 2025
    Under review