Lianyu Hu is a PhD graduating from Tianjin University, China, supervised by Prof Wei Feng. During his graduate student period, he worked closely with Prof Shenglan Liu. His research interest includes Multimodal Learning, Embodied Intelligence and Video Understanding.

πŸ”₯ News

  • We release TennisExpert, which aims to provide high-quality commentaries in real time for tennis matches. It first provides a large-scale, fine-grained and diverse tennis dataset, termed TennisVL to aim MLLMs to understand tennis. A expert model termed as TennisExpert is them trained with superior performance than Gemini-3.0-pro and GPT-5.2.

  • We release LightVLM, an highly efficient method for large vision language models with a two-stage design. It improves model efficiency by first conducting visual token merging in the encoding stage and then adopt KV Cache compression in the decoding stage. It could achieve about 2Γ— throughput across diffferent benchmarks and 3.21Γ— throughput boost when outputting longer sequences.

πŸ–ŠοΈ Selected Publications ($\dagger$ denotes Corresponding Author)

πŸ“– Technical Report

Accepted Publications

πŸŽ– Honors and Awards

  • 2025.06, Outstanding Graduate
  • 2024.12, δΌ˜η§€ε­¦η”Ÿζ ‡ε…΅οΌˆten per yearοΌ‰
  • 2024.10, National Scholarship
  • 2023.10, National Scholarship

πŸ“– Educations

  • 2021-2025, PhD in Computer Science and Technology, Tianjin Univerisity
  • 2018-2021, MEng in Computer Science and Technology, Dalian University of Technology
  • 2014-2018, BSc in Electronics and Information Engineering, Dalian University of Technology