Lianyu Hu is a PhD graduating from Tianjin University, China, supervised by Prof Wei Feng. During his graduate student period, he worked closely with Prof Shenglan Liu. His research interest includes Multimodal Learning, Embodied Intelligence and Video Understanding.

πŸ”₯ News

  • We release LightVLM, an highly efficient method for large vision language models with a two-stage design. It improves model efficiency by first conducting visual token merging in the encoding stage and then adopt KV Cache compression in the decoding stage. It could achieve about 2Γ— throughput across diffferent benchmarks and 3.21Γ— throughput boost when outputting longer sequences.

πŸ“ Publications

πŸ“– Technical Report

πŸ–ŠοΈ Selected Publications ($\dagger$ denotes Corresponding Author)

πŸŽ– Honors and Awards

  • 2025.06, Outstanding Graduate
  • 2024.12, δΌ˜η§€ε­¦η”Ÿζ ‡ε…΅οΌˆten per yearοΌ‰
  • 2024.10, National Scholarship
  • 2023.10, National Scholarship

πŸ“– Educations

  • 2021-2025, PhD in Computer Science and Technology, Tianjin Univerisity
  • 2018-2021, MEng in Computer Science and Technology, Dalian University of Technology
  • 2014-2018, BSc in Electronics and Information Engineering, Dalian University of Technology