Efficient LLM Inference
KV-cache reuse, semantic cache distillation, state transfer, selective recomputation, and bandwidth-aware serving.
AI systems, efficient inference, trustworthy machine learning
I am a Ph.D. student at Beijing Normal University, working on efficient LLM systems, semantic state transfer, multi-candidate reasoning, and security-oriented machine learning.
Research
KV-cache reuse, semantic cache distillation, state transfer, selective recomputation, and bandwidth-aware serving.
Candidate construction, fixed-verifier reranking, coverage-conversion gaps, and answer-mode compatibility.
Backdoor attacks, frequency-domain robustness, model behavior under perturbation, and security evaluation.
Vision pipelines for smart livestock systems, identity-related signals, weight estimation, and agricultural AI.
Selected publications
Qianli Ma, Zhiqing Tang, Hanshuai Cui, Zhi Yao, Weijia Jia
Proceedings of the 43rd International Conference on Machine Learning.
Qianli Ma
Manuscript under review.
Qianli Ma, Junping Qin, Kai Yan, Lei Wang, Hao Sun
Qianli Ma, Junping Qin, Yin Cao, Jiaqi Ren
Jiaqi Ren, Junping Qin, Qianli Ma, Yin Cao
Experience
Designed REUSE and PATCH mechanisms for state transfer between shared-architecture, weight-mismatched models.
Studied stealthy backdoor mechanisms, trigger design, and robustness-oriented model analysis.
Built applied pipelines for cattle image acquisition, weight estimation, visual signal analysis, and farm decision support.
Contact