My path started in control engineering, not ML — and that framing never left. My CVPR Spotlight work came from a simple reframe: training a deep network is a feedback-control problem, so a PID controller can drive its optimizer (later extended to IEEE TNNLS, IF 11.368). I did my Master's at Tsinghua on deep learning optimization, pose estimation, and face recognition, with work at CVPR, IEEE TNNLS, and Pattern Recognition, plus a U.S. patent for automated-checkout tracking.
Across Instacart, TikTok, Meta FAIR, and AiFi, the recurring lesson is that production ML lives or dies below the model. At TikTok, collapsing ~800 narrow perception models into one multimodal LLM only became viable once INT8 and tensor-parallel serving on Inf2 cut inference cost in half. At AiFi, going RGB-only — no shelf sensors — forced detection and tracking to be good enough that the hardware could be cheap. I work the full stack: research, large-scale training and inference, deployment on Docker, Kubernetes, TensorRT and edge, and the serving and product layers on top. I also review for CVPR, ECCV, ICCV, and IEEE TPAMI.