|
Jiale Yan (严佳乐)
Email: yjl16@tsinghua.org.cn / yanjiale@lixiang.com
NPU Architecture Senior Engineer | Li Auto, Shanghai, China
I work at computer architecture, efficient deep learning, and hardware-friendly algorithms for LLMs.
A current focus is LLM acceleration along two axes: scale-up (stronger single-node compute, memory, and dataflow) and scale-out (parallel training/inference across many accelerators with efficient communication).
- Hardware acceleration: LLM/NPU accelerators, dataflow and memory-system co-design, GNN/SpMM accelerators.
- Efficient ML: low-bit and sparse networks, memory-efficient transformers and large-model serving.
- Broader interests: VLSI design, ray-tracing hardware, reconfigurable computing.
Hot! 🔥🔥🔥 I am always looking for self-motivated interns tailored to these topics. Feel free to reach out!
|