C++性能优化新书:从CPU物理层讲起
推荐指数 56.0 NO. 017 · 2026.06.14
发布2026/06/13Score62Comments9
为什么值得看
新书《Efficient C++ Programming for Modern 64-bit CPUs》第四章草稿公开,从CPU物理结构和时钟周期底层原理切入讲解C++优化。对写高性能AI推理引擎、游戏引擎或量化系统的工程师有直接参考价值,能帮你建立"为什么这条代码慢"的物理直觉。
媒体预览
编辑判断
这本书的写法很反常识——不是从算法复杂度讲起,而是从主板布线、CPU引脚到内存控制器的物理距离开始。这种视角在AIinfra圈其实被低估了:很多推理优化的瓶颈根本不在算子本身,而在CPU端的prefetch调度、跨NUMA节点的内存访问模式。
作者Sherry Ignatchenko是游戏行业老兵,做过MMO服务端架构,书里提到的"representative motherboard"分析框架可以直接套用到AI推理服务器的硬件选型上。如果你在用vLLM或TensorRT-LLM做部署优化,建议对照看看CPU端的batch调度逻辑和这里的物理层分析有没有gap。
目前还是草稿状态,评论区可以直接提技术问题,作者会回复——这种早期介入的机会对想深入理解底层的人很划算。
社区反馈
负面 9 条评论
核心争论:新书插图风格被质疑为"AI slop",内容深度与呈现方式引发争议
The little pull-quotes marked by illustrations feel chosen at random and not particularly worthy of being so, i.e. > characteristic times of electronic signals are restricted by so-called parasitic capacitances, and parasitic capacitances in general are proportional to the length of the connection >
You probably noticed, but each of these is also just a quote from the paragraph right next to them. Also here the drawings are cute, so that's nice. Maybe you're not used to that style, but it's pretty common in educational literature (especially for younger audiences). They're mostly navigation aid
ai slop?