Month: January 2026

DeekSeek mHC Explained – How DeepSeek Rewires LLMs for 2026

DeepSeek's Manifold-Constrained Hyper-Connections (mHC) paper introduces a breakthrough approach to LLM architecture by reimagining residual connections—unchanged since 2016. By applying mathematical constraints to ByteDance's Hyper-Connections framework, mHC preserves training stability while expanding model expressiveness. Using doubly stochastic matrices enforced through the Sinkhorn-Knopp algorithm, this innovation achieves superior performance across benchmarks while maintaining stable gradient flow, positioning it as a potential driver for major AI advancements in 2026.

【人工智能】AGI是彻头彻尾的胡扯 – Yann LeCun访谈总结

图灵奖得主Yann LeCun在访谈中对硅谷AI发展路径提出颠覆性批判,认为依靠扩大语言模型规模通往超级智能是"彻头彻尾的胡扯"。他指出大语言模型只是记忆型系统,无法真正理解世界,甚至达不到"狗水平智能"。这位65岁的AI先驱选择离开Meta创办AMI公司,押注世界模型(World Models)技术路线——在抽象表征空间预测世界动力学规律,而非像素级生成。他警告行业技术单一化风险,坚持基础研究和多元化探索才是AI未来的关键。