Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings
Discussion options

Hello! First and foremost, thank you deeply for your incredible work on this project—especially Chapter 3, which has been an invaluable resource for understanding transformer mechanisms clearly and comprehensively.

I’m writing to share an exciting recent development: DeepMind has published new research on MoR (Mixture of Reasoning), a framework that shows significant advantage in enhancing model reasoning capabilities. From what I’ve gathered, MoR’s key strengths include:

  • Parameter Sharing: Significantly reduce the number of parameters through weight binding.
  • Computational Routing: Avoid redundant computations (and reduce ineffective FLOPs) via token routing.
  • Recursion-Level Caching: Cache key-value pairs on demand to reduce memory access overhead and improve throughput efficiency.

Given these advantages, I wonder if you would consider integrating a section on MoR into Chapter 3 or as a bonus. It could complement the existing content by connecting foundational concepts to cutting-edge reasoning techniques, enriching readers’ understanding of advanced reasoning frameworks.

Here's link of MoR: Paper, Code

You must be logged in to vote

Replies: 1 comment · 1 reply

Comment options

Hi there,
I am glad to hear that you liked the book! Also, congrats on the paper, and thanks for suggesting! However, this would not be a good fit here for now as I want to keep this project a bit more foundational (versus covering the latest research trends). I am not saying that I am not interested in the latest research trends, but I prefer to cover them in my blog (https://magazine.sebastianraschka.com/)

You must be logged in to vote
1 reply
@gzqccnu
Comment options

Got it. Looking forward to your blog about it!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #755 on July 24, 2025 15:39.

Morty Proxy This is a proxified and sanitized view of the page, visit original site.