HomeUpcoming webinars
Free to Join

Unlock New LLM Architectural Capabilities By Retraining

Friday, November 21, 11 AM ET

Key Takeaways

  • Understand how different attention (and retention) mechanisms affect LLM behaviour, performance, and cost.
  • Learn the process of retraining large language models by repurposing pretrained weights and exploring new architectures.
  • Discover the latest research techniques—including long-context inference, hardware-efficient kernels, and scalable LLM families—and how to apply them in your projects.

Your Presenter(s)

Jacob Buckman headshot

Jacob Buckman

CEO at Manifest AI

Jacob runs the AI research company, Manifest AI. He is co-creator of the power attention mechanism for long context LLMs, and an expert in deep and reinforcement learning. Previously he was a resident at the Google Brain project.

Why this matters

Large language models (LLMs) are evolving fast—and retraining offers a powerful path to new architectural capabilities, lower cost, and better performance. Recent research on the Brumby-14B-Base model shows how attention-free retention layers, efficient retraining from pretrained weights, and long-context processing can reshape what’s possible in generative AI. This session dives deep into these techniques and how you can apply them in your own work.

In this code-along, Jacob Buckman, the CEO at Manifest AI, will guide you through cutting-edge methods for LLM design, retraining, and deployment. You’ll explore how alternative attention mechanisms like power retention change model behaviour, how to repurpose pretrained models for new architectures, and how to engineer large-scale training pipelines. Whether you’re building foundation models or pushing boundaries in generative AI research, this session gives you the tools and insights to lead the next wave.

Register for the webinar

By continuing, you accept our Terms of Use, our Privacy Policy and that your data is stored in the USA.

Share this webinar

Close your data and AI skills gap

We're the only platform uniquely engineered to advance data and AI skills across your entire organization. Let's explore a tailored program.

Book an Enterprise Demo
Upskilling a small team?Get started today

Grow your data skills with DataCamp for Mobile

Make progress on the go with our mobile courses and daily 5-minute coding challenges.