Skip to Content

Supervised Learning

2 posts

Posts tagged with Supervised Learning

Sudden Leaps - Why Supervised Fine-Tuning Feels Like Evolution’s Punctuated Equilibrium

Supervised fine-tuning in large language models causes sudden, transformative leaps in reasoning abilities, much like evolutionary punctuated equilibrium, rather than gradual improvement.

Sudden Leaps - Why Supervised Fine-Tuning Feels Like Evolution’s Punctuated Equilibrium

I recently read Climbing the Ladder of Reasoning: What LLMs Can—and Still Can’t—Solve after SFT, and it clarified something I’d been suspecting for a while: supervised fine-tuning really can make language models smarter, but only up to a point. The paper lays out a kind of "reasoning ladder" to sort problems by difficulty, from Easy to Extremely Hard, and then looks at how well large language models do at each level after different amounts of fine-tuning.

The results are striking. With just a small number of high-quality examples, models get dramatically better at intermediate tasks, especially

Sudden Leaps - Why Supervised Fine-Tuning Feels Like Evolution’s Punctuated Equilibrium Read more

Video Review: Opportunities in AI by Andrew Ng Featured Post

Forget killer robots - AI's real power is its ability to boost business. - A review of Opportunities in AI speech by Andrew Ng

Video Review: Opportunities in AI by Andrew Ng

I recently watched a video featuring Andrew Ng, a pioneering thought leader in artificial intelligence, as he discussed current trends and future opportunities in AI.

💡
I don't usually publish my notes but I will make an exception here. If this is a format you find useful let me know we can do this more regularly

As founder of Google Brain and former chief scientist at Baidu, Ng has unique insight into the field. His talk highlighted two significant forces shaping the landscape for AI innovation.

Key Takeaways by Viewpoints

For the average person:

  1. AI will increasingly automate tasks in many
Video Review: Opportunities in AI by Andrew Ng Read more