.

Technology

Apple SHARP

SHARP optimizes large-scale model training by utilizing a Smooth Hamiltonian Ascent approach to find flatter minima and improve generalization.

Apple researchers developed SHARP (Smooth Hamiltonian Ascent for Resilient Protocol) to tackle the sharpness-aware minimization challenge in deep learning. By leveraging a Hamiltonian dynamics framework, the optimizer efficiently navigates loss landscapes to locate flatter minima, which directly correlates to better test-time performance. In benchmarks against standard SGD and Adam, SHARP demonstrates superior robustness across ImageNet and various Transformer architectures while maintaining computational efficiency (reducing the overhead typically associated with second-order optimization methods).

https://machinelearning.apple.com/research/sharp-smooth-hamiltonian-ascent
1 project · 1 city

Related technologies

Recent Talks & Demos

Showing 1-1 of 1

Members-Only

Sign in to see who built these projects