Paper
Document
Download
Flag content
280 Bounty
$0.00
5

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Authors
David Raposo,Sam Ritter
Blake Richards,Timothy Lillicrap,Peter Humphreys
+3 authors
,A. Santoro
Are you the author?
Published
Apr 2, 2024
Show more
Save
TipTip
Document
Download
Flag content
5
TipTip
Save
Document
Download
Flag content