Papers I want to read, at some point.
-
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
Paper β’ 2108.12409 β’ Published β’ 5 -
YaRN: Efficient Context Window Extension of Large Language Models
Paper β’ 2309.00071 β’ Published β’ 78 -
MIMIC-IT: Multi-Modal In-Context Instruction Tuning
Paper β’ 2306.05425 β’ Published β’ 11 -
Music ControlNet: Multiple Time-varying Controls for Music Generation
Paper β’ 2311.07069 β’ Published β’ 45