Llama 3.2
llama.com (opens in a new tab)
New Quantized Versions of Llama 3.2 (1B & 3B) (opens in a new tab): Delivering Up To 2-4x Increases in Inference Speed and 56% Reduction in Model Size
llama.com (opens in a new tab)
New Quantized Versions of Llama 3.2 (1B & 3B) (opens in a new tab): Delivering Up To 2-4x Increases in Inference Speed and 56% Reduction in Model Size