Meta Llama 3.2 1B, Meta Llama 3.2 3B Quantized AI Models Released With Reduced Memory Footprint, Faster Inference and Accuracy
Meta released two quantized versions of AI models called Meta Llama 3.2 1B and Meta Llama 3.2 3B offering reduced memory footprint, on-device faster inference and accuracy. Know how to access them.
Mark Zuckerberg's Meta released new quantized versions of Llama 3.2 1B and 3B models. These models could deliver up to 2-4x inference speed while reducing model size by an average of 56%. The company said the new Meta Llama 3.2 1B and Meta Llama 3B models could also reduce 41% of the memory footprint. Utilizing Quantization-Aware Training with LoRA adaptors, these new Meta AI models balance performance, accuracy and portability, making them suitable for resource-constrained devices. Developers can now download the latest models from Meta and Hugging Face. Apple October 2024 Event: iPhone-Maker Confirms Launch Starting on October 28, Likely Introduce New M4-Powerd MacBooks Pro, iMac and Mac Mini.
Meta Launched Meta Llama 3.2 1B, Meta Llama 3.2 3B Quantized AI Models
We want to make it easier for more people to build with Llama — so today we’re releasing new quantized versions of Llama 3.2 1B & 3B that deliver up to 2-4x increases in inference speed and, on average, 56% reduction in model size, and 41% reduction in memory footprint.
Details… pic.twitter.com/GWETOfhCTD
— AI at Meta (@AIatMeta) October 24, 2024
(The above story first appeared on LatestLY on Oct 25, 2024 01:38 PM IST. For more news and updates on politics, world, sports, entertainment and lifestyle, log on to our website latestly.com).