Meta unveils Llama 3.1 AI model with 405B parameters, Meta AI gains multilingual support and more

Meta, led by Mark Zuckerberg, has introduced Llama 3.1 405B, the largest and most capable open-source AI model. With over 300 million downloads of all Llama versions, Meta aims to continue driving innovation.

Llama 3.1 405B Features
  • First Frontier-Level Open Source AI: Matches top AI models in general knowledge, steerability, math, tool use, and multilingual translation.
  • Enhanced Models: Includes upgraded versions of the 8B and 70B models with a longer context length of 128K and stronger reasoning capabilities.
  • Use Cases: Supports long-form text summarization, multilingual conversational agents, and coding assistants.
  • Licensing: Allows developers to use Llama model outputs, including the 405B, to improve other models.

Model Performance and Architecture

Meta evaluated Llama 3.1 405B on over 150 benchmark datasets and through extensive human evaluations. The model is competitive with top AI models like GPT-4 and Claude 3.5 Sonnet.

The 405B was trained on over 15 trillion tokens using 16,000 H100 GPUs, marking a significant achievement in model training.

Key architectural details include:
  • A standard decoder-only transformer model for stability.
  • An iterative post-training process with supervised fine-tuning and direct preference optimization.
  • Enhanced data quality through improved pre-processing and quality assurance methods.

  • Quantization from 16-bit to 8-bit to reduce compute requirements and support large-scale inference.
  • Llama 3.1 405B improves instruction-following and safety. Post-training includes multiple rounds of fine-tuning, rejection sampling, and preference optimization, using high-quality synthetic data.

The Llama System and Ecosystem

Meta envisions Llama models as part of a broader system with components like Llama Guard 3 and Prompt Guard.

They are releasing the “Llama Stack” on GitHub, a set of standardized interfaces for building toolchain components and agentic applications. Also, Meta seeks community feedback to enhance interoperability.

Support for Developers

Developers can leverage the 405B model for various tasks, including real-time and batch inference, fine-tuning, and synthetic data generation. Meta has partnered with AWS, NVIDIA, and Databricks for cloud solutions and optimized inference with Groq and Dell.

Commitment to Open Source

Unlike closed models, Llama model weights are downloadable, allowing developers to fully customize them for specific applications. Meta highlights that Llama models offer low-cost tokens, enabling widespread access to AI.

Llama 3.1 Models with Safety

Meta ensures the safe use of its models through measures like red teaming and safety fine-tuning. The company encourages the community to build new experiences using the multilinguality and extended context length of Llama 3.1, supported by new safety tools and the Llama Stack.

Updates to Meta AI

Meta, alongside the introduction of Llama 3.1, has announced several updates to Meta AI, enhancing its capabilities and availability.

Meta AI Expands Multilingual Support

Meta AI is now accessible in 22 countries, including Argentina, Chile, Colombia, Ecuador, Mexico, Peru, and Cameroon.

  • It supports interactions in French, German, Hindi, Hindi-Romanized Script, Italian, Portuguese, and Spanish, with plans to add more languages.
  • Users can access Meta AI through WhatsApp, Instagram, Messenger, Facebook, and meta.ai.

New Features and Enhancements

Creative Tools: Meta AI introduces “Imagine me” prompts, allowing users to visualize themselves in different scenarios, such as surfing or vacationing.

This feature, currently in beta in the U.S., generates images based on user photos and prompts. Users can further customize these images by adding or changing elements with the upcoming “Edit with AI” button, set to launch next month.

Image Integration: Users can now create and share Meta AI-generated images directly within Facebook posts, stories, comments, and messages, with initial availability in English and plans to extend to other languages and apps.

Advanced Model for Complex Queries: Meta AI now includes Llama 405B, its largest and most advanced open-source model, which improves the assistant’s ability to handle complex questions, particularly in math and coding. This model provides detailed explanations, debugging support, and optimization suggestions.

Meta AI on Meta Quest and Ray-Ban Meta Glasses

Meta AI will soon be available on Ray-Ban Meta smart glasses and, starting next month, on Meta Quest in the U.S. and Canada.

It will replace the current Voice Commands on Quest, enabling hands-free control of the headset, real-time information updates, and interaction with physical surroundings through Passthrough Vision.

Availability

Llama 3.1 models are available for download on llama.meta.com and Hugging Face, and for immediate development on partner platforms.


Related Post