Llama

Fine-tune and deploy Meta Llama 3.2 Vision for generative AI-powered web automation using AWS DLCs, Amazon EKS, and Amazon Bedrock | Amazon Web Services

Fine-tuning of large language models (LLMs) has emerged as a crucial technique for organizations seeking to adapt powerful foundation models (FMs) to their specific needs. Rather than training models from scratch—a process that can cost millions of dollars and require extensive computational resources—companies can customize existing models with domain-specific dataContinue Reading

Training Llama 3.3 Swallow: A Japanese sovereign LLM on Amazon SageMaker HyperPod | Amazon Web Services

In: Artificial Intelligence

This post is based on a technical report written by Kazuki Fujii, who led the Llama 3.3 Swallow model development. The Institute of Science Tokyo has successfully trained Llama 3.3 Swallow, a 70-billion-parameter large language model (LLM) with enhanced Japanese capabilities, using Amazon SageMaker HyperPod. The model demonstrates superior performanceContinue Reading

Best practices for Meta Llama 3.2 multimodal fine-tuning on Amazon Bedrock | Amazon Web Services

In: Artificial Intelligence

Multimodal fine-tuning represents a powerful approach for customizing foundation models (FMs) to excel at specific tasks that involve both visual and textual information. Although base multimodal models offer impressive general capabilities, they often fall short when faced with specialized visual tasks, domain-specific content, or particular output formatting requirements. Fine-tuning addressesContinue Reading

Llama 4 family of models from Meta are now available in SageMaker JumpStart | Amazon Web Services

In: Artificial Intelligence

Today, we’re excited to announce the availability of Llama 4 Scout and Maverick models in Amazon SageMaker JumpStart and coming soon in Amazon Bedrock. Llama 4 represents Meta’s most advanced multimodal models to date, featuring a mixture of experts (MoE) architecture and context window support up to 10 million tokens. WithContinue Reading

NeMo Retriever Llama 3.2 text embedding and reranking NVIDIA NIM microservices now available in Amazon SageMaker JumpStart | Amazon Web Services

In: Artificial Intelligence

Today, we are excited to announce that the NeMo Retriever Llama3.2 Text Embedding and Reranking NVIDIA NIM microservices are available in Amazon SageMaker JumpStart. With this launch, you can now deploy NVIDIA’s optimized reranking and embedding models to build, experiment, and responsibly scale your generative AI ideas on AWS. InContinue Reading

Deploy DeepSeek-R1 distilled Llama models with Amazon Bedrock Custom Model Import | Amazon Web Services

In: Artificial Intelligence

Open foundation models (FMs) have become a cornerstone of generative AI innovation, enabling organizations to build and customize AI applications while maintaining control over their costs and deployment strategies. By providing high-quality, openly available models, the AI community fosters rapid iteration, knowledge sharing, and cost-effective solutions that benefit both developersContinue Reading

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock | Amazon Web Services

In: Artificial Intelligence

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium | Amazon Web Services

In: Artificial Intelligence

Training large language models (LLMs) models has become a significant expense for businesses. For many use cases, companies are looking to use LLM foundation models (FM) with their domain-specific data. However, companies are discovering that performing full fine tuning for these models with their data isn’t cost effective. To reduce costsContinue Reading

Llama 3.3 70B now available in Amazon SageMaker JumpStart | Amazon Web Services

In: Artificial Intelligence

Today, we are excited to announce that the Llama 3.3 70B from Meta is available in Amazon SageMaker JumpStart. Llama 3.3 70B marks an exciting advancement in large language model (LLM) development, offering comparable performance to larger Llama versions with fewer computational resources. In this post, we explore how toContinue Reading

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM | Amazon Web Services

In: Artificial Intelligence

With the rise of large language models (LLMs) like Meta Llama 3.1, there is an increasing need for scalable, reliable, and cost-effective solutions to deploy and serve these models. AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low costContinue Reading

Llama

Fine-tune and deploy Meta Llama 3.2 Vision for generative AI-powered web automation using AWS DLCs, Amazon EKS, and Amazon Bedrock | Amazon Web Services

Training Llama 3.3 Swallow: A Japanese sovereign LLM on Amazon SageMaker HyperPod | Amazon Web Services

Best practices for Meta Llama 3.2 multimodal fine-tuning on Amazon Bedrock | Amazon Web Services

Llama 4 family of models from Meta are now available in SageMaker JumpStart | Amazon Web Services

NeMo Retriever Llama 3.2 text embedding and reranking NVIDIA NIM microservices now available in Amazon SageMaker JumpStart | Amazon Web Services

Deploy DeepSeek-R1 distilled Llama models with Amazon Bedrock Custom Model Import | Amazon Web Services

Deploy DeepSeek-R1 Distilled Llama models in Amazon Bedrock | Amazon Web Services

PEFT fine tuning of Llama 3 on SageMaker HyperPod with AWS Trainium | Amazon Web Services

Llama 3.3 70B now available in Amazon SageMaker JumpStart | Amazon Web Services

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM | Amazon Web Services

Scientists turn common semiconductor into a superconductor

Webb reveals the Universe’s first galaxies were a chaotic mess