SageMaker (Page 7)

NeMo Retriever Llama 3.2 text embedding and reranking NVIDIA NIM microservices now available in Amazon SageMaker JumpStart | Amazon Web Services

Today, we are excited to announce that the NeMo Retriever Llama3.2 Text Embedding and Reranking NVIDIA NIM microservices are available in Amazon SageMaker JumpStart. With this launch, you can now deploy NVIDIA’s optimized reranking and embedding models to build, experiment, and responsibly scale your generative AI ideas on AWS. InContinue Reading

Running NVIDIA NeMo 2.0 Framework on Amazon SageMaker HyperPod | Amazon Web Services

In: Artificial Intelligence

This post is cowritten with Abdullahi Olaoye, Akshit Arora and Eliuth Triana Isaza at NVIDIA. As enterprises continue to push the boundaries of generative AI, scalable and efficient model training frameworks are essential. The NVIDIA NeMo Framework provides a robust, end-to-end solution for developing, customizing, and deploying large-scale AI models,Continue Reading

Unleash AI innovation with Amazon SageMaker HyperPod | Amazon Web Services

In: Artificial Intelligence

The rise of generative AI has significantly increased the complexity of building, training, and deploying machine learning (ML) models. It now demands deep expertise, access to vast datasets, and the management of extensive compute clusters. Customers also face the challenges of writing specialized code for distributed training, continuously optimizing models,Continue Reading

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI | Amazon Web Services

In: Artificial Intelligence

DeepSeek-R1, developed by AI startup DeepSeek AI, is an advanced large language model (LLM) distinguished by its innovative, multi-stage training process. Instead of relying solely on traditional pre-training and fine-tuning, DeepSeek-R1 integrates reinforcement learning to achieve more refined outputs. The model employs a chain-of-thought (CoT) approach that systematically breaks downContinue Reading

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container | Amazon Web Services

In: Artificial Intelligence

DeepSeek-R1 is a large language model (LLM) developed by DeepSeek AI that uses reinforcement learning to enhance reasoning capabilities through a multi-stage training process from a DeepSeek-V3-Base foundation. A key distinguishing feature is its reinforcement learning step, which was used to refine the model’s responses beyond the standard pre-training andContinue Reading

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1 | Amazon Web Services

In: Artificial Intelligence

Increasingly, organizations across industries are turning to generative AI foundation models (FMs) to enhance their applications. To achieve optimal performance for specific use cases, customers are adopting and adapting these FMs to their unique domain requirements. This need for customization has become even more pronounced with the emergence of newContinue Reading

Mistral-Small-24B-Instruct-2501 is now available on SageMaker Jumpstart and Amazon Bedrock Marketplace | Amazon Web Services

In: Artificial Intelligence

Today, we’re excited to announce that Mistral-Small-24B-Instruct-2501—a twenty-four billion parameter large language model (LLM) from Mistral AI that’s optimized for low latency text generation tasks—is available for customers through Amazon SageMaker JumpStart and Amazon Bedrock Marketplace. Amazon Bedrock Marketplace is a new capability in Amazon Bedrock that developers can use to discover,Continue Reading

LLM continuous self-instruct fine-tuning framework powered by a compound AI system on Amazon SageMaker | Amazon Web Services

In: Artificial Intelligence

Fine-tuning a pre-trained large language model (LLM) allows users to customize the model to perform better on domain-specific tasks or align more closely with human preferences. It is a continuous process to keep the fine-tuned model accurate and effective in changing environments, to adapt to the data distribution shift (conceptContinue Reading

Best practices for Amazon SageMaker HyperPod task governance | Amazon Web Services

In: Artificial Intelligence

At AWS re:Invent 2024, we launched a new innovation in Amazon SageMaker HyperPod on Amazon Elastic Kubernetes Service (Amazon EKS) that enables you to run generative AI development tasks on shared accelerated compute resources efficiently and reduce costs by up to 40%. Administrators can use SageMaker HyperPod task governance toContinue Reading

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI | Amazon Web Services

In: Artificial Intelligence

This blog post is co-written with Moran beladev, Manos Stergiadis, and Ilya Gusev from Booking.com. Large language models (LLMs) have revolutionized the field of natural language processing with their ability to understand and generate humanlike text. Trained on broad, generic datasets spanning a wide range of topics and domains, LLMsContinue Reading

SageMaker (Page 7)

NeMo Retriever Llama 3.2 text embedding and reranking NVIDIA NIM microservices now available in Amazon SageMaker JumpStart | Amazon Web Services

Running NVIDIA NeMo 2.0 Framework on Amazon SageMaker HyperPod | Amazon Web Services

Unleash AI innovation with Amazon SageMaker HyperPod | Amazon Web Services

Optimize hosting DeepSeek-R1 distilled models with Hugging Face TGI on Amazon SageMaker AI | Amazon Web Services

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container | Amazon Web Services

Customize DeepSeek-R1 distilled models using Amazon SageMaker HyperPod recipes – Part 1 | Amazon Web Services

Mistral-Small-24B-Instruct-2501 is now available on SageMaker Jumpstart and Amazon Bedrock Marketplace | Amazon Web Services

LLM continuous self-instruct fine-tuning framework powered by a compound AI system on Amazon SageMaker | Amazon Web Services

Best practices for Amazon SageMaker HyperPod task governance | Amazon Web Services

Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI | Amazon Web Services

Life without sunlight? Earthquake fractures fuel deep underground microbes

The DIVA logistics agent, powered by Amazon Bedrock | Amazon Web Services