Deploy

Train and deploy models on Amazon SageMaker HyperPod using the new HyperPod CLI and SDK | Amazon Web Services

Training and deploying large AI models requires advanced distributed computing capabilities, but managing these distributed systems shouldn’t be complex for data scientists and machine learning (ML) practitioners. The newly released command line interface (CLI) and software development kit (SDK) for Amazon SageMaker HyperPod simplify how you can use the service’sContinue Reading

Deploy Amazon Bedrock Knowledge Bases using Terraform for RAG-based generative AI applications | Amazon Web Services

In: Artificial Intelligence

Retrieval Augmented Generation (RAG) is a powerful approach for building generative AI applications by providing foundation models (FMs) access to additional, relevant data. This approach improves response accuracy and transparency while avoiding the potential cost and complexity of FM training or fine-tuning. Many customers use Amazon Bedrock Knowledge Bases toContinue Reading

Train and deploy AI models at trillion-parameter scale with Amazon SageMaker HyperPod support for P6e-GB200 UltraServers | Amazon Web Services

In: Artificial Intelligence

Imagine harnessing the power of 72 cutting-edge NVIDIA Blackwell GPUs in a single system for the next wave of AI innovation, unlocking 360 petaflops of dense 8-bit floating point (FP8) compute and 1.4 exaflops of sparse 4-bit floating point (FP4) compute. Today, that’s exactly what Amazon SageMaker HyperPod delivers withContinue Reading

Fine-tune and deploy Meta Llama 3.2 Vision for generative AI-powered web automation using AWS DLCs, Amazon EKS, and Amazon Bedrock | Amazon Web Services

In: Artificial Intelligence

Fine-tuning of large language models (LLMs) has emerged as a crucial technique for organizations seeking to adapt powerful foundation models (FMs) to their specific needs. Rather than training models from scratch—a process that can cost millions of dollars and require extensive computational resources—companies can customize existing models with domain-specific dataContinue Reading

Deploy a full stack voice AI agent with Amazon Nova Sonic | Amazon Web Services

In: Artificial Intelligence

AI-powered speech solutions are transforming contact centers by enabling natural conversations between customers and AI agents, shortening wait times, and dramatically reducing operational costs—all without sacrificing the human-like interaction customers expect. With the recent launch of Amazon Nova Sonic in Amazon Bedrock, you can now build sophisticated conversational AI agentsContinue Reading

Deploy conversational agents with Vonage and Amazon Nova Sonic | Amazon Web Services

In: Artificial Intelligence

This post is co-written with Mark Berkeland and Oscar Rodriguez from Vonage. Voice-based technologies are transforming the way businesses engage with customers across customer support, virtual assistants, and intelligent agents. However, creating real-time, expressive, and highly responsive voice interfaces still requires navigating a complex stack of communication protocols, AI models,Continue Reading

Build and deploy AI inference workflows with new enhancements to the Amazon SageMaker Python SDK | Amazon Web Services

In: Artificial Intelligence

Amazon SageMaker Inference has been a popular tool for deploying advanced machine learning (ML) and generative AI models at scale. As AI applications become increasingly complex, customers want to deploy multiple models in a coordinated group that collectively process inference requests for an application. In addition, with the evolution ofContinue Reading

Deploy Qwen models with Amazon Bedrock Custom Model Import | Amazon Web Services

In: Artificial Intelligence

We’re excited to announce that Amazon Bedrock Custom Model Import now supports Qwen models. You can now import custom weights for Qwen2, Qwen2_VL, and Qwen2_5_VL architectures, including models like Qwen 2, 2.5 Coder, Qwen 2.5 VL, and QwQ 32B. You can bring your own customized Qwen models into Amazon Bedrock and deploy them in a fullyContinue Reading

Deploy Amazon SageMaker Projects with Terraform Cloud | Amazon Web Services

In: Artificial Intelligence

Amazon SageMaker Projects empower data scientists to self-serve Amazon Web Services (AWS) tooling and infrastructure to organize all entities of the machine learning (ML) lifecycle, and further enable organizations to standardize and constrain the resources available to their data science teams in pre-packaged templates. For AWS customers using Terraform to defineContinue Reading

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container | Amazon Web Services

In: Artificial Intelligence

DeepSeek-R1 is a large language model (LLM) developed by DeepSeek AI that uses reinforcement learning to enhance reasoning capabilities through a multi-stage training process from a DeepSeek-V3-Base foundation. A key distinguishing feature is its reinforcement learning step, which was used to refine the model’s responses beyond the standard pre-training andContinue Reading

Deploy

Train and deploy models on Amazon SageMaker HyperPod using the new HyperPod CLI and SDK | Amazon Web Services

Deploy Amazon Bedrock Knowledge Bases using Terraform for RAG-based generative AI applications | Amazon Web Services

Train and deploy AI models at trillion-parameter scale with Amazon SageMaker HyperPod support for P6e-GB200 UltraServers | Amazon Web Services

Fine-tune and deploy Meta Llama 3.2 Vision for generative AI-powered web automation using AWS DLCs, Amazon EKS, and Amazon Bedrock | Amazon Web Services

Deploy a full stack voice AI agent with Amazon Nova Sonic | Amazon Web Services

Deploy conversational agents with Vonage and Amazon Nova Sonic | Amazon Web Services

Build and deploy AI inference workflows with new enhancements to the Amazon SageMaker Python SDK | Amazon Web Services

Deploy Qwen models with Amazon Bedrock Custom Model Import | Amazon Web Services

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container | Amazon Web Services

Webb reveals the Universe’s first galaxies were a chaotic mess

Scientists discover a way simulate the Universe on a laptop