HyperPod

Streamline machine learning workflows with SkyPilot on Amazon SageMaker HyperPod | Amazon Web Services

This post is co-written with Zhanghao Wu, co-creator of SkyPilot. The rapid advancement of generative AI and foundation models (FMs) has significantly increased computational resource requirements for machine learning (ML) workloads. Modern ML pipelines require efficient systems for distributing workloads across accelerated compute resources, while making sure developer productivity remainsContinue Reading

Amazon SageMaker HyperPod launches model deployments to accelerate the generative AI model development lifecycle | Amazon Web Services

In: Artificial Intelligence

Today, we’re excited to announce that Amazon SageMaker HyperPod now supports deploying foundation models (FMs) from Amazon SageMaker JumpStart, as well as custom or fine-tuned models from Amazon S3 or Amazon FSx. With this launch, you can train, fine-tune, and deploy models on the same HyperPod compute resources, maximizing resourceContinue Reading

Accelerate foundation model development with one-click observability in Amazon SageMaker HyperPod | Amazon Web Services

In: Artificial Intelligence

Amazon SageMaker HyperPod now provides a comprehensive, out-of-the-box dashboard that delivers insights into foundation model (FM) development tasks and cluster resources. This unified observability solution automatically publishes key metrics to Amazon Managed Service for Prometheus and visualizes them in Amazon Managed Grafana dashboards, optimized specifically for FM development with deepContinue Reading

Accelerate foundation model training and inference with Amazon SageMaker HyperPod and Amazon SageMaker Studio | Amazon Web Services

In: Artificial Intelligence

Modern generative AI model providers require unprecedented computational scale, with pre-training often involving thousands of accelerators running continuously for days, and sometimes months. Foundation Models (FMs) demand distributed training clusters — coordinated groups of accelerated compute instances, using frameworks like PyTorch — to parallelize workloads across hundreds of accelerators (likeContinue Reading

Training Llama 3.3 Swallow: A Japanese sovereign LLM on Amazon SageMaker HyperPod | Amazon Web Services

In: Artificial Intelligence

This post is based on a technical report written by Kazuki Fujii, who led the Llama 3.3 Swallow model development. The Institute of Science Tokyo has successfully trained Llama 3.3 Swallow, a 70-billion-parameter large language model (LLM) with enhanced Japanese capabilities, using Amazon SageMaker HyperPod. The model demonstrates superior performanceContinue Reading

Accelerating Articul8’s domain-specific model development with Amazon SageMaker HyperPod | Amazon Web Services

In: Artificial Intelligence

This post was co-written with Renato Nascimento, Felipe Viana, Andre Von Zuben from Articul8. Generative AI is reshaping industries, offering new efficiencies, automation, and innovation. However, generative AI requires powerful, scalable, and resilient infrastructures that optimize large-scale model training, providing rapid iteration and efficient compute utilization with purpose-built infrastructure andContinue Reading

Multi-account support for Amazon SageMaker HyperPod task governance | Amazon Web Services

In: Artificial Intelligence

GPUs are a precious resource; they are both short in supply and much more costly than traditional CPUs. They are also highly adaptable to many different use cases. Organizations building or adopting generative AI use GPUs to run simulations, run inference (both for internal or external usage), build agentic workloads,Continue Reading

How climate tech startups are building foundation models with Amazon SageMaker HyperPod | Amazon Web Services

In: Artificial Intelligence

Climate tech startups are companies that use technology and innovation to address the climate crisis, with a primary focus on either reducing greenhouse gas emissions or helping society adapt to climate change impacts. Their unifying mission is to create scalable solutions that accelerate the transition to a sustainable, low-carbon future.Continue Reading

How Apoidea Group enhances visual information extraction from banking documents with multimodal models using LLaMA-Factory on Amazon SageMaker HyperPod | Amazon Web Services

In: Artificial Intelligence

This post is co-written with Ken Tsui, Edward Tsoi and Mickey Yip from Apoidea Group. The banking industry has long struggled with the inefficiencies associated with repetitive processes such as information extraction, document review, and auditing. These tasks, which require significant human resources, slow down critical operations such as KnowContinue Reading

Customize DeepSeek-R1 671b model using Amazon SageMaker HyperPod recipes – Part 2 | Amazon Web Services

In: Artificial Intelligence

This post is the second part of the DeepSeek series focusing on model customization with Amazon SageMaker HyperPod recipes (or recipes for brevity). In Part 1, we demonstrated the performance and ease of fine-tuning DeepSeek-R1 distilled models using these recipes. In this post, we use the recipes to fine-tune theContinue Reading

HyperPod

Streamline machine learning workflows with SkyPilot on Amazon SageMaker HyperPod | Amazon Web Services

Amazon SageMaker HyperPod launches model deployments to accelerate the generative AI model development lifecycle | Amazon Web Services

Accelerate foundation model development with one-click observability in Amazon SageMaker HyperPod | Amazon Web Services

Accelerate foundation model training and inference with Amazon SageMaker HyperPod and Amazon SageMaker Studio | Amazon Web Services

Training Llama 3.3 Swallow: A Japanese sovereign LLM on Amazon SageMaker HyperPod | Amazon Web Services

Accelerating Articul8’s domain-specific model development with Amazon SageMaker HyperPod | Amazon Web Services

Multi-account support for Amazon SageMaker HyperPod task governance | Amazon Web Services

How climate tech startups are building foundation models with Amazon SageMaker HyperPod | Amazon Web Services

Customize DeepSeek-R1 671b model using Amazon SageMaker HyperPod recipes – Part 2 | Amazon Web Services

Build real-time travel recommendations using AI agents on Amazon Bedrock | Amazon Web Services

Deploy a full stack voice AI agent with Amazon Nova Sonic | Amazon Web Services