Scale (Page 2)

How Amazon trains sequential ensemble models at scale with Amazon SageMaker Pipelines | Amazon Web Services

Amazon SageMaker Pipelines includes features that allow you to streamline and automate machine learning (ML) workflows. This allows scientists and model developers to focus on model development and rapid experimentation rather than infrastructure management Pipelines offers the ability to orchestrate complex ML workflows with a simple Python SDK with theContinue Reading

Scale ML workflows with Amazon SageMaker Studio and Amazon SageMaker HyperPod | Amazon Web Services

In: Artificial Intelligence

Scaling machine learning (ML) workflows from initial prototypes to large-scale production deployment can be daunting task, but the integration of Amazon SageMaker Studio and Amazon SageMaker HyperPod offers a streamlined solution to this challenge. As teams progress from proof of concept to production-ready models, they often struggle with efficiently managingContinue Reading

Unlock cost savings with the new scale down to zero feature in SageMaker Inference | Amazon Web Services

In: Artificial Intelligence

Today at AWS re:Invent 2024, we are excited to announce a new feature for Amazon SageMaker inference endpoints: the ability to scale SageMaker inference endpoints to zero instances. This long-awaited capability is a game changer for our customers using the power of AI and machine learning (ML) inference in theContinue Reading

How Crexi achieved ML models deployment on AWS at scale and boosted efficiency | Amazon Web Services

In: Artificial Intelligence

This post is co-written with Isaac Smothers and James Healy-Mirkovich from Crexi. With the current demand for AI and machine learning (AI/ML) solutions, the processes to train and deploy models and scale inference are crucial to business success. Even though AI/ML and especially generative AI progress is rapid, machine learningContinue Reading

Governing the ML lifecycle at scale, Part 3: Setting up data governance at scale | Amazon Web Services

In: Artificial Intelligence

This post is part of an ongoing series about governing the machine learning (ML) lifecycle at scale. To view this series from the beginning, start with Part 1. This post dives deep into how to set up data governance at scale using Amazon DataZone for the data mesh. The dataContinue Reading

Governing ML lifecycle at scale: Best practices to set up cost and usage visibility of ML workloads in multi-account environments | Amazon Web Services

In: Artificial Intelligence

Cloud costs can significantly impact your business operations. Gaining real-time visibility into infrastructure expenses, usage patterns, and cost drivers is essential. This insight enables agile decision-making, optimized scalability, and maximizes the value derived from cloud investments, providing cost-effective and efficient cloud utilization for your organization’s future growth. What makes costContinue Reading

Unleash the power of generative AI with Amazon Q Business: How CCoEs can scale cloud governance best practices and drive innovation | Amazon Web Services

In: Artificial Intelligence

This post is co-written with Steven Craig from Hearst. To maintain their competitive edge, organizations are constantly seeking ways to accelerate cloud adoption, streamline processes, and drive innovation. However, Cloud Center of Excellence (CCoE) teams often can be perceived as bottlenecks to organizational transformation due to limited resources and overwhelmingContinue Reading

Governing the ML lifecycle at scale: Centralized observability with Amazon SageMaker and Amazon CloudWatch | Amazon Web Services

In: Artificial Intelligence

This post is part of an ongoing series on governing the machine learning (ML) lifecycle at scale. To start from the beginning, refer to Governing the ML lifecycle at scale, Part 1: A framework for architecting ML workloads using Amazon SageMaker. A multi-account strategy is essential not only for improvingContinue Reading

Super charge your LLMs with RAG at scale using AWS Glue for Apache Spark | Amazon Web Services

In: Artificial Intelligence

Large language models (LLMs) are very large deep-learning models that are pre-trained on vast amounts of data. LLMs are incredibly flexible. One model can perform completely different tasks such as answering questions, summarizing documents, translating languages, and completing sentences. LLMs have the potential to revolutionize content creation and the wayContinue Reading

Evaluating prompts at scale with Prompt Management and Prompt Flows for Amazon Bedrock | Amazon Web Services

In: Artificial Intelligence

As generative artificial intelligence (AI) continues to revolutionize every industry, the importance of effective prompt optimization through prompt engineering techniques has become key to efficiently balancing the quality of outputs, response time, and costs. Prompt engineering refers to the practice of crafting and optimizing inputs to the models by selectingContinue Reading

Scale (Page 2)

How Amazon trains sequential ensemble models at scale with Amazon SageMaker Pipelines | Amazon Web Services

Scale ML workflows with Amazon SageMaker Studio and Amazon SageMaker HyperPod | Amazon Web Services

Unlock cost savings with the new scale down to zero feature in SageMaker Inference | Amazon Web Services

How Crexi achieved ML models deployment on AWS at scale and boosted efficiency | Amazon Web Services

Governing the ML lifecycle at scale, Part 3: Setting up data governance at scale | Amazon Web Services

Governing ML lifecycle at scale: Best practices to set up cost and usage visibility of ML workloads in multi-account environments | Amazon Web Services

Governing the ML lifecycle at scale: Centralized observability with Amazon SageMaker and Amazon CloudWatch | Amazon Web Services

Super charge your LLMs with RAG at scale using AWS Glue for Apache Spark | Amazon Web Services

Evaluating prompts at scale with Prompt Management and Prompt Flows for Amazon Bedrock | Amazon Web Services

Build an intelligent eDiscovery solution using Amazon Bedrock Agents | Amazon Web Services

How PerformLine uses prompt engineering on Amazon Bedrock to detect compliance violations | Amazon Web Services