optimization

Customize Amazon Nova in Amazon SageMaker AI using Direct Preference Optimization | Amazon Web Services

At the AWS Summit in New York City, we introduced a comprehensive suite of model customization capabilities for Amazon Nova foundation models. Available as ready-to-use recipes on Amazon SageMaker AI, you can use them to adapt Nova Micro, Nova Lite, and Nova Pro across the model training lifecycle, including pre-training, supervisedContinue Reading

Effective cost optimization strategies for Amazon Bedrock | Amazon Web Services

In: Artificial Intelligence

Customers are increasingly using generative AI to enhance efficiency, personalize experiences, and drive innovation across various industries. For instance, generative AI can be used to perform text summarization, facilitate personalized marketing strategies, create business-critical chat-based assistants, and so on. However, as generative AI adoption grows, associated costs can escalate inContinue Reading

Improve Amazon Nova migration performance with data-aware prompt optimization | Amazon Web Services

In: Artificial Intelligence

In the era of generative AI, new large language models (LLMs) are continually emerging, each with unique capabilities, architectures, and optimizations. Among these, Amazon Nova foundation models (FMs) deliver frontier intelligence and industry-leading cost-performance, available exclusively on Amazon Bedrock. Since its launch in 2024, generative AI practitioners, including the teamsContinue Reading

Amazon Bedrock Prompt Optimization Drives LLM Applications Innovation for Yuewen Group | Amazon Web Services

In: Artificial Intelligence

Yuewen Group is a global leader in online literature and IP operations. Through its overseas platform WebNovel, it has attracted about 260 million users in over 200 countries and regions, promoting Chinese web literature globally. The company also adapts quality web novels into films, animations for international markets, expanding theContinue Reading

Optimize reasoning models like DeepSeek with prompt optimization on Amazon Bedrock | Amazon Web Services

In: Artificial Intelligence

DeepSeek-R1 models, now available on Amazon Bedrock Marketplace, Amazon SageMaker JumpStart, as well as a serverless model on Amazon Bedrock, were recently popularized by their long and elaborate thinking style, which, according to DeepSeek’s published results, lead to impressive performance on highly challenging math benchmarks like AIME-2024 and MATH-500, asContinue Reading

Amazon SageMaker launches the updated inference optimization toolkit for generative AI | Amazon Web Services

In: Artificial Intelligence

Today, Amazon SageMaker is excited to announce updates to the inference optimization toolkit, providing new functionality and enhancements to help you optimize generative AI models even faster. These updates build on the capabilities introduced in the original launch of the inference optimization toolkit (to learn more, see Achieve up toContinue Reading

Improve the performance of your Generative AI applications with Prompt Optimization on Amazon Bedrock | Amazon Web Services

In: Artificial Intelligence

Prompt engineering refers to the practice of writing instructions to get the desired responses from foundation models (FMs). You might have to spend months experimenting and iterating on your prompts, following the best practices for each model, to achieve your desired output. Furthermore, these prompts are specific to a modelContinue Reading

Use Amazon Bedrock Agents for code scanning, optimization, and remediation | Amazon Web Services

In: Artificial Intelligence

Amazon Bedrock is a fully managed service that makes foundation models (FMs) from leading AI startups and Amazon available through an API, so you can choose from a wide range of FMs to find the model that best suits your use case. With the Amazon Bedrock serverless experience, you canContinue Reading

Achieve up to ~2x higher throughput while reducing costs by ~50% for generative AI inference on Amazon SageMaker with the new inference optimization toolkit – Part 1 | Amazon Web Services

In: Artificial Intelligence

Today, Amazon SageMaker announced a new inference optimization toolkit that helps you reduce the time it takes to optimize generative artificial intelligence (AI) models from months to hours, to achieve best-in-class performance for your use case. With this new capability, you can choose from a menu of optimization techniques, applyContinue Reading

Achieve up to ~2x higher throughput while reducing costs by up to ~50% for generative AI inference on Amazon SageMaker with the new inference optimization toolkit – Part 2 | Amazon Web Services

In: Artificial Intelligence

As generative artificial intelligence (AI) inference becomes increasingly critical for businesses, customers are seeking ways to scale their generative AI operations or integrate generative AI models into existing workflows. Model optimization has emerged as a crucial step, allowing organizations to balance cost-effectiveness and responsiveness, improving productivity. However, price-performance requirements varyContinue Reading

optimization

Customize Amazon Nova in Amazon SageMaker AI using Direct Preference Optimization | Amazon Web Services

Effective cost optimization strategies for Amazon Bedrock | Amazon Web Services

Improve Amazon Nova migration performance with data-aware prompt optimization | Amazon Web Services

Optimize reasoning models like DeepSeek with prompt optimization on Amazon Bedrock | Amazon Web Services

Amazon SageMaker launches the updated inference optimization toolkit for generative AI | Amazon Web Services

Improve the performance of your Generative AI applications with Prompt Optimization on Amazon Bedrock | Amazon Web Services

Use Amazon Bedrock Agents for code scanning, optimization, and remediation | Amazon Web Services

Achieve up to ~2x higher throughput while reducing costs by ~50% for generative AI inference on Amazon SageMaker with the new inference optimization toolkit – Part 1 | Amazon Web Services

Achieve up to ~2x higher throughput while reducing costs by up to ~50% for generative AI inference on Amazon SageMaker with the new inference optimization toolkit – Part 2 | Amazon Web Services

Scientists turn common semiconductor into a superconductor

Webb reveals the Universe’s first galaxies were a chaotic mess