models (Page 6)

Talk to your slide deck using multimodal foundation models on Amazon Bedrock – Part 3 | Amazon Web Services

In this series, we share two approaches to gain insights on multimodal data like text, images, and charts. In Part 1, we presented an “embed first, infer later” solution that uses the Amazon Titan Multimodal Embeddings foundation model (FM) to convert individual slides from a slide deck into embeddings. WeContinue Reading

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices | Amazon Web Services

In: Artificial Intelligence

This post is co-written with Abhishek Sawarkar, Eliuth Triana, Jiahong Liu and Kshitiz Gupta from NVIDIA. At AWS re:Invent 2024, we are excited to introduce Amazon Bedrock Marketplace. This a revolutionary new capability within Amazon Bedrock that serves as a centralized hub for discovering, testing, and implementing foundation models (FMs).Continue Reading

Use Amazon Bedrock tooling with Amazon SageMaker JumpStart models | Amazon Web Services

In: Artificial Intelligence

Today, we’re excited to announce a new capability that allows you to deploy over 100 open-weight and proprietary models from Amazon SageMaker JumpStart and register them with Amazon Bedrock, allowing you to seamlessly access them through the powerful Amazon Bedrock APIs. You can now use Amazon Bedrock features such asContinue Reading

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – Part 2 | Amazon Web Services

In: Artificial Intelligence

In Part 1 of this series, we introduced Amazon SageMaker Fast Model Loader, a new capability in Amazon SageMaker that significantly reduces the time required to deploy and scale large language models (LLMs) for inference. We discussed how this innovation addresses one of the major bottlenecks in LLM deployment: the timeContinue Reading

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – part 1 | Amazon Web Services

In: Artificial Intelligence

The generative AI landscape has been rapidly evolving, with large language models (LLMs) at the forefront of this transformation. These models have grown exponentially in size and complexity, with some now containing hundreds of billions of parameters and requiring hundreds of gigabytes of memory. As LLMs continue to expand, AIContinue Reading

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel | Amazon Web Services

In: Artificial Intelligence

Large language models (LLMs) have witnessed an unprecedented surge in popularity, with customers increasingly using publicly available models such as Llama, Stable Diffusion, and Mistral. Across diverse industries—including healthcare, finance, and marketing—organizations are now engaged in pre-training and fine-tuning these increasingly larger LLMs, which often boast billions of parameters andContinue Reading

Reducing hallucinations in large language models with custom intervention using Amazon Bedrock Agents | Amazon Web Services

In: Artificial Intelligence

Hallucinations in large language models (LLMs) refer to the phenomenon where the LLM generates an output that is plausible but factually incorrect or made-up. This can occur when the model’s training data lacks the necessary information or when the model attempts to generate coherent responses by making logical inferences beyondContinue Reading

How Crexi achieved ML models deployment on AWS at scale and boosted efficiency | Amazon Web Services

In: Artificial Intelligence

This post is co-written with Isaac Smothers and James Healy-Mirkovich from Crexi. With the current demand for AI and machine learning (AI/ML) solutions, the processes to train and deploy models and scale inference are crucial to business success. Even though AI/ML and especially generative AI progress is rapid, machine learningContinue Reading

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium | Amazon Web Services

In: Artificial Intelligence

We’re excited to announce the availability of Meta Llama 3.1 8B and 70B inference support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. Meta Llama 3.1 multilingual large language models (LLMs) are a collection of pre-trained and instruction tuned generative models. Trainium and Inferentia, enabled by theContinue Reading

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker | Amazon Web Services

In: Artificial Intelligence

As generative AI models advance in creating multimedia content, the difference between good and great output often lies in the details that only human feedback can capture. Audio and video segmentation provides a structured way to gather this detailed feedback, allowing models to learn through reinforcement learning from human feedbackContinue Reading

models (Page 6)

Talk to your slide deck using multimodal foundation models on Amazon Bedrock – Part 3 | Amazon Web Services

Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices | Amazon Web Services

Use Amazon Bedrock tooling with Amazon SageMaker JumpStart models | Amazon Web Services

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – Part 2 | Amazon Web Services

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – part 1 | Amazon Web Services

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel | Amazon Web Services

Reducing hallucinations in large language models with custom intervention using Amazon Bedrock Agents | Amazon Web Services

How Crexi achieved ML models deployment on AWS at scale and boosted efficiency | Amazon Web Services

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium | Amazon Web Services

Enhance speech synthesis and video generation models with RLHF using audio and video segmentation in Amazon SageMaker | Amazon Web Services

The nuclear clock that could finally unmask dark matter

Giant Einstein ring reveals one of the Universe’s biggest black holes