model (Page 5)

Track LLM model evaluation using Amazon SageMaker managed MLflow and FMEval | Amazon Web Services

Evaluating large language models (LLMs) is crucial as LLM-based systems become increasingly powerful and relevant in our society. Rigorous testing allows us to understand an LLM’s capabilities, limitations, and potential biases, and provide actionable feedback to identify and mitigate risk. Furthermore, evaluation processes are important not only for LLMs, butContinue Reading

Create a SageMaker inference endpoint with custom model & extended container | Amazon Web Services

In: Artificial Intelligence

Amazon SageMaker provides a seamless experience for building, training, and deploying machine learning (ML) models at scale. Although SageMaker offers a wide range of built-in algorithms and pre-trained models through Amazon SageMaker JumpStart, there are scenarios where you might need to bring your own custom model or use specific softwareContinue Reading

Unlock cost-effective AI inference using Amazon Bedrock serverless capabilities with an Amazon SageMaker trained model | Amazon Web Services

In: Artificial Intelligence

In this post, I’ll show you how to use Amazon Bedrock—with its fully managed, on-demand API—with your Amazon SageMaker trained or fine-tuned model. Amazon Bedrock is a fully managed service that offers a choice of high-performing foundation models (FMs) from leading AI companies such as AI21 Labs, Anthropic, Cohere, Meta,Continue Reading

How Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model | Amazon Web Services

In: Artificial Intelligence

This post is co-written with Marta Cavalleri and Giovanni Germani from Fastweb, and Claudia Sacco and Andrea Policarpi from BIP xTech. AI’s transformative impact extends throughout the modern business landscape, with telecommunications emerging as a key area of innovation. Fastweb, one of Italy’s leading telecommunications operators, recognized the immense potentialContinue Reading

A guide to Amazon Bedrock Model Distillation (preview) | Amazon Web Services

In: Artificial Intelligence

When using generative AI, achieving high performance with low latency models that are cost-efficient is often a challenge, because these goals can clash with each other. With the newly launched Amazon Bedrock Model Distillation feature, you can use smaller, faster, and cost-efficient models that deliver use-case specific accuracy that isContinue Reading

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – Part 2 | Amazon Web Services

In: Artificial Intelligence

In Part 1 of this series, we introduced Amazon SageMaker Fast Model Loader, a new capability in Amazon SageMaker that significantly reduces the time required to deploy and scale large language models (LLMs) for inference. We discussed how this innovation addresses one of the major bottlenecks in LLM deployment: the timeContinue Reading

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – part 1 | Amazon Web Services

In: Artificial Intelligence

The generative AI landscape has been rapidly evolving, with large language models (LLMs) at the forefront of this transformation. These models have grown exponentially in size and complexity, with some now containing hundreds of billions of parameters and requiring hundreds of gigabytes of memory. As LLMs continue to expand, AIContinue Reading

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel | Amazon Web Services

In: Artificial Intelligence

Large language models (LLMs) have witnessed an unprecedented surge in popularity, with customers increasingly using publicly available models such as Llama, Stable Diffusion, and Mistral. Across diverse industries—including healthcare, finance, and marketing—organizations are now engaged in pre-training and fine-tuning these increasingly larger LLMs, which often boast billions of parameters andContinue Reading

Cohere Embed multimodal embeddings model is now available on Amazon SageMaker JumpStart | Amazon Web Services

In: Artificial Intelligence

The Cohere Embed multimodal embeddings model is now generally available on Amazon SageMaker JumpStart. This model is the newest Cohere Embed 3 model, which is now multimodal and capable of generating embeddings from both text and images, enabling enterprises to unlock real value from their vast amounts of data thatContinue Reading

Centralize model governance with SageMaker Model Registry Resource Access Manager sharing | Amazon Web Services

In: Artificial Intelligence

We recently announced the general availability of cross-account sharing of Amazon SageMaker Model Registry using AWS Resource Access Manager (AWS RAM), making it easier to securely share and discover machine learning (ML) models across your AWS accounts. Customers find it challenging to share and access ML models across AWS accounts becauseContinue Reading

model (Page 5)

Track LLM model evaluation using Amazon SageMaker managed MLflow and FMEval | Amazon Web Services

Create a SageMaker inference endpoint with custom model & extended container | Amazon Web Services

Unlock cost-effective AI inference using Amazon Bedrock serverless capabilities with an Amazon SageMaker trained model | Amazon Web Services

How Fastweb fine-tuned the Mistral model using Amazon SageMaker HyperPod as a first step to build an Italian large language model | Amazon Web Services

A guide to Amazon Bedrock Model Distillation (preview) | Amazon Web Services

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – Part 2 | Amazon Web Services

Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models (LLMs) – part 1 | Amazon Web Services

Efficiently train models with large sequence lengths using Amazon SageMaker model parallel | Amazon Web Services

Cohere Embed multimodal embeddings model is now available on Amazon SageMaker JumpStart | Amazon Web Services

Scientists suggest the brain may work best with 7 senses, not just 5

Vxceed builds the perfect sales pitch for sales teams at scale using Amazon Bedrock | Amazon Web Services