AWS (Page 8)

How Tealium built a chatbot evaluation platform with Ragas and Auto-Instruct using AWS generative AI services | Amazon Web Services

This post was co-written with Varun Kumar from Tealium Retrieval Augmented Generation (RAG) pipelines are popular for generating domain-specific outputs based on external data that’s fed in as part of the context. However, there are challenges with evaluating and improving such systems. Two open-source libraries, Ragas (a library for RAGContinue Reading

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow | Amazon Web Services

In: Artificial Intelligence

With access to a wide range of generative AI foundation models (FM) and the ability to build and train their own machine learning (ML) models in Amazon SageMaker, users want a seamless and secure way to experiment with and select the models that deliver the most value for their business. In theContinue Reading

Building Generative AI and ML solutions faster with AI apps from AWS partners using Amazon SageMaker | Amazon Web Services

In: Artificial Intelligence

Organizations of every size and across every industry are looking to use generative AI to fundamentally transform the business landscape with reimagined customer experiences, increased employee productivity, new levels of creativity, and optimized business processes. A recent study by Telecom Advisory Services, a globally recognized research and consulting firm that specializesContinue Reading

AWS DeepRacer: How to master physical racing? | Amazon Web Services

In: Artificial Intelligence

As developers gear up for re:Invent 2024, they again face the unique challenges of physical racing. What are the obstacles? Let’s have a look. In this blog post, I will look at what makes physical AWS DeepRacer racing—a real car on a real track—different to racing in the virtual world—aContinue Reading

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM | Amazon Web Services

In: Artificial Intelligence

With the rise of large language models (LLMs) like Meta Llama 3.1, there is an increasing need for scalable, reliable, and cost-effective solutions to deploy and serve these models. AWS Trainium and AWS Inferentia based instances, combined with Amazon Elastic Kubernetes Service (Amazon EKS), provide a performant and low costContinue Reading

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips | Amazon Web Services

In: Artificial Intelligence

The use of large language models (LLMs) and generative AI has exploded over the last year. With the release of powerful publicly available foundation models, tools for training, fine tuning and hosting your own LLM have also become democratized. Using vLLM on AWS Trainium and Inferentia makes it possible toContinue Reading

Enhanced observability for AWS Trainium and AWS Inferentia with Datadog | Amazon Web Services

In: Artificial Intelligence

This post is co-written with Curtis Maher and Anjali Thatte from Datadog. This post walks you through Datadog’s new integration with AWS Neuron, which helps you monitor your AWS Trainium and AWS Inferentia instances by providing deep observability into resource utilization, model execution performance, latency, and real-time infrastructure health, enablingContinue Reading

Apply Amazon SageMaker Studio lifecycle configurations using AWS CDK | Amazon Web Services

In: Artificial Intelligence

This post serves as a step-by-step guide on how to set up lifecycle configurations for your Amazon SageMaker Studio domains. With lifecycle configurations, system administrators can apply automated controls to their SageMaker Studio domains and their users. We cover core concepts of SageMaker Studio and provide code examples of howContinue Reading

How Crexi achieved ML models deployment on AWS at scale and boosted efficiency | Amazon Web Services

In: Artificial Intelligence

This post is co-written with Isaac Smothers and James Healy-Mirkovich from Crexi. With the current demand for AI and machine learning (AI/ML) solutions, the processes to train and deploy models and scale inference are crucial to business success. Even though AI/ML and especially generative AI progress is rapid, machine learningContinue Reading

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium | Amazon Web Services

In: Artificial Intelligence

We’re excited to announce the availability of Meta Llama 3.1 8B and 70B inference support on AWS Trainium and AWS Inferentia instances in Amazon SageMaker JumpStart. Meta Llama 3.1 multilingual large language models (LLMs) are a collection of pre-trained and instruction tuned generative models. Trainium and Inferentia, enabled by theContinue Reading

AWS (Page 8)

How Tealium built a chatbot evaluation platform with Ragas and Auto-Instruct using AWS generative AI services | Amazon Web Services

Accelerating ML experimentation with enhanced security: AWS PrivateLink support for Amazon SageMaker with MLflow | Amazon Web Services

Building Generative AI and ML solutions faster with AI apps from AWS partners using Amazon SageMaker | Amazon Web Services

AWS DeepRacer: How to master physical racing? | Amazon Web Services

Deploy Meta Llama 3.1-8B on AWS Inferentia using Amazon EKS and vLLM | Amazon Web Services

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips | Amazon Web Services

Enhanced observability for AWS Trainium and AWS Inferentia with Datadog | Amazon Web Services

Apply Amazon SageMaker Studio lifecycle configurations using AWS CDK | Amazon Web Services

How Crexi achieved ML models deployment on AWS at scale and boosted efficiency | Amazon Web Services

Deploy Meta Llama 3.1 models cost-effectively in Amazon SageMaker JumpStart with AWS Inferentia and AWS Trainium | Amazon Web Services

You’ve never seen atoms like this before: A hidden motion revealed

Build an intelligent eDiscovery solution using Amazon Bedrock Agents | Amazon Web Services