instances

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips | Amazon Web Services

The use of large language models (LLMs) and generative AI has exploded over the last year. With the release of powerful publicly available foundation models, tools for training, fine tuning and hosting your own LLM have also become democratized. Using vLLM on AWS Trainium and Inferentia makes it possible toContinue Reading

Amazon SageMaker Inference now supports G6e instances | Amazon Web Services

In: Artificial Intelligence

As the demand for generative AI continues to grow, developers and enterprises seek more flexible, cost-effective, and powerful accelerators to meet their needs. Today, we are thrilled to announce the availability of G6e instances powered by NVIDIA’s L40S Tensor Core GPUs on Amazon SageMaker. You will have the option to provisionContinue Reading

How FP8 boosts LLM training by 18% on Amazon SageMaker P5 instances | Amazon Web Services

In: Artificial Intelligence

Large language models (LLMs) are AI systems trained on vast amounts of text data, enabling them to understand, generate, and reason with natural language in highly capable and flexible ways. LLM training has seen remarkable advances in recent years, with organizations pushing the boundaries of what’s possible in terms ofContinue Reading

Implement Amazon SageMaker domain cross-Region disaster recovery using custom Amazon EFS instances | Amazon Web Services

In: Artificial Intelligence

Amazon SageMaker is a cloud-based machine learning (ML) platform within the AWS ecosystem that offers developers a seamless and convenient way to build, train, and deploy ML models. Extensively used by data scientists and ML engineers across various industries, this robust tool provides high availability and uninterrupted access for itsContinue Reading

Amazon EC2 P5e instances are generally available | Amazon Web Services

In: Artificial Intelligence

State-of-the-art generative AI models and high performance computing (HPC) applications are driving the need for unprecedented levels of compute. Customers are pushing the boundaries of these technologies to bring higher fidelity products and experiences to market across industries. The size of large language models (LLMs), as measured by the numberContinue Reading

instances

Serving LLMs using vLLM and Amazon EC2 instances with AWS AI chips | Amazon Web Services

Amazon SageMaker Inference now supports G6e instances | Amazon Web Services

How FP8 boosts LLM training by 18% on Amazon SageMaker P5 instances | Amazon Web Services

Implement Amazon SageMaker domain cross-Region disaster recovery using custom Amazon EFS instances | Amazon Web Services

Amazon EC2 P5e instances are generally available | Amazon Web Services

Google's quantum computer creates exotic state once thought impossible

NASA's Perseverance rover finds clues to ancient Mars chemistry and possible life