large

Implement semantic video search using open source large vision models on Amazon SageMaker and Amazon OpenSearch Serverless | Amazon Web Services

As companies and individual users deal with constantly growing amounts of video content, the ability to perform low-effort search to retrieve videos or video segments using natural language becomes increasingly valuable. Semantic video search offers a powerful solution to this problem, so users can search for relevant video content basedContinue Reading

Curiosity rover finds large carbon deposits on Mars

In: Technology

Research from NASA’s Curiosity rover has found evidence of a carbon cycle on ancient Mars, bringing scientists closer to an answer on whether the Red Planet was ever capable of supporting life. Lead author Dr. Ben Tutolo, PhD, an associate professor with the Department of Earth, Energy and Environment inContinue Reading

New chip uses AI to shrink large language models’ energy footprint by 50%

In: Technology

Oregon State University College of Engineering researchers have developed a more efficient chip as an antidote to the vast amounts of electricity consumed by large-language-model artificial intelligence applications like Gemini and GPT-4. “We have designed and fabricated a new chip that consumes half the energy compared to traditional designs,” saidContinue Reading

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol | Amazon Web Services

In: Artificial Intelligence

Organizations implementing agents and agent-based systems often experience challenges such as implementing multiple tools, function calling, and orchestrating the workflows of the tool calling. An agent uses a function call to invoke an external tool (like an API or database) to perform specific actions or retrieve information it doesn’t possessContinue Reading

Supercharge your LLM performance with Amazon SageMaker Large Model Inference container v15 | Amazon Web Services

In: Artificial Intelligence

Today, we’re excited to announce the launch of Amazon SageMaker Large Model Inference (LMI) container v15, powered by vLLM 0.8.4 with support for the vLLM V1 engine. This version now supports the latest open-source models, such as Meta’s Llama 4 models Scout and Maverick, Google’s Gemma 3, Alibaba’s Qwen, MistralContinue Reading

Pixtral Large is now available in Amazon Bedrock | Amazon Web Services

In: Artificial Intelligence

Today, we are excited to announce that Mistral AI’s Pixtral Large foundation model (FM) is generally available in Amazon Bedrock. With this launch, you can now access Mistral’s frontier-class multimodal model to build, experiment, and responsibly scale your generative AI ideas on AWS. AWS is the first major cloud providerContinue Reading

Fine-tune large language models with reinforcement learning from human or AI feedback | Amazon Web Services

In: Artificial Intelligence

Large language models (LLMs) can be used to perform natural language processing (NLP) tasks ranging from simple dialogues and information retrieval tasks, to more complex reasoning tasks such as summarization and decision-making. Prompt engineering and supervised fine-tuning, which use instructions and examples demonstrating the desired task, can make LLMs betterContinue Reading

Using Large Language Models on Amazon Bedrock for multi-step task execution | Amazon Web Services

In: Artificial Intelligence

The goal of this blog post is to show you how a large language model (LLM) can be used to perform tasks that require multi-step dynamic reasoning and execution. Examples of tasks that require dynamic reasoning and execution are answering questions of the form “What is the average length ofContinue Reading

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container | Amazon Web Services

In: Artificial Intelligence

DeepSeek-R1 is a large language model (LLM) developed by DeepSeek AI that uses reinforcement learning to enhance reasoning capabilities through a multi-stage training process from a DeepSeek-V3-Base foundation. A key distinguishing feature is its reinforcement learning step, which was used to refine the model’s responses beyond the standard pre-training andContinue Reading

Evaluate large language models for your machine translation tasks on AWS | Amazon Web Services

In: Artificial Intelligence

Large language models (LLMs) have demonstrated promising capabilities in machine translation (MT) tasks. Depending on the use case, they are able to compete with neural translation models such as Amazon Translate. LLMs particularly stand out for their natural ability to learn from the context of the input text, which allowsContinue Reading

large

Implement semantic video search using open source large vision models on Amazon SageMaker and Amazon OpenSearch Serverless | Amazon Web Services

Curiosity rover finds large carbon deposits on Mars

New chip uses AI to shrink large language models’ energy footprint by 50%

Extend large language models powered by Amazon SageMaker AI using Model Context Protocol | Amazon Web Services

Supercharge your LLM performance with Amazon SageMaker Large Model Inference container v15 | Amazon Web Services

Pixtral Large is now available in Amazon Bedrock | Amazon Web Services

Fine-tune large language models with reinforcement learning from human or AI feedback | Amazon Web Services

Using Large Language Models on Amazon Bedrock for multi-step task execution | Amazon Web Services

Deploy DeepSeek-R1 distilled models on Amazon SageMaker using a Large Model Inference container | Amazon Web Services

Evaluate large language models for your machine translation tasks on AWS | Amazon Web Services

Build an AI assistant using Amazon Q Business with Amazon S3 clickable URLs | Amazon Web Services

GPT OSS models from OpenAI are now available on SageMaker JumpStart | Amazon Web Services