Gen AI News Talk
Build a dynamic, role-based AI agent using Amazon Bedrock inline agents
AI agents continue to gain momentum, as businesses use the power of generative AI to reinvent customer experiences and automate complex workflows. We are...
Use language embeddings for zero-shot classification and semantic search with Amazon Bedrock
In this post, we discuss what embeddings are, show how to practically use language embeddings, and explore how to use them to add functionality...
From concept to reality: Navigating the Journey of RAG from proof of concept to...
Generative AI has emerged as a transformative force, captivating industries with its potential to create, innovate, and solve complex problems. However, the journey from...
LLM-as-a-judge on Amazon Bedrock Model Evaluation
The evaluation of large language model (LLM) performance, particularly in response to a variety of prompts, is crucial for organizations aiming to harness the...
Achieve ~2x speed-up in LLM inference with Medusa-1 on Amazon SageMaker AI
This blog post is co-written with Moran beladev, Manos Stergiadis, and Ilya Gusev from Booking.com.
Large language models (LLMs) have revolutionized the field of natural...
Fine-tune LLMs with synthetic data for context-based Q&A using Amazon Bedrock
There’s a growing demand from customers to incorporate generative AI into their businesses. Many use cases involve using pre-trained large language models (LLMs) through...
Meta SAM 2.1 is now available in Amazon SageMaker JumpStart
This blog post is co-written with George Orlin from Meta.
Today, we are excited to announce that Meta’s Segment Anything Model (SAM) 2.1 vision segmentation...
Falcon 3 models now available in Amazon SageMaker JumpStart
Today, we are excited to announce that the Falcon 3 family of models from TII are available in Amazon SageMaker JumpStart. In this post,...
Building a virtual meteorologist using Amazon Bedrock Agents
The integration of generative AI capabilities is driving transformative changes across many industries. Although weather information is accessible through multiple channels, businesses that heavily...
Faster distributed graph neural network training with GraphStorm v0.4
GraphStorm is a low-code enterprise graph machine learning (ML) framework that provides ML practitioners a simple way of building, training, and deploying graph ML...