Home Authors Posts by GenAI News

GenAI News

GenAI News
730 POSTS 0 COMMENTS

Agentic AI in the Enterprise Part 2: Guidance by Persona

0
This is Part II of a two-part series from the AWS Generative AI Innovation Center. If you missed Part I, refer to Operationalizing Agentic...

Introducing Disaggregated Inference on AWS powered by llm-d

0
We thank Greg Pereira and Robert Shaw from the llm-d team for their support in bringing llm-d to AWS. In the agentic and reasoning era,...

Build an offline feature store using Amazon SageMaker Unified Studio and...

0
Building and managing machine learning (ML) features at scale is one of the most critical and complex challenges in modern data science workflows. Organizations...

How Workhuman built multi-tenant self-service reporting using Amazon Quick Sight embedded...

0
This post is cowritten with Ilija Subanovic and Michael Rice from Workhuman. Workhuman’s customer service and analytics team were drowning in one-time reporting requests from...

P-EAGLE: Faster LLM inference with Parallel Speculative Decoding in vLLM

0
EAGLE is the state-of-the-art method for speculative decoding in large language model (LLM) inference, but its autoregressive drafting creates a hidden bottleneck: the more...

Secure AI agents with Policy in Amazon Bedrock AgentCore

0
Deploying AI agents safely in regulated industries is challenging. Without proper boundaries, agents that access sensitive data or execute transactions can pose significant security...

Improve operational visibility for inference workloads on Amazon Bedrock with new...

0
As organizations scale their generative AI workloads on Amazon Bedrock, operational visibility into inference performance and resource consumption becomes critical. Teams running latency-sensitive applications...

Fine-tuning NVIDIA Nemotron Speech ASR on Amazon EC2 for domain adaptation

0
This post is a collaboration between AWS, NVIDIA and Heidi.  Automatic speech recognition (ASR), often called speech-to-text (STT) is becoming increasingly critical across industries like...

Multimodal embeddings at scale: AI data lake for media and entertainment...

0
This post shows you how to build a scalable multimodal video search system that enables natural language search across large video datasets using Amazon...

Operationalizing Agentic AI Part 1: A Stakeholder’s Guide

0
Agentic AI isn’t a feature you turn on. It’s a shift in how work is defined, who does it, and how decisions get made. Most...