Move Beyond Chain-of-Thought with Chain-of-Draft on Amazon Bedrock

Gen AI News Talk December 22, 2025 0

As organizations scale their generative AI implementations, the critical challenge of balancing quality, cost, and latency becomes increasingly complex. With inference costs dominating 70–90%...

Introducing SOCI indexing for Amazon SageMaker Studio: Faster container startup times for AI/ML workloads

Gen AI News Talk December 19, 2025 0

Today, we are excited to introduce a new feature for SageMaker Studio: SOCI (Seekable Open Container Initiative) indexing. SOCI supports lazy loading of container images,...

Bi-directional streaming for real-time agent interactions now available in Amazon Bedrock AgentCore Runtime

Gen AI News Talk December 18, 2025 0

Building natural voice conversations with AI agents requires complex infrastructure and lots of code from engineering teams. Text-based agent interactions follow a turn-based pattern:...

Build and deploy scalable AI agents with NVIDIA NeMo, Amazon Bedrock AgentCore, and Strands...

Gen AI News Talk December 18, 2025 0

This post is co-written with Ranjit Rajan, Abdullahi Olaoye, and Abhishek Sawarkar from NVIDIA. AI’s next frontier isn’t merely smarter chat-based assistants, it’s autonomous agents...

Tracking and managing assets used in AI development with Amazon SageMaker AI

Gen AI News Talk December 17, 2025 0

Building custom foundation models requires coordinating multiple assets across the development lifecycle such as data assets, compute infrastructure, model architecture and frameworks, lineage, and...

Track machine learning experiments with MLflow on Amazon SageMaker using Snowflake integration

Gen AI News Talk December 17, 2025 0

A user can conduct machine learning (ML) data experiments in data environments, such as Snowflake, using the Snowpark library. However, tracking these experiments across...

Governance by design: The essential guide for successful AI scaling

Gen AI News Talk December 16, 2025 0

Picture this: Your enterprise has just deployed its first generative AI application. The initial results are promising, but as you plan to scale across...

How Tata Power CoE built a scalable AI-powered solar panel inspection solution with Amazon...

Gen AI News Talk December 16, 2025 0

This post is co-written with Vikram Bansal from Tata Power, and Gaurav Kankaria, Omkar Dhavalikar from Oneture. The global adoption of solar energy is rapidly...

Unlocking video understanding with TwelveLabs Marengo on Amazon Bedrock

Gen AI News Talk December 16, 2025 0

Media and entertainment, advertising, education, and enterprise training content combines visual, audio, and motion elements to tell stories and convey information, making it far...

Checkpointless training on Amazon SageMaker HyperPod: Production-scale training with faster fault recovery

Gen AI News Talk December 15, 2025 0

Foundation model training has reached an inflection point where traditional checkpoint-based recovery methods are becoming a bottleneck to efficiency and cost-effectiveness. As models grow...

AWS and NVIDIA deepen strategic collaboration to accelerate AI from pilot to production

Agentic AI in the Enterprise Part 2: Guidance by Persona

Introducing Disaggregated Inference on AWS powered by llm-d

Build an offline feature store using Amazon SageMaker Unified Studio and SageMaker Catalog

How Workhuman built multi-tenant self-service reporting using Amazon Quick Sight embedded dashboards

Gen AI News Talk