Gen AI News Talk
Move Beyond Chain-of-Thought with Chain-of-Draft on Amazon Bedrock
As organizations scale their generative AI implementations, the critical challenge of balancing quality, cost, and latency becomes increasingly complex. With inference costs dominating 70–90%...
Introducing SOCI indexing for Amazon SageMaker Studio: Faster container startup times for AI/ML workloads
Today, we are excited to introduce a new feature for SageMaker Studio: SOCI (Seekable Open Container Initiative) indexing. SOCI supports lazy loading of container images,...
Bi-directional streaming for real-time agent interactions now available in Amazon Bedrock AgentCore Runtime
Building natural voice conversations with AI agents requires complex infrastructure and lots of code from engineering teams. Text-based agent interactions follow a turn-based pattern:...
Build and deploy scalable AI agents with NVIDIA NeMo, Amazon Bedrock AgentCore, and Strands...
This post is co-written with Ranjit Rajan, Abdullahi Olaoye, and Abhishek Sawarkar from NVIDIA.
AI’s next frontier isn’t merely smarter chat-based assistants, it’s autonomous agents...
Tracking and managing assets used in AI development with Amazon SageMaker AI
Building custom foundation models requires coordinating multiple assets across the development lifecycle such as data assets, compute infrastructure, model architecture and frameworks, lineage, and...
Track machine learning experiments with MLflow on Amazon SageMaker using Snowflake integration
A user can conduct machine learning (ML) data experiments in data environments, such as Snowflake, using the Snowpark library. However, tracking these experiments across...
Governance by design: The essential guide for successful AI scaling
Picture this: Your enterprise has just deployed its first generative AI application. The initial results are promising, but as you plan to scale across...
How Tata Power CoE built a scalable AI-powered solar panel inspection solution with Amazon...
This post is co-written with Vikram Bansal from Tata Power, and Gaurav Kankaria, Omkar Dhavalikar from Oneture.
The global adoption of solar energy is rapidly...
Unlocking video understanding with TwelveLabs Marengo on Amazon Bedrock
Media and entertainment, advertising, education, and enterprise training content combines visual, audio, and motion elements to tell stories and convey information, making it far...
Checkpointless training on Amazon SageMaker HyperPod: Production-scale training with faster fault recovery
Foundation model training has reached an inflection point where traditional checkpoint-based recovery methods are becoming a bottleneck to efficiency and cost-effectiveness. As models grow...

















