Gen AI News Talk
Amazon SageMaker launches the updated inference optimization toolkit for generative AI
Today, Amazon SageMaker is excited to announce updates to the inference optimization toolkit, providing new functionality and enhancements to help you optimize generative AI...
Elevate customer experience by using the Amazon Q Business custom plugin for New Relic...
Digital experience interruptions can harm customer satisfaction and business performance across industries. Application failures, slow load times, and service unavailability can lead to user...
Query structured data from Amazon Q Business using Amazon QuickSight integration
Amazon Q Business is a generative AI-powered assistant that can answer questions, provide summaries, generate content, and securely complete tasks based on data and...
Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models...
In Part 1 of this series, we introduced Amazon SageMaker Fast Model Loader, a new capability in Amazon SageMaker that significantly reduces the time required...
Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models...
The generative AI landscape has been rapidly evolving, with large language models (LLMs) at the forefront of this transformation. These models have grown exponentially...
Unlock cost savings with the new scale down to zero feature in SageMaker Inference
Today at AWS re:Invent 2024, we are excited to announce a new feature for Amazon SageMaker inference endpoints: the ability to scale SageMaker inference...
Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker...
Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required...
Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker
This post is co-written with Abhishek Sawarkar, Eliuth Triana, Jiahong Liu and Kshitiz Gupta from NVIDIA.
At re:Invent 2024, we are excited to announce new...
How Amazon Finance Automation built a generative AI Q&A chat assistant using Amazon Bedrock
Today, the Accounts Payable (AP) and Accounts Receivable (AR) analysts in Amazon Finance operations receive queries from customers through email, cases, internal tools, or...
Fast and accurate zero-shot forecasting with Chronos-Bolt and AutoGluon
Chronos-Bolt is the newest addition to AutoGluon-TimeSeries, delivering accurate zero-shot forecasting up to 250 times faster than the original Chronos models .
Time series forecasting...

















