Gen AI News Talk
Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models...
In Part 1 of this series, we introduced Amazon SageMaker Fast Model Loader, a new capability in Amazon SageMaker that significantly reduces the time required...
Introducing Fast Model Loader in SageMaker Inference: Accelerate autoscaling for your Large Language Models...
The generative AI landscape has been rapidly evolving, with large language models (LLMs) at the forefront of this transformation. These models have grown exponentially...
Unlock cost savings with the new scale down to zero feature in SageMaker Inference
Today at AWS re:Invent 2024, we are excited to announce a new feature for Amazon SageMaker inference endpoints: the ability to scale SageMaker inference...
Supercharge your auto scaling for generative AI inference – Introducing Container Caching in SageMaker...
Today at AWS re:Invent 2024, we are excited to announce the new Container Caching capability in Amazon SageMaker, which significantly reduces the time required...
Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker
This post is co-written with Abhishek Sawarkar, Eliuth Triana, Jiahong Liu and Kshitiz Gupta from NVIDIA.
At re:Invent 2024, we are excited to announce new...
How Amazon Finance Automation built a generative AI Q&A chat assistant using Amazon Bedrock
Today, the Accounts Payable (AP) and Accounts Receivable (AR) analysts in Amazon Finance operations receive queries from customers through email, cases, internal tools, or...
Fast and accurate zero-shot forecasting with Chronos-Bolt and AutoGluon
Chronos-Bolt is the newest addition to AutoGluon-TimeSeries, delivering accurate zero-shot forecasting up to 250 times faster than the original Chronos models .
Time series forecasting...
Create a generative AI assistant with Slack and Amazon Bedrock
Seamless integration of customer experience, collaboration tools, and relevant data is the foundation for delivering knowledge-based productivity gains. In this post, we show you...
Use Amazon Bedrock Agents for code scanning, optimization, and remediation
Amazon Bedrock is a fully managed service that makes foundation models (FMs) from leading AI startups and Amazon available through an API, so you...
Getting started with Amazon Bedrock Agents custom orchestrator
Generative AI agents are designed to interact with their environment to achieve specific objectives, such as automating repetitive tasks and augmenting human capabilities. By...