Gen AI News Talk
Deploy RAG applications on Amazon SageMaker JumpStart using FAISS
Generative AI has empowered customers with their own information in unprecedented ways, reshaping interactions across various industries by enabling intuitive and personalized experiences. This...
Speed up your cluster procurement time with Amazon SageMaker HyperPod training plans
Today, organizations are constantly seeking ways to use advanced large language models (LLMs) for their specific needs. These organizations are engaging in both pre-training...
Amazon Bedrock Marketplace now includes NVIDIA models: Introducing NVIDIA Nemotron-4 NIM microservices
This post is co-written with Abhishek Sawarkar, Eliuth Triana, Jiahong Liu and Kshitiz Gupta from NVIDIA.
At AWS re:Invent 2024, we are excited to introduce...
Scale ML workflows with Amazon SageMaker Studio and Amazon SageMaker HyperPod
Scaling machine learning (ML) workflows from initial prototypes to large-scale production deployment can be daunting task, but the integration of Amazon SageMaker Studio and...
Build generative AI applications quickly with Amazon Bedrock IDE in Amazon SageMaker Unified Studio
Building generative AI applications presents significant challenges for organizations: they require specialized ML expertise, complex infrastructure management, and careful orchestration of multiple services. To...
A guide to Amazon Bedrock Model Distillation (preview)
When using generative AI, achieving high performance with low latency models that are cost-efficient is often a challenge, because these goals can clash with...
Use Amazon Bedrock tooling with Amazon SageMaker JumpStart models
Today, we’re excited to announce a new capability that allows you to deploy over 100 open-weight and proprietary models from Amazon SageMaker JumpStart and...
Real value, real time: Production AI with Amazon SageMaker and Tecton
This post is cowritten with Isaac Cameron and Alex Gnibus from Tecton.
Businesses are under pressure to show return on investment (ROI) from AI use...
Building Generative AI and ML solutions faster with AI apps from AWS partners using Amazon...
Organizations of every size and across every industry are looking to use generative AI to fundamentally transform the business landscape with reimagined customer experiences,...
Introducing Amazon Kendra GenAI Index – Enhanced semantic search and retrieval capabilities
Amazon Kendra is an intelligent enterprise search service that helps you search across different content repositories with built-in connectors. AWS customers use Amazon Kendra...