Tag: Multi-Model Endpoint
Scale foundation model inference to hundreds of models with Amazon SageMaker – Part 1 | Amazon Web Services
As democratization of foundation models (FMs) becomes more prevalent and demand for AI-augmented services increases, software as a service (SaaS) providers are looking to...
Top News
Breaking News
Efficiently train, tune, and deploy custom ensembles using Amazon SageMaker | Amazon Web Services
Artificial intelligence (AI) has become an important and popular topic in the technology community. As AI has evolved, we have seen different types of...
How Forethought saves over 66% in costs for generative AI models using Amazon SageMaker | Amazon Web Services
This post is co-written with Jad Chamoun, Director of Engineering at Forethought Technologies, Inc. and Salina Wu, Senior ML Engineer at Forethought Technologies, Inc....
Host ML models on Amazon SageMaker using Triton: ONNX Models | Amazon Web Services
ONNX (Open Neural Network Exchange) is an open-source standard for representing deep learning models widely supported by many providers. ONNX provides tools for optimizing...
Host ML models on Amazon SageMaker using Triton: CV model with PyTorch backend | Amazon Web Services
PyTorch is a machine learning (ML) framework based on the Torch library, used for applications such as computer vision and natural language processing. One...
Analyze Amazon SageMaker spend and determine cost optimization opportunities based on usage, Part 5: Hosting | Amazon Web Services
In 2021, we launched AWS Support Proactive Services as part of the AWS Enterprise Support plan. Since its introduction, we have helped hundreds of...
Create high-quality images with Stable Diffusion models and deploy them cost-efficiently with Amazon SageMaker | Amazon Web Services
Text-to-image generation is a task in which a machine learning (ML) model generates an image from a textual description. The goal is to generate...
Host ML models on Amazon SageMaker using Triton: Python backend | Amazon Web Services
Amazon SageMaker provides a number of options for users who are looking for a solution to host their machine learning (ML) models. Of these...
Host ML models on Amazon SageMaker using Triton: TensorRT models
Sometimes it can be very beneficial to use tools such as compilers that can modify and compile your models for optimal inference performance. In...
Hosting ML Models on Amazon SageMaker using Triton: XGBoost, LightGBM, and Treelite Models
One of the most popular models available today is XGBoost. With the ability to solve various problems such as classification and regression, XGBoost has...
How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency
This is a guest post co-written with Fred Wu from Sportradar. Sportradar is the world’s leading sports technology company, at the intersection between sports,...
Architect personalized generative AI SaaS applications on Amazon SageMaker
The AI landscape is being reshaped by the rise of generative models capable of synthesizing high-quality data, such as text, images, music, and videos....
Accelerate disaster response with computer vision for satellite imagery using Amazon SageMaker and Amazon Augmented AI
In recent years, advances in computer vision have enabled researchers, first responders, and governments to tackle the challenging problem of processing global satellite...