Generative Data Intelligence

Tag: Multi-Model Endpoint

Scale foundation model inference to hundreds of models with Amazon SageMaker – Part 1 | Amazon Web Services

As democratization of foundation models (FMs) becomes more prevalent and demand for AI-augmented services increases, software as a service (SaaS) providers are looking to...

Top News

Efficiently train, tune, and deploy custom ensembles using Amazon SageMaker | Amazon Web Services

Artificial intelligence (AI) has become an important and popular topic in the technology community. As AI has evolved, we have seen different types of...

How Forethought saves over 66% in costs for generative AI models using Amazon SageMaker | Amazon Web Services

This post is co-written with Jad Chamoun, Director of Engineering at Forethought Technologies, Inc. and Salina Wu, Senior ML Engineer at Forethought Technologies, Inc....

Host ML models on Amazon SageMaker using Triton: ONNX Models | Amazon Web Services

ONNX (Open Neural Network Exchange) is an open-source standard for representing deep learning models widely supported by many providers. ONNX provides tools for optimizing...

Host ML models on Amazon SageMaker using Triton: CV model with PyTorch backend | Amazon Web Services

PyTorch is a machine learning (ML) framework based on the Torch library, used for applications such as computer vision and natural language processing. One...

Analyze Amazon SageMaker spend and determine cost optimization opportunities based on usage, Part 5: Hosting | Amazon Web Services

In 2021, we launched AWS Support Proactive Services as part of the AWS Enterprise Support plan. Since its introduction, we have helped hundreds of...

Create high-quality images with Stable Diffusion models and deploy them cost-efficiently with Amazon SageMaker | Amazon Web Services

Text-to-image generation is a task in which a machine learning (ML) model generates an image from a textual description. The goal is to generate...

Host ML models on Amazon SageMaker using Triton: Python backend | Amazon Web Services

Amazon SageMaker provides a number of options for users who are looking for a solution to host their machine learning (ML) models. Of these...

Host ML models on Amazon SageMaker using Triton: TensorRT models

Sometimes it can be very beneficial to use tools such as compilers that can modify and compile your models for optimal inference performance. In...

Hosting ML Models on Amazon SageMaker using Triton: XGBoost, LightGBM, and Treelite Models

One of the most popular models available today is XGBoost. With the ability to solve various problems such as classification and regression, XGBoost has...

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

This is a guest post co-written with Fred Wu from Sportradar. Sportradar is the world’s leading sports technology company, at the intersection between sports,...

Architect personalized generative AI SaaS applications on Amazon SageMaker

The AI landscape is being reshaped by the rise of generative models capable of synthesizing high-quality data, such as text, images, music, and videos....

Accelerate disaster response with computer vision for satellite imagery using Amazon SageMaker and Amazon Augmented AI

In recent years, advances in computer vision have enabled researchers, first responders, and governments to tackle the challenging problem of processing global satellite...

Latest Intelligence

spot_img
spot_img
spot_img

Chat with us

Hi there! How can I help you?