Generative Data Intelligence

Tag: SageMaker Inference

Boost inference performance for Mixtral and Llama 2 models with new Amazon SageMaker containers | Amazon Web Services

In January 2024, Amazon SageMaker launched a new version (0.26.0) of Large Model Inference (LMI) Deep Learning Containers (DLCs). This version offers support for...

Top News

Gemma is now available in Amazon SageMaker JumpStart  | Amazon Web Services

Today, we’re excited to announce that the Gemma model is now available for customers using Amazon SageMaker JumpStart. Gemma is a family of language models based on...

Run ML inference on unplanned and spiky traffic using Amazon SageMaker multi-model endpoints | Amazon Web Services

Amazon SageMaker multi-model endpoints (MMEs) are a fully managed capability of SageMaker inference that allows you to deploy thousands of models on a single...

Code Llama 70B is now available in Amazon SageMaker JumpStart | Amazon Web Services

Today, we are excited to announce that Code Llama foundation models, developed by Meta, are available for customers through Amazon SageMaker JumpStart to deploy...

Train and host a computer vision model for tampering detection on Amazon SageMaker: Part 2 | Amazon Web Services

In the first part of this three-part series, we presented a solution that demonstrates how you can automate detecting document tampering and fraud at...

Talk to your slide deck using multimodal foundation models hosted on Amazon Bedrock and Amazon SageMaker – Part 1 | Amazon Web Services

With the advent of generative AI, today’s foundation models (FMs), such as the large language models (LLMs) Claude 2 and Llama 2, can perform...

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning | Amazon Web Services

In this post, we demonstrate how to use neural architecture search (NAS) based structural pruning to compress a fine-tuned BERT model to improve model...

Llama Guard is now available in Amazon SageMaker JumpStart | Amazon Web Services

Today we are excited to announce that the Llama Guard model is now available for customers using Amazon SageMaker JumpStart. Llama Guard provides input...

Identify cybersecurity anomalies in your Amazon Security Lake data using Amazon SageMaker | Amazon Web Services

Customers are faced with increasing security threats and vulnerabilities across infrastructure and application resources as their digital footprint has expanded and the business impact...

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 2: Interactive User Experiences in SageMaker Studio | Amazon Web Services

Amazon SageMaker is a fully managed service that enables developers and data scientists to quickly and easily build, train, and deploy machine learning (ML)...

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 1: PySDK Improvements | Amazon Web Services

Amazon SageMaker is a fully managed service that enables developers and data scientists to quickly and effortlessly build, train, and deploy machine learning (ML)...

Scale foundation model inference to hundreds of models with Amazon SageMaker – Part 1 | Amazon Web Services

As democratization of foundation models (FMs) becomes more prevalent and demand for AI-augmented services increases, software as a service (SaaS) providers are looking to...

Reduce model deployment costs by 50% on average using the latest features of Amazon SageMaker | Amazon Web Services

As organizations deploy models to production, they are constantly looking for ways to optimize the performance of their foundation models (FMs) running on the...

Latest Intelligence

spot_img
spot_img
spot_img

Chat with us

Hi there! How can I help you?