Tag: SageMaker Inference

Boost inference performance for Mixtral and Llama 2 models with new Amazon SageMaker containers | Amazon Web Services

AI April 8, 2024

In January 2024, Amazon SageMaker launched a new version (0.26.0) of Large Model Inference (LMI) Deep Learning Containers (DLCs). This version offers support for...

Seamlessly transition between no-code and code-first machine learning with Amazon SageMaker Canvas and Amazon SageMaker Studio | Amazon Web Services

AI April 3, 2024

Solar models from Upstage are now available in Amazon SageMaker JumpStart | Amazon Web Services

AI April 2, 2024

Optimize price-performance of LLM inference on NVIDIA GPUs using the Amazon SageMaker integration with NVIDIA NIM Microservices | Amazon Web Services

AI March 18, 2024

Gemma is now available in Amazon SageMaker JumpStart | Amazon Web Services

AIMarch 13, 2024

Today, we’re excited to announce that the Gemma model is now available for customers using Amazon SageMaker JumpStart. Gemma is a family of language models based on...

Run ML inference on unplanned and spiky traffic using Amazon SageMaker multi-model endpoints | Amazon Web Services

AIFebruary 19, 2024

Amazon SageMaker multi-model endpoints (MMEs) are a fully managed capability of SageMaker inference that allows you to deploy thousands of models on a single...

Code Llama 70B is now available in Amazon SageMaker JumpStart | Amazon Web Services

AIFebruary 16, 2024

Today, we are excited to announce that Code Llama foundation models, developed by Meta, are available for customers through Amazon SageMaker JumpStart to deploy...

Train and host a computer vision model for tampering detection on Amazon SageMaker: Part 2 | Amazon Web Services

AIJanuary 31, 2024

In the first part of this three-part series, we presented a solution that demonstrates how you can automate detecting document tampering and fraud at...

Talk to your slide deck using multimodal foundation models hosted on Amazon Bedrock and Amazon SageMaker – Part 1 | Amazon Web Services

AIJanuary 30, 2024

With the advent of generative AI, today’s foundation models (FMs), such as the large language models (LLMs) Claude 2 and Llama 2, can perform...

Reduce inference time for BERT models using neural architecture search and SageMaker Automated Model Tuning | Amazon Web Services

AIJanuary 19, 2024

In this post, we demonstrate how to use neural architecture search (NAS) based structural pruning to compress a fine-tuned BERT model to improve model...

Llama Guard is now available in Amazon SageMaker JumpStart | Amazon Web Services

AIDecember 20, 2023

Today we are excited to announce that the Llama Guard model is now available for customers using Amazon SageMaker JumpStart. Llama Guard provides input...

Identify cybersecurity anomalies in your Amazon Security Lake data using Amazon SageMaker | Amazon Web Services

AIDecember 20, 2023

Customers are faced with increasing security threats and vulnerabilities across infrastructure and application resources as their digital footprint has expanded and the business impact...

Package and deploy classical ML and LLMs easily with Amazon SageMaker, part 2: Interactive User Experiences in SageMaker Studio | Amazon Web Services

AINovember 30, 2023

Amazon SageMaker is a fully managed service that enables developers and data scientists to quickly and easily build, train, and deploy machine learning (ML)...

12 3...7 Page 1 of 7

Latest Intelligence

Accelerate data preparation for ML in Amazon SageMaker Canvas | Amazon Web Services

AI November 29, 2023

Democratize ML on Salesforce Data Cloud with no-code Amazon SageMaker Canvas | Amazon Web Services

AI November 27, 2023

Build a contextual chatbot for financial services using Amazon SageMaker JumpStart, Llama 2 and Amazon OpenSearch Serverless with Vector Engine | Amazon Web Services

AI November 22, 2023

Generative Data Intelligence

Tag: SageMaker Inference

Boost inference performance for Mixtral and Llama 2 models with new Amazon SageMaker containers | Amazon Web Services

Top News

Seamlessly transition between no-code and code-first machine learning with Amazon SageMaker Canvas and Amazon SageMaker Studio | Amazon Web Services

Solar models from Upstage are now available in Amazon SageMaker JumpStart | Amazon Web Services

Optimize price-performance of LLM inference on NVIDIA GPUs using the Amazon SageMaker integration with NVIDIA NIM Microservices | Amazon Web Services

Latest Intelligence

Accelerate data preparation for ML in Amazon SageMaker Canvas | Amazon Web Services

Democratize ML on Salesforce Data Cloud with no-code Amazon SageMaker Canvas | Amazon Web Services

Build a contextual chatbot for financial services using Amazon SageMaker JumpStart, Llama 2 and Amazon OpenSearch Serverless with Vector Engine | Amazon Web Services

Build a medical imaging AI inference pipeline with MONAI Deploy on AWS | Amazon Web Services

Deploy ML models built in Amazon SageMaker Canvas to Amazon SageMaker real-time endpoints | Amazon Web Services

How Meesho built a generalized feed ranker using Amazon SageMaker inference | Amazon Web Services

Whisper models for automatic speech recognition now available in Amazon SageMaker JumpStart | Amazon Web Services

Deploy ML models built in Amazon SageMaker Canvas to Amazon SageMaker real-time endpoints | Amazon Web Services

How Meesho built a generalized feed ranker using Amazon SageMaker inference | Amazon Web Services

Whisper models for automatic speech recognition now available in Amazon SageMaker JumpStart | Amazon Web Services

Chat with us