Tag: Amazon Elastic Kubernetes Service

Generative artificial intelligence (AI) applications built around large language models (LLMs) have demonstrated the potential to create and accelerate economic value for businesses. Examples...

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace | Amazon Web Services

AIJanuary 24, 2024

Generative AI solutions have the potential to transform businesses by boosting productivity and improving customer experiences, and using large language models (LLMs) with these...

Foundational data protection for enterprise LLM acceleration with Protopia AI | Amazon Web Services

AIDecember 5, 2023

This post is written in collaboration with Balaji Chandrasekaran, Jennifer Cwagenberg and Andrew Sansom and Eiman Ebrahimi from Protopia AI. New and powerful large...

Rundown of Security News from AWS re:Invent 2023

Cyber SecurityNovember 29, 2023

Amazon Web Services has been unveiling a steady stream of announcements during its AWS re:Invent 2023 event in Las Vegas this week. The focus...

Introducing three new NVIDIA GPU-based Amazon EC2 instances | Amazon Web Services

AINovember 27, 2023

Amazon Elastic Compute Cloud (Amazon EC2) accelerated computing portfolio offers the broadest choice of accelerators to power your artificial intelligence (AI), machine learning (ML),...

Amazon EC2 DL2q instance for cost-efficient, high-performance AI inference is now generally available | Amazon Web Services

AINovember 22, 2023

This is a guest post by A.K Roy from Qualcomm AI. Amazon Elastic Compute Cloud (Amazon EC2) DL2q instances, powered by Qualcomm AI 100...

Cloud Service Trends: Positives and Negatives of AWS Cloud

FintechOctober 11, 2023

Cloud computing has emerged as a vital technology fueling innovation and success in today's fast developing corporate landscape. Amazon Web Services (AWS) is a dominant force among...

Enable pod-based GPU metrics in Amazon CloudWatch | Amazon Web Services

AISeptember 7, 2023

In February 2022, Amazon Web Services added support for NVIDIA GPU metrics in Amazon CloudWatch, making it possible to push metrics from the Amazon...

Optimize AWS Inferentia utilization with FastAPI and PyTorch models on Amazon EC2 Inf1 & Inf2 instances | Amazon Web Services

AIJuly 24, 2023

When deploying Deep Learning models at scale, it is crucial to effectively utilize the underlying hardware to maximize performance and cost benefits. For production...

SambaSafety automates custom R workload, improving driver safety with Amazon SageMaker and AWS Step Functions | Amazon Web Services

AIJune 16, 2023

At SambaSafety, their mission is to promote safer communities by reducing risk through data insights. Since 1998, SambaSafety has been the leading North American...

12 3 Page 1 of 3

Analyze Amazon SageMaker spend and determine cost optimization opportunities based on usage, Part 1 | Amazon Web Services

AI May 30, 2023

Accelerate machine learning time to value with Amazon SageMaker JumpStart and PwC’s MLOps accelerator | Amazon Web Services

AI May 23, 2023

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

AI April 19, 2023

Deploy pre-trained models on AWS Wavelength with 5G edge using Amazon SageMaker JumpStart

AI April 7, 2023

Accelerate machine learning time to value with Amazon SageMaker JumpStart and PwC’s MLOps accelerator | Amazon Web Services

AI May 23, 2023

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

AI April 19, 2023

Deploy pre-trained models on AWS Wavelength with 5G edge using Amazon SageMaker JumpStart

AI April 7, 2023

Generative Data Intelligence

Tag: Amazon Elastic Kubernetes Service

Use Kubernetes Operators for new inference capabilities in Amazon SageMaker that reduce LLM deployment costs by 50% on average | Amazon Web Services

Top News

Scale AI training and inference for drug discovery through Amazon EKS and Karpenter | Amazon Web Services

Open source observability for AWS Inferentia nodes within Amazon EKS clusters | Amazon Web Services

Critical Bugs Put Hugging Face AI Platform in a ‘Pickle’

Latest Intelligence

How Forethought saves over 66% in costs for generative AI models using Amazon SageMaker | Amazon Web Services

AWS Inferentia2 builds on AWS Inferentia1 by delivering 4x higher throughput and 10x lower latency | Amazon Web Services

Build high-performance ML models using PyTorch 2.0 on AWS – Part 1 | Amazon Web Services

Analyze Amazon SageMaker spend and determine cost optimization opportunities based on usage, Part 1 | Amazon Web Services

Accelerate machine learning time to value with Amazon SageMaker JumpStart and PwC’s MLOps accelerator | Amazon Web Services

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

Deploy pre-trained models on AWS Wavelength with 5G edge using Amazon SageMaker JumpStart

Accelerate machine learning time to value with Amazon SageMaker JumpStart and PwC’s MLOps accelerator | Amazon Web Services

How Sportradar used the Deep Java Library to build production-scale ML platforms for increased performance and efficiency

Deploy pre-trained models on AWS Wavelength with 5G edge using Amazon SageMaker JumpStart

Chat with us