Generative Data Intelligence

Tag: Amazon Elastic Kubernetes Service

Use Kubernetes Operators for new inference capabilities in Amazon SageMaker that reduce LLM deployment costs by 50% on average | Amazon Web Services

We are excited to announce a new version of the Amazon SageMaker Operators for Kubernetes using the AWS Controllers for Kubernetes (ACK). ACK is...

Top News

Scale LLMs with PyTorch 2.0 FSDP on Amazon EKS – Part 2 | Amazon Web Services

This is a guest post co-written with Meta’s PyTorch team and is a continuation of Part 1 of this series, where we demonstrate the...

Federated learning on AWS using FedML, Amazon EKS, and Amazon SageMaker | Amazon Web Services

This post is co-written with Chaoyang He, Al Nevarez and Salman Avestimehr from FedML. Many organizations are...

Architect defense-in-depth security for generative AI applications using the OWASP Top 10 for LLMs | Amazon Web Services

Generative artificial intelligence (AI) applications built around large language models (LLMs) have demonstrated the potential to create and accelerate economic value for businesses. Examples...

Build enterprise-ready generative AI solutions with Cohere foundation models in Amazon Bedrock and Weaviate vector database on AWS Marketplace | Amazon Web Services

Generative AI solutions have the potential to transform businesses by boosting productivity and improving customer experiences, and using large language models (LLMs) with these...

Foundational data protection for enterprise LLM acceleration with Protopia AI | Amazon Web Services

This post is written in collaboration with Balaji Chandrasekaran, Jennifer Cwagenberg and Andrew Sansom and Eiman Ebrahimi from Protopia AI. New and powerful large...

Rundown of Security News from AWS re:Invent 2023

Amazon Web Services has been unveiling a steady stream of announcements during its AWS re:Invent 2023 event in Las Vegas this week. The focus...

Introducing three new NVIDIA GPU-based Amazon EC2 instances | Amazon Web Services

Amazon Elastic Compute Cloud (Amazon EC2) accelerated computing portfolio offers the broadest choice of accelerators to power your artificial intelligence (AI), machine learning (ML),...

Amazon EC2 DL2q instance for cost-efficient, high-performance AI inference is now generally available | Amazon Web Services

This is a guest post by A.K Roy from Qualcomm AI. Amazon Elastic Compute Cloud (Amazon EC2) DL2q instances, powered by Qualcomm AI 100...

Cloud Service Trends: Positives and Negatives of AWS Cloud

Cloud computing has emerged as a vital technology fueling innovation and success in today's fast developing corporate landscape. Amazon Web Services (AWS) is a dominant force among...

Enable pod-based GPU metrics in Amazon CloudWatch | Amazon Web Services

In February 2022, Amazon Web Services added support for NVIDIA GPU metrics in Amazon CloudWatch, making it possible to push metrics from the Amazon...

Optimize AWS Inferentia utilization with FastAPI and PyTorch models on Amazon EC2 Inf1 & Inf2 instances | Amazon Web Services

When deploying Deep Learning models at scale, it is crucial to effectively utilize the underlying hardware to maximize performance and cost benefits. For production...

SambaSafety automates custom R workload, improving driver safety with Amazon SageMaker and AWS Step Functions | Amazon Web Services

At SambaSafety, their mission is to promote safer communities by reducing risk through data insights. Since 1998, SambaSafety has been the leading North American...

Latest Intelligence

spot_img
spot_img
spot_img

Chat with us

Hi there! How can I help you?