LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's Guide to INT4/INT8 ... Speculative Decoding, Cost Optimization

Prix à partir de
9,14

En vedette

COMPARER TOUS LES MAGASINS EN LIGNE (2)

Description

Amazon LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's Guide to INT4/INT8 Quantization, ... Speculative Decoding, and Cost Optimization

Comparer les boutiques en ligne (2)

Shop
Prix
Affranchissement
Prix total
9,14 
3,00 €
12,14 
Voir l’offre
3,00 € Shipping Costs
9,14 
3,00 €
12,14 
Voir l’offre
3,00 € Shipping Costs
Description (1)

LLM Inference Engineering: Quantization, KV-Cache Optimization, and High-Throughput Serving: A Production Engineer's Guide to INT4/INT8 Quantization, ... Speculative Decoding, and Cost Optimization


Spécifications du produit

Marque Independently Published
EAN
  • 9798180985187

Prix mis à jour pour la dernière fois le :

Choix en vedette
9,14 
Voir l’offre