gpu.fm
Intel Gaudi 3 AI Accelerator

Intel Gaudi 3 AI Accelerator

Intel's third-gen AI accelerator optimized for LLM training and inference. Cost-effective alternative with Ethernet-based scaling. Contact us for evaluation units and pricing.

Price Available Upon Request

Contact us for custom pricing and availability

?Request quote for availability

Ships directly from distributor

Price Available Upon Request
This cutting-edge product requires custom configuration and pricing. Contact our team for availability and quotes.

Request Quote & Availability

Or call us at (850) 407-7265

Memory128 GB HBM2e
TDP600W
Form FactorPCIe
Warranty36 months
Questions? Ask our voice agent →

What it's for

Intel Gaudi 3 AI Accelerator delivers exceptional performance for training and inference of large language models and generative AI. Built on TSMC's 5nm process, Gaudi 3 combines 64 tensor processor cores, 8 matrix multiplication engines, and 128GB of HBM2e memory with industry-leading 3.7TB/s bandwidth. With integrated 24x 200GbE networking and competitive pricing, Gaudi 3 offers compelling value for AI infrastructure. Break free from proprietary interconnects. Gaudi 3 scales on standard Ethernet at half the cost. Call (555) 123-4567 for demo units and TCO analysis.

Key Features

  • 64 tensor processor cores (TPCs) with 8 matrix multiplication engines (MMEs)
  • 128GB HBM2e memory with 3.7TB/s bandwidth
  • 1.8 petaFLOPS FP8 and BF16 compute performance
  • Integrated 24x 200 GbE networking for scale-out
  • 96MB on-die SRAM cache for low-latency operations
  • 14 integrated media engines (H.265, H.264, JPEG, VP9)
  • TSMC 5nm process node technology
  • Available in PCIe (600W) and OAM (900W) form factors

Use Cases

  • Large language model training (LLaMA, GPT-style models)
  • Generative AI inference at scale
  • Computer vision and video processing
  • Recommendation systems and embeddings
  • Multi-modal AI workloads
  • Cost-optimized AI infrastructure deployment

Technical Specifications

ArchitectureGaudi 3 (TSMC 5nm)
GPU Memory128 GB HBM2e
Memory Bandwidth3.7 TB/s
Tensor Processor Cores64 TPCs
Matrix Multiply Engines8 MMEs
FP8 Performance1.8 petaFLOPS
BF16 Performance1.8 petaFLOPS
On-die SRAM96 MB
Network Interfaces24x 200 GbE
Media Engines14 (H.265/H.264/JPEG/VP9)
Max TDP (PCIe)600W
Max TDP (OAM)900W
Thermal SolutionAir cooling (PCIe) / Liquid (OAM)
Form FactorPCIe Gen5 AIC
PCIe InterfacePCIe Gen5 x16
Intel Gaudi 3 AI Accelerator | gpu.fm | gpu.fm