$0.00
Models

Deploy a Model

Choose a preset model or enter any HuggingFace model path to deploy an inference server.

Choose Model

Model FamilyVersionSpecialized TaskGPUVRAMTags
Qwen3-Coder30B MoECode GenerationH200:146 GB VRAM
Tool callingfp8
Qwen3-Coder30B MoE FP8Code GenerationH200:148 GB VRAM
Tool callingfp8
Qwen34B Instruct 2507Text GenerationRTX5090:124 GB VRAM
Tool callingfp8
Qwen30.6BText GenerationRTX5090:116 GB VRAM
Tool calling
Qwen31.7BText GenerationRTX5090:116 GB VRAM
Tool calling
Qwen34BText GenerationRTX5090:116 GB VRAM
Tool calling
Qwen38BText GenerationRTX5090:124 GB VRAM
Tool calling
Qwen33.5BText GenerationRTX5090:124 GB VRAM
Tool calling

23 models · page 1 of 3

Configuration

Cancel