GPU types & models
Compare 85 GPU types tracked across our provider network. Filter by VRAM, brand and pricing to find the right fit.
- GPU types
- 85
- Offerings
- 802
- Up to
- 432GB
- Cheapest
- $0.02/hr
Choose the right GPU
Quick heuristics for common workloads.
Training & research
High-VRAM accelerators like H100, H200 and A100 for large language models and deep learning research.
Inference & production
Optimized GPUs like L40S, RTX 4090 and RTX 5090 for efficient model serving and inference workloads.
Browse by group
Curated GPU groups for faster discovery.
Latest generation GPUs across datacenter and prosumer lines.
Browse groupTop-tier accelerators for demanding training and large-scale inference.
Browse groupCost-efficient GPUs with strong value for common workloads.
Browse groupAvailable GPU types
Showing 85 of 85 models.
NVIDIA Tesla P4
NVIDIA Tesla P4 8 GB - low-power Pascal inference accelerator for video transcoding and edge AI workloads.
NVIDIA Tesla V100
NVIDIA Tesla V100 16 GB Volta - legacy datacenter accelerator still popular for CUDA development, scientific compute and AI research.
NVIDIA GTX 1060
GeForce GTX 1060 6 GB - mid-range Pascal graphics card for 1080p gaming and lightweight AI experimentation.
NVIDIA RTX 3070
GeForce RTX 3070 8 GB - mid-range graphics card for 1440p gaming, creative rendering and AI-assisted workflows.
NVIDIA RTX A2000
NVIDIA RTX A2000 12 GB - compact Ampere professional GPU for CAD, VR development and AI-assisted workflows.
NVIDIA GTX 1080
GeForce GTX 1080 8 GB - high-end Pascal graphics card for 1440p gaming and creative workloads.
NVIDIA RTX 3060
GeForce RTX 3060 12 GB - mid-range graphics card for 1440p gaming, creative rendering and AI-assisted workflows.
NVIDIA GTX 1080 Ti
GeForce GTX 1080 Ti 11 GB - flagship Pascal card still capable for 4K gaming, rendering and entry AI workloads.
NVIDIA GTX 1660 Super
GeForce GTX 1660 Super 6 GB - refreshed Turing card with faster GDDR6 memory for 1080p gaming.
NVIDIA Quadro P4000
NVIDIA Quadro P4000 8 GB - single-slot Pascal workstation GPU for VR, rendering and creative production.
NVIDIA GTX 1660 Ti
GeForce GTX 1660 Ti 6 GB - higher-clocked Turing model for 1080p gaming and light creator workloads.
NVIDIA RTX 2080 Ti
GeForce RTX 2080 Ti 11 GB - high-end graphics card for 4K gaming, creative rendering and AI-assisted workflows.
NVIDIA Tesla P100
NVIDIA Tesla P100 16 GB HBM2 - Pascal datacenter accelerator for HPC and early deep-learning training workloads.
NVIDIA RTX 3060 Ti
GeForce RTX 3060 Ti 8 GB - mid-range graphics card for 1440p gaming, creative rendering and AI-assisted workflows.
NVIDIA RTX 3080 Ti
GeForce RTX 3080 Ti 12 GB - enthusiast graphics card for 4K gaming, creative rendering and entry-level AI experimentation.
NVIDIA RTX 4090
GeForce RTX 4090 24 GB - ultimate consumer GPU for 8K gaming, neural rendering and AI model training.
NVIDIA RTX A4000
NVIDIA RTX A4000 16 GB - compact professional GPU for CAD, VR development and AI-assisted workflows.
NVIDIA Tesla P40
NVIDIA Tesla P40 24 GB - Pascal inference accelerator with high VRAM, popular for self-hosted LLM inference on a budget.
NVIDIA RTX 2060
GeForce RTX 2060 6 GB - entry-level Turing RTX card bringing real-time ray-tracing to mainstream gaming.
NVIDIA T4
NVIDIA T4 16 GB Tensor Core GPU (Turing) widely used for cost-efficient AI inference, video transcoding and lightweight training workloads.
NVIDIA GTX 1070
GeForce GTX 1070 8 GB - mid-range graphics card for 1440p gaming, creative rendering and AI-assisted workflows.
NVIDIA RTX 2070 Super
GeForce RTX 2070 Super 8 GB - refreshed Turing model with more cores for creators and deep-learning hobbyists.
NVIDIA RTX 5060 Ti
GeForce RTX 5060 Ti 16 GB Blackwell - rumoured mid-range card targeting efficient 1440p gaming and entry AI workloads.
NVIDIA RTX 3090
GeForce RTX 3090 24 GB - powerhouse card for gaming, 3D rendering and AI research on a budget.
NVIDIA RTX 5060
GeForce RTX 5060 8 GB Blackwell - rumoured mid-range card targeting efficient 1440p gaming and entry AI workloads.
NVIDIA RTX 4060
GeForce RTX 4060 8 GB - efficient GPU for 1440p gaming, DLSS 3 frame generation and lightweight AI projects.
NVIDIA RTX 3060 Laptop GPU
GeForce RTX 3060 Laptop GPU 6 GB - mobile Ampere card for gaming notebooks and portable AI workloads. Distinct from the desktop RTX 3060 (12 GB).
NVIDIA RTX 5080
GeForce RTX 5080 16 GB Blackwell - anticipated high-end successor delivering faster ray-tracing and AI inference.
NVIDIA RTX 3080
GeForce RTX 3080 10 GB - high-end graphics card for 4K gaming, creative rendering and AI-assisted workflows.
NVIDIA RTX 2000 Ada Generation
NVIDIA RTX 2000 Ada Generation 16 GB - compact professional GPU for CAD, VR development and AI-assisted workflows.
NVIDIA RTX A4500
NVIDIA RTX A4500 20 GB - Ampere-based workstation card for real-time ray-tracing, simulation and AI inference.
NVIDIA RTX 4000 Ada Generation
NVIDIA RTX 4000 Ada 20 GB - single-slot Ada Lovelace GPU optimised for compact workstations, AI inferencing and real-time graphics.
NVIDIA L4
The NVIDIA L4 Tensor Core GPU powered by the NVIDIA Ada Lovelace architecture delivers universal, energy-efficient acceleration for video, AI, visual computing, graphics, virtualization, and more.
NVIDIA RTX 4080 Super
GeForce RTX 4080 Super 16 GB - refreshed Ada model with more cores for creators and deep-learning hobbyists.
NVIDIA RTX 5070 Ti
Leaked GeForce RTX 5070 Ti 16 GB - projected sweet-spot Blackwell GPU for creators, gamers and small-scale machine-learning.
NVIDIA RTX A5000
NVIDIA RTX A5000 24 GB - versatile pro GPU for large-scene rendering, VR and deep-learning acceleration.
NVIDIA GTX 1650
GeForce GTX 1650 4 GB - entry-level Turing graphics card for 1080p gaming and basic content creation.
NVIDIA GTX 1050 Ti
GeForce GTX 1050 Ti 4 GB - entry-level graphics card for lightweight AI inference and basic content creation.
NVIDIA RTX 5090
GeForce RTX 5090 32 GB Blackwell flagship - tipped to exceed RTX 4090 in gaming, rendering and generative-AI.
NVIDIA RTX 5000 Ada Generation
NVIDIA RTX 5000 Ada 32 GB - single-slot Ada Lovelace GPU optimised for compact workstations, AI inferencing and real-time graphics.
NVIDIA RTX A6000
NVIDIA RTX A6000 48 GB - flagship Ampere workstation GPU powering complex VFX, digital twins and AI pipelines.
NVIDIA A30
NVIDIA A30 24 GB Ampere GPU designed for mainstream enterprise AI inference and mixed workloads.
NVIDIA A40
NVIDIA A40 48 GB workstation GPU balancing real-time ray-tracing, AI acceleration and professional graphics rendering.
NVIDIA RTX 4000 SFF Ada Generation
The NVIDIA RTX™ 4000 SFF Ada Generation packs a punch with the features, capabilities, and performance demanded by professionals—all in a compact GPU design.
NVIDIA RTX 3090 Ti
GeForce RTX 3090 Ti 24 GB - high-end graphics card for gaming, creative rendering and AI-assisted workflows.
NVIDIA RTX 4060 Ti
GeForce RTX 4060 Ti 8 GB - efficient GPU for 1440p gaming, DLSS 3 frame generation and lightweight AI projects.
NVIDIA A16
NVIDIA A16 16 GB multi-instance GPU tailored for virtual desktops, AI inference and mixed graphics workloads in the cloud.
NVIDIA RTX PRO 6000 MIG 24GB
NVIDIA RTX PRO 6000 Blackwell Server Edition MIG 1g.24gb slice - a partitioned instance providing 24 GB of memory and ~1/4 of the full GPU compute for isolated AI inference and lightweight training workloads.
NVIDIA RTX 4070 Ti
GeForce RTX 4070 Ti 12 GB - higher-clocked Ada GPU ideal for 4K esports, Blender rendering and local generative-AI tasks.
NVIDIA RTX 4080
GeForce RTX 4080 16 GB - high-end Ada Lovelace GPU delivering excellent 4K ray-tracing performance and generative-AI acceleration.
NVIDIA RTX PRO 5000
NVIDIA RTX PRO 5000 Blackwell 48 GB GDDR7 ECC - workstation GPU for advanced visualization, simulation and AI development.
NVIDIA RTX PRO 6000
NVIDIA RTX PRO 6000 96 GB - universal AI and visual computing performance for the data center.
NVIDIA L40S
NVIDIA L40S 48 GB Ada GPU, tuned for faster generative-AI inference and graphics workloads in datacenter environments.
NVIDIA RTX PRO 4000
NVIDIA RTX PRO 4000 Blackwell 24 GB GDDR7 ECC - workstation GPU for professional content creation and AI workloads.
NVIDIA A100 80GB
NVIDIA A100 80 GB Tensor Core GPU - the proven workhorse for deep-learning training, data analytics and accelerated HPC.
NVIDIA Quadro RTX 8000
NVIDIA Quadro RTX 8000 48 GB - ultimate AI and visual computing performance for the data center.
NVIDIA GTX 1650 Super
GeForce GTX 1650 Super 4 GB - refreshed Turing entry-level card with faster memory for 1080p gaming.
NVIDIA GTX 1660
GeForce GTX 1660 6 GB - mid-range Turing graphics card for 1080p gaming and creative workloads.
NVIDIA RTX 6000
NVIDIA RTX 6000 24 GB
NVIDIA RTX 4070
GeForce RTX 4070 12 GB Ada - efficient GPU for 1440p gaming, DLSS 3 frame generation and lightweight AI projects.
NVIDIA RTX 5070
Rumoured GeForce RTX 5070 12 GB Blackwell - expected mid-range card targeting efficient 1440p gaming and entry AI workloads.
NVIDIA RTX PRO 4500
NVIDIA RTX PRO 4500 Blackwell 32GB GDDR7 with error-correcting code (ECC)
NVIDIA RTX 6000 Ada Generation
NVIDIA RTX 6000 Ada 48 GB - top-tier Ada workstation card delivering 91 TFLOPS for advanced visualization and AI training.
NVIDIA RTX 2060 Super
GeForce RTX 2060 Super 8 GB - refreshed Turing model with more VRAM for 1440p gaming and AI experimentation.
NVIDIA L40
NVIDIA L40 48 GB Ada Lovelace GPU delivering advanced ray-tracing and AI performance for enterprise visualization and generative content.
NVIDIA RTX 3050
GeForce RTX 3050 8 GB - mid-range graphics card for 1440p gaming, creative rendering and AI-assisted workflows.
NVIDIA RTX PRO 6000 MIG 48GB
NVIDIA RTX PRO 6000 Blackwell Server Edition MIG 2g.48gb slice - a partitioned instance providing 48 GB of memory and ~1/2 of the full GPU compute for isolated AI inference and medium training workloads.
Intel Gaudi 2
Intel Gaudi 2 AI accelerator - High performance acceleration for GenAI and LLMs.
NVIDIA A10
NVIDIA A10 24 GB.
NVIDIA A100 40GB
NVIDIA A100 40 GB offers affordable access to Ampere Tensor Cores for research AI training, inference and scientific computing.
NVIDIA H100
Flagship NVIDIA H100 Tensor Core GPU with 80 GB HBM3 for exascale HPC, large-language-model training and low-latency AI inference.
NVIDIA Titan RTX
NVIDIA Titan RTX 24 GB - flagship Turing prosumer card with Tensor Cores for AI research, 8K video editing and rendering.
AMD Instinct MI300X
AMD Instinct MI300X 192 GB HBM3 - CDNA 3 accelerator targeting large-language-model training and high-performance computing.
RTX PRO 6000 CC 96GB
NVIDIA RTX PRO 6000 CC 96GB GPU from Verda
NVIDIA H200
NVIDIA H200 Tensor Core GPU with 141 GB HBM3e, optimized for generative AI, transformer models and data-intensive HPC simulations.
AMD Instinct MI325X
AMD Instinctâ„¢ MI325X accelerators are designed to deliver leadership performance for Generative AI workloads and HPC applications.
NVIDIA H100 NVL
NVIDIA H100 NVL dual-GPU solution offering 2x94 GB HBM3 for ultra-large LLM inference and memory-bound AI workloads.
NVIDIA GH200
NVIDIA GH200 96 GB - high-performance GPU for large-scale AI training and scientific computing.
NVIDIA Tesla V100 SXM2 32GB
NVIDIA Tesla V100 SXM2 32GB - legacy datacenter accelerator still popular for CUDA development, scientific compute and AI research.
AMD Instinct MI355X
Built on the 4th Gen AMD CDNAâ„¢ architecture, AMD Instinctâ„¢ MI355X GPUs deliver leadership AI and HPC performance.
NVIDIA B200
Next-generation NVIDIA Blackwell B200 with 180 GB HBM3e delivers breakthrough performance for frontier AI training and scientific computing.
NVIDIA H200 NVL
Dual-board NVIDIA H200 NVL (2x141 GB) accelerator designed for trillion-parameter model inference and massive memory bandwidth.
NVIDIA B300
Next-generation NVIDIA Blackwell B300 with 288 GB HBM3e delivers breakthrough performance for AI training and scientific computing.
NVIDIA GB300
NVIDIA GB300 Grace Blackwell Ultra Superchip pairs an Arm-based Grace CPU with a B300 GPU featuring 288 GB HBM3e, designed for trillion-parameter AI training and reasoning workloads.
Quadro P5000
GPU offering from Vast.ai: Quadro P5000