AI Cloud Infrastructure
AI Cloud Infrastructure provides scalable cloud-based computing resources, including GPUs, TPUs, and high-performance computing optimized for AI and machine learning workloads. It enables developers and enterprises to train, deploy, and scale AI models without managing underlying hardware.
Showing 12 of 12 services
Banana.dev
Banana.dev was a cloud platform that enabled developers to deploy and scale machine learning models on serverless GPU infrastructure with minimal configuration. It provided a simple API-based interface for running inference workloads, allowing teams to avoid managing their own GPU servers. The service shut down in 2023 as the team pivoted or wound down operations.
Baseten
Baseten is a machine learning infrastructure platform that enables developers and ML engineers to deploy, serve, and scale AI models in production. It provides tools for building model pipelines, creating model-backed applications, and managing inference workloads with support for popular frameworks like PyTorch, TensorFlow, and Hugging Face. Baseten focuses on simplifying the MLOps workflow by offering features such as autoscaling, GPU support, and a Python-native SDK called Truss for packaging and deploying models.
CoreWeave
CoreWeave is a specialized cloud provider focused on GPU-accelerated computing, offering large-scale infrastructure optimized for AI/ML workloads, visual effects rendering, and high-performance computing. The company operates one of the largest fleets of NVIDIA GPUs in the cloud, providing on-demand access to compute resources through Kubernetes-based orchestration. CoreWeave went public on the Nasdaq in March 2025 and serves major AI companies, enterprises, and research institutions requiring massive parallel compute capacity.
Crusoe
Crusoe is an AI cloud infrastructure company that provides purpose-built cloud computing services optimized for AI workloads, including GPU clusters for training and inference. Originally founded as Crusoe Energy Systems, the company pivoted to focus on sustainable AI cloud computing, leveraging stranded and flared natural gas to power data centers, reducing carbon emissions compared to traditional grid-powered facilities. Crusoe offers high-performance computing resources tailored for machine learning, generative AI, and large-scale model training, positioning itself as an environmentally conscious alternative to hyperscale cloud providers.
FluidStack
FluidStack is a cloud GPU infrastructure provider that aggregates underutilized GPU capacity from data centers worldwide to offer on-demand and reserved GPU compute at competitive prices. The platform enables AI companies, researchers, and developers to access large-scale GPU clusters for training and inference workloads, including support for high-performance interconnects like InfiniBand. FluidStack differentiates itself by sourcing capacity from a distributed network of partner data centers, providing cost-effective alternatives to hyperscale cloud providers for AI/ML workloads.
Lambda Labs
Lambda Labs (also known as Lambda) is a cloud computing and hardware company specializing in GPU-based infrastructure for AI and machine learning workloads. The company offers on-demand and reserved GPU cloud instances, as well as on-premise GPU servers and workstations, designed for training and deploying deep learning models. Lambda serves researchers, startups, and enterprises seeking high-performance compute at competitive pricing compared to hyperscale cloud providers.
Modal
Modal is a cloud infrastructure platform that allows developers and data scientists to run code in the cloud without managing servers or infrastructure. It provides a Python-native interface for running serverless functions, training machine learning models, and deploying AI applications with on-demand GPU and CPU compute. Modal handles scaling, containerization, and dependency management automatically, enabling teams to go from local code to production cloud workloads with minimal configuration.
Paperspace
Paperspace is a cloud computing platform specializing in GPU-accelerated virtual machines and machine learning infrastructure, enabling developers and data scientists to build, train, and deploy AI/ML models at scale. It offers products including Gradient, a MLOps platform for running Jupyter notebooks and ML pipelines, and Core, which provides on-demand GPU cloud instances. Paperspace was acquired by DigitalOcean in 2023, integrating its GPU cloud capabilities into DigitalOcean's broader cloud services portfolio.
Replicate
Replicate is a cloud platform that allows developers to run open-source machine learning models via a simple API without requiring deep ML infrastructure expertise. It hosts thousands of community-contributed and official models spanning image generation, language processing, video, and audio tasks. Replicate also enables users to fine-tune models and deploy their own custom models at scale using its managed infrastructure.
Runpod
RunPod is a cloud computing platform that provides on-demand GPU instances for AI, machine learning, and deep learning workloads at competitive prices. The platform offers both serverless GPU computing and dedicated pod deployments, enabling developers and researchers to run inference, fine-tuning, and training jobs without managing infrastructure. RunPod also features a marketplace where GPU owners can rent out their hardware, creating a distributed network of compute resources.
Together AI
Together AI is a cloud platform that enables developers and enterprises to run, fine-tune, and deploy open-source large language models (LLMs) at scale with high performance and cost efficiency. The platform provides access to a wide range of open-source models including LLaMA, Mistral, and others through a unified API, along with tools for custom model fine-tuning and inference optimization. Together AI also conducts AI research and has developed its own inference infrastructure designed to deliver fast and affordable generative AI capabilities.
Vast.ai
Vast.ai is a decentralized cloud GPU marketplace that connects individuals and businesses who need GPU compute resources with hosts who have idle GPU hardware available for rent. The platform allows users to rent GPU instances at significantly lower prices than traditional cloud providers by aggregating consumer and data center GPUs from around the world. Vast.ai supports a wide range of use cases including machine learning training, inference, rendering, and other compute-intensive workloads.