Accelerate AI Anywhere with Oracle Cloud and NVIDIA

The Oracle and NVIDIA partnership accelerates AI innovation, helping you develop, customize, and deploy advanced AI, including agentic AI, anywhere with cutting-edge AI infrastructure. Maximize performance, scale, and cost efficiency with the broadest set of deployment options—only with Oracle Cloud Infrastructure (OCI) and NVIDIA.

Delivering AI Anywhere with Oracle and NVIDIA (2:11)

NVIDIA AI Enterprise is available on OCI

AI infrastructure solutions

Connect with an Oracle AI/ML expert

NVIDIA AI Enterprise on OCI Marketplace

  • Data preparation

    Accelerate data science workflows using NVIDIA RAPIDS Accelerator on Oracle Cloud Infrastructure (OCI) Data Flow.

    Learn more for Data preparation

  • Model training and customization

    Build and customize generative AI models using NVIDIA DGX™ Cloud on OCI.

    See what’s possible for Model training and customization

  • Inference (RAG)

    Improve the accuracy and relevance of LLM outputs by building retrieval-augmented generation (RAG) pipelines using NVIDIA NeMo Retriever, Oracle Database 23ai, and Oracle Cloud Infrastructure Kubernetes Engine (OKE).

    Read the blog for Inference (RAG)

  • Inference

    Accelerate and scale model deployment with NVIDIA NIM microservices and OKE.

    Learn for Inference

Now generally available: The largest, fastest AI supercomputer in the cloud

We’re excited to announce the general availability of Oracle Cloud Infrastructure Supercluster with NVIDIA H200 Tensor Core GPUs.

Enabling customers to streamline AI innovation

  • Simplified operations for agentic AI

    Deliver a full-stack AI platform that powers agentic AI—intelligent systems that autonomously perceive, reason, and act. With NVIDIA AI Enterprise available natively through the OCI Console, enterprises can quickly and easily access more than 160 AI tools for training and inference while getting direct billing and customer support.

    For rapid AI inference, NVIDIA NIM inference endpoints in OCI Marketplace offer a scalable, low-complexity solution for deploying AI-powered assistants, copilots, and real-time applications.

  • Industry-leading AI infrastructure

    Stay at the forefront of generative AI innovation by achieving up to 260 exaFLOPS of performance with Hopper GPUs and 2.4 zettaFLOPS with Blackwell GPUs. With these OCI Superclusters, you can train trillion-parameter models faster and deploy them at scale.

    Take advantage of OCI Compute with bare metal instances and no virtualization overhead, ultrafast RDMA cluster networking, petabyte-scale file storage, and orchestration tools such as OCI Kubernetes Engine to accelerate AI workloads at any scale.

  • Powering AI anywhere

    Optimize inferencing and run AI anywhere with NVIDIA technologies on OCI’s distributed cloud. Deploy NVIDIA L4 GPUs on an edge appliance and scale up to the largest supercomputing infrastructure in the public cloud, or expand AI infrastructure in your data center.

    Governments and regulated industries can choose flexible deployment models to address strict data sovereignty and compliance needs. NVIDIA AI Enterprise on OCI accelerates the development and deployment of production-ready AI and is available anywhere in OCI’s distributed cloud.

Featured AI partners and customers

See more customer stories