Creating a Private Endpoint
Create a private endpoint for a custom or pretrained model on a hosting dedicated AI cluster in OCI Generative AI.
Before you create a private endpoint, perform the Prerequisites for Private Endpoints and have the following details:
- The name of the Virtual Cloud Network (VCN)
- The name of the private subnet in the VCN
By default, a tenancy has a maximum limit of 5 private endpoints. For more Generative AI private endpoints, request an increase in the limit, private-endpoint-count for the Generative AI service.
On the Private Endpoints list page, select Create private endpoint. If you need help finding the list page, see Listing Private Endpoints.
General Information
VCN and Subnet
Select the following information:
- VCN compartment
- VCN
- Private subnet compartment
- Private subnet
DNS and Network Security Groups
Use this Endpoint for On-Demand Models
By default, this private endpoint is available for models hosted on dedicated AI clusters. If you want this endpoint to also be available for on-demand models that are offered in Generative AI service, then perform this step: - Select Allow Usage In On-Demand Mode.
- See the Tip at the end of this section on how to reach the on-demand model.
Important
To access a Generative AI model through this private endpoint, see Adding a Model to a Private Endpoint.Create the Endpoint
- (Optional) Select Add tag and assign tags to this private endpoint. See Resource Tags.
- Select Create.
Tip
To use this private endpoint to reach an on-demand model, create a Compute instance in the private subnet allocated for the private endpoint, add your code to the Compute instance, use the FQDN for the private endpoint, and access the model from that Compute instance.Use the generative-ai-private-endpoint create command and required parameters to create a private endpoint:
oci generative-ai generative-ai-private-endpoint create [OPTIONS] [OPTIONS]For a complete list of parameters and values for CLI commands, see the CLI Command Reference.
Note
For pretrained models, instead of an OCID, you can use the model name exactly as listed in the Console's playground. You can also find this OCI model name, in the model's detail page in Offered Pretrained Foundational Models in Generative AI.Run the CreateGenerativeAiPrivateEndpoint operation to create a private endpoint.