Foundational models can be consumed on demand, where you pay per character based on the length of the prompt and the response from the model (except for the embedding models, where the response from the model isn’t accounted for). In the table below, a transaction = a character and 10,000 transactions = 10,000 characters.
Additionally, you can host private replicas of foundational models and create fine-tuned models on dedicated AI clusters. Dedicated AI clusters come in two types: hosting and fine-tuning. You create a hosting cluster by assigning AI units to it based on the model you want to host and the expected call volume to the model. Fine-tuning clusters require two AI units of the specific model you want to fine-tune. Once you create a fine-tuned model in a fine-tuning cluster, you can host it on your hosting cluster.
Dedicated AI clusters require a minimum commitment of 744 unit-hours (per cluster) for hosting models. Fine-tuning clusters require a minimum of 1 unit-hour.
Product |
Comparison Price (/vCPU) * |
Unit price |
Unit |
Oracle Cloud Infrastructure Generative AI - Large Cohere - 10,000 Transactions |
|||
Oracle Cloud Infrastructure Generative AI - Small Cohere - 10,000 Transactions |
|||
Oracle Cloud Infrastructure Generative AI - Embed Cohere - 10,000 Transactions |
|||
Oracle Cloud Infrastructure Generative AI - Large Meta |
10,000 Transactions |
||
Oracle Cloud Infrastructure Generative AI - Meta Llama 3.1 405B |
10,000 Transactions |
||
Oracle Cloud Infrastructure Generative AI - Large Cohere - Dedicated - AI Unit Per Hour |
|||
Oracle Cloud Infrastructure Generative AI - Small Cohere - Dedicated - AI Unit Per Hour |
|||
Oracle Cloud Infrastructure Generative AI - Embed Cohere - Dedicated - AI Unit Per Hour |
|||
Oracle Cloud Infrastructure Generative AI- Large Meta - Dedicated |
AI Unit Per Hour |
Service |
Comparison Price (/vCPU) * |
Unit price |
Unit |
Oracle Cloud Infrastructure Generative AI Agents - Retrieval-Augmented Generation (RAG) |
10,000 Transactions |
||
Oracle Cloud Infrastructure Generative AI Agents - Knowledge Base Storage |
Gigabyte Storage Per Hour |
||
Oracle Cloud Infrastructure Generative AI Agents - Data Ingestion |
10,000 Transactions |