In today’s technology landscape, we can tap into research and statistics, pulling in data feeds for analysis and drawing insights to make decisions in real time. However, new information can be hard to parse and contextualize, even for the most robust analytics solutions. This is where retrieval-augmented generation (RAG) is useful, allowing you to augment the knowledge of a large language model without retraining it when new information is available. This updates your model with more recent data, making it more capable, with minimal effort.
Oracle Cloud Infrastructure (OCI) Generative AI Agents allows you to do just that. In this example, we’ll upload our documents, process this data, put it into a vector store (via OCI Search with OpenSearch), create a Redis cluster for caching purposes, and provide you with a way to consume the data through a chatbot.
For the infrastructure, we’ll have the following OCI services present: