Big Data platform

Seamlessly scale and run Apache Spark, Hive, Trino, Flink, and more. Discover the power of easy development and visualization through Data Science notebooks, leveraging familiar open source tools, all at an exceptional price-to-performance ratio.

Discover OCI Big Data platform capabilities

  • Open source upstream services

    Comprehensive portfolio of open source components, such as Hadoop and Spark.

    Explore OCI Big Data

  • Fully managed, autoscaling, and elastic

    Focus on your data and your code and we take care of the rest.

    Explore OCI Data Flow

  • Migrate easily and modernize

    Open source projects are easy to spin up, and we keep you up to date with latest innovations.

    Explore the guides

  • Integrated natively in OCI

    Leverage all the Oracle Cloud Infrastructure (OCI) services effortlessly and expand.

    Explore OCI Data Catalog

  • Enterprise-grade security

    More than 30 compliance certifications ensuring your data protection.

    Explore OCI compliance

  • Pay as you go

    Pay only for what you need.

    Explore OCI pricing

Machine learning ebook

Data is the raw material for machine learning. Find out how to use machine learning in the cloud with the data you already have.

Migrate to OCI

Big data clusters can easily be migrated to OCI. Discover the guidelines on our migration hub.

Bring all your data together with a data lake

  • Complete, integrated solution

    Deploy a complete, integrated solution, including data management, data integration, and data science, so analytics teams can maximize the value of enterprise data. Customers ingest any data via batch, streaming, or real-time processes and store it in data warehouses or data lakes as needed. Teams then catalog and apply governance to the data so they can use it for analyses, visualizations, and machine learning models. IT teams leverage consistent security policies across data warehouses and data lakes.

  • Easy to manage and operate

    Increase developer productivity with a fully managed, serverless, Apache Spark cluster that is accessible via APIs. Each cluster is automatically provisioned, secured, and shut down to reduce developer workloads. Customers can deploy fully managed Hadoop clusters of any size or shape, then add security and high availability with a single click.

  • Deploy in Oracle Cloud data centers or customer data centers

    Deploy Oracle big data services wherever needed to satisfy customer data residency and latency requirements. Big data services, along with all other Oracle Cloud Infrastructure services, can be utilized by customers in the Oracle public cloud, or deployed in customer data centers as part of an Oracle Dedicated Region Cloud@Customer environment.

Smoothly transition to the cloud with OCI Big Data services. Our comprehensive, proven approach supports a hassle-free migration, whether you're using existing data lakes, Spark, Hadoop, Flink, Hive, or other Hadoop components. Migrate to OCI without the need for extensive configuration or integration and with minimal impact on your current environment. Benefit from detailed, step-by-step resources and expert guidance from Oracle's dedicated engineers and partners, ensuring a seamless shift to the cloud.

Move Your Environment To OCI Big Data with Apache Hadoop


  • Fully managed: We support and manage Apache Hadoop and Spark ecosystems so that you can run and scale your big data workload in the cloud.
  • Easy migration: Migrate your workload to OCI and keep your familiar open source tools at an exceptional price-to-performance ratio.
  • Seamless integration: Get a unified experience for all your data applications in OCI and leverage Oracle Modern Data Platform.

Learn more about our Big Data migration

Oracle data platform unlocks the full potential of your data

  • Combine transactional and analytical data—avoid silos.
  • Leverage Oracle IaaS to Oracle SaaS, or anything in between—select the amount of control desired.
  • Bring any kind of data to the platform—we break the barrier between structured and unstructured data.
  • Explore the power of OCI and its openness to other cloud service providers—we meet you where you are.
  • Use leading Oracle Analytics Cloud reporting or any third-party analytical application—OCI is open.
Oracle data platform overview diagram, description below The diagram shows the Oracle data platform with data sources, data movement services such as integration services, the core of the Oracle modern data platform, and possible outcome and application development services.

Learn more about our Big Data migration

See how our customers are using Big Data

Explore more customer stories
Experian moves critical data and tools to OCI for a 40% performance lift and a 60% drop in costs.

Big Data services from Oracle

Data motion and integration

Connect and extend analytical applications with real-time consistent transactional data, efficient batch loads, and streaming data.

Big Data and Data Lake

Meet your big data needs in an open-source platform

AI and machine learning

Gain insights from data with prebuilt AI models, or create your own.

Big data architectures on OCI

See all reference architectures
  • Reference Architecture

    Leverage a cloud data lakehouse that combines the abilities of a data lake and a data warehouse to process a broad range of enterprise and streaming data for business analysis and machine learning.

  • Reference Architecture

    Combine, correlate, and analyze your lakehouse data with federated data regardless of location (third-party cloud stores, cloud, and on premises databases) without having to duplicate data.

  • Solution Playbook

    Generate demand forecasts to enable business users in the retail industry to overcome the technical limitations of legacy data analytics solutions that undermine forecasting accuracy.

  • Solution Playbook

    Optimize refractory usage and expose operational data to develop a predictive maintenance system and improve on-site customer warehouse management.

September 19, 2023

Oracle expands cloud services with new open source data management solutions

Carter Shanklin, Senior Director, Product Management

Oracle Cloud Infrastructure (OCI) is excited to announce three new managed open source data management cloud services and three significant enhancements to existing ones. These services offer a wide range of capabilities to help organizations of all sizes enhance their data operations.

Get started with OCI Big Data platform

Try Always Free cloud services and get a 30-day trial

Oracle offers a Free Tier with no time limits on a selection of services, including Autonomous Data Warehouse, OCI Compute, and Oracle Storage products, as well as US$300 in free credits to try additional cloud services. Get the details and sign up for your free account today.

  • What's included with Oracle Cloud Free Tier?

    • Always Free
    • 2 autonomous databases, 20 GB each
    • Compute VMs
    • 100 GB block volume
    • 10 GB object storage

Learn with a hands-on lab

The best way to learn is to try it yourself. Try this free data lake workshop, which demonstrates a typical usage scenario and highlights some of the tools you can use to build a data lake.

  • Access the Data Lake using Autonomous Database and Data Catalog

    The labs in this workshop walk you through the steps you need to access a data lake created with Oracle Object Storage buckets by using Oracle Autonomous Database and OCI Data Catalog.

    Start data lake access lab
  • Get Started with Oracle Big Data Service

    Learn how to create and monitor a highly available Hadoop cluster using Big Data Service and OCI. You’ll also add Oracle Cloud SQL to the cluster and access the utility and master node, and learn how to use Cloudera Manager and Hue to access the cluster directly in a web browser.

    Start the data lake lab
  • Learn analytics and machine learning with Red Bull Racing

    Use analytics and machine learning to analyze 70 years of racing data. Find out what makes some races so exciting you can’t look away while others are more predictable.

    Start the data analytics lab
  • Get started with Oracle Cloud Infrastructure Anomaly Detection

    Discover how to use OCI Anomaly Detection to create customized machine learning models. You’ll take data uploaded by users, use a specialized algorithm to train a model, and deploy the model into the cloud environment to detect anomalies.

    Start the anomaly detection lab now

Contact sales

Interested in learning more about a data lake? Let one of our experts help.

  • They can answer questions such as

    • How do I get started with a data lake on Oracle?
    • What can I do with a data lake that I can’t do with a data warehouse?
    • How can my business benefit from a data lake?