GithubHelp home page GithubHelp logo

classicvalues / spark-oracle Goto Github PK

View Code? Open in Web Editor NEW

This project forked from oracle/spark-oracle

0.0 1.0 0.0 2.99 MB

On the fly, translation of Spark programs to run natively on your Oracle DB. Your Spark programs require no changes.

License: Other

Scala 99.32% Shell 0.68%

spark-oracle's Introduction

Spark_On_Oracle

  • Currently, data lakes comprising Oracle Data Warehouse and Apache Spark have these characteristics:
    • They have separate data catalogs, even if they access the same data in an object store.
    • Applications built entirely on Spark have to compensate for gaps in data management.
    • Applications that federate across Spark and Oracle usually suffer from inefficient data movement.
    • Operating Spark clusters are expensive because they lack administration tooling and they have gaps in data management. Therefore, price-performance advantages of Spark are overstated.

current deployments

This project fixes those issues:

  • It provides a single catalog: Oracle Data Dictionary.
  • Oracle is responsible for data management, including:
    • Consistency
    • Isolation
    • Security
    • Storage layout
    • Data lifecycle
    • Data in an object store managed by Oracle as external tables
  • It provides support for a full Spark programming model.
  • Spark on Oracle has these characteristics:
    • Full pushdown on SQL workloads: Query, DML on all tables, DDL for external tables.
    • Push SQL operations of other workloads.
    • Surface Oracle capabilities like machine learning and streaming in the Spark programming model.
    • Co-processor on Oracle instances to run certain kinds of Scala code. Co-processors are isolated and limited and therefore are easy to manage.
  • Enable simpler, smaller Spark clusters.

spark on oracle

Feature summary:

See Project Wiki for complete documentation.

Installation

Spark on Oracle can be deployed on any Spark 3.1 or above environment. See the Quick Start Guide.

Documentation

See the wiki.

Examples

The demo script walks you through the features of the library.

Help

Please file Github issues.

Contributing

This project welcomes contributions from the community. Before submitting a pull request, please review our contribution guide.

Security

Please consult the security guide for our responsible security vulnerability disclosure process.

License

Copyright (c) 2022 Oracle and/or its affiliates.

Released under the Universal Permissive License v1.0 as shown at https://oss.oracle.com/licenses/upl/.

spark-oracle's People

Contributors

hbutani avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.