Optimized Analytics Package for Spark* Platform (OAP)
Soft Reminder
Maintenance releases for OAP 0.8.x and 1.0.x will continue host here.
OAP 1.1+ development and release have migrated to https://github.com/oap-project
LEGAL
* LEGAL NOTICE: Your use of this software and any required dependent software (the "Software Package") is subject to the terms and conditions of the software license agreements for the Software Package, which may also include notices, disclaimers, or license terms for third party or open source software included in or with the Software Package, and your use indicates your acceptance of all such terms. Please refer to the "TPP.txt" or other similarly-named text file included with the Software Package for additional details.
* Optimized Analytics Package for Spark* Platform is under Apache 2.0 (https://www.apache.org/licenses/LICENSE-2.0).
OAP is a project to optimize Spark by providing optimized implementation of packages for various aspects including cache, shuffle, native SQL engine, Mllib and so on. In this version, OAP contains the optimized implementations of SQL Index and Data Source Cache supporting DRAM and PMem, RDD Cache PMem Extension, Shuffle Remote PMem Extension, Remote Shuffle, Intel MLlib, Unified Arrow Data Source and Native SQL Engine.
Installation Guide
Please follow the link below for the guide to compile and install OAP to your system.
User Guide
Please refer to the corresponding documents below for the introductions on how to use the features.
- SQL Index and Data Source Cache
- RDD Cache PMem Extension
- Shuffle Remote PMem Extension
- Remote Shuffle
- Intel MLlib
- Unified Arrow Data Source
- Native SQL Engine
Developer Guide
Please follow the link below for the guide for developers.