DLR is a stand-alone, light-weight and portable runtime for CNN and decicion-tree models. Built on top of TVM and Treelite runtime, DLR provides simple and unified Python/C++ APIs for loading and running TVM/Treelite compiled models on a wide range of devices, including X86, TRT-enabled GPU and Arm devices.
For more details about using DLR and SageMaker NEO service, please refer to AWS News Blog
See DEVELOPMENT.md
This library is licensed under the Apache 2.0 License.