GithubHelp home page GithubHelp logo

toolkits's Introduction

CDP Toolkits

CDP Educational Toolkit

Summary

This toolkit consists of administrative scripts, data, ddl, notebooks, and tutorials used in classes taught by Cloudera Educational Services. This toolkit is intended for educational purposes only.

CDP Upgrade Toolkit

Summary

This toolkit can be used to migrate an existing CDH cluster to CDP. There are multiple directories each of which represent a separate artifact that make up the toolkit. Each individual subdirectory contains a ReadME with directions on execution.

Disclaimer

The CDP Upgrade toolkit is offered as a free utility from Cloudera, is open sourced under the Apache License version 2.0, is not warranted, and does not fall under the purview of Cloudera support. For any questions/issues in implementation, Cloudera recommends you contact your account team and/or engage professional services.

CDP Upgrade Flow

  1. CDH Cluster Inventory
  2. CDP Version Check
  3. Backup Playbooks
  4. CDP Upgrade
  5. Rollback (if necessary)
  6. CDP Upgrade
  7. CDP Configuration Push
  8. CDP Smoke Test

CDH Cluster Inventory

This script will generate an excel file with the following sheets:

  1. Host Information
  2. Cloudera Version Information

A sheet will be created for every cluster that is managed within the CM env specified that contains:

  1. Service Type
  2. Service Name

Please see the ReadME in the cdh-cluster-inventory directory for more information

CDP Version Check

This script should be run prior to the CDP Upgrade to determine if the versions of critical components present in the cluster will pose any risks to the upgrade. The script will compare versions installed against the CDP Support matrix that can be found at: https://supportmatrix.cloudera.com/

This script will generate an excel file with the following sheets:

  1. Status Summary
  2. Incompatible Versions Error Log

Please see the ReadME in the cdp-version-check directory for more information.

Ansible Hostfile Generation

The nodes.py script will generate an ansible formatted hostfile for the cluster given as an input.

Please see the ReadME in the utilities directory for more information.

Backup Playbooks

These playbooks will collect backups of all services and databases prior to a CDP Upgrade. You may have to edit some paths in the playbooks to point to your specific configuration.

Please see the ReadME in the backup-playbooks directory for more information.

CDP In-Place Upgrade

Utilize the Cloudera Manager wizard to complete the CDP Upgrade

Rollback

This set of playbooks and scripts can utilized to rollback a CDP Upgrade back to CDH. The directions to complete a full rollback are detailed in the ReadME file found in the rollback-playbooks directory.

CDP Configuration Push

Utilize the apply_properties.py script and json objects to push CM Configurations for the new services added after the CDP Upgrade as well as configurations for existing key services. Sample JSON templates have been provided for the following services:

  • Ranger
  • Ranger RMS
  • Ranger KMS KTS
  • Atlas
  • Hive
  • Hive on Tez
  • HDFS
  • Kafka
  • CDP Infra SOLR

Please see the ReadMe in the cdp-configuration-push directory for more information.

CDP Smoke Test

This script should be run after the CDP Upgrade to ensure functionality of all services on the newly upgraded CDP Cluster. This script will generate an output displaying the status of each service test.

Please see the ReadMe in the CDP Smoke Test directory for more information.

CDP PvC DS Pre-Req Check

This script should be run prior to a Data Services Installation to verify that all nodes have the necessary packages and utilities installed.

Please see the ReadMe in the data_services-toolkit directory for more information.

CDH-Discovery-Tool

This Discovery Tool is a lightweight automation package can run against a CDH or CDP cluster to produce a "Discovery Bundle" that is useful for CDP migration planning.

Please see the ReadMe in the CDH-Discovery-Tool directory for more information.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.