GithubHelp home page GithubHelp logo

redactkitdoc's Introduction

RedactKit Documentation

Overview

RedactKit is a CLI tool to redact and un-redact sensitive data from multiple log files on Government Standard Image Build (GSIB) devices.

What is it about?

GovTech GIG, as the Infrastructure Engineer Capability Center, provides functional leadership to WOG. As part of this initiative, there are tools and processes that Agencies can leverage for their technical operations work. This tool addresses the common issue GIG has faced during the Co-sourcing model with vendors and product principals.

Scenario:

  • When we seek support from product principals, there may be instances where we will need to send logs with sensitive internal IP addresses, URLs, email addresses, SOE IDs etc. to them.
    • Engineers will then need to manually eyeball and redact such data which could be time-consuming and prone to errors.
  • This tool enables engineers to automate this process and save time, thereby, reducing operation overheads and errors.

Why use a tool?

  • To redact sensitive data like internal IP addresses, emails, domain names, hostnames and SOE-IDs before sending them to product principles for troubleshooting.
  • Sure, you can use sed and grep to redact sensitive data. But the original data is lost.
  • RedactKit CLI tokenizes the sensitive data for later un-redaction if you need to deep dive into certain parts of the log file during troubleshooting.

Features

demo

A python-based command line tool that helps you automate the redaction of common sensitive data from the log files. The tool can be used on GSIB via Powershell. Engineers can redact / un-redact sensitive log data using the tool.

The core redaction engine redacts the following list of data types from your log files. (Extensible to other types of data based on user-defined regular expressions). ๐Ÿ“„ โœ๏ธ

  • SG NRIC ๐Ÿ†” (M Series not included yet)
  • Credit cards ๐Ÿง
  • URLs ๐ŸŒ
  • Emails โœ‰๏ธ
  • Ipv4 ๐Ÿ“Ÿ
  • Ipv6 ๐Ÿ“Ÿ
  • Base64 ๐Ÿ…ฑ๏ธ
  • SOE-ID ๐Ÿ†”

Benefits

Saves time โณ. Focus on what matters. Here is a sample redaction run on a log file with over 10k lines. If an engineer were to manually go through this it could take about ~6 hours.

[+] Redacted 10072 targets...
[+] Redacted results saved to ./redacted_test.txt
[+] Estimated total words : 29052
[+] Estimated total minutes saved : 388
[+] Estimated total man hours saved : 6

Getting Started

The tool is available on software center as a Government Standard Software Package (GSSP) immediately. (GSSP_Python310 RedactKit_0.1.2)

Agency IT reps can opt to list it in WOG App Library for their respective agency's use.

Team profile and other details

  • Original Ideation by Benjamin Quek
    • Senior Infrastructure Engineer - LinkedIn
  • Improved and expanded more features by Oaker Min

The RedactKit CLI also has an upstream open source version on GitHub. You can get involved here: PyRedactKit

redactkitdoc's People

Contributors

brootware avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.