GithubHelp home page GithubHelp logo

cutlass_ex's Introduction

Cutlass_EX

0. Introduction

  • Goal : Development of a 4-bit primitives kernels by using Cutlass

1. Example List

example_1) custom code with CUTLASS

example_2) cutlass::uint4b_t

example_3) single-precision gemm template

  • 00_basic_gemm
  • This is kernel computes the general matrix product (GEMM) using single-precision floating-point arithmetic and assumes all matrices have column-major layout.

example_4) mixed-precision gemm template with cutlass utilities

  • 01_cutlass_utilities
  • These utilities are intended to be useful supporting components for managing tensor and matrix memory allocations, initializing and comparing results, and computing reference output.

example_5) CUTLASS debugging tool

  • 02_dump_reg_shmem
  • Demonstrate CUTLASS debugging tool for dumping fragments and shared memory
  • dumping : Record the state of memory at a specific point in time

example_6) CUTLASS layout visualization example

example_7) CUTLASS example to compute a batched strided gemm in two different ways

  • 05_batched_gemm
  • strided batched gemm : By specifying pointers to the first matrices of the batch and the stride between the consecutive matrices of the batch.
  • array gemm : By copying pointers to all matrices of the batch to the device memory.

example_8) CUTLASS turing gemm using tensor cores

example_9) CUTLASS turing convolution using tensor cores

example_10) CUTLASS ampere convolution using tensor cores

example_11) Handling Cutlass Tensors

example_12) Simple CUTLASS convolution using Tensor core

2. Guide

    cd example_{number}
    mkdir build
    cd build
    cmake ..
    make
    ./main

3 Reference

cutlass_ex's People

Contributors

yester31 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.