GithubHelp home page GithubHelp logo

Accelerator's Projects

bladedisc icon bladedisc

BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.

byteps icon byteps

A high performance and generic framework for distributed DNN training

caffe icon caffe

Caffe: a fast open framework for deep learning.

caffe-int8-convert-tools icon caffe-int8-convert-tools

This convert tools is base on TensorRT 2.0 Int8 calibration tools,which use the KL algorithm to find the suitable threshold to quantize the activions from Float32 to Int8(-128 - 127).

convolution_kernel icon convolution_kernel

Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.

cpufp icon cpufp

A CPU tool for benchmarking the peak of floating points

cuda-samples icon cuda-samples

Samples for CUDA Developers which demonstrates features in CUDA Toolkit

cutlass icon cutlass

CUDA Templates for Linear Algebra Subroutines

dali icon dali

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

darknet icon darknet

darknet深度学习框架源码分析:详细中文注释,涵盖框架原理与实现语法分析

decuda icon decuda

Decuda and cudasm, the CUDA binary utilities package. Low-level tools for NVidia G80 GPUs.

deepctr icon deepctr

Easy-to-use,Modular and Extendible package of deep-learning based CTR models.

dlrm icon dlrm

An implementation of a deep learning recommendation model (DLRM)

faiss icon faiss

A library for efficient similarity search and clustering of dense vectors.

fucking-algorithm icon fucking-algorithm

刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.

gpgpu-sim_distribution icon gpgpu-sim_distribution

GPGPU-Sim provides a detailed simulation model of a contemporary GPU running CUDA and/or OpenCL workloads and now includes an integrated (and validated) energy model, GPUWattch.

hugectr icon hugectr

HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training

incubator-tvm icon incubator-tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

leetcode icon leetcode

LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.