GithubHelp home page GithubHelp logo

Comments (11)

feifeibear avatar feifeibear commented on August 24, 2024

应该不会比tf慢吧。虽然没人会去单纯比较一个embedding层

from turbotransformers.

hadoop2xu avatar hadoop2xu commented on August 24, 2024

应该不会比tf慢吧。虽然没人会去单纯比较一个embedding层

请问大佬,使用GPU该如何设置?
if (device_type() == kDLCPU) {
.......
} else if (device_type() == kDLGPU) {
.......
}

from turbotransformers.

feifeibear avatar feifeibear commented on August 24, 2024

Python还是C++?python的话看example

from turbotransformers.

hadoop2xu avatar hadoop2xu commented on August 24, 2024

Python还是C++?python的话看example
例子是from_torch的,如果没有预训练模型,直接用tf Tensor转的core::Tensor,调用底层的算子呢?

from turbotransformers.

feifeibear avatar feifeibear commented on August 24, 2024

你如果已经把cpu model move to GPU了,然后再调用from_torch接口,那就自动在gpu上计算了。

from turbotransformers.

hadoop2xu avatar hadoop2xu commented on August 24, 2024

你如果实现把cpu model move to GPU了,然后再调用from_torch接口,那就自动在gpu上计算了。
我想单独测试一下turbo的embedding_lookup算子,自己实现了一个接口直接调用turbo的LookupEmbedding函数,但是每次都跑在CPU上,程序中没找到创建core::Tensor时设置device_type的接口

from turbotransformers.

feifeibear avatar feifeibear commented on August 24, 2024

首先你要让张量都放在cpu上,你可以调用Copy函数,显式转移数据。
其实真的duck不必单独加速embedding,就是加速到0延迟,也不会让你的bert变快。
See Amdahl's law

from turbotransformers.

hadoop2xu avatar hadoop2xu commented on August 24, 2024

首先你要让张量都放在cpu上,你可以调用Copy函数,显式转移数据。
其实真的duck不必单独加速embedding,就是加速到0延迟,也不会让你的bert变快。
See Amdahl's law

解决了。embedding_lookup确实不是热点,我想通过embedding算子的替换做一点技术积累,之后看能不能做成一个算子库,可以灵活选择的那种。未来希望和硬件、编译走的更近些

from turbotransformers.

feifeibear avatar feifeibear commented on August 24, 2024

TurboTransformers也需要人贡献算子库,比如conv1d这种算子还没实现呢,你愿不愿意involve进来?

from turbotransformers.

hadoop2xu avatar hadoop2xu commented on August 24, 2024

TurboTransformers也需要人贡献算子库,比如conv1d这种算子还没实现呢,你愿不愿意involve进来?
好啊,大佬给我个join方式吧

from turbotransformers.

feifeibear avatar feifeibear commented on August 24, 2024

readme底下有联系方式,你加我企业微信吧

from turbotransformers.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.