GithubHelp home page GithubHelp logo

Comments (3)

lancelet avatar lancelet commented on May 29, 2024 1

OK; PR raised here: #1679

I'm completely new to burn, so please let me know what extra tests, examples, docs, etc. should be added.

I saw a mention in another PR of improving the docs for matmul. Should I put that into this PR as well?

from burn.

lancelet avatar lancelet commented on May 29, 2024

EDIT: I have a WIP branch here: https://github.com/lancelet/burn/tree/matmul-broadcasting
In that branch, I use strides for the non-matrix dimensions (ie. the batch, depth, channel, or whatever) to handle arbitrary broadcasting.

It's getting the correct result for the above example, but it's failing on some autoregressive test cases with an error about an incompatible memory layout. This occurs when I try to flatten / reshape the arrays, in the same way that the original code does. Still investigating.

from burn.

laggui avatar laggui commented on May 29, 2024

Hi @lancelet, thanks for looking into this! 🙂 I'll take a quick look at your PR for this issue specifically.

We do want to improve our documentation regarding the general broadcasting semantics in burn, and also matmul. If you want to take a stab at it you could open another PR!

from burn.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.