GithubHelp home page GithubHelp logo

Comments (7)

martindevans avatar martindevans commented on July 24, 2024 2

That's correct, it requires the master branch at the moment. We'll probably be releasing a new preview version soon (once #90 and #65 have been reviewed and merged)

from llamasharp.

ogre76 avatar ogre76 commented on July 24, 2024 1

Yes, it's running. Great work! Thx

from llamasharp.

martindevans avatar martindevans commented on July 24, 2024

I haven't tried it, but I believe 70B models should be supported on the 0.4.2 version at the moment.

I have now tried it and it doesn't work, sorry about that. Definitely something that needs looking into!

from llamasharp.

martindevans avatar martindevans commented on July 24, 2024

I did some more investigation into this to see what was required. Turns out the model I was testing with before was corrupt!

If you set GroupedQueryAttention = 8 in the model params you can load llama2 70B right now 🥳

from llamasharp.

ogre76 avatar ogre76 commented on July 24, 2024

Thx!
Which model did you use for testing? I tested it with the TheBloke/Llama-2-70B-Chat-GGML model, but it doesn't work.

var mp = new ModelParams(modelPath, contextSize: 1024, seed: 1337, gpuLayerCount: 128);
mp.GroupedQueryAttention = 8;
var interactiveExecutor = new InteractiveExecutor(new LLamaModel( mp ));

from llamasharp.

martindevans avatar martindevans commented on July 24, 2024

I used the q3_K_S version from TheBloke.

I just tested it again. Using the master branch I modified the SaveAndLoadSession demo to load the model like this:

var @params = new ModelParams(modelPath, contextSize: 1024, seed: 1337, gpuLayerCount: 5)
{
    GroupedQueryAttention = 8,
};

InteractiveExecutor ex = new(new LLamaModel(@params));

And it works for me.

from llamasharp.

KSemenenko avatar KSemenenko commented on July 24, 2024

GroupedQueryAttention = 8,
is not available yet in nuget right? not in 0.4.2?

from llamasharp.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.