GithubHelp home page GithubHelp logo

Comments (14)

zap8600 avatar zap8600 commented on May 20, 2024 3

Well, I'll test it once I implement it 😄

One thing I find strange is that there isn't a way to say who is speaking? Is it just assumed by the alternating order of conversations?

No need to test it. It does seem to work. My code has been uploaded to my repo, and you can find it here.

from transformers.js.

zap8600 avatar zap8600 commented on May 20, 2024 3

Yay!

from transformers.js.

zap8600 avatar zap8600 commented on May 20, 2024 2

Right, but if you look at the vocabulary of the model, <s> is not one of the tokens.

My bad. I've taken a look at the vocabulary file, and the tokens appear to be __start__ and __end__. I haven't tested this yet, but that is what I can tell.

from transformers.js.

fakerybakery avatar fakerybakery commented on May 20, 2024 1

Thank you so much!!!

from transformers.js.

zap8600 avatar zap8600 commented on May 20, 2024 1

Thank you! I hope my testing code helped some!

from transformers.js.

xenova avatar xenova commented on May 20, 2024

Hi! As stated in a similar issue from 2 days ago (#29) I haven't yet implemented BlenderBot (no listed support in the README or on the demo website). I just uploaded the files since I plan to add it (which, I probably shouldn't have done haha). However, I do plan to add support for it in the next few days!

It will need a way to keep track of conversations, but I will look to HF's implementation for inspiration :)

(The error message you got is because I haven't uploaded the tokenizer yet ;) )

from transformers.js.

zap8600 avatar zap8600 commented on May 20, 2024

Hi! As stated in a similar issue from 2 days ago (#29) I haven't yet implemented BlenderBot (no listed support in the README or on the demo website). I just uploaded the files since I plan to add it (which, I probably shouldn't have done haha). However, I do plan to add support for it in the next few days!

It will need a way to keep track of conversations, but I will look to HF's implementation for inspiration :)

(The error message you got is because I haven't uploaded the tokenizer yet ;) )

I believe that the Blenderbot model keeps track of a conversation using certain tokens to tell when a speaker stops and starts talking, like this:
<s>Hello! How are you?</s> <s>I'm good. How are you?</s> <s>I'm doing pretty good.</s>
From what I can tell, this is implemented by the user. My really bad code (in Python) for this is here. Even though this code is for Blenderbot, I believe the same goes for Blenderbot Small. However, there are way better examples than my code. This is just what I've seen in multiple Spaces on Huggingface. I hope this helps!

from transformers.js.

xenova avatar xenova commented on May 20, 2024

Thanks @zap8600! I've seen similar examples for blenderbot, but the documentation for blenderbot small does not seem to be as clear. I raised an issue (huggingface/transformers#22301), but it looks like it's been hidden already by other issues.

If possible, do you think you could look into how blenderbot small handles it? (Which is apparently different to normal blenderbot)

from transformers.js.

zap8600 avatar zap8600 commented on May 20, 2024

Thanks @zap8600! I've seen similar examples for blenderbot, but the documentation for blenderbot small does not seem to be as clear. I raised an issue (huggingface/transformers#22301), but it looks like it's been hidden already by other issues.

If possible, do you think you could look into how blenderbot small handles it? (Which is apparently different to normal blenderbot)

I've look at the Transformers doc (which is where I based my code on) and it appears to be the same way.

from transformers.js.

xenova avatar xenova commented on May 20, 2024

Right, but if you look at the vocabulary of the model, <s> is not one of the tokens.

from transformers.js.

xenova avatar xenova commented on May 20, 2024

Well, I'll test it once I implement it 😄

One thing I find strange is that there isn't a way to say who is speaking? Is it just assumed by the alternating order of conversations?

from transformers.js.

xenova avatar xenova commented on May 20, 2024

A little late since it wasn't as simple as some of the other models (combined with the fact that the python tokenizer is broken), but it's here now! 🤗 I'll make a release soon, so you'll be able to test it.

from transformers.js.

xenova avatar xenova commented on May 20, 2024

It did! Thanks 🤗

from transformers.js.

zap8600 avatar zap8600 commented on May 20, 2024

Np! I can't wait to use the BlenderBot model!

from transformers.js.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.