GithubHelp home page GithubHelp logo

clue-ai / promptclue Goto Github PK

View Code? Open in Web Editor NEW
651.0 651.0 68.0 16.29 MB

PromptCLUE, 全中文任务支持零样本学习模型

Home Page: https://www.clueai.cn

License: Other

Jupyter Notebook 100.00%
bert chinese few-shot-learning gpt-3 multitask-learning pretrained-models prompt-tuning roberta t5-model transfer-learning zero-shot-learning

promptclue's People

Contributors

brightmart avatar joytianya avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

promptclue's Issues

针对t5-large模型的训练问题

大神们好。我看该项目说是“在t5-large版基础上,使用数百G中文语料,训练了100万步,累积训练了1.5万亿个中文字词级别token

我想问下,这里是采用t5-large模型作为预训练模型,在中文数据上进行微调训练的嘛?

ClueAI/PromptCLUE-base-v1-5 分类任务

请问调用本地ClueAI/PromptCLUE-base-v1-5进行分类任务,如何实现clueai API进行分类的输出效果呢?(给出prediction、每个label的Confidence)

fine-tune out of memory

请问使用这个参数fine-tune,显存占用大概多少?我这边16G的v100显示out of memory

image

请问如果单卡fine-tune显存不够用,有多卡的fine-tune代码吗

使用 pCLUE-main 项目里面的datasets里面的120万数据训练后,效果不佳

使用pCLUE-main项目里的数据训练后, 加载模型 , 同样的代码 , 使用本地训练的和示例代码出来的结果完全不同 , 请问需要怎么改进
示例代码
tokenizer = T5Tokenizer.from_pretrained("ClueAI/PromptCLUE")
model = T5ForConditionalGeneration.from_pretrained("ClueAI/PromptCLUE")
print(answer('''信息抽取:
今天我向大家介绍一下一个人。他是张丰毅1956年9月1日出生于河南省南阳市唐河县,1982年毕业于北京电影学院,是**电影协会理事。1993年,与其他演员主演电影《霸王别姬》
问题:主角,嘉宾,演员,改编自,面积,出生地,学校,成员,出生时间
答案:''',sample=False))

输出结果:
地址:河南省南阳市唐河县
组织:北京电影学院,**电影协会
名字:张丰毅
职位:理事

本地模型
tokenizer = T5Tokenizer.from_pretrained("ClueAI/PromptCLUE")
model = T5ForConditionalGeneration.from_pretrained("outputs/model_files/")
#或者以下方式引用
tokenizer = AutoTokenizer.from_pretrained("ClueAI/PromptCLUE")
model = AutoModelForSeq2SeqLM.from_pretrained("outputs/model_files/")
print(answer('''信息抽取:
今天我向大家介绍一下一个人。他是张丰毅1956年9月1日出生于河南省南阳市唐河县,1982年毕业于北京电影学院,是**电影协会理事。1993年,与其他演员主演电影《霸王别姬》
问题:主角,嘉宾,演员,改编自,面积,出生地,学校,成员,出生时间
答案:''',sample=False))
输出结果:
演员:张丰毅1956年9月1日出生于河南省南阳市唐河县

基于大模型的优化

不知道官方是否有基于LLM+NLP任务prompt数据的模型计划~~~
因为现在的模型效果确实不太好

能否开源各个任务的数据格式(demo)?

感谢能开源如此好用的模型,
直接索要数据这肯定是不可能的,但能否放出相应的数据格式提供用户拿自己数据进行FineTune参考?
比如v1.5新增的改写、纠错任务,thx

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.