https://huggingface.co/bigscience/bloomz-7b1-p3#model-summary
More details are in the paper: https://arxiv.org/abs/2211.01786\n
More details are in the paper: https://arxiv.org/abs/2211.01786\n
In short: the fine-tuning dataset is different.
\nLet me know if there's still questions :)
\n","updatedAt":"2023-03-22T10:46:30.945Z","author":{"_id":"5f1eb362eec0ad2a071ad6e2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5f1eb362eec0ad2a071ad6e2/IXMYkYKuTwn6kBdWnQeeY.png","fullname":"Niklas Muennighoff","name":"Muennighoff","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":154}},"numEdits":1,"editors":["Muennighoff"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/5f1eb362eec0ad2a071ad6e2/IXMYkYKuTwn6kBdWnQeeY.png"],"reactions":[],"isReport":false}}],"pinned":false,"locked":false,"collection":"discussions","isPullRequest":false,"isReport":false},"repo":{"name":"bigscience/bloomz-7b1-p3","type":"model"},"activeTab":"discussion","discussionRole":0,"watched":false,"muted":false,"repoDiscussionsLocked":false}">what is the difference between this model and bigscience / bloomz-7b1-mt?
#2
by
muziyongshixin
- opened
https://huggingface.co/bigscience/bloomz-7b1-p3#model-summary
More details are in the paper: https://arxiv.org/abs/2211.01786\n
More details are in the paper: https://arxiv.org/abs/2211.01786\n
In short: the fine-tuning dataset is different.
\nLet me know if there's still questions :)
\n","updatedAt":"2023-03-22T10:46:30.945Z","author":{"_id":"5f1eb362eec0ad2a071ad6e2","avatarUrl":"https://cdn-avatars.huggingface.co/v1/production/uploads/5f1eb362eec0ad2a071ad6e2/IXMYkYKuTwn6kBdWnQeeY.png","fullname":"Niklas Muennighoff","name":"Muennighoff","type":"user","isPro":false,"isHf":false,"isHfAdmin":false,"isMod":false,"followerCount":154}},"numEdits":1,"editors":["Muennighoff"],"editorAvatarUrls":["https://cdn-avatars.huggingface.co/v1/production/uploads/5f1eb362eec0ad2a071ad6e2/IXMYkYKuTwn6kBdWnQeeY.png"],"reactions":[],"isReport":false}}],"pinned":false,"locked":false,"collection":"discussions","isPullRequest":false,"isReport":false},"primaryEmailConfirmed":false,"repo":{"name":"bigscience/bloomz-7b1-p3","type":"model"},"discussionRole":0,"acceptLanguages":["*"],"hideComments":true,"repoDiscussionsLocked":false,"isDiscussionAuthor":false}">I am courious about the difference bewteen bigscience/bloomz-7b1-mt and bigscience/bloomz-7b1-p3, it seems that they have the same summary and parameters.
It's explained in the Table on the model page: https://huggingface.co/bigscience/bloomz-7b1-p3#model-summary
More details are in the paper: https://arxiv.org/abs/2211.01786
In short: the fine-tuning dataset is different.
Let me know if there's still questions :)