The attention mask is not set and cannot be inferred from input because pad token is same as eos token. As a consequence, you may observe unexpected behavior. Please pass your input's `attention_mask` to obtain reliable results. Traceback (most recent call last): #2

yukePeng0815 · 2024-12-10T09:29:10Z

直接下载的您训练好的模型，运行测试代码报错

taishan1994 · 2024-12-10T10:31:45Z

这个应该不是真正的错误，看下后面的截图。

yukePeng0815 · 2024-12-10T12:14:46Z

感谢已解决，device = "cuda:0" 加入这两行就好了
torch.backends.cuda.enable_mem_efficient_sdp(False)
torch.backends.cuda.enable_flash_sdp(False)

yukePeng0815 · 2024-12-10T12:29:16Z

您好想问一下，如果通过大模型实现信息抽取某个领域的三元组，微调需要的数据量需求最少是怎样的，如果不能微调可以通过prompt达到抽取的效果吗

Provide feedback