[Feature] m系列的mac启用gpu #152

kingzeus · 2023-03-19T05:19:59Z

Is your feature request related to a problem? Please describe.

mac下cpu运行非常慢
pytorch 在m系列的mac上可以支持GPU加速

Solutions

可以通过以下函数判断
torch.backends.mps.is_available()

修改 .half().cuda() 成 float().to("mps")

运行返回

有解决办法么？

Additional context

No response

The text was updated successfully, but these errors were encountered:

chaucerling · 2023-03-19T07:08:58Z

#6 (comment)

int64 is supported on MacOS 13.3 Βeta, and you should also use the nightly build of pytorch.

I tried to use mps backend to run on gpu, but it seems to have a bug when calling the generate function.

kingzeus · 2023-03-19T11:27:47Z

#6 (comment)

int64 is supported on MacOS 13.3 Βeta, and you should also use the nightly build of pytorch.

I tried to use mps backend to run on gpu, but it seems to have a bug when calling the generate function.

it seems to work! !

步骤：

修改 .half().cuda() 成 float().to("mps")
修改 modeling_chatglm.py line33-37

# flags required to enable jit fusion kernels
# torch._C._jit_set_profiling_mode(False)
# torch._C._jit_set_profiling_executor(False)
# torch._C._jit_override_can_fuse_on_cpu(True)
# torch._C._jit_override_can_fuse_on_gpu(True)

修改 modeling_chatglm.py line 268

 dtype = attention_scores.dtype

运行

目前会有些警告，但是似乎不影响使用
cpu模式下大概300s左右，
经过以上修改，仅需5-8s左右即可，多1轮回答，内存大概增加2G左右

需要改进的地方：
模型需要下载到本地，才能修改 modeling_chatglm.py。现有代码结构下，似乎没有很好的解决办法

imClumsyPanda · 2023-03-19T12:55:17Z

@kingzeus 您好请问您测试环境的pytorch版本和macOS版本分别是多少呢？

kingzeus · 2023-03-20T06:21:38Z

@kingzeus 您好请问您测试环境的pytorch版本和macOS版本分别是多少呢？

macOS 13.2.1
torch 2.0.0
torchaudio 2.0.0.dev20230313
torchvision 0.15.1

LeeeSe · 2023-03-21T22:17:59Z

@kingzeus 请问如何绕过 cpm_kernels 的 RuntimeError: Unknown platform: darwin 报错？

kingzeus · 2023-03-22T02:22:06Z

@kingzeus 请问如何绕过 cpm_kernels 的 RuntimeError: Unknown platform: darwin 报错？

目前来看，最简单的方法不要调用量化函数/不使用int4模型

duzx16 · 2023-03-23T14:55:19Z

#6 (comment)
int64 is supported on MacOS 13.3 Βeta, and you should also use the nightly build of pytorch.
I tried to use mps backend to run on gpu, but it seems to have a bug when calling the generate function.

it seems to work! !

步骤：

修改 .half().cuda() 成 float().to("mps")

修改 modeling_chatglm.py line33-37
# flags required to enable jit fusion kernels
# torch._C._jit_set_profiling_mode(False)
# torch._C._jit_set_profiling_executor(False)
# torch._C._jit_override_can_fuse_on_cpu(True)
# torch._C._jit_override_can_fuse_on_gpu(True)
修改 modeling_chatglm.py line 268
 dtype = attention_scores.dtype
运行

目前会有些警告，但是似乎不影响使用 cpu模式下大概300s左右，经过以上修改，仅需5-8s左右即可，多1轮回答，内存大概增加2G左右

需要改进的地方：模型需要下载到本地，才能修改 modeling_chatglm.py。现有代码结构下，似乎没有很好的解决办法

@kingzeus 感谢你提供的方法。我们已经修改了HF hub上的 modeling_chatglm.py，现在可以直接运行。另外将.float()改为.half()可以节省内存。

tedyyu · 2023-03-27T09:47:21Z

但是直接运行 python web_demo.py 就会遇到这个错，请问怎么避免？

@kingzeus 请问如何绕过 cpm_kernels 的 RuntimeError: Unknown platform: darwin 报错？

目前来看，最简单的方法不要调用量化函数/不使用int4模型

duzx16 closed this as completed Mar 23, 2023

W-Mai mentioned this issue Mar 24, 2023

[BUG/Help] M1 Pro 运行报错 #213

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] m系列的mac启用gpu #152

[Feature] m系列的mac启用gpu #152

kingzeus commented Mar 19, 2023

chaucerling commented Mar 19, 2023

kingzeus commented Mar 19, 2023 •

edited

Loading

imClumsyPanda commented Mar 19, 2023

kingzeus commented Mar 20, 2023

LeeeSe commented Mar 21, 2023

kingzeus commented Mar 22, 2023

duzx16 commented Mar 23, 2023 •

edited

Loading

tedyyu commented Mar 27, 2023

[Feature] m系列的mac启用gpu #152

[Feature] m系列的mac启用gpu #152

Comments

kingzeus commented Mar 19, 2023

Is your feature request related to a problem? Please describe.

Solutions

Additional context

chaucerling commented Mar 19, 2023

kingzeus commented Mar 19, 2023 • edited Loading

imClumsyPanda commented Mar 19, 2023

kingzeus commented Mar 20, 2023

LeeeSe commented Mar 21, 2023

kingzeus commented Mar 22, 2023

duzx16 commented Mar 23, 2023 • edited Loading

tedyyu commented Mar 27, 2023

kingzeus commented Mar 19, 2023 •

edited

Loading

duzx16 commented Mar 23, 2023 •

edited

Loading