[xpu] support fp16 data pricision #9080

xiuxin121 · 2022-05-30T06:50:46Z

No description provided.

paddle-bot-old · 2022-05-30T06:51:16Z

Thanks for your contribution!

shentanyue · 2022-05-30T08:48:21Z

lite/core/optimizer/mir/__xpu__static_kernel_pick_pass.cc

+  }
+}
+
+void XPUStaticKernelPickPass::GetScore(PrecisionType precision,


是否要单独考虑下数据类型为LOD_TENSOR_ARRAY时的场景？
当为LOD_TENSOR_ARRAY时，precision是为kUnk的，但注册的kernel是有具体的类型的。

能否给个具体模型例子

考虑到当前修改的pick算法主要是针对FP16，而数据类型为LOD_TENSOR_ARRAY时的场景，也需要在FP32下考虑，此外，数据类型为LOD_TENSOR_ARRAY的OP reverse目前lite还没有合入，将在后续中加入，本次不做添加。

shentanyue · 2022-05-30T08:49:02Z

lite/core/optimizer/mir/__xpu__static_kernel_pick_pass.cc

+    size_t score_tmp = 0;
+    if (kernel.GetInputDeclType(tmp)->precision() == PrecisionType::kAny) {
+      GetScore(PrecisionType::kAny, &score_tmp);
+      VLOG(6) << "match input data presion:kAny";


shentanyue · 2022-05-30T08:49:21Z

lite/core/optimizer/mir/__xpu__static_kernel_pick_pass.cc

+            kernel.GetInputDeclType(tmp)->precision() ||
+        xpu_output_type_[in_names[i]] == PrecisionType::kAny) {
+      GetScore(xpu_output_type_[in_names[i]], &score_tmp);
+      VLOG(6) << "match input data presion";


同上，precision

…to develop

shentanyue · 2022-06-21T06:54:37Z

lite/api/paddle_use_passes.h

@@ -107,6 +107,9 @@ USE_MIR_PASS(__xpu__bigru_fuse_pass);
 USE_MIR_PASS(__xpu__dynamic_lstm_fuse_pass);
 USE_MIR_PASS(__xpu__multi_softmax_fuse_pass);
 USE_MIR_PASS(__xpu__max_pooling_pad_zero_detect_fuse_pass);
+#ifdef LITE_WITH_XPU


__xpu__static_kernel_pick_pass.cc本身已经用宏隔离了，这里应该不需要加宏隔离

ok,下次提交时修改。

建议去掉吧

shentanyue · 2022-06-21T06:57:38Z

lite/core/optimizer/mir/ssa_graph.h

@@ -93,6 +93,11 @@ class SSAGraph : GraphBase {

  std::string dump();

+#ifdef LITE_WITH_XPU
+  void CopyScope(const Scope *scope) { scope_ = scope; }


函数命名为 SetScope 会不会更好？

只做scope拷贝，不涉及scope设置。

graph 可以通过类似方法获得scope ，不建议增加接口，原则上做最小改动

Paddle-Lite/lite/core/optimizer/mir/subgraph/subgraph_pass.cc

Line 132 in 4478d30

for (auto& any_op_node : graph->StmtTopologicalOrder()) {

hong19860320 · 2022-06-21T07:42:26Z

lite/core/optimizer/mir/ssa_graph.h

@@ -93,6 +93,11 @@ class SSAGraph : GraphBase {

  std::string dump();

+#ifdef LITE_WITH_XPU
+  void CopyScope(const Scope *scope) { scope_ = scope; }


graph 可以通过类似方法获得scope ，不建议增加接口，原则上做最小改动

Paddle-Lite/lite/core/optimizer/mir/subgraph/subgraph_pass.cc

Line 132 in 4478d30

for (auto& any_op_node : graph->StmtTopologicalOrder()) {

hong19860320 · 2022-06-21T07:44:59Z

lite/api/paddle_use_passes.h

@@ -107,6 +107,9 @@ USE_MIR_PASS(__xpu__bigru_fuse_pass);
 USE_MIR_PASS(__xpu__dynamic_lstm_fuse_pass);
 USE_MIR_PASS(__xpu__multi_softmax_fuse_pass);
 USE_MIR_PASS(__xpu__max_pooling_pad_zero_detect_fuse_pass);
+#ifdef LITE_WITH_XPU


建议去掉吧

hong19860320 · 2022-06-21T07:46:45Z

lite/core/optimizer/optimizer.cc

@@ -47,9 +50,13 @@ std::unique_ptr<RuntimeProgram> Optimizer::Run(Program&& program) {
    graph.reset(new mir::SSAGraph);
    graph->Build(program, valid_places_, block_idx);
    graph->SetValidPlaces(valid_places_);
+
+#ifdef LITE_WITH_XPU


不建议修改

hong19860320

LGTM

* [XPU] fixed the bug of tile op in large input and add XPU implementation. (#9102) * [XPU] Fixed the bug of reuse of reshape2's output in xpu_memory_optimize_pass (#9178) * [x86][XPU] Add the support tensorarray of slice on x86 and xpu (#9134) * [XPU] Fixed the error on stack op binding on float. (#9204) * [XPU] Stop supporting xpu conv autotune config with paddlelite c api. (#9316) * [XPU] Support pre-LN encoder (#9159) * [xpu] support fp16 data pricision (#9080) * [xpu] delete kernel.precision()==float (#9189) * [XPU] support fp16 data pression (#9228) * [XPU] support fc per channel quant (#9323) Co-authored-by: wbn <[email protected]> Co-authored-by: Jinchen Han <[email protected]> Co-authored-by: TingShenXD <[email protected]> Co-authored-by: quwei03 <[email protected]>

xiuxin121 requested review from chenjiaoAngel, hong19860320 and liyancas as code owners May 30, 2022 06:50

paddle-bot-old bot added contributor status: proposed labels May 30, 2022

shentanyue reviewed May 30, 2022

View reviewed changes

xiuxin121 and others added 21 commits June 1, 2022 10:35

[xpu] support fp16 data pricision

1273338

Merge branch 'PaddlePaddle:develop' into develop

7e66c57

update

1dff51e

Merge branch 'PaddlePaddle:develop' into develop

ec41c49

update

1ae1228

Merge branch 'develop' of https://github.com/xiuxin121/Paddle-Lite in…

ce48be0

…to develop

update

a938df4

update

a2b8c41

update

cabf5d6

update

820fd0b

update

5ff8020

update

e9d9921

update

0f88249

update

6214c6d

update

da7036b

Merge branch 'PaddlePaddle:develop' into develop

630f9ef

update

9626bbc

Merge branch 'PaddlePaddle:develop' into develop

3762fe2

update

ade131c

Merge branch 'PaddlePaddle-develop' into develop

43839f1

conflict fixed

e2c52e5

Merge branch 'PaddlePaddle-develop' into develop

eba9b83

xiuxin121 requested a review from shentanyue June 21, 2022 06:50

shentanyue reviewed Jun 21, 2022

View reviewed changes

hong19860320 reviewed Jun 21, 2022

View reviewed changes

xiuxin121 and others added 12 commits June 21, 2022 21:04

update

fa42fd8

update

b292cf5

update

d1fe72c

update

ffca8a6

update

d3209cb

update

01ee3d5

Merge branch 'PaddlePaddle:develop' into develop

8243201

update

1040c0d

update

734abbf

update

68950a6

update

b34fdc5

update

7d88249

zealoct approved these changes Jun 27, 2022

View reviewed changes

hong19860320 approved these changes Jun 28, 2022

View reviewed changes

zhupengyang merged commit 9f8f7a6 into PaddlePaddle:develop Jun 28, 2022

newway pushed a commit to newway/Paddle-Lite that referenced this pull request Aug 23, 2022

[xpu] support fp16 data pricision (PaddlePaddle#9080)

51768ed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[xpu] support fp16 data pricision #9080

[xpu] support fp16 data pricision #9080

xiuxin121 commented May 30, 2022

paddle-bot-old bot commented May 30, 2022

shentanyue May 30, 2022

xiuxin121 May 30, 2022

xiuxin121 May 31, 2022

shentanyue May 30, 2022

shentanyue May 30, 2022

shentanyue Jun 21, 2022

xiuxin121 Jun 21, 2022

hong19860320 Jun 21, 2022

xiuxin121 Jun 21, 2022

shentanyue Jun 21, 2022

xiuxin121 Jun 21, 2022

hong19860320 Jun 21, 2022

xiuxin121 Jun 21, 2022

hong19860320 Jun 21, 2022

hong19860320 Jun 21, 2022

hong19860320 Jun 21, 2022

hong19860320 left a comment

[xpu] support fp16 data pricision #9080

[xpu] support fp16 data pricision #9080

Conversation

xiuxin121 commented May 30, 2022

paddle-bot-old bot commented May 30, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hong19860320 left a comment

Choose a reason for hiding this comment