feat: fix mulmat #2

chraac · 2024-09-09T06:22:02Z

Summary

In latest dev-refactoring, the test-backend-ops runs well on ADD operator, but failed on MUL_MAT.
After investigation, we found that the input tensor should be transposed (see: 63dc587).

Here's how we fix it:

Add tensor types INTERMEDIATE and PARAMETER. These types are graph-private and invisible to other graphs.
1. The INTERMEDIATE tensor is used for connecting multiple operations within a single graph.
2. The PARAMETER tensor is used for setting the parameters of certain operators, such as transpose.

Add ggml_qnn_matmul_op_config class, for creating necessary nodes to matmul graph

At function ggml_qnn_matmul_op_config::create_tensors, we will add several nodes, and finally make a graph like this:

 graph TD;
      i1>input_tensor_a] --src0--> mat_mul0;
      i2>input_tensor_b] --src1--> transpose0;
      transpose0 --intermediate0--> mat_mul0;
      mat_mul0 --intermediate1--> transpose1;
      transpose1 --dst0--> o1>output_tensor_c];

Log

Backend 4/4: CPU
  Skipping CPU backend
4/4 backends passed
�[1;32mOK�[0m

test-backend-ops_MUL_MAT.log

Changes

Modify the ggml_qnn_op_config, to manage the rensors, and support multi-op
Add transpose op before mulmat

Self-check

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

next will set the prameter of transpose

* vulkan : do not use tensor->extra This patch allows using the Vulkan backend with the RPC backend as tensor->extra is no longer used. Ref: ggerganov#8536 * Adapt GGML_VULKAN_CHECK_RESULTS to extra removal (#2) --------- Co-authored-by: 0cc4m <[email protected]>

chraac · 2024-10-06T15:28:14Z

ggml/src/ggml-qnn/op-config.cpp

+     * src1 -> | transpose0 | -> intermediate0 -> | mat_mul0 |
+     */
+
+    const auto tensor_rank = get_rank(tensor_inputs, tensor_outputs);


Here we add nodes to the mat_mul graph, after this function ,the graph will be like:

graph TD; i1>input_tensor_a] --src0--> mat_mul0; i2>input_tensor_b] --src1--> transpose0; transpose0 --intermediate0--> mat_mul0; mat_mul0 --intermediate1--> transpose1; transpose1 --dst0--> o1>output_tensor_c];

Loading

https://docs.qualcomm.com/bundle/publicresource/topics/80-63442-50/cpu_backend.html#supported-operations

chraac added enhancement New feature or request wip work in progress labels Sep 9, 2024

chraac self-assigned this Sep 9, 2024

chraac marked this pull request as draft September 9, 2024 06:22

chraac force-pushed the dev-multi-op-in-one-graph branch from c34eecb to d55b067 Compare September 10, 2024 15:28

chraac added 15 commits September 18, 2024 13:02

ggml_qnn_op_config now manager the construction of ggml_qnn_tensor

832c29b

wip

0c428d9

add interface ggml_qnn_op_config

ef763f4

add ggml_qnn_list_op_config

94499ad

add create_tensor and move tensor bind to execute

2f661df

wip

d8caaec

rename: ggml_qnn_list_op_config -> ggml_qnn_matmul_op_config

cff20aa

add tensortype to allow native tensor

bff365a

remove ggml_tensor param at ggml_qnn_tensor::create_tensor

b47929c

postpone the tensor id allocation to add_node

50a88fa

add ggml_qnn_op_config_base

5da526a

trival change to reduct the param of function

6e6bfbe

split bind_tensors into bind_input_tensors and bind_output_tensors

6ade608

implement ggml_qnn_single_op_config::create_tensors

c1bd94c

next will set the prameter of transpose

tensor: add bind buffer

74d5016

chraac force-pushed the dev-multi-op-in-one-graph branch from 14283fa to 74d5016 Compare September 18, 2024 05:31

chraac added 3 commits September 19, 2024 10:11

add parameter tensor type

f53c016

implement add_tensor_param

3b69e71

set qnn_instance only at constructor

ed181a1

chraac force-pushed the dev-multi-op-in-one-graph branch from 7933c79 to ed181a1 Compare September 19, 2024 14:03

chraac added 4 commits September 19, 2024 22:19

set transpose tensor param

8f22c15

move create_op_constructor into op-config module

189325f

create QNN_OP_MAT_MUL from ggml_qnn_matmul_op_config

222f9a1

try fix crash

378d2ba

chraac mentioned this pull request Sep 26, 2024

Refactoring: add helper class to bind qnn tensor -> ggml tensor zhouwg/llama.cpp#2

Open

4 tasks

chraac added 3 commits October 1, 2024 14:53

fix parameter tensor name

2dc0bbd

update tensor dimension assignment and add TODO

07fc1e6

fix mat_mul graph creating

fc8b521

fix MUL_MAT_256x16x10x1_256x1x10x1_16x1x10x1

a6deb22

chraac force-pushed the dev-multi-op-in-one-graph branch from 8e55942 to a6deb22 Compare October 3, 2024 16:27

chraac commented Oct 6, 2024

View reviewed changes

chraac added 15 commits October 11, 2024 10:50

Merge branch 'dev-refactoring' into dev-multi-op-in-one-graph

938075c

append type to graph cache key

82fcd12

wip

b2281a0

fix supported op

d923257

update comment

42a7c41

disable op other than add and mat_mul

4071999

add convert op to adapt multi input/output format

328369a

disable f16 for cpu backend according to official doc

7c798c0

https://docs.qualcomm.com/bundle/publicresource/topics/80-63442-50/cpu_backend.html#supported-operations

add supported data types flags in each backend

4688211

remove unused functions

c560733

append output type to graph key

173371c

Merge branch 'dev-refactoring' into dev-multi-op-in-one-graph

baca4cb

fix gpu backend by disable the different data type op

2547eca

fix cpu backend support ops

657db64

fix duplicated tensor name

327b3db

chraac force-pushed the dev-multi-op-in-one-graph branch from 14f2205 to 327b3db Compare October 28, 2024 03:59

chraac added 3 commits October 28, 2024 12:13

append op name

1c7f136

suppress warning

452197c

remove unused code

8448acd

chraac marked this pull request as ready for review October 28, 2024 04:47

chraac changed the title ~~[WIP] feat: fix mulmat~~ feat: fix mulmat Oct 28, 2024

chraac merged commit 4abaf7d into dev-refactoring Oct 28, 2024

chraac deleted the dev-multi-op-in-one-graph branch October 28, 2024 04:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: fix mulmat #2

feat: fix mulmat #2

chraac commented Sep 9, 2024 •

edited

Loading

chraac Oct 6, 2024 •

edited

Loading

feat: fix mulmat #2

feat: fix mulmat #2

Conversation

chraac commented Sep 9, 2024 • edited Loading

Summary

Changes

Self-check

chraac Oct 6, 2024 • edited Loading

Choose a reason for hiding this comment

chraac commented Sep 9, 2024 •

edited

Loading

chraac Oct 6, 2024 •

edited

Loading