How to specify fusion rules with quantization? #21251

f2013519 · 2024-07-04T07:42:02Z

How do we specify fused operator patterns like (conv+relu) in the quantization config? I see such options are available in pytorch but not in onnx static_quantize.

Right now I see different scales at output of conv and relu which is not suitable for us as it will require additional requantize step.

Thanks!

xadupre · 2024-07-05T09:35:21Z

If you need to fuse operator in a custom way, you can use this tool: https://onnxscript.ai/tutorial/rewriter/rewrite_patterns.html (you should install the development version).

f2013519 · 2024-07-16T08:02:26Z

i do not necessarily need a custom op, rather a way to specify convolution and relu should not have different scales as this can introduce noise.

although it would be good to have a standard fused op like conv-relu. any reason this is not supported yet?

github-actions bot added the quantization issues related to quantization label Jul 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to specify fusion rules with quantization? #21251

How to specify fusion rules with quantization? #21251

f2013519 commented Jul 4, 2024

xadupre commented Jul 5, 2024

f2013519 commented Jul 16, 2024

How to specify fusion rules with quantization? #21251

How to specify fusion rules with quantization? #21251

Comments

f2013519 commented Jul 4, 2024

xadupre commented Jul 5, 2024

f2013519 commented Jul 16, 2024