Skip to content

NEGEMMLowpMatrixMultiplyCore: per-channel quantization support #1153

Closed
@alvoron

Description

@alvoron

NEGEMMLowpMatrixMultiplyCore supports per-tensor quantization only.
Feature request is to support per-channel quantization.

Reproducer: eshoguli@49a91f3

Issue is reproduced if per-channel quantization is changed to per-tensor quantization in the provided example:

   // per-tensor quantization:
   const QuantizationInfo src2_qinfo(0.2f);

   // per-channel quantization:
   const auto scales2 = generate_quantization_scales(6, 0.2f);
   const QuantizationInfo src2_qinfo(scales2);
   std::cout << "scales2: " << scales2 << std::endl;

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions