[CODEGEN] ARM Popcount lowering rule and codegen updates #1235

cowanmeg · 2018-06-05T23:42:42Z

TVM compiler changes for low precision operators

ARM popcount lowering rule
Codegen updates to support reinterpreting vectors, and accessing upper/lower halves separately.

Thanks for contributing to TVM! Please refer to guideline http://docs.tvm.ai/contribute/ for useful information and tips. After the pull request is submitted, please request code reviews from others in the community.

…ing and accessing vectors

tqchen · 2018-06-06T15:48:59Z

There is an unintended change that reverts the submodule to an older version. Please update the submodule (HalideIR) to the latest version. You can do it by git pull under the HalideIR folder

tqchen · 2018-06-06T16:27:48Z

src/codegen/llvm/codegen_llvm.cc

@@ -366,7 +366,7 @@ llvm::Value* CodeGenLLVM::CreateBroadcast(llvm::Value* value, int lanes) {
 llvm::Value* CodeGenLLVM::CreateVecSlice(llvm::Value* vec, int begin, int extent) {
  int num_elems = static_cast<int>(vec->getType()->getVectorNumElements());
  if (extent == num_elems && begin == 0) return vec;
-  CHECK_LT(begin + extent, num_elems);
+  CHECK_LT(begin + extent, num_elems+1);


CHECK_LT-> CHECK_LE

tqchen · 2018-06-06T16:33:50Z

src/codegen/llvm/codegen_arm.cc

+  return CodeGenCPU::CreateIntrinsic(op);
+}
+
+Expr CodeGenARM::ARMPopcount(const Call *call) {


We will need a regression test for this rule. please add a test case to arm popcount, to a new file tests/python/unittest/test_codegen_arm.py .

Since we don't have ARM device to verify, what we can do is to dump out the asm file(Maybe we can patch GetSource in llvm module to support get_source("asm") ) and verify the neons sequence is as expected.

tqchen · 2018-06-06T16:35:24Z

src/codegen/llvm/codegen_arm.cc

+  ::llvm::Intrinsic::ID vpaddu_id = ::llvm::Intrinsic::arm_neon_vpaddlu;
+
+
+  Type uint8_type = Type(e.type().code(), 8, e.type().bits() * e.type().lanes() / 8);


move the typedef after the fallback guard, add comment that the division is always dividable.

Add a comment about what this specific pattern of neon sequence is

tqchen · 2018-06-12T20:45:42Z

Thanks, this is merged!

ajtulloch · 2018-06-13T03:24:21Z

Nice!

ARM Popcount lowering rule and codegen updates to support reinterpret…

0329772

…ing and accessing vectors

tqchen changed the title ~~ARM Popcount lowering rule and codegen updates~~ [CODEGEN] ARM Popcount lowering rule and codegen updates Jun 6, 2018

tqchen requested changes Jun 6, 2018

View reviewed changes

tqchen added the status: review in progress label Jun 6, 2018

Meghan added 3 commits June 6, 2018 16:05

Fixes and test case for arm popcount

777f9ea

white space fixes

2e56f09

Merge branch 'master' of https://github.com/dmlc/tvm into low-precision

2220a36

tqchen added the status: need update need update based on feedbacks label Jun 12, 2018

unit test fixes and arm codegentest

1d55a8e

tqchen approved these changes Jun 12, 2018

View reviewed changes

tqchen added status: accepted and removed status: need update need update based on feedbacks status: review in progress labels Jun 12, 2018

tqchen merged commit be29ac7 into apache:master Jun 12, 2018

tqchen pushed a commit to tqchen/tvm that referenced this pull request Jul 6, 2018

[CODEGEN] ARM Popcount lowering rule and codegen updates (apache#1235)

afcef43

mnuyens pushed a commit to mnuyens/tvm that referenced this pull request Jul 10, 2018

[CODEGEN] ARM Popcount lowering rule and codegen updates (apache#1235)

f3f406a

sergei-mironov pushed a commit to sergei-mironov/tvm that referenced this pull request Aug 8, 2018

[CODEGEN] ARM Popcount lowering rule and codegen updates (apache#1235)

53d65da

cowanmeg deleted the low-precision branch April 5, 2019 18:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CODEGEN] ARM Popcount lowering rule and codegen updates #1235

[CODEGEN] ARM Popcount lowering rule and codegen updates #1235

cowanmeg commented Jun 5, 2018

tqchen commented Jun 6, 2018 •

edited

Loading

tqchen Jun 6, 2018

tqchen Jun 6, 2018

tqchen Jun 6, 2018

tqchen Jun 6, 2018

tqchen commented Jun 12, 2018

ajtulloch commented Jun 13, 2018

		::llvm::Intrinsic::ID vpaddu_id = ::llvm::Intrinsic::arm_neon_vpaddlu;


		Type uint8_type = Type(e.type().code(), 8, e.type().bits() * e.type().lanes() / 8);

[CODEGEN] ARM Popcount lowering rule and codegen updates #1235

[CODEGEN] ARM Popcount lowering rule and codegen updates #1235

Conversation

cowanmeg commented Jun 5, 2018

tqchen commented Jun 6, 2018 • edited Loading

tqchen Jun 6, 2018

Choose a reason for hiding this comment

tqchen Jun 6, 2018

Choose a reason for hiding this comment

tqchen Jun 6, 2018

Choose a reason for hiding this comment

tqchen Jun 6, 2018

Choose a reason for hiding this comment

tqchen commented Jun 12, 2018

ajtulloch commented Jun 13, 2018

tqchen commented Jun 6, 2018 •

edited

Loading