Skip to content

About parsing assembly & normalization #2

@oalieno

Description

@oalieno

I found the difference between radare2 and xed-interface while doing some experiment

  • radare2 will output mov byte [rax], 0xb2
  • xed-interface will output mov byte ptr [rax], 0xb2

This might be a problem.
If you do not use bin2asm.py to generate the data, the assembly code you get elsewhere may not be noramlized and may have tiny difference.

Parse the assembly code and normalize them in asm2vec library may be a better solution?
Maybe use keystone and capstone to assemble and then disassemble to obtain a unified representation.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions