University of California, Santa Cruz
Sequence alignment algorithms have recently found a use in detecting code clones, software plagiarism, code theft, and polymorphic malware. This approach involves extracting birthmarks, in this case sequences, from programs and comparing them using sequence alignment, a procedure which has been intensively studied in the field of bio-informatics. This idea seems promising. However, the authors have shown that an attacker can evade detection by considering the positions of inserted dummy code and/or the frequency of function calls. Moreover, they found that randomly inserting and deleting symbols in the sequence was ineffective.