Esc

Byte Sequence Emulation

Definition

Analyzing sequences of bytes and determining if they likely represent malicious shellcode.

Synonyms: Shellcode Transmission Detection .

How it works

Bytes are analyzed as if they are machine code instructions, and such instructions that are a common component of known shellcode are noted, such as stack pivots, reads from a Memory Address Table, and system calls for functions that disable protections or execute code. For example, the x86 instruction b0 0b: mov $11, %ax, with no further alterations to the %ax register, followed by cd 80: syscall executes the system call execve() in the Linux kernel, which replaces the current process with another one specified -- this is a common action in shellcode, so this sequence would be flagged.

This technique detects shellcode despite whether or not it would cause a buffer overflow in the target binary.

If the sequence of bytes contains a sequence similar to that used in malicious shellcode, the entire byte sequence is flagged and a follow-on technique may be invoked.

Considerations

False Negatives

If the shellcode instructions are far apart, simple implementations might not detect the shellcode.

Due to the nature of assembly instructions not having a defined start or end, implementations which do not process all start sequences (for example, when they a find byte sequence of interest, continue scanning forwards from the end of it) might not detect the shellcode.

This technique might not detect more complex or obfuscated instructions. For that purpose, Dynamic Analysis or Emulated File Analysis could assist by analyzing the actual instruction function.

This technique may not detect self-modifying code. To make it harder for a process to modify itself, Process Segment Execution Prevention should be used, while noting its considerations.

This technique might not detect malicious shellcode which reuses instructions in the target binary for malicious effect, as memory references in the presumed assembly code are not dereferenced. Dynamic Analysis and Emulated File Analysis, when set up properly to fork from the running target binary, might detect this. Process Segment Execution Prevention combined with Segment Address Offset Randomization frequently makes introduction of shellcode through overwriting a saved return pointer more difficult. Call stack depth analysis might detect excessive reuse of instructions in the target binary. Shadow Stack Frames might detect that a stack frame's return address has changed and Stack Frame Canary Verification might detect that the stack frame's return address was overwritten. Other heuristic methods might detect jump-oriented programming shellcode.

With inserting code directly, that it is not a buffer overflow, and just some place where code is executed either to a file or a write-what-where, the buffer overflow mitigations do not help. Behavioral analysis could detect this, or proper access control could mitigate this.

False Positives

Byte sequences containing code that is never used as machine code are still analyzed and flagged for anomalies, and eventually, it is likely that an attack sequence will arise from the sheer volume of bytes transmitted.

json

References

All

Academic Paper

The following references were used to develop the Byte Sequence Emulation knowledge-base article.

(Note: the consideration of references does not imply specific functionality exists in an offering.)

Network-Based Buffer Overflow Detection by Exploit Code Analysis

Reference Type: Academic Paper Organization: Information Security Research Centre Author: Stig Andersson, Andrew Clark, and George Mohay

Source:

https://eprints.qut.edu.au/21172/1/21172.pdf

Network-level polymorphic shellcode detection using emulation

Reference Type: Academic Paper Author: Michalis Polychronakis

Source:

https://www.cs.unc.edu/~fabian/course_papers/polymorphic-detect.pdf

D3FEND^™

A knowledge graph of cybersecurity countermeasures