Your Students Are Hiding Plagiarism in Plain Sight

You ran the submissions through your department's plagiarism checker. The similarity scores came back low, some under 10%. You breathed a sigh of relief. Another semester, another batch of original work.

You were wrong.

In upper-level computer science courses—Data Structures, Algorithms, Systems Programming—the game has changed. The students aren't just copying and pasting. They are engineering their submissions to exploit the blind spots of automated similarity detection. They are building plagiarism that looks, to a tool, like unique work. This isn't about lazy cheating. It's a technical arms race, and if you're only looking at token-level similarity, you're losing.

"The most dangerous plagiarism is the kind your tools tell you isn't there. It trains students that integrity is a system to be gamed, not a principle to uphold." – Dr. Anya Sharma, CS Department Chair, University of Toronto.

This guide is a tactical manual for detecting what I call Structural Deception Plagiarism. We'll move beyond simple string matching and into the realm of code semantics, control flow, and programmer intent. I'll show you the three most common techniques and provide a step-by-step forensic workflow you can implement before your next grading cycle.

The Three Pillars of Modern Code Obfuscation

Students (and sometimes, regrettably, online "tutoring" services) have gotten sophisticated. Their methods aren't random; they're designed to attack specific phases of common detection algorithms like those in MOSS or JPlag.

1. Dead Code & Control Flow Obfuscation

This is the most common technique in algorithm-heavy assignments. The core logic is plagiarized, but it's wrapped in irrelevant code that never executes or is designed to drastically alter the syntactic fingerprint.

Example: The Infinite Loop Decoy

An assignment asks for an implementation of Dijkstra's shortest path algorithm. The stolen core is intact, but it's prefixed with a loop that does nothing but changes the token stream.

// Legitimate, plagiarized core
public void dijkstra(Graph graph, Node source) {
    // ... standard initialization (stolen)
    for (Node node : graph.getNodes()) {
        distances.put(node, Integer.MAX_VALUE);
    }
    distances.put(source, 0);

    // DECOY: A loop that runs O(n^2) but does nothing to the logic
    for (Node n1 : graph.getNodes()) {
        for (Node n2 : graph.getNodes()) {
            int temp = n1.getId() + n2.getId(); // Dead calculation
            temp = temp * 0; // Always zero
            // No state modification
        }
    }

    // ... continuation of stolen algorithm
    while (!unvisitedNodes.isEmpty()) {
        Node current = getNodeWithMinDistance(unvisitedNodes);
        unvisitedNodes.remove(current);
        // ... standard relaxation loop (stolen)
    }
}

To a token-based checker, this code looks vastly different from the original. The double loop injects dozens of unique tokens. But to a compiler or runtime, the decoy is irrelevant. The semantic core is identical.

2. Comment & Identifier Spoofing

This method attacks the comment-removal and normalization phase. By filling the code with unique, verbose comments or changing identifier names in a systematic but meaningless way, they alter the normalized text beyond recognition.

# Original stolen function
def quicksort(arr):
    if len(arr) <= 1:
        return arr
    pivot = arr[len(arr)//2]
    left = [x for x in arr if x < pivot]
    middle = [x for x in arr if x == pivot]
    right = [x for x in arr if x > pivot]
    return quicksort(left) + middle + quicksort(right)

# Obfuscated version
def sort_quickly(sequence_list): # Main sorting entry point
    # Base case handler: lists of size 0 or 1 are already sorted.
    if len(sequence_list) <= 1:
        return sequence_list # Simply return the trivial case.
    # Select the median element as the partitioning pivot.
    pivot_element = sequence_list[len(sequence_list)//2]
    # Partition step: create sublists for elements less than,
    # equal to, and greater than the pivot.
    left_partition = [element for element in sequence_list if element < pivot_element]
    middle_partition = [element for element in sequence_list if element == pivot_element]
    right_partition = [element for element in sequence_list if element > pivot_element]
    # Recursively apply the same process and concatenate results.
    return sort_quickly(left_partition) + middle_partition + sort_quickly(right_partition)

The algorithm is verbatim. The logic, structure, and even the list comprehensions are identical. But a tool that strips comments and normalizes identifiers will see completely different token streams. quicksort vs sort_quickly, arr vs sequence_list, pivot vs pivot_element. It's a simple find-and-replace with a thesaurus, and it's devastatingly effective against naive checkers.

3. False Refactoring & Semantic Equivalence

This is the highest tier of deception. The student understands the algorithm well enough to perform semantics-preserving transformations. They change data structures, loop types, or conditional logic without changing the fundamental algorithm's time/space complexity or output.

Example: Rewriting Recursion as Iteration (and back)

If the source solution uses recursion for a depth-first search, the plagiarized version might use an explicit stack. The control flow graph looks different, but the order of operations and the result are the same.

// Original (likely stolen from a textbook solution)
void dfsRecursive(Node* node, std::unordered_set& visited) {
    visited.insert(node);
    for (Node* neighbor : node->neighbors) {
        if (visited.find(neighbor) == visited.end()) {
            dfsRecursive(neighbor, visited);
        }
    }
}

// Plagiarized via False Refactoring
void dfsIterative(Node* start, std::unordered_set& visited) {
    std::stack stack;
    stack.push(start);
    visited.insert(start);

    while (!stack.empty()) {
        Node* current = stack.top();
        stack.pop(); // Note: order differs from recursion, but output (visited set) is identical for undirected graphs

        for (Node* neighbor : current->neighbors) {
            if (visited.find(neighbor) == visited.end()) {
                visited.insert(neighbor);
                stack.push(neighbor);
            }
        }
    }
}

This is hard to detect because it requires understanding the code's purpose. A token matcher sees zero similarity. An Abstract Syntax Tree (AST) comparison sees different structures. Only a tool or an examiner looking for semantic equivalence—same inputs yielding same outputs via a known set of transforms—will catch it.

The Forensic Workflow: A 5-Step Detection Process

You need a multi-layered approach. Relying on a single similarity score is like using a sieve to stop a flood.

Step 1: Run Standard Detection, But Read the Output Differently

Don't discard low-similarity matches. Use them as a baseline, not a verdict. In Codequiry or similar platforms, a 10-25% match on a complex algorithm is a red flag, not an all-clear. It suggests obfuscation is present. Export these low-match pairs for manual review in Step 4.

Step 2: Apply Semantic-Preserving Normalization (The Clean Slate)

Before any deep analysis, strip the code down to its semantic bones. Write a simple script (Python works well) to perform aggressive, intelligent normalization:

Remove all comments and docstrings.
Standardize all user-defined identifiers. Rename every variable, function, and class to a generic sequence (var1, func2, Class3). This defeats identifier spoofing. Use a consistent mapping per file so sort_quickly and quicksort both become func1.
Collapse whitespace and standardize formatting. Use a code formatter like clang-format or Prettier with a single, strict profile.
Inline trivial helper functions. If a function is only called once and is fewer than 5 lines, inline it. This removes a common decoy method.

Run your plagiarism checker on these normalized files. Matches that appear now were hidden by surface-level noise.

Step 3: Control Flow Graph (CFG) Comparison

This is where you catch dead code obfuscation and some false refactoring. The goal is to compare the shape of execution, not the text.

Actionable Tactic: Use a static analysis library for your assignment's language (e.g., tree-sitter for multi-language parsing, javaparser for Java, libclang for C/C++). Extract a simplified CFG:

Nodes represent basic blocks (straight-line code).
Edges represent control flow (ifs, loops, jumps).
Ignore the content of the nodes. Only count them and their connections.

Compare the CFG of two submissions using graph isomorphism algorithms (networkx in Python has these). If two solutions for the same problem have radically different CFGs (one has 50 nodes due to dead loops, another has 15), dig deeper. If they have isomorphic or near-isomorphic CFGs, they likely share the same algorithmic core, regardless of comments or variables.

Step 4: Manual Spot-Check with a "Focal Point" Method

Automation gets you 80% there. Your expertise closes the gap. For submissions flagged in Steps 1-3, don't try to read every line.

Identify the "Focal Point." In every non-trivial assignment, there's a critical, non-obvious line or logic block. In Dijkstra's, it's the node relaxation loop. In a parser, it's the recursive descent function. Find it in both submissions.
Compare Logic, Not Syntax. Look at the conditional checks, the update operations, the error conditions. Are they logically identical? Do they make the same subtle mistake? A unique bug is the fingerprint of plagiarism.
Look for "Algorithmic Accents." Does one solution use a 3-way partition in quicksort while another uses a 2-way? These are meaningful differences. Do both use a min-heap for the frontier in A*? That's a meaningful similarity, especially if the textbook example uses a list.

Step 5: Runtime Behavior Profiling (The Silver Bullet)

This is the most definitive test. If two pieces of code are semantically equivalent, their runtime behavior on a battery of inputs will be identical—not just in output, but often in internal state progression.

How to implement it for a Java assignment:

// A simple profiling wrapper you can provide to students or run secretly
public class AlgorithmProfiler {
    public static List<Integer> profileDijkstra(Graph g, Node src) {
        List<Integer> distanceSnapshot = new ArrayList<>();
        // ... Instrument the algorithm to record the 'distances' map
        // after each node is processed (removed from the unvisited set).
        // This records the *order* and *state* of the algorithm.
        return distanceSnapshot; // A fingerprint of execution
    }
}

Run both submissions through the same set of 20-30 carefully crafted test graphs (corner cases, random edges). Collect the state snapshots. If the sequence of internal states matches exactly across all tests, you have near-certain proof of shared derivation, even if the code looks different. This catches false refactoring dead-on.

Building a Culture That Prevents the Need for Forensics

This technical arms race is exhausting. The better strategy is to design it out of existence.

Assign Unique Parameters: Don't assign "implement Dijkstra's algorithm." Assign "implement a route planner for the attached London Tube map, optimizing for morning congestion weights." The data is unique, making direct copying useless.
Require Submissions to Include a "Design Narrative": A 300-word write-up explaining one key design choice and one debugging challenge they faced. Plagiarized code rarely comes with a plausible, specific narrative.
Use Two-Stage Assignments: Stage 1 is a code review of a provided (buggy) solution. Stage 2 is implementing their own. They engage with code structure before writing, reducing the temptation to outsource.

The goal isn't to catch more cheaters. It's to make cheating a futile, high-effort endeavor so that genuine learning becomes the only rational path. By understanding their methods and deploying this layered forensic workflow, you shift the advantage back to where it belongs: with the educator who values integrity as much as innovation.

Your tools are a starting point. Your judgment is the final line of defense. Use both.