New: both AI detection models just got a major upgrade. They now catch ChatGPT, Claude, and the newest AI hiding in student code. See what's new
Live search across 47M+ indexed files

Search the Codequiry Libraries

A small window into the corpus every Codequiry scan checks against. Public code from GitHub, snippets from Stack Overflow, and a private database of anonymized academic submissions, all searchable here in under a second.

Try:
This is a public demo, limited to 10 queries per hour per IP. A real scan checks far more: every submission is run against 2B+ sources on the web plus our private academic database.
Example match for quicksort. Run a search above to query the live corpus.
github.com / TheAlgorithms / Python / sorts/quick_sort.py
Python 16 lines
1
def quicksort(arr):
2
"""Sort a list using the quicksort algorithm."""
3
if len(arr) <= 1:
4
return arr
5
6
pivot = arr[len(arr) // 2]
7
left = [x for x in arr if x < pivot]
8
mid = [x for x in arr if x == pivot]
9
right = [x for x in arr if x > pivot]
10
11
return quicksort(left) + mid + quicksort(right)
12
13
14
if __name__ == "__main__":
15
data = [3, 6, 8, 10, 1, 2, 1]
16
print(quicksort(data))
2B+
Web sources per scan
47M+
Files in this demo index
65+
Languages covered
< 1s
Median query latency
What's in the corpus

Code your students copy from, indexed before they submit.

Web search misses code in Stack Overflow gists, archived repos, and reposted snippets. The corpus catches what Google doesn't.

Public GitHub repositories

Continuously ingested from active public projects. Code that lives in starred repos, course solutions, and homework helpers shows up in your match report.

Stack Overflow answers

Top-voted snippets and accepted answers, indexed at the chunk level. Paraphrased copies still match because the corpus is searched by token, not by URL.

Private academic database

Anonymized submissions from academic institutions across the Codequiry network. A private corpus that catches contract-cheating rings and recycled coursework that never appears on the public web.

How it works

Trigram-indexed search built for code, not prose.

Sub-second matching across tens of millions of documents. Every scan you run hits the same corpus you can query right above, plus 2B+ web sources and our private academic database.

Continuously indexed

New repositories and answers ingested daily. The corpus grows by hundreds of thousands of documents a week, so your scans get smarter with no work on your end.

Sub-second matching

Trigram-indexed search returns top matches in under a second even across tens of millions of documents. Your submission gets graded faster.

Noise-filtered

Boilerplate like Supabase clients, framework templates, and common stdlib idioms is filtered out automatically. You see real matches, not "this file imports React."

Built to be trusted

We know code looks alike. Codequiry flags plagiarism with one of the lowest false-positive rates on the market.

Boilerplate, framework templates, common stdlib patterns, and standard imports are filtered before a match is ever surfaced. When Codequiry flags a pair of submissions, you can take it to the academic-integrity committee with confidence.

Every scan checks all of it.

2B+ web sources, every public GitHub repo, Stack Overflow, Chegg, CourseHero, and every prior submission you’ve uploaded, plus our private academic database, all cross-checked in a single pass. Once students learn their code gets compared against everything, the copying tends to stop.