Technical Documentation

Inside the Forensic Engine.

Discover the multi-layered linguistic analysis we use to distinguish human intent from algorithmic probability.

The Multi-Step Forensic Process

Step 1: Text Tokenization

The system breaks down your input into discrete linguistic units (tokens). We analyze not just words, but the semantic relationship between them.

Step 2: Predictive Modeling

We compare your word sequences against probability maps of known LLMs. If your text perfectly mirrors a machine’s next-token prediction, it’s flagged.

Step 3: Metric Calculation

This is the heart of the engine. We calculate Perplexity (randomness) and Burstiness (structural pulse) to create a unique forensic profile.

Step 4: Cross-Model Verification

Finally, we validate the results across multiple AI signatures. This cross-referencing is crucial for reducing false positives and ensuring institutional accuracy.

The Mathematics of Detection

Perplexity

Perplexity measures the randomness of word choice. AI models aim for low perplexity (high predictability) to ensure clarity. Humans naturally introduce “linguistic noise” that machines find unpredictable.

AI Pattern (Low)Predictability: 98%
“The capital of France is Paris.”
Human Pattern (High)Predictability: 12%
“France’s beating heart, Paris, remains…”

Burstiness

Burstiness measures the variance in sentence structure. Human writers alternate between long explanatory clauses and short, punchy statements. AI output is typically uniform and “flat.”

Human (High Variance) AI (Monotonous)

Why Detection is Probabilistic

It is crucial to understand that AI detection is a **statistical estimation**, not a definitive proof. There is no biological DNA in text; only mathematical patterns. We provide an educated forensic guess based on entropy models, which should be used as a conversation starter in academic settings, never as the sole proof of misconduct.

Ethical Data Analysis
Forensic Proofing
Model Calibration

Continuously Retrained on Millions of Tokens

GPT Claude Gemini Llama

Our engine is updated weekly as new model signatures are released, ensuring the highest possible accuracy for 2026 standards.

Test Your Own Text

Launch Forensic Scanner