Slop Watch
In an effort to understand AI-generated content, I developed a web app that provides some meaningful statistics about content in order to determine how much it's AI-generated. Call it Slop Watch This simple interface provides lots of information to understand AI writing. But beware, I've only tested this sparingly.
How It Works
The Binoculars method (the backbone) runs input text through two related LLMs (Mistral-7B base and Instruct) and computes:
- Perplexity from the observer model (how surprised it is by the text)
- Cross-perplexity between the performer and observer (how much their predictions agree)
- Score = perplexity / cross-perplexity — AI text scores below a threshold, human text above
The web UI provides four visualizations:
- Score gauge — overall verdict with confidence
- Per-sentence heatmap — color-coded sentences (green=human, red=AI)
- Perplexity distribution — per-token perplexity chart (AI=flat, human=spiky)
- Burstiness plot — sentence length variation (AI=uniform, human=varied)
