Detection Engine

Multi-wave detection with explicit escalation

StyloBot does not market a single magic detector. It combines cheap protocol checks, behavioral evidence, cross-request correlation, and optional deeper analysis into one runtime decision.

The open-source stack is enough to evaluate real production traffic. Enterprise layers add more control, persistence, and operational reach.

Self-hosted
runs in your VPC, your data stays there

Full decision trace
signals, deltas, action, policy

49 detectors
layered protocol + behavior signals

Privacy-aware
HMACed IDs + stripped UAs

How the engine sees you, right now

Live · Your Detection Bot

18:08:19 · 0ms

Identified as ClaudeBot

Wave 1

Protocol and signature checks

<1ms

User-agent and known tool matching

Catch obvious automation, scanners, and commodity scraping tools before the runtime spends time on subtler questions.

Pattern Match

Recon Tools

<1ms

Header and browser fingerprint validation

Compare what the client claims to be with the headers and browser behavior it presents. Spoofing usually leaks somewhere.

Header Logic

Client Shape

<1ms

Infrastructure signals

Datacenter IP ranges, stale versions, and hostile-source indicators help separate likely automation from ordinary consumer traffic.

Datacenter

Version Age

Wave 2

Behavior and consistency

1-5ms

Request sequence analysis

Examine cadence, transitions, and per-session flow. Real users browse with friction and variation; bots tend to reveal a program.

Cadence

Transitions

1-5ms

Cross-signal inconsistency

Catch impossible combinations such as mismatched OS, browser, protocol, or client capability claims. Bots often forge one layer and forget the rest.

Correlation

Sanity Check

Client-Side

Browser execution proof

Optional client-side checks help distinguish a real browser from a headless impersonator when the application can support that signal.

JavaScript

Headless Gaps

Wave 3

Aggregation and escalation

<1ms

Heuristic aggregation

The main runtime combines detector output into bot probability, confidence, and risk band. This is the decision core that keeps the hot path fast and explainable.

Detector contributions stay visible.
Confidence is separate from probability.
Reputation can promote repeat offenders into the fast path.

Escalation Only

Deep analysis for borderline cases

Optional LLM-backed analysis exists for requests that justify slower reasoning. It is an escalation path, not the identity of the product and not something every request should pay for.

Use for ambiguous spoofing and novel patterns.
Keep the main request path bounded.
Prefer local or controlled model deployment where possible.

Wave 4

Cross-request and cluster intelligence

Background

Bot cluster detection

Group confirmed bad signatures to expose product families, shared infrastructure, and coordinated campaigns. This sharpens later decisions on related traffic.

Similarity Graphs

Campaign View

Real-Time

Country and infrastructure reputation

Reputation adds supporting context for borderline requests and decays over time so old conditions do not poison new traffic forever.

Time Decay

Context Signal

Background

Community affinity

When a request shares traits with known hostile clusters, the runtime can raise scrutiny without treating that single overlap as a final verdict.

Shared Traits

Confidence Lift

Enterprise

Advanced enterprise layers

Deeper fingerprinting and SQLite or Postgres + pgvector persistence

Enterprise builds extend the runtime with stronger persistence, richer fingerprint layers, and operational tooling for teams managing multiple gateways.

Controlled model integrations

When deeper model analysis is useful, enterprise deployments can plug in approved providers without turning the product into a generic model-marketing page.

External Intel

Optional threat intelligence

~100ms

Project Honeypot

External IP reputation can add another signal for known hostile sources. Treat it as one input in the graph, not a substitute for local evidence.

IP Reputation

External Feed

Recent additions

Friendly-bot throttle-status policy: legitimate crawlers (Googlebot, Bingbot) routed through a rate-limit lane instead of blocked.
Deceptive-bot (!) marker: bots claiming to be browsers but failing protocol checks get an explicit deception flag in the dashboard.
Drift-gated naming: bot display names only update when behavior drifts, preventing flicker in the dashboard.
Ambiguity-persistence: repeat boundary-probing requests are tracked as a signal in their own right.
Slow-path coordinator: expensive identity verification is admission-controlled so it cannot DoS the fast path.

What these detectors catch

Googlebot

verified-bot

Google's web crawler. Honest user-agent, datacenter origin, no Sec-Fetch headers, predictable timing. We route it through the friendly-bot throttle-status policy; never blocked.

googlebot

Headless Chrome

headless

Puppeteer, Playwright, and chrome --headless sessions. Looks like Chrome at the UA level but diverges from a real browser's protocol fingerprint. Watch-level by default.

headless-chrome

curl

tool

Command-line HTTP clients (curl, wget, http). Honest Accept: */* and tiny header set. Useful in scripts and expected in many CI pipelines, so it stays info-level until the request shape suggests scanning.

curl-tool

Pipeline order

Cheap checks first. Context second. Escalation last.

Fast checks cut obvious traffic

Known tools, malformed clients, and hostile infrastructure get caught early.

Behavioral and sequence analysis refine the call

Session cadence and cross-signal consistency determine whether suspicion hardens or falls away.

Aggregation outputs risk, confidence, and action

The system produces a traceable decision: signals, detector deltas, aggregation, and policy action.

Escalation handles the hard residue

Only the tricky traffic earns slower, deeper analysis.

Confirmed patterns become cheaper to stop next time

Reputation and cluster context make repeat offenders faster to classify.

Product Family

Common runtime, different surface area

StyloBot and StyloWall share the same runtime mindset: evidence-first traffic decisions, local control, and low-latency enforcement.

StyloBot

Focused on HTTP and application-layer bot traffic. Use it to protect login, checkout, API, and content routes where browser behavior matters.

Request Early Access

StyloWall

Extends the same operator mindset toward broader network services and protocol surfaces beyond the web stack.

Inspect the runtime, then decide how hard to enforce

The detector stack is useful because it makes the decision path visible.

Request Early Access View Enterprise

Name	Hits	Seen
Bingbot	450	3m
Serpstatbot Crawler serpstatbot.com	218	15h 59m
Googlebot	210	3m
Applebot	168	19m
ClaudeBot	151	18s

Multi-wave detection with explicit escalation

How the engine sees you, right now

Top Bots

Protocol and signature checks

User-agent and known tool matching

Header and browser fingerprint validation

Infrastructure signals

Behavior and consistency

Request sequence analysis

Cross-signal inconsistency

Browser execution proof

Aggregation and escalation

Heuristic aggregation

Deep analysis for borderline cases

Cross-request and cluster intelligence

Bot cluster detection

Country and infrastructure reputation

Community affinity

Advanced enterprise layers

Deeper fingerprinting and SQLite or Postgres + pgvector persistence

Controlled model integrations

Optional threat intelligence

Project Honeypot

Recent additions

What these detectors catch

Pipeline order

Fast checks cut obvious traffic

Behavioral and sequence analysis refine the call

Aggregation outputs risk, confidence, and action

Escalation handles the hard residue

Confirmed patterns become cheaper to stop next time

Common runtime, different surface area

StyloBot

StyloWall

Inspect the runtime, then decide how hard to enforce