PDFy

Analysis Pipeline

Two-Phase Scan Model

PDFy uses a hybrid scan model:

Fast Scan Responsibilities

The fast scan should produce the first usable verdict and include:

Advanced Scan Responsibilities

Advanced analysis runs after the fast verdict and may include:

Scan Lifecycle

  1. Receive upload request and retention preference.
  2. Persist a scan record.
  3. Store the uploaded file temporarily for processing.
  4. Run fast analysis and publish initial findings.
  5. Queue advanced tasks when applicable.
  6. Append advanced findings to the same scan record.
  7. Generate a structured report payload.
  8. Delete or expire the source file according to retention policy.

Failure Behavior