Skip to content

src/dazzle_back/email/normalizer.py

Generated: 2026-03-30T04:38:45.955625+00:00

Commit: 615f1354a103

Language: python

Scope: dazzle_back

Purpose

Email normalizer for DAZZLE.

Converts raw email content into normalized, queryable events. Extracts structured fields, business references, and safe excerpts.

The normalizer is a consumer that: 1. Reads from office.mail.raw stream 2. Fetches raw content from blob store 3. Parses and extracts struc...

Key Exports

  • NormalizationResult
  • EmailNormalizer
  • normalize
  • _TextExtractor
  • handle_starttag
  • handle_endtag
  • handle_data
  • get_text

Dependencies

Imports

  • __future__.annotations
  • logging
  • re
  • dataclasses.dataclass
  • dataclasses.field
  • datetime.UTC
  • datetime.datetime
  • email.message_from_bytes
  • email.header.decode_header
  • email.message.Message
  • email.utils.parseaddr
  • typing.TYPE_CHECKING
  • typing.Any
  • dazzle.core.ir.email.BusinessReference
  • dazzle.core.ir.email.EmailAttachmentRef
  • dazzle.core.ir.email.NormalizedMailEvent
  • dazzle.core.ir.email.RawMailEvent
  • blob_store.BlobStore
  • html
  • html.parser.HTMLParser

Event Interactions

None detected.

Invariants and Assumptions

  • Validates: Raw content not found: {raw_event.raw_pointer}

Tests

No related tests found.

Notes

This page was automatically generated by tools/gen_reference_docs.py.