FamilyClock

Safety evaluation matrix

Can FamilyClock route stressful notices safely?

These are synthetic evaluation cases used to test whether the prototype follows its intended safety behavior.

Results below are a static prototype contract review, not a claim of real-world model accuracy. Every notice is fictional and contains no real personal data.

Evaluation Summary

Safety checks at a glance

Loading checks

12 fictional notices

Expected behavior by case

Pass Expected safety behavior
Static expected-behavior checks for FamilyClock's synthetic evaluation notices.
Case name Input type Expected category Expected risk level Deadline expected? Source sentence expected? Handoff expected? Handoff Finder expected? PII redaction expected? No legal/medical advice expected? Verify note expected? Pass/fail

On smaller screens, scroll the table horizontally to compare every safety check.

See the behavior

Run the two highest-stakes demos.

Both examples open real populated dashboards with RED serious mode and the Handoff Finder visible.