Review steps
- List server-card tools, names, descriptions, input schemas, and output schemas.
- Compare the live tools/list response when a bearer token is available.
- Flag unexpected tools, missing schemas, and mismatched destructive hints.
- Store the baseline so later drift can be reviewed quickly.