Cong — Restaurant Data Dashboard

Restaurants by Type

Click a bar to filter map + source. Ctrl+click (Cmd on Mac) to select multiple types.

Restaurants by Source

Auto-filters when types are selected above.

Restaurant Distribution by State

Ctrl+click tabs to combine multiple types on the map.

Restaurant Types by State

Duplicate Names Within Sources

Restaurants with the same normalized name within a single dataset (likely chains)

Cross-Source Duplicate Names

Same restaurant name appearing in multiple datasets

Top Chain Restaurants (by location count)

Restaurant Types by Source Dataset

Top 50 Canonical Dishes

Click a row to drill into a dish. Click a column header to sort.

Canonicals by Dish Type

Canonicals by Menu Section

Cuisine × Dish-type Heatmap

Canonical-dish counts at each cuisine × dish-type intersection.

All Dishes by State

Distinct canonical dishes attested in each state.

Conversations

No v3 conversations yet. Run the pipeline: python scripts/seed_v3_demo.py for a stub demo, or kick off docker run … cong/pipeline-v3 --restaurant 2 --n 10 for the real Shokudo end-to-end.

Top groups by flag rate

Groups with flag_rate are sorted DESC. Excludes utterances without a T1 audit (run scripts/backfill_rfc013_t1.py --apply to populate).

Closure metric

Latest application per (utterance, stt_model) — re-runs and rollback-then-reapply paths don't double-count. Bo's signal: auto_standing_pct trending up means recurring patterns are becoming automatic.

Pending proposals

Trigger the wer-proposal-audit skill in a Claude Code session to refresh. Sample column shows up to 20 representatives (full cluster_member_fks live in the DB).

Snapshot

Recompute via scripts/compute_per_term_wer.py --stt-model <m> --corpus-run-id <tag> after each generation/STT run.

Top terms by per_term_wer

Filter: n_gt ≥ min_occurrences (default 5) to drop noisy single-shot terms. Click a row to drill into the utterances containing that term.

Export

Writes data/finetune_manifests/candidates_<UTC-date>.jsonl — copy-paste the command below, adjust threshold / cap, run from the repo root.

Entity hit-rate

hit = match in {exact, equivalent}; miss = match in {partial, missing}. Click a row to drill into the misses for that type. Run scripts/llm_entity_extract.py against the cloud DB to populate.

聪 Cong — Restaurant Data Dashboard