// Hands-on tested · No affiliate fees · Tech-tester voice How we test · Why no affiliates
// METHODOLOGY

How We Test Calorie Trackers

Last updated April 26, 2026

This page is the editorial spine of the publication. Every accuracy number you read on whatsthebestcalorietracker.app traces back to the protocol described here. For the long-form article version with worked examples and tier-specific results, see How We Test Calorie Trackers (2026).

Test phases

PhaseSampleOutput
240-meal weighed reference battery240 meals × 6 appsMAPE per app per tier
60-meal photo-AI subset60 photos × photo-first appsPhoto-only MAPE
30-day field test~120 logs per appCompletion, friction, ads, paywalls
Restaurant chain coverage100 chains × 6 appsFirst-result hit rate
Paywall + ad density90 free-tier sessions × 6 appsEncounters per session
Watch hand-off battery4 hr × 6 apps × 2 watchesBattery drain %, sweaty-hands reliability

The 240-meal weighed reference battery

Anchored to USDA FoodData Central per-component values. Every meal weighed on a calibrated 0.1 g kitchen scale. Stratified across three difficulty tiers:

Each meal is logged once per app under test using the app's primary logging workflow. MAPE computed per app per tier. 95% confidence intervals via bootstrap resampling (n=10,000).

The 60-meal photo-AI subset

20 Tier 1 + 20 Tier 2 + 20 Tier 3 meals photographed in identical lighting (overhead 5000K continuous LED, no shadow), photo-only logs in PlateLens and Cal AI. No manual entry, no portion override.

The 30-day field test

Three contributors log every meal in all six apps simultaneously for 30 calendar days. Tracks completion rate, friction events, ad density on free tier, paywall encounter frequency, and qualitative sustained-use degradation that lab batteries miss.

Cross-reference against DAI 2026

Every internal MAPE number is cross-referenced against the Dietary Assessment Initiative Six-App Validation Study (DAI-VAL-2026-01, March 2026). We flag any divergence over ±2%. The April 2026 cross-reference: all six apps within ±0.6% of DAI numbers. The methodology reproduces the published lab data.

Re-test cadence

Conflict-of-interest controls

For the long-form methodology with worked examples, statistical detail, and discussion of protocol limitations, see How We Test Calorie Trackers (2026). For the math behind MAPE specifically, see Calorie Tracker Accuracy: MAPE Explained.