Methodology

Data sources

Specialty taxonomy mapping

NPPES encodes provider specialty using NUCC (National Uniform Claim Committee) taxonomy codes. The index tracks 15 core specialties selected to align with the HRSA HPSA shortage categories. Each specialty is defined as a set of NUCC codes that includes relevant subspecialties; the complete mapping is published in the taxonomy.yaml file in the public repository. The taxonomy is intentionally inclusive, meaning more codes are mapped per specialty than a strict definition would require. This approach produces a slight overcount of available physicians, which keeps gap claims conservative rather than inflated.

Why 30 miles

Specialty coverage is measured within a 30-mile radius of each hospital. That threshold is not arbitrary: HRSA applies 30 miles as its standard definition of "reasonable access" in non-metropolitan areas when designating Health Professional Shortage Areas, the federal mechanism that governs loan forgiveness placements, J-1 visa waivers, and related workforce interventions.

An earlier design used 60 miles. At that radius, the catchment areas of Jackson, Memphis, Mobile, and New Orleans begin to overlap with rural Mississippi CAHs, masking the access gaps that rural patients actually face. At 30 miles, the index produces a picture that is consistent with what rural CMOs report from operational experience.

Geographic distance

Hospital coordinates are resolved to precise latitude and longitude via Nominatim (OpenStreetMap). Approximately half of Mississippi CAHs carry rural highway-style addresses (for example, "25117 HIGHWAY 51") that Nominatim cannot geocode; those hospitals fall back to their ZIP code centroid. Within a 30-mile radius, this fallback introduces less than one mile of positional error. CAHs using the centroid fallback are marked with an outlined map symbol so users can identify which positions are approximate. Physician locations are geocoded using the Census 2020 ZCTA gazetteer. All distances are computed using the haversine great-circle formula.

Gap scoring

Each (CAH, specialty) pair receives one of four severity classifications based on the count of active physicians of that specialty within 30 miles of the hospital:

Map severity buckets

Each CAH is assigned a gap count equal to the number of its 15 tracked specialties classified HIGH or CRITICAL. County shading on the map reflects the highest gap count among all CAHs within that county, using the maximum rather than the mean, so that a county containing one severely gapped hospital is represented accurately even if other facilities in the county are better resourced:

Verification

Prior to publication of each dashboard build, five hospital-specialty pairs are selected at random and manually reviewed against external sources to confirm or refute the underlying gap claim. Confirmations are logged in an internal verification record keyed to a build identifier derived from the gap matrix's claim columns. Signoffs issued against a prior build are rejected by the render gate, ensuring that verification is specific to the data version being published.

Known limitations