Users can upload trial sites to SAGE by latitude/longitude, address search, or a batch .csv template. Isochrones are generated for each site using the ORS API and overlaid on U.S. census tracts. Demographic data from intersected tracts are compiled to provide summary statistics on populations within driving distance of trial sites. Results are viewable both at the individual-site level and in aggregate across all sites.
The prototype cohort slicer module allows users to apply demographic filters (e.g., race, income, age, sex, insurance status) to the combined catchment population. Estimates are derived by multiplying across the selected dimensions. This assumes the dimensions are independent of one another, which is not strictly true in reality (for example, insurance status and income are strongly correlated). As a result, the slicer outputs should be interpreted as directional and exploratory, highlighting how different filters may change cohort size in aggregate, rather than as precise headcounts.
Data Sources
SAGE integrates multiple publicly available datasets to ensure transparency and reproducibility. Demographic and socioeconomic variables are drawn from the U.S. Census Bureau’s American Community Survey (ACS) 5-Year Estimates (latest release) and the Decennial Census (2020 PL94-171). Geographic boundaries (states, counties, and census tracts) are sourced from the 2023 TIGER/Line shapefiles. Travel-time isochrones are generated via the open-source OpenRouteService (ORS) API. These sources collectively provide a robust foundation for analyzing equity, access, and representation in clinical trial site planning.
Development State
SAGE is currently in a prototype phase and may occasionally encounter bugs or errors. Planned development includes more robust reporting and visualization features designed to directly support Diversity Action Plan requirements. Looking ahead, potential enhancements include integrating disease prevalence data to estimate patient counts within catchment areas and adding optimization tools to assist with trial site planning (e.g., identifying the minimal set of sites needed to achieve demographic recruitment goals).