Bus AID Hotspot Analysis
Documentation Status: FINAL
Purpose
The purpose of this repository is to document the geospatial analyses being conducted as part of the Bus Accelerated Infrastructure Delivery (BusAID) program.
Description
BusAID geospatial analyses are being carried out collaboratively between the DataViz and Regional Network Management (RNM) teams. The RNM team, leading the BusAID program, is working to inventory transit operator-identified “hotspot” locations throughout the Bay Area with transit speed and reliability issues, score these hotspot locations, and fund quick-build projects at the highest scoring locations. To score projects, the team is using criteria related to transit ridership, potential delay reduction, equity considerations, and presence of Priority Development Areas (PDAs).
The goal of this project is to develop an automated, repeatable geospatial analysis process related to the equity and PDA criteria mentioned above.
Parameters
- For each hotspot, provide the number of Equity Priority Area (EPC) census tracts served by all transit routes passing through the hotspot as a share of total census tracts served by all transit routes passing through the hotspot. Break out EPC census tracts served into “high”, “higher”, and “highest” EPC classifications.
- For each hotspot, provide the number of transit routes that serve PDA(s) as a share of total transit routes passing through the hotspot.
- For each hotspot, summarize the following Equity Priority Community (EPC) population demographic factors for all census tracts served by transit routes passing through hotspots:
- People of Color
- Low Income (< 200% Federal Poverty Level -FPL)
- Limited English Proficiency
- Zero-Vehicle Household
- Seniors 75 Years and Over
- People with Disability
- Single Parent Families
- Rent-Burdened
Additionally, summarize the following:
- Total population
- Total population over 5 years old
- Total households
- Total families
Remove the following hotspots from the analysis:
- Remove hotspots #108, 109, 110 (WestCAT-identified hotspots) and any routes that only pass through these hotspots. These are being addressed via a DPD Forwards project.
- Remove hotspots #82, 84, 85 (SFMTA-identified hotspots) and any routes that only pass through these hotspots. SFMTA has decided to withdraw these from consideration for the BusAID program.
Project Resources
Purpose | Resource | Description | Location | Link (if any) | Access | Notes |
---|---|---|---|---|---|---|
Input | Source Data | Hotspot geospatial data (KMZ points/lines) | Box | Spatial Data | Internal Only | |
Input | Source Data | Hotspot tabular data (Excel spreadsheet) | Box | BusAID Hotspot Master List_112823.xlsx | Internal Only | |
Input | Source Data | Individual operator hotspot data (Excel spreadsheets) | Box | Individual Operator Spreadsheets | Internal Only | |
Input | Source Data | Equity Priority Communities - Plan Bay Area 2050 (EPCs) | ArcGIS Online | EPCs | Public | |
Input | Source Data | Priority Development Areas - Plan Bay Area 2050 (PDAs) | ArcGIS Online | PDAs | Public | |
Input | Source Data | 511 GTFS Data - November 2023 | Online | 511 GTFS Data | Public | Data pulled from the historic feed for November 2023 |
Data Catalog | Published Datasets | ArcGIS Online working roup for managing working and draft content | ArcGIS Online | Bus AID (Private - Working) | Project Collaborators | |
Output | Data Products | BusAID Hotspots Summary Data | Box | BusAID Hotspot Master List_112823.xlsx | Internal Only | Data summaries were added to the tabular excel workbook as the following tabs: hotspot_epc_summary; hotspot_pda_summary; hotspot_demographic_summary |
Methodology
Build Busaid Datasets (Jupyter Notebook)
- Read input datasets
- Pre-process hotspot datasets
- Drop hotspot records with missing ids from hotspot master list
- Remove hotspots no longer under consideration
- Create point and line datasets from KML data
- Extract hotspot ids from hotspot name column to prepare for join of spatial and tabular data
- Merge master list tabular data to spatial data (points and lines)
- Remove whitespace from transit routes column in master list
- Split comma separated values into a list of transit routes for each hotspot in master list
- Add transit agency ids that match GTFS data in master list
- Explode transit routes into individual rows (one row per transit route) in master list
- Manually correct transit route ids that do not match GTFS data in master list
- Pre-process GTFS transit datasets
- Check that all agencies represented in the hotspot master list are included in the GTFS data
- Join GTFS agency, route, trip, and shape data into a single dataset
- Filter to only include bus and Tram, Streetcar, Light Rail transit modes
- Merge hotspot tabular data with spatial data
- Publish datasets to ArcGIS Online for review
- Perform spatial overlays
- Overlay hotspots with EPCs
- Overlay hotspots with PDAs
- Summarize data
- Summarize tracts by hotspot, epc classification
- Summarize routes by hotspot, pda
- Summarize demographic data by hotspot