Note—this dashboard is still in development. Please excuse any building work as we develop in the open.
Data and Methods
Overview
This page documents the data sources, processing methods, and analytical approaches used in the Safer Parks project. Our analysis combines multiple geospatial datasets to provide comprehensive insights into perception of park safety in the Bradford Local Authority District area.
Project Background
The Safer Parks project develops geospatial analysis tools and methodologies to assess park safety, focusing on factors that contribute to the perceived and actual safety of public green spaces. The project provides both analytical tools and data visualizations to support evidence-based decision making for park management and safety improvements.
Description: Official administrative boundaries for Bradford District
Processing: Clipped and simplified for web visualization
Usage: Geographic filtering and contextual mapping
Ward Boundaries - Source: ONS Geography Portal - License: Open Government Licence v3.0 - Description: Electoral ward boundaries within Bradford District - Processing: Simplified geometry for performance optimization - Usage: Local area analysis and demographic contextualization
Description: Recorded crime incidents with spatial and temporal attributes
Processing Methods:
[Processing details to be completed when crime data methodology is finalized]
Spatial aggregation using H3 hexagonal grid system
Temporal analysis and trend identification
Categories: All crime types
Usage: Crime density mapping and risk assessment
Note: Detailed crime data processing methodology will be documented separately as this component is processed independently.
Data Processing Methods
Geospatial Processing Pipeline
The project uses a comprehensive Python-based processing pipeline built on the safer_parks library. Key processing steps include:
1. Data Acquisition and Standardization
Automated download and caching of external datasets
Projection standardization to British National Grid (EPSG:27700)
Data format conversion to GeoJSON for web compatibility
2. Geographic Subsetting
# Example using safer_parks library functionsfrom safer_parks import subset_to_LAD# Subset data to Bradford Districtbradford_parks = subset_to_LAD( LAD_gdf=local_authority_boundaries, LAD_column_name='LAD_name', LAD_name='Bradford', data_to_subset=parks_data)
3. Geometry Processing and Validation
Boundary Merging: Adjacent or overlapping park polygons are merged using merge_touching_or_intersecting_polygons() function
Topology Validation: Geometry validation and repair for invalid polygons
Simplification: Coordinate reduction for web performance while maintaining accuracy
4. Attribute Data Enhancement
Data Cleaning: Standardization of text fields and removal of duplicates using clean_and_deduplicate() function
Facility Categorization: Standardized classification of park amenities and facilities
Accessibility Assessment: Analysis of accessibility features and ratings
5. Spatial Analysis Operations
Buffer Analysis: Creation of analysis zones around parks
Intersection Analysis: Calculation of park-facility relationships
Network Analysis: Assessment of connectivity and access routes
Point-in-Polygon: Assignment of crime incidents and facilities to park catchments
Quality Assurance
Data Validation Procedures
Geometric Validation: Automated topology checking and repair
Attribute Validation: Range checking and categorical validation
Cross-Dataset Consistency: Comparison between OS and OSM sources
Manual Verification: Spot-checking of key locations and attributes
Error Handling and Documentation
Comprehensive logging of processing steps and data quality issues
Version control for all data processing scripts
Documentation of known limitations and caveats
Analysis Methods
Park Safety Assessment Framework
Multi-Criteria Analysis
The dashboard employs a multi-criteria approach considering: - Physical Safety: Crime statistics and incident patterns - Perceived Safety: Environmental design factors and visibility - Accessibility: Physical access and inclusive design features - Activity Levels: Presence of facilities and natural surveillance
Spatial Analysis Techniques
Hot Spot Analysis: Identification of crime concentration areas using density estimation
Accessibility Modeling: Network analysis for pedestrian access routes
Visibility Analysis: Assessment of sight lines and natural surveillance opportunities
Buffer Analysis: Creation of influence zones for facilities and safety features
Statistical Methods
Aggregation Techniques
H3 Hexagonal Binning: Spatial aggregation of crime data using Uber’s H3 system for consistent spatial units
Temporal Aggregation: Monthly and seasonal crime pattern analysis
Categorical Aggregation: Grouping of facilities and amenities by safety relevance
Density Estimation
Kernel Density Estimation: Smooth interpolation of point-based crime data
Adaptive Bandwidth: Variable smoothing parameters based on local data density
Edge Correction: Adjustment for boundary effects in density calculations
Technical Implementation
Processing Environment
Language: Python 3.8+
Key Libraries:
GeoPandas for spatial data manipulation
Shapely for geometric operations
Folium/Leaflet for web mapping
H3-py for hexagonal spatial indexing
Web Visualization
Mapping: Leaflet.js with custom styling and interaction
Data Format: GeoJSON for client-side rendering
Performance Optimization: Geometry simplification and data tiling