AR1/docs/analysis-tools/hotspot.md

79 lines
2.1 KiB
Markdown

# Hot Spot Analysis
Identify statistically significant clusters of high and low values using Getis-Ord Gi* statistics.
## Overview
Hot spot analysis uses the Getis-Ord Gi* statistic to identify statistically significant spatial clusters. Features are classified as:
- **99% Hot Spot**: Very high values, 99% confidence
- **95% Hot Spot**: High values, 95% confidence
- **90% Hot Spot**: High values, 90% confidence
- **Not Significant**: No significant clustering
- **90% Cold Spot**: Low values, 90% confidence
- **95% Cold Spot**: Low values, 95% confidence
- **99% Cold Spot**: Very low values, 99% confidence
## Inputs
- **Dataset**: Point or polygon dataset
- **Value Field**: Numeric field to analyze
- **Neighbor Type**: Distance-based or K-nearest neighbors
- **Distance** (if distance-based): Maximum neighbor distance
- **K Neighbors** (if KNN): Number of nearest neighbors
## Outputs
New dataset containing:
- Original geometry
- **Gi* Z-Score**: Standardized z-score
- **P-Value**: Statistical significance
- **Hot Spot Class**: Categorized class
- Original attributes
## Algorithm
1. Calculate spatial weights matrix based on neighbor configuration
2. Compute Getis-Ord Gi* statistic for each feature
3. Calculate z-scores and p-values
4. Categorize into hot spot classes
5. Store results in output dataset
## Example
```json
{
"dataset_id": 123,
"value_field": "population",
"neighbor_type": "distance",
"distance": 1000
}
```
## Background Jobs
This analysis runs as a background job. See [Hot Spot Analysis Worker](../workers/hotspot_analysis.md) for details.
## Use Cases
- Crime analysis
- Disease clustering
- Economic activity patterns
- Environmental monitoring
- Social phenomena analysis
## Notes
- Requires numeric field with sufficient variation
- Distance should be appropriate for data scale
- KNN method is generally faster for large datasets
- Results depend on neighbor configuration
## Related Documentation
- [Hot Spot Analysis Worker](../workers/hotspot_analysis.md)
- [Analysis API](../api/analysis.md)
- [Live Hot Spot Analysis](hotspot-live.md)