High-Resolution Remote Sensing Image Captioning Based on Structured Attention and SAM Network