Once basics improve: Put a fixed threshold on predicted masks and visualize results epoch-by-epoch Visualize false positives and false negatives Add confidence heatmaps Test multiple input resolutions (256 vs 512)