I am off-work today, but I am taking some time reflecting on somewhat boring (annoying?) dilemmas from work.
1) I don't like something about U-Net shaped neural networks, which are great starting points for #segmentation in #medicalimaging . In general, I think that the encoder should do most of the semantic work (see SAM from Meta AI). These networks usually have 1-2x parameters/computations on the decoder hand. Basically they operate on downsampled images then need upsampled outputs (U shape, resolution goes down, then up, and stages are "parallel").
What do?
(Context: I process 3D brain scans, much numbers and compute!)
2) I come back to similarity, proximity and distance measures, metric and semi-metric. Who's farthest from data point A? Data point B with ~-1 correlation, or data point C with ~0 correlation? If you choose to put C the farthest, how do you maintain the distinction of sign? I want point D (~1 correlation) and point B close to A, but in "different" ways ???
(Context: I cluster health records, sometimes negative correlations do not mean much, sometimes they do)
#deeplearning #unet #semanticsegmentation #machinelearning