I once again need to mention how much I love regular expressions. Just wrote one that takes a VTT file and gives the first and last timestamps of groups of consecutive segments with the same speaker (e.g., Alice spoke from 02:31 to 17:03, then Bob spoke from 18:24 to 26:59).