also offered as (Old) Lecture 16 | Connectionist Temporal Classification
https://www.youtube.com/watch?v=A8IhGQCurPc&list=PLp-0K3kfddPzNdZPX4p0lVi6AcDXBofuf&index=16
image.png
pretend that the output shows up for more than 1 time, and divgergence is everywhere
however, this assumption may not hold in question answering: answer will not be reached before the question is completed
two assumptions here: output is order-synchronous with input, and output number is smaller than input number
just look at the row we are interested in
(unfinished)