Pascal Fua, Bugra Tekin, Weizhe Liu
State-of-the-art methods for self-supervised sequential action alignment rely on deep networks that find correspon- dences across videos in time. They either learn frame-to- frame mapping across sequences, which does not leverage temporal information, or a ...
IEEE2022