Multi-modal Temporal Relation Network for Video Understanding | Synapse