0%

论文笔记 | HiT:Hierarchical Transformer with Momentum Contrast for Video-Text Retrieval

论文《HiT: Hierarchical Transformer with Momentum Contrast for Video-Text Retrieval》笔记

背景和问题

uYL9dy

概述和总结

TXxIFJ

模型和方法

The pipeline of our method.

m5sYRB

Hierarchical Transformer

kG5TdK

Video Encoders

image.png

image.png

TEXT Encoders

DElUe2

Momentum Cross-modal Contrast

MJc5i4

Matching

7lXP9X

Experments

DQXpQ0

KPJUnj