Bilingual video captioning model for enhanced video retrieval
Abstract Many video platforms rely on the descriptions that uploaders provide for video retrieval.However, this reliance may cause inaccuracies.Although deep learning-based video captioning can resolve this problem, it has some limitations: (1) traditional keyframe extraction techniques do not consider video length/content, resulting in low accurac