Name:
Alcazar_APES_Audiovisual_Person_Search_in_Untrimmed_Video_CVPRW_2021_paper.pdf
Size:
2.328Mb
Format:
PDF
Description:
Accepted Manuscript
Type
Conference PaperAuthors
Alcazar, Juan LeonHeilbron, Fabian Caba
Mai, Long
Perazzi, Federico
Lee, Joon-Young
Arbelaez, Pablo
Ghanem, Bernard

KAUST Department
Computer, Electrical and Mathematical Science and Engineering (CEMSE) DivisionElectrical and Computer Engineering Program
VCC Analytics Research Group
Date
2021-06Preprint Posting Date
2021-06-03Permanent link to this record
http://hdl.handle.net/10754/669426
Metadata
Show full item recordAbstract
Humans are arguably one of the most important subjects in video streams, many real-world applications such as video summarization or video editing workflows often require the automatic search and retrieval of a person of interest. Despite tremendous efforts in the person re-identification and retrieval domains, few works have developed audiovisual search strategies. In this paper, we present the Audiovisual Person Search dataset (APES), a new dataset composed of untrimmed videos whose audio (voices) and visual (faces) streams are densely annotated. APES contains over 1.9K identities labeled along 36 hours of video, making it the largest dataset available for untrimmed audiovisual person search. A key property of APES is that it includes dense temporal annotations that link faces to speech segments of the same identity. To showcase the potential of our new dataset, we propose an audiovisual baseline and benchmark for person retrieval. Our study shows that modeling audiovisual cues benefits the recognition of people’s identities.Citation
Alcazar, J. L., Heilbron, F. C., Mai, L., Perazzi, F., Lee, J.-Y., Arbelaez, P., & Ghanem, B. (2021). APES: Audiovisual Person Search in Untrimmed Video. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). doi:10.1109/cvprw53098.2021.00188Publisher
IEEEConference/Event name
2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)arXiv
2106.01667Additional Links
https://ieeexplore.ieee.org/document/9523077/https://openaccess.thecvf.com/content/CVPR2021W/MULA/html/Alcazar_APES_Audiovisual_Person_Search_in_Untrimmed_Video_CVPRW_2021_paper.html
ae974a485f413a2113503eed53cd6c53
10.1109/cvprw53098.2021.00188