Home » PhD Courses » PhD Course in Computer Science, Mathematics and Physics » PhD Students » Alex Falcon
Supervisor: Oswald Lanz / Giuseppe Serra
+39 0432 558446
Stanza / Room: Lab AI2
falcon.alex@spes.uniud.it
Image Question Answering and Video Question Answering are two tasks involving the realization of models able to analyze the visual content of an image or a video, and produce a meaningful answer to visual content-related questions. These tasks both involve spatial, frame-level reasoning. Moreover, Video Question Answering also requires temporal, video-level reasoning which further raises the difficulty of the task. Solving these tasks would represent the ability to train models able to jointly analyze and reason on visual contents and textual contents at a human-level: the obtained models would be able to learn to isolate and pinpoint objects of interest in video (or image), and to identify and reason about their interactions in both the spatial and temporal domains. Image and Video Question Answering thus represent a challenging, but fundamental task in both Computer Vision and Natural Language Processing communities.
During my Ph.D. I will work on the Video Question Answering task focusing on videos recorded from an Egocentric perspective. In addition to the temporal and spatial reasoning aspects, such task also requires the analysis of several egocentric cues. Finally, Egocentric Video Question Answering will be useful in several fields, such as a visual support to help a worker develop new skills and improve the existing ones.
Università degli Studi di Udine
Dipartimento di Scienze Matematiche, Informatiche e Fisiche (DMIF)
via delle Scienze 206, 33100 Udine, Italy
Tel: +39 0432 558400
Fax: +39 0432 558499
PEC: dmif@postacert.uniud.it
p.iva 01071600306 | c.f. 80014550307
30 km from Slovenia border
80 km from Austria border
120 km from Croatia border
160 km South West of Klagenfurt (Austria)
160 km West of Lubiana (Slovenia)
120 km North East of Venezia (Italy)