MERL is looking for an intern to work on fundamental research in the area of audiovisual semantic understanding for scene-aware dialog technologies by combining end-to-end dialog and video scene understanding technologies. The intern will collaborate with MERL researchers to derive and implement new models, conduct experiments, and prepare results for high impact publication. The ideal candidate would be a senior Ph.D. student with experience in one or more of video captioning/description, end-to-end conversation modeling and natural language processing including practical machine learning algorithms with related programming skills. The duration of the internship is expected to be 3-6 months.
- Research Areas: Speech & Audio
- Host: Chiori Hori
- Apply Now