Login
Talks & Seminars
Title: Cross-Language Information Retrieval in the MATERIAL Program
Prof. Douglas W. Oard and Mr. Suraj Nair, University of Maryland, USA
Date & Time: December 6, 2018 11:30
Venue: Department of Computer Science and Engineering, Conference Room, 01st Floor, 'C Block, Kanwal Rekhi (KReSIT) Building
Abstract:
In this talk I will describe a research program called MAchine Translation for English Retrieval of Information in Any Language (MATERIAL) that includes a substantial focus on Cross Language Information Retrieval (CLIR). Over four years, this program expects to build new CLIR test collections for ten new languages, in each case with English queries. Novel aspects of these test collections include (1) domain-limited, sense-specific, and morphology-specific queries, and (2) mixed collections including both text and speech. Two novel aspects of the evaluation design are a focus on set-based rather than ranked retrieval, and the use of a linear utility measure for evaluating result set selection. The MATERIAL program also includes an interactive CLIR evaluation in which assessors use system-generated English summaries in an effort to identify the truly relevant documents in the result set. In this talk I will start by walking through these evaluation design issues, I will then describe the three test collections that have been built to date (for Somali, Swahili, and Tagalog) and systems we are building, and I will conclude with a summary of some open research questions. Additional information on the MATERIAL program, and the Open CLIR evaluation that is being conducted as a part of the program, is available at https://www.iarpa.gov/index.php/research-programs/material.
Speaker Profile:
Douglas Oard is a Professor at the University of Maryland, College Park (USA), with joint appointments there in the College of Information Studies (Maryland’s iSchool) and the University of Maryland Institute for Advanced Computer Studies (UMIACS). His research interests center around the use of emerging technologies to support information seeking by end users. Additional information is available at http://terpconnect.umd.edu/~oard/. Suraj Nair is a Ph. D. student in Computer Science at the University of Maryland, College Park, where he is a member of the Computational Linguistics and Information Processing (CLIP) lab. His research interests lie in cross-lingual information retrieval for low resource languages.
List of Talks

Webmail

Username:
Password:
Faculty CSE IT
Forgot Password
    [+] Sitemap     Feedback