Talks & Seminars
Title: Hyracks: A Flexible and Extensible Foundation for Data-Intensive Computing
Prof. Vinayak Borkar, Computer Science Dept., Univ. of California, Irvine
Date & Time: February 4, 2011 15:30
Venue: SIC 201, Second Floor, C Block, Kanwal Rekhi Building
Hyracks is a new partitioned-parallel software platform designed to run data-intensive computations on large shared-nothing clusters of computers. Hyracks allows users to express a computation as a DAG of data operators and connectors. Operators operate on partitions of input data and produce partitions of output data, while connectors repartition operators’ outputs to make the newly produced partitions available at the consuming operators. We describe the Hyracks end user model, for authors of dataflow jobs, and the extension model for users who wish to augment Hyracks’ built-in library with new operator and/or connector types. We also describe our initial Hyracks implementation. Since Hyracks is in roughly the same space as the open source Hadoop platform, we compare Hyracks with Hadoop experimentally for several different kinds of use cases. The initial results demonstrate that Hyracks has significant promise as a next-generation platform for data-intensive applications. Joint work with: Michael Carey, Raman Grover, Nicola Onose, Rares Vernica To appear in ICDE 2011.
Speaker Profile:
List of Talks


Faculty CSE IT
Forgot Password
    [+] Sitemap     Feedback