Didier Parigot

Zenith INRIA Team

INRIA Sophia Antipolis
Batiment Fermat, F109
2004 Route des Lucioles
BP 93
06902 Sophia Antipolis
Cedex France

Didier.Parigot@inria.fr
Tel : (33-4) 4 92 38 50 01
Fax : (33-4) 4 92 38 76 44



Alvaro:EECS-2009-113

Summary

BOOM: Data-Centric Programming in the Datacenter. Alvaro, Peter, Condie, Tyson, Conway, Neil, Elmeleegy, Khaled, Hellerstein, Joseph M. and Sears, Russell C. Technical report UCB/EECS-2009-113, EECS Department, University of California, Berkeley, Aug 2009. (URL)

Abstract

Cloud computing makes clusters a commodity, creating the potential for a wide range of programmers to develop new scalable services. However, current cloud platforms do little to simplify truly distributed systems development. In this paper, we explore the use of a declarative, data-centric programming model to achieve this simplicity. We describe our experience using Overlog and Java to implement a "Big Data" analytics stack that is API-compatible with Hadoop and HDFS, with equivalent performance. We extended the system with complex features not yet available in Hadoop, including availability, scalability, and unique monitoring and debugging facilities. We present our experience to validate the enhanced programmer productivity afforded by declarative programming, and to inform the design of new development environments for distributed programming.

Bibtex entry

@TECHREPORT { Alvaro:EECS-2009-113,
    AUTHOR = { Alvaro, Peter and Condie, Tyson and Conway, Neil and Elmeleegy, Khaled and Hellerstein, Joseph M. and Sears, Russell C },
    TITLE = { BOOM: Data-Centric Programming in the Datacenter },
    INSTITUTION = { EECS Department, University of California, Berkeley },
    YEAR = { 2009 },
    MONTH = { Aug },
    URL = { http://www.eecs.berkeley.edu/Pubs/TechRpts/2009/EECS-2009-113.html },
    NUMBER = { UCB/EECS-2009-113 },
    ABSTRACT = { Cloud computing makes clusters a commodity, creating the potential for a wide range of programmers to develop new scalable services. However, current cloud platforms do little to simplify truly distributed systems development. In this paper, we explore the use of a declarative, data-centric programming model to achieve this simplicity. We describe our experience using Overlog and Java to implement a "Big Data" analytics stack that is API-compatible with Hadoop and HDFS, with equivalent performance. We extended the system with complex features not yet available in Hadoop, including availability, scalability, and unique monitoring and debugging facilities. We present our experience to validate the enhanced programmer productivity afforded by declarative programming, and to inform the design of new development environments for distributed programming. },
}


INRIA main page LIRMM main page