Research Internships 2006-2007

A Fault-tolerance Mechanism for Future Updates 

General context

The OASIS INRIA Sophia Antipolis project has been designing and implementing  ProActive, a Java library for the GRID. In the framework of an active object model, the library features Asynchronous Typed Messages, Future Synchronizations (Wait-by-necessity), Group Communications, Mobility, Security, and a generic component model. At the infrastructure level, XML Deployment Descriptors provide the capacity to deploy on many kind of GRIDs.

A fault tolerance protocol has been designed and implemented in ProActive by Christian Delbe during his thesis. More precisely, several protocols are proposed, the objective of this internship is to focus on the Communication Induced Checkpointing protocol that has been designed specifically for the ASP calculus, and its Java implementation: ProActive.

Objectives

Futures are placeholders for an awaited reply of an asynchronous method call.

The main goal of this internship is to design a mechanism that allows  the existing fault-tolerance protocol to deal with first class futures. Indeed, first class futures come with very intreresting properties that ease the programming of distributed application: whatever the order of future updates, whatever the set of processes to which this future is to be updated, the result of the computation is the same! Unfortunately, if those properties ease programming, they make the design of fault-tolerance protocols a little more difficult.

The fault-tolerance protocol that has been implemented in ProActive does not support first class futures: the value of a future can only be re-sent to the sender of the associated request upon recovery. In other words, if the future has been automatically forwarded to a third process, this process will not receive the future value upon recovery.

This internship can be addressed both from a purely theoretical approach (focusing on the link between ASP calculus and events model, see bibliography below); and from a much more applied point of view that could consist, for example in designing and implementing an independent protocol for addressing future updates.

Some of the following (independent) steps could be followed:
Of course, the objective of the internship is not necessarily to achieve all these goals.

Possible area of work/aspects:
 


Advisors :  Ludovic Henrio  
Phone : 04 92 38 71 64 Email :    Ludovic.Henrio@inria.fr
Team :  OASIS  (INRIA Sophia Antipolis -- I3S -- CNRS)



Prerequisite : Some knowledge in Object-oriented languages, distributed programming.

Internship location:
       Sophia Antipolis, between Nice and Cannes, France



Bibliography:

   A Hybrid Message Logging-CIC Protocol for Constrained Checkpointability
   Francoise Baude, Denis Caromel, Christian Delbe, Ludovic Henrio
   proceedings of Euro-Par 2005,  Springer-Verlag     [PDF]

   Un protocole de tolérance aux pannes pour objets actifs non préemptifs
   Francoise Baude, Denis Caromel, Christian Delbé and Ludovic Henrio
   Technique et Science Informatiques - 2006  

   Promised Consistency for Rollback Recovery
  Denis Caromel, Christian Delbé, Ludovic Henrio
  Research Report, INRIA Sophia Antipolis,  RR-5902 

ProActive Home page

Asynchronous and Deterministic Objects
Denis Caromel, Ludovic Henrio, Bernard Serpette,
POPL'04, Proceedings of the 31st ACM Symposium on Principles of Programming Languages, 2004, Venice, Italy. bibtex

A Theory of Distributed Objects  Asynchrony - Mobility - Groups - Components  
Denis Caromel - Ludovic Henrio  Springer, 2005, XXXII, 346 p., Hardcover, ISBN: 3-540-20866-6