[dcmf] 32-bit ROMIO: What is the best solution?

Rajeev Thakur thakur at mcs.anl.gov
Thu Feb 28 13:26:35 CST 2008


Perhaps we could use mpich2-bg for private communication between the IBM
folks and Argonne MPICH2 developers, and mpich2-dev for stuff meant for
others. mpich2-dev has many subscribers.

Rajeev
  

> -----Original Message-----
> From: dcmf-bounces at lists.anl-external.org 
> [mailto:dcmf-bounces at lists.anl-external.org] On Behalf Of Rob Ross
> Sent: Thursday, February 28, 2008 12:56 PM
> To: Pavan Balaji
> Cc: dcmf at lists.anl-external.org; dcmf-bounces at lists.anl-external.org
> Subject: Re: [dcmf] 32-bit ROMIO: What is the best solution?
> 
> That's fine with me too. Michael, please provide us a list of  
> interested IBM parties, and we'll get this set up.
> 
> Pavan, can you put RobL, Kevin, Sam, and I on the mpich2-bg list  
> (along with Rajeev and others on the MPICH team as appropriate)?
> 
> Thanks,
> 
> Rob
> 
> On Feb 28, 2008, at 12:42 PM, Pavan Balaji wrote:
> 
> >
> > There's an mpich2-bg list that I just created if you want to move  
> > there.
> >
> > -- Pavan
> >
> > On 02/28/2008 12:39 PM, Rob Ross wrote:
> >> Rajeev had suggested this too. Let's go with mpich2-dev. Can you  
> >> give us a list of people that should be added to the list?
> >> Thanks,
> >> Rob
> >> On Feb 28, 2008, at 12:29 PM, Michael Blocksome wrote:
> >>>
> >>> I agree .. we need to get over the discussion about the 
> discussion  
> >>> as fast as possible. :)
> >>>
> >>> The dcmf list has worked well, but I think we started using it  
> >>> because there were no alternatives to the issue at hand.  Given  
> >>> that the mpich2 and global arrays communities have different  
> >>> licenses, our long term direction for the dcmf community 
> should be  
> >>> to support the dcmf library and move discussion of other  
> >>> "external" libraries (like mpich2 and global arrays) into thier  
> >>> communities.
> >>>
> >>> I just noticed that there is a mpich2-dev at mcs.anl.gov 
> mailing list  
> >>> already set up.  Could we use this list to collaborate on the  
> >>> mpich2 implementation?
> >>>
> >>> Michael Blocksome
> >>> Blue Gene Messaging Team Lead
> >>> Advanced Systems SW Development
> >>> blocksom at us.ibm.com
> >>>
> >>>
> >>> Rob Ross <rross at mcs.anl.gov> wrote on 02/28/2008 12:17:06 PM:
> >>>
> >>> > Hi,
> >>> >
> >>> > We've been using the DCMF list for all sorts of things 
> that aren't
> >>> > directly related to DCMF without any issues (as far as 
> I know);  
> >>> it has
> >>> > become the successful technical communication path between IBM  
> >>> and ANL
> >>> > so far.
> >>> >
> >>> > However, if you'd like to create a separate list for 
> discussing  
> >>> this,
> >>> > we can do that. Is that what you would like to do? If so, can  
> >>> you give
> >>> > us a list of people on the IBM side that should be on the new  
> >>> list?
> >>> > I'd like to be done discussing where we're going to talk about  
> >>> this
> >>> > issue ASAP and concentrate on fixing it.
> >>> >
> >>> > Thanks,
> >>> >
> >>> > Rob
> >>> >
> >>> > On Feb 28, 2008, at 11:18 AM, Michael Blocksome wrote:
> >>> >
> >>> > >
> >>> > > Unfortunately, this MPI_Aint discussion really has 
> nothing to do
> >>> > > with DCMF.  The entrire problem and solution is with MPICH2.
> >>> > >
> >>> > > Is there another, less broad, mpich mailing list that 
> we could  
> >>> use?
> >>> > > Maybe a "mpich-development" or something like that?
> >>> > >
> >>> > > Michael Blocksome
> >>> > > Blue Gene Messaging Team Lead
> >>> > > Advanced Systems SW Development
> >>> > > blocksom at us.ibm.com
> >>> > >
> >>> > >
> >>> > > dcmf-bounces at lists.anl-external.org wrote on 02/28/2008  
> >>> 11:01:09 AM:
> >>> > >
> >>> > > > I'd say this mailing list is better for it right now.  
> >>> Posting on the
> >>> > > > broader mpich-discuss might just distract us.
> >>> > > >
> >>> > > > Rajeev
> >>> > > >
> >>> > > > From: dcmf-bounces at lists.anl-external.org [mailto:dcmf-
> >>> > > > bounces at lists.anl-external.org] On Behalf Of Bob Cernohous
> >>> > > > Sent: Wednesday, February 27, 2008 4:19 PM
> >>> > > > To: dcmf at lists.anl-external.org
> >>> > > > Subject: Re: [dcmf] 32-bit ROMIO: What is the best solution?
> >>> > >
> >>> > > >
> >>> > > > bobc at us.ibm.com wrote on 02/08/2008 02:41:40 PM:
> >>> > > >
> >>> > > > >We all realize there are issues with MPI_Aint's in 32 bit
> >>> > > implementations.
> >>> > > > >Using signed 32 bit addresses along with 64 bit offsets can
> >>> > > result in
> >>> > > > >some pretty broken code.  I've reproduced several 
> problems on
> >>> > > > >BGL/BGP/linux.   The problems are most obvious with romio  
> >>> files >
> >>> > > 2G or
> >>> > > > >virtual addresses > 2G.
> >>> > > >
> >>> > > > Earlier this month I posted some comments to this mailing  
> >>> list about
> >>> > > > 32-bit MPICH/ROMIO/MPI_Aint issues (see above and the  
> >>> archive).
> >>> > > >
> >>> > > > We've started to work on the problem.   We're (optionally)  
> >>> making
> >>> > > > MPI_Aint a 64 bit signed value even when integers/pointers  
> >>> are 32
> >>> > > bit
> >>> > > > values.
> >>> > > >
> >>> > > > We'd like to start sending fix patches to the 
> community for  
> >>> comment.
> >>> > > >
> >>> > > > However, since we are (in parallel) contributing our BGP  
> >>> code back
> >>> > > > to MPICH and trying to converge on a common code base of  
> >>> 1.0.7,
> >>> > > > and because this problem applies to any 32-bit platform,
> >>> > > > we thought we should move the discussion to 
> mpich-discuss to  
> >>> get
> >>> > > > the larger community's comments.
> >>> > > >
> >>> > > > It's unlikely, at this point, that our patches 
> would apply to
> >>> > > MPICH2,
> >>> > > > but that is our goal after the 1.0.7/BGP patch settles.    
> >>> For now,
> >>> > > we'd
> >>> > > > be sending these as RFC (request for comment) patches.
> >>> > > >
> >>> > > > Please let us know if you agree with discussing 
> this on mpich-
> >>> > > discuss,
> >>> > > > or if there is a better place.
> >>> > > >
> >>> > > > Bob Cernohous:  (T/L 553) 507-253-6093
> >>> > > >
> >>> > > > BobC at us.ibm.com
> >>> > > > IBM Rochester, Building 030-2(C335), Department 61L
> >>> > > > 3605 Hwy 52 North, Rochester,  MN 55901-7829
> >>> > > >
> >>> > > > > Chaos reigns within.
> >>> > > > > Reflect, repent, and reboot.
> >>> > > > > Order shall  
> >>> return._______________________________________________
> >>> > > > dcmf mailing list
> >>> > > > dcmf at lists.anl-external.org
> >>> > > > http://lists.anl-external.org/cgi-bin/mailman/listinfo/dcmf
> >>> > > > http://dcmf.anl-external.
> >>> > org/wiki_______________________________________________
> >>> > > dcmf mailing list
> >>> > > dcmf at lists.anl-external.org
> >>> > > http://lists.anl-external.org/cgi-bin/mailman/listinfo/dcmf
> >>> > > http://dcmf.anl-external.org/wiki
> >>> >
> >> _______________________________________________
> >> dcmf mailing list
> >> dcmf at lists.anl-external.org
> >> http://lists.anl-external.org/cgi-bin/mailman/listinfo/dcmf
> >> http://dcmf.anl-external.org/wiki
> >
> > -- 
> > Pavan Balaji
> > http://www.mcs.anl.gov/~balaji
> >
> 
> _______________________________________________
> dcmf mailing list
> dcmf at lists.anl-external.org
> http://lists.anl-external.org/cgi-bin/mailman/listinfo/dcmf
> http://dcmf.anl-external.org/wiki
> 
> 




More information about the dcmf mailing list