[dcmf] 32-bit ROMIO: What is the best solution?

Pavan Balaji balaji at mcs.anl.gov
Thu Feb 28 13:52:13 CST 2008


Rajeev just gave me the list password. I'm adding the relevant Argonne 
folks and everyone from IBM in the below list to mpich2-bg and mpich2-dev.

  -- Pavan

On 02/28/2008 01:45 PM, Rob Ross wrote:
> Michael, can you give Pavan this list? Thanks. -- Rob
> 
> On Feb 28, 2008, at 1:40 PM, Pavan Balaji wrote:
> 
>>
>> Hmm.. I don't know what the authorization password is. Can someone 
>> send me the list of folks to be added from IBM?
>>
>> -- Pavan
>>
>> On 02/28/2008 01:18 PM, Michael Blocksome wrote:
>>> Log in with your email address and password at the bottom of this 
>>> page to see the "roster"
>>>        http://lists.anl-external.org/cgi-bin/mailman/listinfo/dcmf
>>> Michael Blocksome
>>> Blue Gene Messaging Team Lead
>>> Advanced Systems SW Development
>>> blocksom at us.ibm.com
>>> *Rob Ross <rross at mcs.anl.gov>*
>>> 02/28/2008 01:11 PM
>>>     
>>> To
>>>     Michael Blocksome/Rochester/IBM at IBMUS
>>> cc
>>>     dcmf at lists.anl-external.org, dcmf-bounces at lists.anl-external.org, 
>>> "Rajeev Thakur" <thakur at mcs.anl.gov>
>>> Subject
>>>     Re: [dcmf] 32-bit ROMIO: What is the best solution?
>>>     
>>> Hi,
>>> Is someone on this list that can get that list of email addresses?
>>> Rob
>>> On Feb 28, 2008, at 12:59 PM, Michael Blocksome wrote:
>>> >
>>> > ok .. mpich2-dev it is.  Let's just add all of the ...
>>> >
>>> > *@*.ibm.com
>>> >
>>> > ... email addresses that are subscribed to the dcmf mailing list.   
>>> > If someone is not interested they could always remove themselves   
>>> > from the mpich2-dev list later.
>>> >
>>> > Thanks
>>> >
>>> > Michael Blocksome
>>> > Blue Gene Messaging Team Lead
>>> > Advanced Systems SW Development
>>> > blocksom at us.ibm.com
>>> >
>>> >
>>> >
>>> > Rob Ross <rross at mcs.anl.gov>
>>> > 02/28/2008 12:39 PM
>>> >
>>> > To
>>> > Michael Blocksome/Rochester/IBM at IBMUS
>>> > cc
>>> > dcmf at lists.anl-external.org, dcmf-bounces at lists.anl-external.org,   
>>> > "Rajeev Thakur" <thakur at mcs.anl.gov>
>>> > Subject
>>> > Re: [dcmf] 32-bit ROMIO: What is the best solution?
>>> >
>>> >
>>> >
>>> >
>>> >
>>> > Rajeev had suggested this too. Let's go with mpich2-dev. Can you give
>>> > us a list of people that should be added to the list?
>>> >
>>> > Thanks,
>>> >
>>> > Rob
>>> >
>>> > On Feb 28, 2008, at 12:29 PM, Michael Blocksome wrote:
>>> >
>>> > >
>>> > > I agree .. we need to get over the discussion about the discussion
>>> > > as fast as possible. :)
>>> > >
>>> > > The dcmf list has worked well, but I think we started using it
>>> > > because there were no alternatives to the issue at hand.  Given that
>>> > > the mpich2 and global arrays communities have different licenses,
>>> > > our long term direction for the dcmf community should be to support
>>> > > the dcmf library and move discussion of other "external" libraries
>>> > > (like mpich2 and global arrays) into thier communities.
>>> > >
>>> > > I just noticed that there is a mpich2-dev at mcs.anl.gov mailing list
>>> > > already set up.  Could we use this list to collaborate on the mpich2
>>> > > implementation?
>>> > >
>>> > > Michael Blocksome
>>> > > Blue Gene Messaging Team Lead
>>> > > Advanced Systems SW Development
>>> > > blocksom at us.ibm.com
>>> > >
>>> > >
>>> > > Rob Ross <rross at mcs.anl.gov> wrote on 02/28/2008 12:17:06 PM:
>>> > >
>>> > > > Hi,
>>> > > >
>>> > > > We've been using the DCMF list for all sorts of things that aren't
>>> > > > directly related to DCMF without any issues (as far as I know); it
>>> > > has
>>> > > > become the successful technical communication path between IBM and
>>> > > ANL
>>> > > > so far.
>>> > > >
>>> > > > However, if you'd like to create a separate list for discussing
>>> > > this,
>>> > > > we can do that. Is that what you would like to do? If so, can you
>>> > > give
>>> > > > us a list of people on the IBM side that should be on the new   
>>> > list?
>>> > > > I'd like to be done discussing where we're going to talk 
>>> about   > this
>>> > > > issue ASAP and concentrate on fixing it.
>>> > > >
>>> > > > Thanks,
>>> > > >
>>> > > > Rob
>>> > > >
>>> > > > On Feb 28, 2008, at 11:18 AM, Michael Blocksome wrote:
>>> > > >
>>> > > > >
>>> > > > > Unfortunately, this MPI_Aint discussion really has nothing to do
>>> > > > > with DCMF.  The entrire problem and solution is with MPICH2.
>>> > > > >
>>> > > > > Is there another, less broad, mpich mailing list that we could
>>> > > use?
>>> > > > > Maybe a "mpich-development" or something like that?
>>> > > > >
>>> > > > > Michael Blocksome
>>> > > > > Blue Gene Messaging Team Lead
>>> > > > > Advanced Systems SW Development
>>> > > > > blocksom at us.ibm.com
>>> > > > >
>>> > > > >
>>> > > > > dcmf-bounces at lists.anl-external.org wrote on 02/28/2008 11:01:09
>>> > > AM:
>>> > > > >
>>> > > > > > I'd say this mailing list is better for it right now. Posting
>>> > > on the
>>> > > > > > broader mpich-discuss might just distract us.
>>> > > > > >
>>> > > > > > Rajeev
>>> > > > > >
>>> > > > > > From: dcmf-bounces at lists.anl-external.org [mailto:dcmf-
>>> > > > > > bounces at lists.anl-external.org] On Behalf Of Bob Cernohous
>>> > > > > > Sent: Wednesday, February 27, 2008 4:19 PM
>>> > > > > > To: dcmf at lists.anl-external.org
>>> > > > > > Subject: Re: [dcmf] 32-bit ROMIO: What is the best solution?
>>> > > > >
>>> > > > > >
>>> > > > > > bobc at us.ibm.com wrote on 02/08/2008 02:41:40 PM:
>>> > > > > >
>>> > > > > > >We all realize there are issues with MPI_Aint's in 32 bit
>>> > > > > implementations.
>>> > > > > > >Using signed 32 bit addresses along with 64 bit offsets can
>>> > > > > result in
>>> > > > > > >some pretty broken code.  I've reproduced several problems on
>>> > > > > > >BGL/BGP/linux.   The problems are most obvious with romio
>>> > > files >
>>> > > > > 2G or
>>> > > > > > >virtual addresses > 2G.
>>> > > > > >
>>> > > > > > Earlier this month I posted some comments to this mailing list
>>> > > about
>>> > > > > > 32-bit MPICH/ROMIO/MPI_Aint issues (see above and the   > 
>>> archive).
>>> > > > > >
>>> > > > > > We've started to work on the problem.   We're (optionally)
>>> > > making
>>> > > > > > MPI_Aint a 64 bit signed value even when integers/pointers are
>>> > > 32
>>> > > > > bit
>>> > > > > > values.
>>> > > > > >
>>> > > > > > We'd like to start sending fix patches to the community for
>>> > > comment.
>>> > > > > >
>>> > > > > > However, since we are (in parallel) contributing our BGP code
>>> > > back
>>> > > > > > to MPICH and trying to converge on a common code base of   
>>> > 1.0.7,
>>> > > > > > and because this problem applies to any 32-bit platform,
>>> > > > > > we thought we should move the discussion to mpich-discuss 
>>> to   > get
>>> > > > > > the larger community's comments.
>>> > > > > >
>>> > > > > > It's unlikely, at this point, that our patches would apply to
>>> > > > > MPICH2,
>>> > > > > > but that is our goal after the 1.0.7/BGP patch settles.   For
>>> > > now,
>>> > > > > we'd
>>> > > > > > be sending these as RFC (request for comment) patches.
>>> > > > > >
>>> > > > > > Please let us know if you agree with discussing this on mpich-
>>> > > > > discuss,
>>> > > > > > or if there is a better place.
>>> > > > > >
>>> > > > > > Bob Cernohous:  (T/L 553) 507-253-6093
>>> > > > > >
>>> > > > > > BobC at us.ibm.com
>>> > > > > > IBM Rochester, Building 030-2(C335), Department 61L
>>> > > > > > 3605 Hwy 52 North, Rochester,  MN 55901-7829
>>> > > > > >
>>> > > > > > > Chaos reigns within.
>>> > > > > > > Reflect, repent, and reboot.
>>> > > > > > > Order shall
>>> > > return._______________________________________________
>>> > > > > > dcmf mailing list
>>> > > > > > dcmf at lists.anl-external.org
>>> > > > > > http://lists.anl-external.org/cgi-bin/mailman/listinfo/dcmf
>>> > > > > > http://dcmf.anl-external.
>>> > > > org/wiki_______________________________________________
>>> > > > > dcmf mailing list
>>> > > > > dcmf at lists.anl-external.org
>>> > > > > http://lists.anl-external.org/cgi-bin/mailman/listinfo/dcmf
>>> > > > > http://dcmf.anl-external.org/wiki
>>> > > >
>>> >
>>> >
>>> ------------------------------------------------------------------------
>>> _______________________________________________
>>> dcmf mailing list
>>> dcmf at lists.anl-external.org
>>> http://lists.anl-external.org/cgi-bin/mailman/listinfo/dcmf
>>> http://dcmf.anl-external.org/wiki
>>
>> -- 
>> Pavan Balaji
>> http://www.mcs.anl.gov/~balaji
>>
> 

-- 
Pavan Balaji
http://www.mcs.anl.gov/~balaji



More information about the dcmf mailing list