[dcmf] 32-bit ROMIO: What is the best solution?

Pavan Balaji balaji at mcs.anl.gov
Thu Feb 28 14:01:06 CST 2008


Added the following folks to both lists (mpich2-bg and mpich2-dev):

Argonne -- balaji at mcs.anl.gov, buntinas at mcs.anl.gov, thakur at mcs.anl.gov, 
goodell at mcs.anl.gov, jayesh at mcs.anl.gov, wgropp at uiuc.edu, 
lusk at mcs.anl.gov, rross at mcs.anl.gov, robl at mcs.anl.gov, 
slang at mcs.anl.gov, harms at alcf.anl.gov

IBM -- archerc at us.ibm.com, blocksom at us.ibm.com, bobc at us.ibm.com, 
dougmill at us.ibm.com, faraja at us.ibm.com, gdozsa at us.ibm.com, 
jjparker at us.ibm.com, jratt at us.ibm.com, sameerk at us.ibm.com, 
smithbr at us.ibm.com, tgooding at us.ibm.com, tmusta at us.ibm.com

Let me know if someone else needs to be added on.

  -- Pavan

On 02/28/2008 01:52 PM, Pavan Balaji wrote:
> 
> Rajeev just gave me the list password. I'm adding the relevant Argonne 
> folks and everyone from IBM in the below list to mpich2-bg and mpich2-dev.
> 
>  -- Pavan
> 
> On 02/28/2008 01:45 PM, Rob Ross wrote:
>> Michael, can you give Pavan this list? Thanks. -- Rob
>>
>> On Feb 28, 2008, at 1:40 PM, Pavan Balaji wrote:
>>
>>>
>>> Hmm.. I don't know what the authorization password is. Can someone 
>>> send me the list of folks to be added from IBM?
>>>
>>> -- Pavan
>>>
>>> On 02/28/2008 01:18 PM, Michael Blocksome wrote:
>>>> Log in with your email address and password at the bottom of this 
>>>> page to see the "roster"
>>>>        http://lists.anl-external.org/cgi-bin/mailman/listinfo/dcmf
>>>> Michael Blocksome
>>>> Blue Gene Messaging Team Lead
>>>> Advanced Systems SW Development
>>>> blocksom at us.ibm.com
>>>> *Rob Ross <rross at mcs.anl.gov>*
>>>> 02/28/2008 01:11 PM
>>>>     To
>>>>     Michael Blocksome/Rochester/IBM at IBMUS
>>>> cc
>>>>     dcmf at lists.anl-external.org, 
>>>> dcmf-bounces at lists.anl-external.org, "Rajeev Thakur" 
>>>> <thakur at mcs.anl.gov>
>>>> Subject
>>>>     Re: [dcmf] 32-bit ROMIO: What is the best solution?
>>>>     Hi,
>>>> Is someone on this list that can get that list of email addresses?
>>>> Rob
>>>> On Feb 28, 2008, at 12:59 PM, Michael Blocksome wrote:
>>>> >
>>>> > ok .. mpich2-dev it is.  Let's just add all of the ...
>>>> >
>>>> > *@*.ibm.com
>>>> >
>>>> > ... email addresses that are subscribed to the dcmf mailing 
>>>> list.   > If someone is not interested they could always remove 
>>>> themselves   > from the mpich2-dev list later.
>>>> >
>>>> > Thanks
>>>> >
>>>> > Michael Blocksome
>>>> > Blue Gene Messaging Team Lead
>>>> > Advanced Systems SW Development
>>>> > blocksom at us.ibm.com
>>>> >
>>>> >
>>>> >
>>>> > Rob Ross <rross at mcs.anl.gov>
>>>> > 02/28/2008 12:39 PM
>>>> >
>>>> > To
>>>> > Michael Blocksome/Rochester/IBM at IBMUS
>>>> > cc
>>>> > dcmf at lists.anl-external.org, 
>>>> dcmf-bounces at lists.anl-external.org,   > "Rajeev Thakur" 
>>>> <thakur at mcs.anl.gov>
>>>> > Subject
>>>> > Re: [dcmf] 32-bit ROMIO: What is the best solution?
>>>> >
>>>> >
>>>> >
>>>> >
>>>> >
>>>> > Rajeev had suggested this too. Let's go with mpich2-dev. Can you give
>>>> > us a list of people that should be added to the list?
>>>> >
>>>> > Thanks,
>>>> >
>>>> > Rob
>>>> >
>>>> > On Feb 28, 2008, at 12:29 PM, Michael Blocksome wrote:
>>>> >
>>>> > >
>>>> > > I agree .. we need to get over the discussion about the discussion
>>>> > > as fast as possible. :)
>>>> > >
>>>> > > The dcmf list has worked well, but I think we started using it
>>>> > > because there were no alternatives to the issue at hand.  Given 
>>>> that
>>>> > > the mpich2 and global arrays communities have different licenses,
>>>> > > our long term direction for the dcmf community should be to support
>>>> > > the dcmf library and move discussion of other "external" libraries
>>>> > > (like mpich2 and global arrays) into thier communities.
>>>> > >
>>>> > > I just noticed that there is a mpich2-dev at mcs.anl.gov mailing list
>>>> > > already set up.  Could we use this list to collaborate on the 
>>>> mpich2
>>>> > > implementation?
>>>> > >
>>>> > > Michael Blocksome
>>>> > > Blue Gene Messaging Team Lead
>>>> > > Advanced Systems SW Development
>>>> > > blocksom at us.ibm.com
>>>> > >
>>>> > >
>>>> > > Rob Ross <rross at mcs.anl.gov> wrote on 02/28/2008 12:17:06 PM:
>>>> > >
>>>> > > > Hi,
>>>> > > >
>>>> > > > We've been using the DCMF list for all sorts of things that 
>>>> aren't
>>>> > > > directly related to DCMF without any issues (as far as I 
>>>> know); it
>>>> > > has
>>>> > > > become the successful technical communication path between IBM 
>>>> and
>>>> > > ANL
>>>> > > > so far.
>>>> > > >
>>>> > > > However, if you'd like to create a separate list for discussing
>>>> > > this,
>>>> > > > we can do that. Is that what you would like to do? If so, can you
>>>> > > give
>>>> > > > us a list of people on the IBM side that should be on the 
>>>> new   > list?
>>>> > > > I'd like to be done discussing where we're going to talk 
>>>> about   > this
>>>> > > > issue ASAP and concentrate on fixing it.
>>>> > > >
>>>> > > > Thanks,
>>>> > > >
>>>> > > > Rob
>>>> > > >
>>>> > > > On Feb 28, 2008, at 11:18 AM, Michael Blocksome wrote:
>>>> > > >
>>>> > > > >
>>>> > > > > Unfortunately, this MPI_Aint discussion really has nothing 
>>>> to do
>>>> > > > > with DCMF.  The entrire problem and solution is with MPICH2.
>>>> > > > >
>>>> > > > > Is there another, less broad, mpich mailing list that we could
>>>> > > use?
>>>> > > > > Maybe a "mpich-development" or something like that?
>>>> > > > >
>>>> > > > > Michael Blocksome
>>>> > > > > Blue Gene Messaging Team Lead
>>>> > > > > Advanced Systems SW Development
>>>> > > > > blocksom at us.ibm.com
>>>> > > > >
>>>> > > > >
>>>> > > > > dcmf-bounces at lists.anl-external.org wrote on 02/28/2008 
>>>> 11:01:09
>>>> > > AM:
>>>> > > > >
>>>> > > > > > I'd say this mailing list is better for it right now. Posting
>>>> > > on the
>>>> > > > > > broader mpich-discuss might just distract us.
>>>> > > > > >
>>>> > > > > > Rajeev
>>>> > > > > >
>>>> > > > > > From: dcmf-bounces at lists.anl-external.org [mailto:dcmf-
>>>> > > > > > bounces at lists.anl-external.org] On Behalf Of Bob Cernohous
>>>> > > > > > Sent: Wednesday, February 27, 2008 4:19 PM
>>>> > > > > > To: dcmf at lists.anl-external.org
>>>> > > > > > Subject: Re: [dcmf] 32-bit ROMIO: What is the best solution?
>>>> > > > >
>>>> > > > > >
>>>> > > > > > bobc at us.ibm.com wrote on 02/08/2008 02:41:40 PM:
>>>> > > > > >
>>>> > > > > > >We all realize there are issues with MPI_Aint's in 32 bit
>>>> > > > > implementations.
>>>> > > > > > >Using signed 32 bit addresses along with 64 bit offsets can
>>>> > > > > result in
>>>> > > > > > >some pretty broken code.  I've reproduced several 
>>>> problems on
>>>> > > > > > >BGL/BGP/linux.   The problems are most obvious with romio
>>>> > > files >
>>>> > > > > 2G or
>>>> > > > > > >virtual addresses > 2G.
>>>> > > > > >
>>>> > > > > > Earlier this month I posted some comments to this mailing 
>>>> list
>>>> > > about
>>>> > > > > > 32-bit MPICH/ROMIO/MPI_Aint issues (see above and the   > 
>>>> archive).
>>>> > > > > >
>>>> > > > > > We've started to work on the problem.   We're (optionally)
>>>> > > making
>>>> > > > > > MPI_Aint a 64 bit signed value even when integers/pointers 
>>>> are
>>>> > > 32
>>>> > > > > bit
>>>> > > > > > values.
>>>> > > > > >
>>>> > > > > > We'd like to start sending fix patches to the community for
>>>> > > comment.
>>>> > > > > >
>>>> > > > > > However, since we are (in parallel) contributing our BGP code
>>>> > > back
>>>> > > > > > to MPICH and trying to converge on a common code base of   
>>>> > 1.0.7,
>>>> > > > > > and because this problem applies to any 32-bit platform,
>>>> > > > > > we thought we should move the discussion to mpich-discuss 
>>>> to   > get
>>>> > > > > > the larger community's comments.
>>>> > > > > >
>>>> > > > > > It's unlikely, at this point, that our patches would apply to
>>>> > > > > MPICH2,
>>>> > > > > > but that is our goal after the 1.0.7/BGP patch settles.   For
>>>> > > now,
>>>> > > > > we'd
>>>> > > > > > be sending these as RFC (request for comment) patches.
>>>> > > > > >
>>>> > > > > > Please let us know if you agree with discussing this on 
>>>> mpich-
>>>> > > > > discuss,
>>>> > > > > > or if there is a better place.
>>>> > > > > >
>>>> > > > > > Bob Cernohous:  (T/L 553) 507-253-6093
>>>> > > > > >
>>>> > > > > > BobC at us.ibm.com
>>>> > > > > > IBM Rochester, Building 030-2(C335), Department 61L
>>>> > > > > > 3605 Hwy 52 North, Rochester,  MN 55901-7829
>>>> > > > > >
>>>> > > > > > > Chaos reigns within.
>>>> > > > > > > Reflect, repent, and reboot.
>>>> > > > > > > Order shall
>>>> > > return._______________________________________________
>>>> > > > > > dcmf mailing list
>>>> > > > > > dcmf at lists.anl-external.org
>>>> > > > > > http://lists.anl-external.org/cgi-bin/mailman/listinfo/dcmf
>>>> > > > > > http://dcmf.anl-external.
>>>> > > > org/wiki_______________________________________________
>>>> > > > > dcmf mailing list
>>>> > > > > dcmf at lists.anl-external.org
>>>> > > > > http://lists.anl-external.org/cgi-bin/mailman/listinfo/dcmf
>>>> > > > > http://dcmf.anl-external.org/wiki
>>>> > > >
>>>> >
>>>> >
>>>> ------------------------------------------------------------------------ 
>>>>
>>>> _______________________________________________
>>>> dcmf mailing list
>>>> dcmf at lists.anl-external.org
>>>> http://lists.anl-external.org/cgi-bin/mailman/listinfo/dcmf
>>>> http://dcmf.anl-external.org/wiki
>>>
>>> -- 
>>> Pavan Balaji
>>> http://www.mcs.anl.gov/~balaji
>>>
>>
> 

-- 
Pavan Balaji
http://www.mcs.anl.gov/~balaji



More information about the dcmf mailing list