[Llvm-bgq-discuss] Details behind MPI wrapper for bgclang++

Hal Finkel hfinkel at anl.gov
Fri Mar 1 15:57:06 CST 2013


----- Original Message -----
> From: "Hal Finkel" <hfinkel at anl.gov>
> To: "Jack Poulson" <jack.poulson at gmail.com>
> Cc: llvm-bgq-discuss at lists.alcf.anl.gov
> Sent: Friday, March 1, 2013 3:39:31 PM
> Subject: Re: [Llvm-bgq-discuss] Details behind MPI wrapper for bgclang++
> 
> ----- Original Message -----
> > From: "Jack Poulson" <jack.poulson at gmail.com>
> > To: "Hal Finkel" <hfinkel at anl.gov>
> > Cc: "Jeff Hammond" <jhammond at alcf.anl.gov>,
> > llvm-bgq-discuss at lists.alcf.anl.gov
> > Sent: Friday, March 1, 2013 3:07:17 PM
> > Subject: Re: [Llvm-bgq-discuss] Details behind MPI wrapper for
> > bgclang++
> > 
> > On Fri, Mar 1, 2013 at 1:03 PM, Jack Poulson <
> > jack.poulson at gmail.com
> > > wrote:
> > 
> > 
> > 
> > 
> > On Fri, Mar 1, 2013 at 12:43 PM, Hal Finkel < hfinkel at anl.gov >
> > wrote:
> > 
> > 
> > 
> > 
> > The lightweight core files are really text files, I looked at the
> > line:
> > While executing instruction at..........0x000000000100c7c4
> > 
> > Then I ran powerpc64-bgq-linux-objdump -C -d Backproj-2d and looked
> > at the assembly around address 100c7c4 (if you search for it in the
> > file, note that objdump may omit the leading 0s in the address).
> > 
> > 
> > 
> > 
> > Thanks!
> > 
> > 
> > 
> > 
> > Can you try compiling/linking with
> > /home/projects/llvm/r175919-20130222/bin/bgclang++ instead of the
> > default one; this is a newer build and I'd like to see if it still
> > has whatever bug is yielding this miscompile.
> > 
> > 
> > Strangely enough, my executable ran correctly with the new version
> > of
> > LLVM (and passed my accuracy tests). I'm rerunning it again right
> > now to help rule out whether or not that was a fluke.
> > 
> > Any ideas as to what might have been the major change in the new
> > release?
> > 
> > 
> > 
> > Sigh. It was a fluke.
> 
> Hrmm... this being something that sometimes works is interesting. Are
> there any sources of non-determinism here? [I suppose that running
> on different partitions could cause PAMI to malloc memory
> differently; Jeff?]
> 
> > 
> > Perhaps we should take this offline to avoid spamming everyone else
> > on the list? I will try linking statically next.
> 
> Let's see if static linking "fixes" it; the list would certainly like
> to know that ;)

Jack, also please try compiling with these extra flags:
-mllvm -post-RA-scheduler=0
[if that fixes the problem I'll have a good idea of where the problem lies, and you'll have a work-around].

 -Hal

> 
>  -Hal
> 
> > 
> > Jack
> > 
> > 
> _______________________________________________
> llvm-bgq-discuss mailing list
> llvm-bgq-discuss at lists.alcf.anl.gov
> https://lists.alcf.anl.gov/mailman/listinfo/llvm-bgq-discuss
> 


More information about the llvm-bgq-discuss mailing list