[dcmf] running my own mpich

Jeff Parker jjparker at us.ibm.com
Tue Feb 12 16:33:15 CST 2008


Rob,

libdcmf.cnk.a needs to link with libSPI.cna.a to access the DMA SPIs.  You
are probably linking with the V1R1M2 version of libSPI.cna.a, which may be
incompatible with the DCMF source we gave you.  Try linking with the
attached libSPI.cna.a, which should be a closer match.

(See attached file: libSPI.cna.a)

We have not posted source to the wiki for building libSPI.cna.a.  The
"runtime source tarball" currently on the wiki contains the DMA SPI source,
but other things are needed to build libSPI.cna.a that were not ready yet.
Shortly, we will post a new runtime source tarball that will build
libSPI.cna.a so you can build it yourself.

Jeff Parker
Blue Gene Messaging
61L/030-2 A407    507-253-4208    TieLine: 553-4208
Notes email: Jeff Parker/Rochester/IBM
INTERNET: jjparker at us.ibm.com     AFS: jeff at rchland


                                                                           
             robl at mcs.anl.gov                                              
             (Robert Latham)                                               
             Sent by:                                                   To 
             dcmf-bounces at list         dcmf at lists.anl-external.org         
             s.anl-external.or                                          cc 
             g                                                             
                                                                   Subject 
                                       [dcmf] running my own mpich         
             02/12/2008 03:56                                              
             PM                                                            
                                                                           
                                                                           
                                                                           
                                                                           




I've made some changes to ROMIO and would like to test them out.  I've
built an mpich library with the the 'make mpich' rule, and that goes
just fine: i've got an install/bin/mpicc which links in
install/lib/libdcmfcoll.cnk.a install/lib/libdcmf.cnk.a and
install/lib/libmpich.cnk.a

So far, everything looks normal.

When I try to run the resulting program, I get a segfault.  Here's the
output after running the stack dump in one of the lightweight core
file through addr2line:

0x010fa338
DMA_InjFifoRgetFifoFullInit
??:0
0x01304834
??
??:0
0x010cd56c
DCMF::DMA::Device::initGroups()
/home/robl/src/bgp.comm/sys/build-dcmf/../messaging/devices/prod/dma/Init.cc:308

0x010cd91c
DCMF::DMA::Device::initDMADevice()
/home/robl/src/bgp.comm/sys/build-dcmf/../messaging/devices/prod/dma/Init.cc:397

0x010bc2dc
BGPMessager
/home/robl/src/bgp.comm/sys/build-dcmf/../messaging/messager/prod/bgp/msgr.h:81

0x010b2cec
DCMF::BGPMessager::generate()
/home/robl/src/bgp.comm/sys/build-dcmf/../messaging/messager/prod/bgp/msgr.h:105

0x0102563c
MPID_Init
/gpfs/home/robl/src/bgp.comm/lib/mpi/mpich2/src/mpid/dcmf/src/misc/mpid_init.c:63

0x0100cf58
MPIR_Init_thread
/gpfs/home/robl/src/bgp.comm/lib/mpi/mpich2/src/mpi/init/initthread.c:236
0x0100cd1c
PMPI_Init
/gpfs/home/robl/src/bgp.comm/lib/mpi/mpich2/src/mpi/init/init.c:93
0x010013a4
main
/home/robl/src/darray-io.c:51
0x011004c0
generic_start_main
../csu/libc-start.c:231
0x01100734
__libc_start_main
../sysdeps/unix/sysv/linux/powerpc/libc-start.c:137
0xfffffffc
??
??:0


If I had to guess, I'd say that the libdcmf in the development tree is
incompatible with argonne's V1R1M2_500_2007-071213P driver.  What's
the best way to test out my ROMIO changes?

Thanks
==rob

--
Rob Latham
Mathematics and Computer Science Division    A215 0178 EA2D B059 8CDF
Argonne National Lab, IL USA                 B29D F333 664A 4280 315B
_______________________________________________
dcmf mailing list
dcmf at lists.anl-external.org
http://lists.anl-external.org/cgi-bin/mailman/listinfo/dcmf
http://dcmf.anl-external.org/wiki
-------------- next part --------------
A non-text attachment was scrubbed...
Name: libSPI.cna.a
Type: application/octet-stream
Size: 542686 bytes
Desc: not available
URL: <http://lists.alcf.anl.gov/pipermail/dcmf/attachments/20080212/e22867d1/attachment.a>


More information about the dcmf mailing list