1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 <2022> 2023 2024 2025 | Index | 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 <2022> 2023 2024 2025 |
<== Date ==> | <== Thread ==> |
---|
Subject: | RE: Bus errors accessing VME with base 7.0.6.1 and latest synApps modules |
From: | Mark Rivers via Tech-talk <tech-talk at aps.anl.gov> |
To: | Michael Davidsaver <mdavidsaver at gmail.com> |
Cc: | "tech-talk at aps.anl.gov" <tech-talk at aps.anl.gov> |
Date: | Sat, 21 May 2022 18:05:24 +0000 |
Hi Michael Ø
What specific board is involved? (eg. mvme3100?) The test crate is an MVME5100. But the production crates that were also failing include several MVME2700 boards as well as some MVME5100. Ø
Although you haven't mentioned the presence of an EVR card or timing driver. There is no EVR card or timing driver. The following cards seem to work fine: -
SIS3801 and SIS3820 VME cards. These are scalers/multichannel scalers. I have tested them a lot and it does not crash. -
IpUnidig digital I/O Industry Pack module with interrupts on inputs. I cannot get any failures flipping these bits. It is on the same carrier card as the IP330. Ø
I still wonder if this isn't somehow resulting from an incomplete rebuild. I don’t think so. I did a “make clean” at the top of the synApps/support tree, which includes my specific application directory. I then did a “find” for .o files and there were none leftover. Ø
Since the error seems to come from a CAS-event thread, I would be interested to know which CA client is associated. Running "casr 5" will show this (and more). eg. I have attached a file which shows the following: Ø
Ran casr 5 right after IOC finishes startup. It shows a client which is connected to a few records from the iocStats module. Ø
Opened medm screen for IP330, got VME bus error Ø
Ran casr 5 again showing connections for that medm screen. Thanks, Mark From: Michael Davidsaver <mdavidsaver at gmail.com> On 5/21/22 09:42, Mark Rivers via Tech-talk wrote:
In this case...
What specific board is involved? (eg. mvme3100?)
I remain surprised that a CAS-event thread seems to be involved. I don't think this thread would be running driver code. Though it might be calling things like generalTime providers or server side filter hooks, which might be making VME access. Although
you haven't mentioned the presence of an EVR card or timing driver. I say "seems to be" because there is a chance that vxworks is somehow messing up and capturing wrong thread. eg. a driver makes a VME access, then posts a monitor, which causing a context switch to CAS-event. Then somehow the CPU exception arrives late,
with the wrong instruction pointer address. This is probably possible w/ some powerpc and out of order execution. I still wonder if this isn't somehow resulting from an incomplete rebuild.
Since the error seems to come from a CAS-event thread, I would be interested to know which CA client is associated. Running "casr 5" will show this (and more). eg.
If you've identified a particular record, try setting TPRO to see if anything else is happening.
|
Done executing startup script '/home/epics/devel/CARS/iocBoot/ioc13lab/st.cmd'. ioc13lab> casr 5 Channel Access Server V4.13 1 client connected: TCP client at 164.54.160.82:36816 'corvette': User 'epics', V4.13, Priority = 0, 4 Channels Task Id = 0x553fdb0, Socket FD = 53 0.40 secs since last send, 1348.17 secs since last receive Unprocessed request bytes = 0, Undelivered response bytes = 0 State = up Channel: '13LAB::UPTIME' field_type=DBF_STRING (40 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB::CA_CONN_CNT' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB::SYS_CPU_LOAD' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB::MEM_FREE' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw 1176 bytes allocated Send Lock: epicsMutexId 0x54bba70 source ../rsrv/caservertask.c line 1254 Semaphore Id : 0x54ecda0 Semaphore Name : N/A Semaphore Type : MUTEX Task Queuing : PRIORITY Pended Tasks : 0 Owner : NONE Options : 0xd SEM_Q_PRIORITY SEM_DELETE_SAFE SEM_INVERSION_SAFE VxWorks Events -------------- Registered Task : NONE Event(s) to Send : N/A Options : N/A Put Notify Lock: epicsMutexId 0x54ef960 source ../rsrv/caservertask.c line 1255 Semaphore Id : 0x54ece20 Semaphore Name : N/A Semaphore Type : MUTEX Task Queuing : PRIORITY Pended Tasks : 0 Owner : NONE Options : 0xd SEM_Q_PRIORITY SEM_DELETE_SAFE SEM_INVERSION_SAFE VxWorks Events -------------- Registered Task : NONE Event(s) to Send : N/A Options : N/A Address Queue Lock: epicsMutexId 0x54c54e0 source ../rsrv/caservertask.c line 1256 Semaphore Id : 0x54edd20 Semaphore Name : N/A Semaphore Type : MUTEX Task Queuing : PRIORITY Pended Tasks : 0 Owner : NONE Options : 0xd SEM_Q_PRIORITY SEM_DELETE_SAFE SEM_INVERSION_SAFE VxWorks Events -------------- Registered Task : NONE Event(s) to Send : N/A Options : N/A Event Queue Lock: epicsMutexId 0x54ed6e0 source ../rsrv/caservertask.c line 1257 Semaphore Id : 0x54edda0 Semaphore Name : N/A Semaphore Type : MUTEX Task Queuing : PRIORITY Pended Tasks : 0 Owner : NONE Options : 0xd SEM_Q_PRIORITY SEM_DELETE_SAFE SEM_INVERSION_SAFE VxWorks Events -------------- Registered Task : NONE Event(s) to Send : N/A Options : N/A Block Semaphore: Semaphore Id : 0x54ed330 Semaphore Name : N/A Semaphore Type : BINARY Task Queuing : FIFO Pended Tasks : 0 State : EMPTY Options : 0x0 SEM_Q_FIFO VxWorks Events -------------- Registered Task : NONE Event(s) to Send : N/A Options : N/A CAS-TCP server on 0.0.0.0:5064 with CAS-UDP name server on 0.0.0.0:5064 Last name requested by 164.54.160.64:53062: User '', V4.13, Priority = 0, 0 Channels Task Id = 0x530d590, Socket FD = 40 1351.35 secs since last send, 0.03 secs since last receive Unprocessed request bytes = 0, Undelivered response bytes = 16 State = up Sending CAS-beacons to 1 address: 164.54.160.255:5065 Free-lists total 283312 bytes, comprising 6 client(s), 508 channel(s), 508 monitor event(s), 0 putNotify(s) 14 small (16384 byte) buffers, 4294967295 jumbo (500024 byte) buffers Server resource id table: Bucket entries in use = 4 bytes in use = 16468 Bucket entries/hash id - mean = 0.000977 std dev = 0.031235 max = 1 value = 0 = 0x0 ioc13lab> ioc13lab> ioc13lab> ioc13lab> ioc13lab> ioc13lab> VME Bus Error accessing A16: 0x327e machine check Exception next instruction address: 0x0368ce90 Machine Status Register: 0x0008b032 Condition Register: 0x48000884 Task: 0x55394d0 "CAS-event" 0x55394d0 (CAS-event): task 0x55394d0 has had a failure and has been stopped. 0x55394d0 (CAS-event): The task has been terminated because it triggered an exception that raised the signal 10. ioc13lab> ioc13lab> casr 5 Channel Access Server V4.13 1 client connected: TCP client at 164.54.160.82:36816 'corvette': User 'epics', V4.13, Priority = 0, 68 Channels Task Id = 0x553fdb0, Socket FD = 53 13.22 secs since last send, 18.70 secs since last receive Unprocessed request bytes = 0, Undelivered response bytes = 0 State = up Channel: '13LAB::UPTIME' field_type=DBF_STRING (40 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB::CA_CONN_CNT' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB::SYS_CPU_LOAD' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB::MEM_FREE' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_1.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_1.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_1.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_2.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_2.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_2.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_3.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_3.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_3.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_4.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_4.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_4.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_5.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_5.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_5.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_6.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_6.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_6.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_7.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_7.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_7.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_8.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_8.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_8.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_9.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_9.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_9.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_10.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_10.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_10.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_11.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_11.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_11.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_12.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_12.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_12.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_13.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_13.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_13.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_14.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_14.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_14.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_15.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_15.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_15.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_16.SCAN' field_type=DBF_MENU (2 bytes), dbr_type=DBF_ENUM, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_16.VAL' field_type=DBF_DOUBLE (8 bytes), dbr_type=DBF_DOUBLE, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_16.PREC' field_type=DBF_SHORT (2 bytes), dbr_type=DBF_SHORT, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_1.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_2.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_3.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_4.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_5.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_6.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_7.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_8.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_9.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_10.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_11.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_12.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_13.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_14.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_15.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw Channel: '13LAB:IP330_16.DESC' field_type=DBF_STRING (41 bytes), dbr_type=DBF_STRING, 1 element, no filters # on eventq=1, access=rw 17048 bytes allocated Send Lock: epicsMutexId 0x54bba70 source ../rsrv/caservertask.c line 1254 Semaphore Id : 0x54ecda0 Semaphore Name : N/A Semaphore Type : MUTEX Task Queuing : PRIORITY Pended Tasks : 0 Owner : NONE Options : 0xd SEM_Q_PRIORITY SEM_DELETE_SAFE SEM_INVERSION_SAFE VxWorks Events -------------- Registered Task : NONE Event(s) to Send : N/A Options : N/A Put Notify Lock: epicsMutexId 0x54ef960 source ../rsrv/caservertask.c line 1255 Semaphore Id : 0x54ece20 Semaphore Name : N/A Semaphore Type : MUTEX Task Queuing : PRIORITY Pended Tasks : 0 Owner : NONE Options : 0xd SEM_Q_PRIORITY SEM_DELETE_SAFE SEM_INVERSION_SAFE VxWorks Events -------------- Registered Task : NONE Event(s) to Send : N/A Options : N/A Address Queue Lock: epicsMutexId 0x54c54e0 source ../rsrv/caservertask.c line 1256 Semaphore Id : 0x54edd20 Semaphore Name : N/A Semaphore Type : MUTEX Task Queuing : PRIORITY Pended Tasks : 0 Owner : NONE Options : 0xd SEM_Q_PRIORITY SEM_DELETE_SAFE SEM_INVERSION_SAFE VxWorks Events -------------- Registered Task : NONE Event(s) to Send : N/A Options : N/A Event Queue Lock: epicsMutexId 0x54ed6e0 source ../rsrv/caservertask.c line 1257 Semaphore Id : 0x54edda0 Semaphore Name : N/A Semaphore Type : MUTEX Task Queuing : PRIORITY Pended Tasks : 0 Owner : NONE Options : 0xd SEM_Q_PRIORITY SEM_DELETE_SAFE SEM_INVERSION_SAFE VxWorks Events -------------- Registered Task : NONE Event(s) to Send : N/A Options : N/A Block Semaphore: Semaphore Id : 0x54ed330 Semaphore Name : N/A Semaphore Type : BINARY Task Queuing : FIFO Pended Tasks : 0 State : EMPTY Options : 0x0 SEM_Q_FIFO VxWorks Events -------------- Registered Task : NONE Event(s) to Send : N/A Options : N/A CAS-TCP server on 0.0.0.0:5064 with CAS-UDP name server on 0.0.0.0:5064 Last name requested by 164.54.160.111:54303: User '', V4.13, Priority = 0, 0 Channels Task Id = 0x530d590, Socket FD = 40 32.85 secs since last send, 0.22 secs since last receive Unprocessed request bytes = 0, Undelivered response bytes = 16 State = up Sending CAS-beacons to 1 address: 164.54.160.255:5065 Free-lists total 276656 bytes, comprising 6 client(s), 444 channel(s), 444 monitor event(s), 0 putNotify(s) 14 small (16384 byte) buffers, 4294967295 jumbo (500024 byte) buffers Server resource id table: Bucket entries in use = 68 bytes in use = 17492 Bucket entries/hash id - mean = 0.016602 std dev = 0.127773 max = 1 value = 0 = 0x0 ioc13lab>