Hi Mark,
Thanks for the pointers. I am pretty new to AreaDetector plugin development.
>I think this means you have set callbacksBlock=Enable for this plugin. Is that correct?
Yes, that was the case. I wrote the tests first, then the implementation. The 1 is a leftover arbitrary choice I forgot to change to 0. It is now set to ‘not blocking’.
>You are measuring the elapsed time for execution.
Removed now.
>You are allocating a new inputArray with NDArrayPool->convert(), converting to NDUInt16…
I have now deleted the intermediate ‘inputArray’ step. I guess I was paranoid about mistakenly touching the input array. I am now also release()ing outputArray in the destructor.
>What happens if the new input array has different dimensions from the one that was used when creating this->outputArray?
Dimensions were checked and modified in doTemperatureConversion, but this breaks naming and compartmentalization conventions. Dimension checking is now in processCallbacks.
Here is the revised processCallbacks:
void NDPluginTemperature::processCallbacks(NDArray *pArray){
static const char *functionName = "processCallbacks";
NDArrayInfo_t arrayInfo;
NDPluginDriver::beginProcessCallbacks(pArray);
if(pArray->dataType != NDUInt16){
asynPrint(this->pasynUserSelf, ASYN_TRACE_ERROR,
"%s:%s: Only UInt16 supported.", driverName, functionName);
return;
}
std::string lastCalFileName = this->calibrationFileName;
getStringParam(this->calibrationFileNameIdx, this->calibrationFileName);
if(this->calibrationFileName != lastCalFileName){
this->processCalibrationFile();
}
int arrayCallbacks;
getIntegerParam(NDArrayCallbacks, &arrayCallbacks);
if(arrayCallbacks==1){
if(NULL == this->outputArray){
this->outputArray = this->pNDArrayPool->copy(pArray,this->outputArray,false,true,true);
}
pArray->getInfo(&arrayInfo);
this->outputArray->dims[arrayInfo.xDim].size = pArray->dims[arrayInfo.xDim].size;
this->outputArray->dims[arrayInfo.yDim].size = pArray->dims[arrayInfo.yDim].size;
//unlock while the plug-and-chug happens. No shared resources are accessed at this time.
this->unlock();
this->doTemperatureConversion(pArray, this->outputArray, &arrayInfo);
this->lock();
setIntegerParam(NDArraySizeX, (int)outputArray->dims[arrayInfo.xDim].size);
setIntegerParam(NDArraySizeY, (int)outputArray->dims[arrayInfo.yDim].size);
callParamCallbacks();
}
NDPluginDriver::endProcessCallbacks(outputArray, false, true);
}
The test now runs for six frames and fails on the seventh, in endProcessCallbacks. Regular run, then GDB backtrace at first occurrence of “cantProceed”:
NDArray.uniqueId=1
NDArray.uniqueId=2
NDArray.uniqueId=3
NDArray.uniqueId=4
NDArray.uniqueId=5
NDArray.uniqueId=6
NDArrayPool:reserve ERROR, reference count = 0, should be >= 1, pArray=0x7f3470001880
Thread TEST_PORT_Plugin_1 (0x559fbb49aa50) can't proceed, suspending.
Dumping a stack trace of thread 'TEST_PORT_Plugin_1':
NDArray.uniqueId=7
[ 0x7f348a037ab3]: /lib/x86_64-linux-gnu/libCom.so.3.15.9(epicsStackTrace+0x73)
[ 0x7f348a028216]: /lib/x86_64-linux-gnu/libCom.so.3.15.9(cantProceed+0xc6)
[ 0x7f34897ebb08]: /lib/x86_64-linux-gnu/libADBase.so.3.11(_ZN11NDArrayPool7reserveEP7NDArray+0x78)
[ 0x7f3489f2cdd8]: /lib/x86_64-linux-gnu/libNDPlugin.so.3.11(_ZN14NDPluginDriver14driverCallbackEP8asynUserPv+0x258)
[ 0x7f3489e86155]: /lib/x86_64-linux-gnu/libasyn.so.4.38(_ZN14asynPortDriver25doCallbacksGenericPointerEPvii+0x1f5)
[ 0x7f3489f2d445]: /lib/x86_64-linux-gnu/libNDPlugin.so.3.11(_ZN14NDPluginDriver19endProcessCallbacksEP7NDArraybb+0x275)
[ 0x7f3489d93f6e]: /home/daykin/git/areadetector-temperature/lib/linux-x86_64/libNDPluginTemperature.so(_ZN19NDPluginTemperature16processCallbacksEP7NDArray+0x25e)
[ 0x7f3489f2d8a9]: /lib/x86_64-linux-gnu/libNDPlugin.so.3.11(_ZN14NDPluginDriver11processTaskEv+0x1c9)
[ 0x7f348a02c3fb]: /lib/x86_64-linux-gnu/libCom.so.3.15.9(epicsThreadCallEntryPoint+0x3b)
[ 0x7f348a0320bb]: /lib/x86_64-linux-gnu/libCom.so.3.15.9(epicsSnprintf+0x7bb)
[ 0x7f34899bdea7]: /lib/x86_64-linux-gnu/libpthread.so.0(start_thread+0xd7)
[ 0x7f3489ad4def]: /lib/x86_64-linux-gnu/libc.so.6(clone+0x3f)
Thread 12 "TEST_PORT_Plugi" hit Breakpoint 1, 0x00007ffff7f72150 in cantProceed () from /lib/x86_64-linux-gnu/libCom.so.3.15.9
(gdb) bt
#0 0x00007ffff7f72150 in cantProceed () from /lib/x86_64-linux-gnu/libCom.so.3.15.9
#1 0x00007ffff7735b08 in NDArrayPool::reserve(NDArray*) () from /lib/x86_64-linux-gnu/libADBase.so.3.11
#2 0x00007ffff7e76dd8 in NDPluginDriver::driverCallback(asynUser*, void*) () from /lib/x86_64-linux-gnu/libNDPlugin.so.3.11
#3 0x00007ffff7dd0155 in asynPortDriver::doCallbacksGenericPointer(void*, int, int) () from /lib/x86_64-linux-gnu/libasyn.so.4.38
#4 0x00007ffff7e77445 in NDPluginDriver::endProcessCallbacks(NDArray*, bool, bool) () from /lib/x86_64-linux-gnu/libNDPlugin.so.3.11
#5 0x00007ffff7cddf6e in NDPluginTemperature::processCallbacks (this=0x5555555c8970, pArray=0x7fffd0001e70) at ../NDPluginTemperature.cpp:100
#6 0x00007ffff7e778a9 in NDPluginDriver::processTask() () from /lib/x86_64-linux-gnu/libNDPlugin.so.3.11
#7 0x00007ffff7f763fb in epicsThreadCallEntryPoint () from /lib/x86_64-linux-gnu/libCom.so.3.15.9
#8 0x00007ffff7f7c0bb in ?? () from /lib/x86_64-linux-gnu/libCom.so.3.15.9
#9 0x00007ffff7907ea7 in start_thread (arg=<optimized out>) at pthread_create.c:477
#10 0x00007ffff7a1edef in clone () at ../sysdeps/unix/sysv/linux/x86_64/clone.S:95
From: Mark Rivers <rivers at cars.uchicago.edu>
Sent: Tuesday, May 3, 2022 2:17 PM
To: Daykin, Evan <daykin at frib.msu.edu>; Michael Davidsaver <mdavidsaver at gmail.com>
Cc: tech-talk at aps.anl.gov
Subject: Re: NDArrayPool:reserve ERROR, reference count = 0, should be = 1
[EXTERNAL] This email originated from outside of FRIB
Hi Evan,
> Thanks, that was the problem. Setting copyArray=true makes the issue go away. Are there any adverse side-effects of doing so?
This argument to NDPluginDriver::endProcessCallbacks() is documented here:
Your driver is calling endProcessCallbacks with a new NDArray that processCallbacks() created. Thus you must set copyArray=false, as you were originally doing.
NDPluginDriver::endProcessCallbacks(outputArray, false, true);
I should have looked more closely at your original message. I assumed the problem was in NDPluginDriver::endProcessCallbacks with a call to NDArray::release(). However, the problem is actually in NDPluginDriver::beginProcessCallbacks
with a call to NDArray::reserve().
NDArrayPool:reserve ERROR, reference count = 0, should be >= 1, pArray=0x7f453c001d80
Thread SimDetTask (0x5621bbbb0c80) can't proceed, suspending.
Dumping a stack trace of thread 'SimDetTask':
[ 0x7f45538ffab3]: /lib/x86_64-linux-gnu/libCom.so.3.15.9(epicsStackTrace+0x73)
[ 0x7f45538f0216]: /lib/x86_64-linux-gnu/libCom.so.3.15.9(cantProceed+0xc6)
[ 0x7f4553197b08]: /lib/x86_64-linux-gnu/libADBase.so.3.11(_ZN11NDArrayPool7reserveEP7NDArray+0x78)
[ 0x7f455305db37]: /lib/x86_64-linux-gnu/libNDPlugin.so.3.11(_ZN14NDPluginDriver21beginProcessCallbacksEP7NDArray+0x367)
[ 0x7f455375df1e]: /home/daykin/git/areadetector-temperature/lib/linux-x86_64/libNDPluginTemperature.so(_ZN19NDPluginTemperature16processCallbacksEP7NDArray+0xae)
[ 0x7f455305dd53]: /lib/x86_64-linux-gnu/libNDPlugin.so.3.11(_ZN14NDPluginDriver14driverCallbackEP8asynUserPv+0x1d3)
[ 0x7f4553851155]: /lib/x86_64-linux-gnu/libasyn.so.4.38(_ZN14asynPortDriver25doCallbacksGenericPointerEPvii+0x1f5)
[ 0x7f4553809974]: /usr/lib/epics/lib/linux-x86_64/libsimDetector.so(_ZN11simDetector7simTaskEv+0x4e4)
[ 0x7f45538fa0bb]: /lib/x86_64-linux-gnu/libCom.so.3.15.9(epicsSnprintf+0x7bb)
[ 0x7f4553387ea7]: /lib/x86_64-linux-gnu/libpthread.so.0(start_thread+0xd7)
[ 0x7f455349edef]: /lib/x86_64-linux-gnu/libc.so.6(clone+0x3f)
From this stack trace it looks like beginProcessCallbacks is being called from the simDetTask. I think this means you have set callbacksBlock=Enable for this plugin. Is that correct? Otherwise the task with
the error should be your plugin tasks, not the simDetTask. Is there a reason you set callbacksBlock? That should not be a problem, but it is unusual and I am just curious.
I have a couple of comments on your processCallbacks function.
You are measuring the elapsed time for execution.
epicsTimeStamp after;
epicsTimeGetCurrent(&after);
double delta = epicsTimeDiffInSeconds(&after, &before);
cout<<"Took "<<delta<<" s"<<endl;
setDoubleParam(runTimeIdx, delta);
But the base class already does this for you, so you don't should not need to do this.
You are allocating a new inputArray with NDArrayPool->convert(), converting to NDUInt16. But you already know that the input array (pArray) has type NDUInt16, so why make a new array? You can just pass pArray
to doTemperatureConversion, as long as you don't modify that array.
this->pNDArrayPool->convert(pArray,&(this->inputArray), NDUInt16);
More importantly you have allocated this->inputArray, but you have never released it. This means you have a memory leak. Once you are done with this->inputArray you should call this->inputArray->release().
But as I said above I am not sure you need to create this array at all.
You are only allocating this->outputArray if it is currently NULL. You then pass this->outputArray to doTemperatureConversion(). What happens if the new input array has different dimensions from the one that
was used when creating this->outputArray? If the new array is larger then you will probably get an access violation unless doTemperatureConversion() is checking the dimensions of the output array.
This does not solve your original problem. For some reason the NDArray being passed to NDPluginDriver::beginProcessCallbacks() has a reference count of 0. That should never happen, because if the reference count
is 0 then it should be in the free list and not in active use.
You should switch the flag to endProcessCallbacks back to the correct value of false and then try to track down the problem.
Thanks, that was the problem. Setting copyArray=true makes the issue go away. Are there any adverse side-effects of doing so?
-----Original Message-----
From: Michael Davidsaver <mdavidsaver at gmail.com>
Sent: Tuesday, May 3, 2022 12:10 PM
To: Daykin, Evan <daykin at frib.msu.edu>
Cc: Mark Rivers <rivers at cars.uchicago.edu>;
tech-talk at aps.anl.gov
Subject: Re: NDArrayPool:reserve ERROR, reference count = 0, should be = 1
[EXTERNAL] This email originated from outside of FRIB
On 5/3/22 08:52, Daykin, Evan via Tech-talk wrote:
> Hm… I am not explicitly calling release() anywhere. This is my processCallbacks function- everything else in my plugin, AFAIK, doesn’t access any shared resources.
I have half a memory that NDPluginDriver::endProcessCallbacks() "steals"* a reference
to the NDArray when copyArray=false.
* https://docs.python.org/3/c-api/intro.html#reference-count-details
> void NDPluginTemperature::processCallbacks(NDArray *pArray){
>
> static const char *functionName = "processCallbacks";
>
> NDArrayInfo_t arrayInfo;
>
> if(pArray->dataType != NDUInt16){
>
> asynPrint(this->pasynUserSelf, ASYN_TRACE_ERROR,
>
> "%s:%s: Only UInt16 supported.", driverName, functionName);
>
> return;
>
> }
>
> std::string lastCalFileName = this->calibrationFileName;
>
> getStringParam(this->calibrationFileNameIdx, this->calibrationFileName);
>
> if(this->calibrationFileName != lastCalFileName){
>
> this->processCalibrationFile();
>
> }
>
> int arrayCallbacks;
>
> getIntegerParam(NDArrayCallbacks, &arrayCallbacks);
>
> if(arrayCallbacks==1){
>
> epicsTimeStamp before;
>
> epicsTimeGetCurrent(&before);
>
> NDPluginDriver::beginProcessCallbacks(pArray);
>
> this->pNDArrayPool->convert(pArray,&(this->inputArray), NDUInt16);
>
> if(NULL == this->outputArray){
>
> this->pNDArrayPool->convert(pArray,&(this->outputArray),NDUInt16);
>
> }
>
> //unlock while the plug-and-chug happens. No shared resources are accessed at this time.
>
> inputArray->getInfo(&arrayInfo);
>
> this->unlock();
>
> this->doTemperatureConversion(this->inputArray, this->outputArray, &arrayInfo);
>
> this->lock();
>
> setIntegerParam(NDArraySizeX, (int)outputArray->dims[arrayInfo.xDim].size);
>
> setIntegerParam(NDArraySizeY, (int)outputArray->dims[arrayInfo.yDim].size);
>
> epicsTimeStamp after;
>
> epicsTimeGetCurrent(&after);
>
> double delta = epicsTimeDiffInSeconds(&after, &before);
>
> cout<<"Took "<<delta<<" s"<<endl;
>
> setDoubleParam(runTimeIdx, delta);
>
> NDPluginDriver::endProcessCallbacks(outputArray, false, true);
>
> callParamCallbacks();
>
> }
>
> }
>
> *From:*Mark Rivers <rivers at cars.uchicago.edu>
> *Sent:* Tuesday, May 3, 2022 11:37 AM
> *To:* tech-talk at aps.anl.gov; Daykin, Evan <daykin at frib.msu.edu>
> *Subject:* Re: NDArrayPool:reserve ERROR, reference count = 0, should be = 1
>
> *[EXTERNAL] This email originated from outside of FRIB*
>
> Hi Evan,
>
> That error probably means you have called NDArray::release() on an array whose reference count is already 0.
>
> Most plugins don't need to call release() because it is handled in the base class NDPluginDriver::endProcessCallbacks().
>
> Mark
>
> ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
> *From:*Tech-talk <tech-talk-bounces at aps.anl.gov <mailto:tech-talk-bounces at aps.anl.gov>> on behalf of Daykin, Evan via Tech-talk <tech-talk at aps.anl.gov <mailto:tech-talk at aps.anl.gov>>
> *Sent:* Tuesday, May 3, 2022 10:20 AM
> *To:* tech-talk at aps.anl.gov <mailto:tech-talk at aps.anl.gov> <tech-talk at aps.anl.gov <mailto:tech-talk at aps.anl.gov>>
> *Subject:* NDArrayPool:reserve ERROR, reference count = 0, should be = 1
>
> Hi,
>
> I am hoping there’s an AreaDetector plugin maven here…
>
>
> I have written a small AD plugin intended to convert our raw camera images into a temperature heatmap, based on a calibrated gain, emissivity and exposure time. To test it, I set up a SimDetector in “peak” mode. I connect my plugin to this simDetector image.
I can successfully capture and convert 3 frames. On the fourth frame, my test dies with the following error. Are there any common pitfalls that might cause this?
>
> NDArray.uniqueId=1
>
> Do temperature conversion start
>
> Took 0.0937809 s
>
> NDArray.uniqueId=2
>
> Do temperature conversion start
>
> Took 0.0655851 s
>
> NDArray.uniqueId=3
>
> Do temperature conversion start
>
> Took 0.0642867 s
>
> NDArray.uniqueId=4
>
> NDArrayPool:reserve ERROR, reference count = 0, should be >= 1, pArray=0x7f453c001d80
>
> Thread SimDetTask (0x5621bbbb0c80) can't proceed, suspending.
>
> Dumping a stack trace of thread 'SimDetTask':
>
> [ 0x7f45538ffab3]: /lib/x86_64-linux-gnu/libCom.so.3.15.9(epicsStackTrace+0x73)
>
> [ 0x7f45538f0216]: /lib/x86_64-linux-gnu/libCom.so.3.15.9(cantProceed+0xc6)
>
> [ 0x7f4553197b08]: /lib/x86_64-linux-gnu/libADBase.so.3.11(_ZN11NDArrayPool7reserveEP7NDArray+0x78)
>
> [ 0x7f455305db37]: /lib/x86_64-linux-gnu/libNDPlugin.so.3.11(_ZN14NDPluginDriver21beginProcessCallbacksEP7NDArray+0x367)
>
> [ 0x7f455375df1e]: /home/daykin/git/areadetector-temperature/lib/linux-x86_64/libNDPluginTemperature.so(_ZN19NDPluginTemperature16processCallbacksEP7NDArray+0xae)
>
> [ 0x7f455305dd53]: /lib/x86_64-linux-gnu/libNDPlugin.so.3.11(_ZN14NDPluginDriver14driverCallbackEP8asynUserPv+0x1d3)
>
> [ 0x7f4553851155]: /lib/x86_64-linux-gnu/libasyn.so.4.38(_ZN14asynPortDriver25doCallbacksGenericPointerEPvii+0x1f5)
>
> [ 0x7f4553809974]: /usr/lib/epics/lib/linux-x86_64/libsimDetector.so(_ZN11simDetector7simTaskEv+0x4e4)
>
> [ 0x7f45538fa0bb]: /lib/x86_64-linux-gnu/libCom.so.3.15.9(epicsSnprintf+0x7bb)
>
> [ 0x7f4553387ea7]: /lib/x86_64-linux-gnu/libpthread.so.0(start_thread+0xd7)
>
> [ 0x7f455349edef]: /lib/x86_64-linux-gnu/libc.so.6(clone+0x3f)
>
> *Evan Daykin*
>
> Controls Engineer
>
> Facility for Rare Isotope Beams
>
> Michigan State University
>
> 640 South Shaw Lane
>
> East Lansing, MI 48824, USA
>
> Tel. 517-908-7678
>
> Email: daykin at frib.msu.edu <mailto:mccausey at frib.msu.edu>
>
>
>
> *cid:[email protected]*
>