The discussion of these two SSD packages is very brief and gives only the barest indications of functionality. Indeed, the summaries provided here are essentially all they offer. It would be helpful to discuss what options exist, if any, for omitting one or more available models from use in the averaging process. It is entirely possible that one or more models will deviate so far from the data as to render them not just useless but actually detrimental. I have observed this with SSD Master on many occasions and with other packages on occasion. Reliance on the stated AIC criteria alone for down weighing these models may not be sufficient. It would have enhanced the paper to have evaluated this issue for each package.
7 Responses
Comments are closed.
RESOLUTION
Use following response from JT:
We have expanded the table on software tools and rewritten the section to allow more comparisons between tools. Model averaging is only available for SSD Toolbox and (shiny)ssdtools. Ultimately the user can select which models to include however the purpose of model averaging is to downweight detrimental models. We are unaware of an example where the AICc criteria is insufficient.
I’m happy to tackle in a section on ssd software.
Go for it!
I am a little confused why they think there are two packages? DO they think ssdtools and the shiny ap are two separate packages? If so we can just clarify the wording in the MS, and we only need to provide more information for the ssdtools R package. The shiny is really just an interface.
Agreed. Plus I think there are other points of clarification.
It might be worth adding to the Table 1 an indication of the distribution(s) each uses. Obviously for ssdtools this would be “various” – but where there are only a couple they probably should be listed (e.g. BurrliOz). I haven’t used SSD Master but from the text in our paper it looks like you can fit a range of distributions, but it does not use model averaging. I’m not sure what evidence the reviewer bases his statement that “Reliance on the stated AIC criteria alone for down weighing these models may not be sufficient.” So far with all the examples I have tried, AICc seems to be doing a pretty good job. As David says in point 2, the whole point is that it down weights these detrimental models. We could reference Schwarz and Tillmanns (2019) as evidence of the stability of the AICc based model averaging method. Or a pers com from one of the Canadians that have has used the method in their derivations – have any of you guys found the AICc to be insufficient in terms of identifying bad fitting models? My simulation study (too complex to include in this paper) suggests it may start to fail to select the “correct” distribution with small sample sizes, but with real data we do not know the correct distribution in any case – thus a model averaged version is probably safer than arbitrary selection of a single wrong distribution in any case.
As an aside – From what I can gather on their instructions the contents of tables are not included in the word limit, which means expanding Table 1 with more detail would provide a means of expanding our ‘review’ of the available methods without necessarily blowing out the word limit.
I agree with adding the distributions to table 1 (#12) and will do.