Assuming you meant to put 'id7' as the first one, I don't understand what this achieves beyond just:
/events[id2]/data/items/element[id7]/value/magnitude >
/events[id2]/data/items/element[id4]/value/magnitude +
/events[id2]/data/items/element[id5]/value/magnitude +
/events[id2]/data/items/element[id6]/value/magnitude
which says the same thing, for all events in the runtime data that conform to the /events[id2] branch of the structure.
Since the occurrences of events[id2] can be more than one, /events[id2]/data/items/element[id7]/value/magnitude in the execution environment (actual data) maps to a List<Real>. That means you need to define operators such as +, > and = on a list of reals. Or to define somehow that a statement containing only path bindings within a single multiply-occurring structure will mean that it gets evaluated for each occurrence of such a structure. The second case is complicated if you need to include data outside /events[id2] in your expression. A real world use case would be data in /protocol, which can influence the interpretation of event data, but is outside of the event.
So we did the first in Archie, with a bit of tricks to make this work for assignments. For example, how the plus operator is interpreted in Archie:
+(Real r1, Real r2)
return the sum both numbers
+(Real r1, List<Real> r2)
return a list the same size as r2, with every element result[i] = r1 + r2[i]
+(List<Real> r1, r2)
return a list the same size as r1, with every element result[i] = r1[i] + r2
+(List<Real>l r1, List<Real> r2)
if r1 and r2 have different length, throw an exception. Otherwise, return a list the same size as r1, with every element result[i] being r1[i] + r2[i]
And the > operator:
(Real r1, Real r2)
return true if r1 > r2, false otherwise
(Real r1, List<Real> r2)
return a list of Booleans, one for every element in r2, true if r1 > that element
(List<Real> r1, r2)
return a list of Booleans, one for every element in r1, true if that element > r2
(List<Real>l r1, List<Real> r2)
if r1 and r2 have different length, throw an exception. Otherwise, return a list of Booleans, with every element result[i] being r1[i] > r2[i]
This is a simplification, in reality there is null-handling and integer-> real conversion involved, which is also missing in the specification I think.
We defined assertions/checks to only succeed if every Boolean in such a list is true or null/undefined (to not fail on data which is optional). Additionally in Archie every value returned contains every unique path in the data that was used to evaluate it, so you can see exactly for which data an assertion/check failed in the result, to notify the user where the problem occurs. Or to apply an assignment to the correct output if the output path does not match a single field. This last bit is implementation specific of course.
Much of this complexity can be avoided by not defining the operator on lists/sets, and requiring the for_all loop on lists or sets of data in the specification. This comes at a price, because the author of the expressions needs to understand more of the language and data structures. So we chose the second, since the previous draft specification did not specify at all how to handle these cases.
Undefined value handling is another subject, I have not checked yet if the new proposal solves it. We defined some functions to handle this explicitly if the automatic handling doesn’t do it ((input , alternative) -> return input unless input is undefined, then alternative), as well as some rounding functions.
If we were to allow the expression for_all $event in /data/events[id3] then we need to be clear on what it means: it actually refers to an object of type List<EVENT>, but do the members consist of EVENT objects found in data at the moment the expression is evaluated? Or just the statically defined members in the archetype - which can also be multiple, e.g. see the Apgar archetype, it has 1 min, 2 min, 5 min events?
You would need to evaluate on actual data. If you define it on the archetype data, you would need some kind of rules to convert it to an evaluation on actual data with different multiplicities than the rules specify, for example if events[id2] has occurrences > 1. Might be possible, I have not tried to define that. Would probably include some extra for_all loops plus some kind of validation errors for cases that cannot be converted.
So I would say always the data found at the moment which the expression is evaluated. You can still refer to separate statically defined members using their distinct node ids, and even those could have occurrences > 1 (not in the apgar example since those have occurrences {0..1} in the archetype).
Normally we want the processing of 'rules' expressions in archetypes to apply to the data when the archetype is being used in its normal role of creation / semantic validation at data commit time.
Agreed.
So it seems to me that if we want to support expressions like the above, we need to be able to do something like (don't worry about the concrete syntax too much, could easily be TS or java-flavoured):
use_model
org.openehr.rm
data_context
$xxx_events: List<EVENT>
$item_aaa, $item_bbb, $item_ccc, $item_ddd: Real
definition
check for_all event in $xxx_events:
event/$item_aaa > event/$item_bbb + event/$item_ccc + event/$item_ddd
data_bindings -- pseudo-code for now
$xxx_events -> /events[id2]
$item_aaa -> /data/items/element[id7]/value/magnitude
$item_bbb -> /data/items/element[id4]/value/magnitude
$item_ccc -> /data/items/element[id5]/value/magnitude
$item_ddd -> /data/items/element[id6]/value/magnitude
I don't know what this archetype is, so assume that $xxx_events, $item_aaa etc are more meaningful names.
That would leave $item_aaa globally defined as a path-reference, not matching a path in the archetype on its own. Type of variable being some kind of PATH_REFERENCE? That would solve the problem. Why not just inline the assignments, such as:
$events := /events[id2]
For_all event in $events:
$item_aaa := /data/items/element[id7]/value/magnitude
$item_bbb := /data/items/element[id4]/value/magnitude
$item_ccc := /data/items/element[id5]/value/magnitude
$item_ddd := /data/items/element[id6]/value/magnitude
check $item_aaa > $item_bbb + $item_ccc + $item_ddd
end
Or if the end keyword is not desired, perhaps
$events := /events[id2]
For_all event in $events:
($item_aaa := $events/data/items/element[id7]/value/magnitude,
$item_bbb := $events/data/items/element[id4]/value/magnitude,
$item_ccc := $events/data/items/element[id5]/value/magnitude,
$item_ddd := $events/data/items/element[id6]/value/magnitude)
check $item_aaa > $item_bbb + $item_ccc + $item_ddd
although the second syntax is a bit less flexible when multiple statements need to be included within the for loop, you would need to repeat the for statement in that case.
The next problem you mentioned is:
PB: Note that a path that points to a single typed dvquantity in an archetype can still point to many items in the RM if somewhere up the tree there is a list or a set, for example more than one observation
So I think this implies an incorrect interpretation of this kind of code within an archetype. It can't be understood as simultaneously applying to multiple Observations if it is within an Observation archetype, only to one OBSERVATION instance at a time - usually one about to be committed.
You can still have Lists of things internal to the archetype, as shown above with the Events list, but to process the multiplicity, you would need to do as we have done and use for_all, or some other container-aware operator or function.
Of course, the context being a single instance of data – In case of an OBSERVATION a single OBSERVATION with possibly more than one event, or one event with a cluster that occurs twice. But in case of a COMPOSITION archetype it can be more than one OBSERVATION.
Anyway, does this get closer to the sense of what you would like to do? It's more than I had conceived of, so this is a useful challenge...
Much closer! Does not do everything (looks like not turing complete?) but functions will solve the remaining cases.