I think that Scott's message highlights the biggest problem faced with all marker sets: skin movement artifact.

When we compared bone-pin mounted marker clusters with skin mounted marker clusters, even when using the same anatomical coordinate system, skin mounted markers not only predicted larger motions, they often predicted motions that were opposite to what the underlying bones of the tibio-femoral joint during walking (1) and cutting motions (unpublished results).

When we further evaluated the skin marker data and applied various "solidification" and optimisation methods to the marker clusters we had no improvement in terms of reducing the error between skin and pin mounted knee joint kinematics (2).

In terms of a bias imposed by the skin movement artifact, although we attempted to find correlations across subject, the error was subject specific (unpublished results) and we have, as of yet, not found a reliable way to reduce this subject-specific bias.

As such, we have attempted to put forward a basic guideline for reporting kinematics along the lines of what Scott was suggesting (3) by describing the standard error associated with predicting tibio-femoral motion from a four marker cluster of skin markers similar to that described by Manal et al (4). Although this was a conservative way of predicting the error, in terms of translations and abd-add and internal-external rotations the 'window' of error may exceed the actual measurement. This is particularly significant since skin markers may predict a tibio-femoral motion that is not only different in magnitude but also in direction to the in vivo motion of the underlying bones.

Ironically, based on some of our other data I believe it may be easier to 'subtract' the error during more vigorous activities (cutting and hopping) due to the increased level of muscle activation which thus reduces the 'wobbling mass' effect of the soft tissue of the thigh (we are still looking at this however).

In summary, I concur with Scott that we must consider the limitations of our technology, or perhaps even more so of our methodologies. Although the technology is there and it is so tempting to look at those graphs displaying the secondary motions of the joints (in particular the knee joint), we have a responsibility to ensure that the information we deliver respects those limitations or, at the very least, communicates those limitations within the paper. This is particularly relevant when publishing in journals that have a more clinical slant as the limitations may not be as apparent to this audience.


Daniel Benoit

Daniel L Benoit, Ph.D.
To all who have contributed so far - I believe that this is a
valuable discussion of an area too often neglected in motion
analysis, and the quality of the responses has been excellent.

I am not going to add to the discussion about the relative merits of
different marker systems, but I would like to comment on how these
systems are being evaluated/compared. In particular, I would like to
emphasize one point (previously made by David Smith): Repeatability
is necessary but not sufficient to establish accuracy. It implies
good precision, but cannot address the issue of measurement bias.
If one considers the nature of errors introduced by skin markers, the
potential for biased measurements is significant. Displacements of
surface markers due to muscle contraction, skin motion due to change
of joint angle or tissue "bounce" due to impact forces (e.g.
heelstrike) are not random - they are highly correlated with movement
patterns. The errors introduced by these displacements can be quite
repeatable within each subject, but can introduce significant bias
that varies from subject to subject (depending on body style, injury/
disease state, etc). We can generally agree that reduced within-
subject variability (across trials/test sessions/investigators) is a
"good thing". However, in the absence of a gold standard for
comparison, we simply do not know how accurate our measurements
really are. This has been an achilles heel of 3D motion analysis
throughout its history. I remember working with Jim Gage at the
Newington Children's Hospital gait lab back in the early-mid 1980's
(using a marker system that might be considered a predecessor to the
Helen Hayes system). We could generate repeatable knee internal/
external and ab/adduction plots for our patients (mostly children
with cerebral palsy), but ended up removing them from our clinical
printouts because we did not feel confident about their accuracy. I
routinely see these plots now; clearly the motion analysis technology
has improved, but how much more do we know about their accuracy than
we did 20+ years ago?

So, I think the focus of the relative repeatability of different
marker systems misses the point to some extent. It would be far more
valuable to understand the effect of marker sets on the "confidence
interval" of the measurements we use for research and clinical
decision-making, particularly as they apply to the specific motions
and subject populations each of us work with. It is simplistic to
assume that there is one marker set that is the "best", even if we
consider only clinical gait analysis. For example, a functional
calibration approach (such as that described by Richard Baker) could
be the best solution for a relatively healthy population, but might
be poorly suited for subjects with limited range of motion in one or
more joints (as suggested by Ton van den Bogert ,e.g. patients with
CP, stroke, arthritis, etc). Landmark-based joint center location
may work better for these populations, but probably has larger errors
with obesity or skeletal deformity. The type of movement task also
changes the nature of errors (consider skin displacement during
running vs. slow walking), and might affect the relative performance
of different systems.

Trying to summarize where I am going with this (I almost forgot by
now), I think the search for the "best" marker set is a bit of a
quest for the holy grail. The obvious weaknesses of skin-based
motion measurement for assessing joint kinematics have been
demonstrated in multiple studies, and no marker placement or analysis
method can completely overcome them. Perhaps the best we could hope
for would be to agree that there are multiple approaches, each with
their own merits. They should be selected based on consideration of
the specific application, rather than strictly by convenience ("came
with the system" or "that is the one I know"). Equally important, we
should consider the inherent limitations of this technology, to avoid
interpreting artifact as science (even if it is repeatable). Many
types of studies may be relatively insensitive to marker set
selection. For others, there may be specific methods that are
clearly advantageous. However, for studies of certain disorders and
subject populations, there simply may be no acceptable surface marker

Then, how do we answer the original question about marker set
selection? I do not think we have the data to make these decisions.
But, I believe that one key step is incorporating true studies of
accuracy (instead of just repeatability) into the decision-making
process. This is an area where emerging technologies (e.g. dynamic
MRI or biplane radiography) have the potential to make significant
contributions to our understanding of the estimation of skeletal
motion from skin markers.

Thanks to Nancy Denniston for initiating this topic, and I look
forward to continued discussion/debate.

Scott Tashman
