Unveiling Trail Making Test: visual and manual trajectories indexing multiple executive processes

Linari, Ignacio; Juantorena, Gustavo E.; Ibáñez, Agustín; Petroni, Agustín; Kamienkowski, Juan E.

doi:10.1038/s41598-022-16431-9

Download PDF

Article
Open access
Published: 22 August 2022

Unveiling Trail Making Test: visual and manual trajectories indexing multiple executive processes

Ignacio Linari¹^na1,
Gustavo E. Juantorena¹^na1,
Agustín Ibáñez^2,3,4,5,
Agustín Petroni^1,6^na2 &
…
Juan E. Kamienkowski^1,7,8^na2

Scientific Reports volume 12, Article number: 14265 (2022) Cite this article

4323 Accesses
13 Citations
24 Altmetric
Metrics details

Subjects

Abstract

The Trail Making Test (TMT) is one of the most popular neuropsychological tests for executive functions (EFs) assessment. It presents several strengths: it is sensitive to executive dysfunction, it is easy to understand, and has a short administration. However, it has important limitations. First, the underlying EFs articulated during the task are not well discriminated, which makes it a test with low specificity. Second, the pen-and-paper version presents one trial per condition which introduces high variability. Third, only the total time is quantified, which does not allow for a detailed analysis. Fourth, it has a fixed spatial configuration per condition. We designed a computerised version of the TMT to overcome its main limitations and evaluated it in a group of neurotypical adults. Eye and hand positions are measured with high resolution over several trials, and spatial configuration is controlled. Our results showed a very similar performance profile compared to the traditional TMT. Moreover, it revealed differences in eye movements between parts A and B. Most importantly, based on hand and eye movements, we found an internal working memory measure that showed an association to a validated working memory task. Additionally, we proposed another internal measure as a potential marker of inhibitory control. Our results showed that EFs can be studied in more detail using traditional tests combined with powerful digital setups. The cTMT showed potential use in older adult populations and patients with EFs disorders.

Microdosing with psilocybin mushrooms: a double-blind placebo-controlled study

Article Open access 02 August 2022

Control of working memory by phase–amplitude coupling of human hippocampal neurons

Article Open access 17 April 2024

The language network as a natural kind within the broader landscape of the human brain

Article 12 April 2024

Introduction

The Trail Making Test (TMT) is perhaps the most popular neuropsychological task used for standard clinical assessment and research^1,2,3,4,5,6. It comprises parts A and B. In part A, the subject uses a pencil to connect a series of 25 encircled numbers in numerical order. In part B, the subject connects 25 encircled numbers and letters in numerical and alphabetical order, alternating between the numbers and letters². It is sensitive to executive function (EF) impairments and has shown consistent results in multiple clinical populations^1,7,8,9. Different executive processes are thought to be associated with performance in the TMT, including inhibitory control, working memory, and attention^5,10,11,12. In addition to its sensitivity to executive dysfunction, the TMT presents several strengths, as it is simple and intuitive, easy to understand for patients, has a short administration, can be used in different cultures, and the existence of adapted versions allows cross-cultural comparisons^13,14,15.

However, the standard version of TMT presents severe limitations. First, its multiple underlying EFs are not well discriminated, which makes it a test with low specificity. Solving the TMT involves the articulation of multiple processes (e.g. motor preparation and execution, visual search, visuomotor planning and coordination, working memory, inhibition, among others). The behavioural scores do not disentangle these processes, and the final performance constitutes a rough summary and undiscriminated assessment¹². This is known as the impurity problem. One possible solution is to apply multiple assessments in order to capture the specific characteristics of the EF¹⁶, while another option is to understand in more detail the subcomponents of the complex tasks and use more fine-grained measures specifically designed for them¹⁷. Second, the results on the TMT include a very limited set of measures, consisting most frequently of the total time for completion. Third, it has high variability, given that only one sample (trial) is measured per condition, and that time is measured with low accuracy (e.g. the time is measured with a hand chronograph by the experimenter). Fourth, the spatial configurations of the targets are fixed, and their effects are largely unexplored; thus, it is currently a confounding factor when comparing part A and part B^18,19. Fifth, the TMT has moderate accuracy for impaired neuropsychological performance²⁰. Taken together, these limitations reveal the necessity of new versions of the task unravelling the underlying EFs process, where time and hand trajectories are measured with more precision, and where spatial configurations are controlled.

Previous lines of research highlighted the importance of hand and eye movements as valuable tools to study EFs. For instance, some studies showed a tight interaction between spatial working memory and the planning of eye movements in several experiments^21,22. Additionally, other lines of work exhibited a link between visual search markers and inhibitory control (reactive and proactive suppression, see^23,24, respectively). Moreover, two studies showed that the central resources involved in response selection are shared by hand or eye movements during a sequential task^25,26. Overall, these and more recent studies (e.g.²⁷) demonstrated that many central cognitive processes are involved in tasks with hand and eye movements.

In recent years, a few studies have attempted to dissect the TMT into smaller subcomponents, in order to scrutinise in more detail which processes are relevant during the task. Digital versions of the TMT that present a more refined measure of time have been developed, some of them measuring hand trajectories^28,29,30,31. Even more scarce are digital versions of the TMT with eye-tracking. To our knowledge, only one eye-tracking study in TMT parsed the task in monitoring and planning measuring the interaction between hand and eye movements³². Monitoring occurred when the eye fixations were close to the hand, whereas planning occurred when the eyes were far from the hand³². Despite the interesting theoretical and methodological contributions, Wölwer and Gaebel measured eye movements with a low resolution and low sampling rate eye-tracking device, which implicated a serious limitation to measure eye-fixations^32,33,34. Moreover, most of the above mentioned limitations still hold for this pioneering report.

Here, we aim to tackle most of the TMT limitations by designing a computerised version (cTMT) with several innovations. Our design measures performance in multiple trials, it has a controlled spatial configuration, and measures hand and eye movements with high temporal and spatial resolution. More importantly, this design allowed us to reveal different underlying processes. We parsed the task into three phases: monitoring, planning, and a new phase called exploration. Exploration consisted of eye movements scrutinising the scene before the first-hand movement in a trial, collecting information of the scene before starting the actual task of concatenating visual targets. We also validated the cTMT by comparing its performance with the classical TMT and by testing its association to executive functions assessed by a standard neuropsychological battery. We reported specific EFs underlying the task but also externally validated. To that end, we investigated internal markers of working memory and inhibitory control.

Based on the antecedents and our design, we present a specific set of hypotheses. We anticipate that (a) cTMT will parallel classic TMT outcomes and will be well validated with external measures of EFs. Given its higher complexity, (b) TMT-B will exhibit differential eye-movement features in relation to TMT-A. For instance, TMT-B will present more eye fixations than TMT-A. (c) Some of these eye-movement features will reflect the higher-order EF demands in TMT-B that are not present in TMT-A (d) A subgroup of features will be associated with individual differences in performance. (e) A novel eye and hand marker of working memory and inhibition will be obtained from TMT and will correlate with external EF measures.

Methods

Participants

Sixty-one participants were evaluated with the computerised version of the Trail Making Test (cTMT). Participants were recruited at the university and through social media. They reported no record of neurological or psychiatric disorders and no consumption of psychiatric drugs when recruited, and asked for consumption of alcohol and recreational drugs in the last twenty four hours before the experiment. From this sample some participants had to be excluded from the analysis: 12 participants due to poor data acquisition or not following the instructions. The final sample consisted of 49 participants (24 women, between 18 and 42 years old, mean ± = 25.7 ± 5.4), except for additional online measurements (see below), which consisted of a final sample of 41 participants (16% of attrition rate ~ 1 year later). This dropout rate does not affect the main results, since a power analysis for a wilcoxon rank signed-rank test on the two main variables (Pc and RT, see below) showed that only 12 participants were needed to reach an empirical power equal or above 0.99 (Monte Carlo simulation, 10,000 iterations; library MKpower in R language³⁵). For this calculation, we estimated the approximate a priori normal distribution of PC (RT) from a sample of five participants with mean and standard deviation of approximately − 15 (1.5 ms) and 10 (1.0 ms) respectively. All subjects were naïve to the objectives of the experiment and had normal or corrected-to-normal vision. All the experiments described in this paper were reviewed and approved by the IRB of CEMIC Medical Centre and qualified by the Department of Health and Human Services (HHS, USA): IRb00001745–IORG 0001315. All participants provided written informed consent in agreement with the Helsinki declaration.

Computerised TMT (C-TMT)

Procedure

The task follows the original design of the TMT (Fig. 1A)². Participants had to connect 20 items in consecutive order. In TMT-A, only numbers are presented (1 to 20). In TMT-B, both numbers (1 to 10) and capital letters (A to J) were presented. Participants had to connect items in alternate order, starting from number 1 (1, A, 2, B, and so on). The complete task was divided into 5 blocks of 20 trials, divided by four breaks for resting. Each trial started when participants pressed the left mouse button (Fig. 1B). As soon as they pressed the button, the stimuli appeared on the screen, and they had to pass over every item without releasing the button. When the mouse button was released, the stimuli disappeared, and a fixation dot appeared. Each trial had a time limit of 25 s. Trials ended when participants released the button or when they reached the time limit.

Every block started with a drift correction for the eye-tracker, in which participants had to fixate in a small circle (20 pixels) and press the spacebar (Fig. 1B). After the drift correction, a small red/blue dot indicated the upcoming trial type (blue and red predicted trials A and B, respectively), and the new trial began with the button press. Participants were instructed to rest between blocks as much as they needed, and to resume the task whenever they were ready. Before resuming the experiment, they performed the drift correction, consisting of a central dot in which they had to fixate. If the program failed to detect the eye or if the drift exceeded 2 degrees (EyeLink default value), the experiment stopped and could only be resumed after the participant called the experimenter and a recalibration was launched (built-in Eyelink toolbox function).

Participants were instructed to correct their trajectories if they realised that they reached an incorrect item. Although the eye and hand movements were monitored during the whole trial, no online feedback was provided.

Participants completed a total of 100 trials, 50 were TMT-A and 50 were TMT-B, strictly alternating between the two trial types. The task took between 40 and 60 min, including eye tracker calibration and re-calibrations. The stimuli were presented using Psychophysics Toolbox Version 3³⁶. Data was collected between October to December 2018 at the University of Buenos Aires.

Stimuli

The spatial distributions of items were the same for all participants, but the order and whether it corresponded to a trial-type A or B was randomised across participants. With regard to the stimuli spatial arrangement design, the item positions were selected one-by-one from a 30-by-30 grid. First, the starting position was selected randomly. Second, horizontal and vertical displacements were selected from a Poisson distribution with the parameter μ = 5. The position was added to the path if the stroke did not cross any previous stroke (straight lines that connected the centre of each item, if they were connected in order). After filling the grid with 20 items, the area of the convex hull of the resulting path was calculated. Target arrangements were accepted only if they presented an area larger than 40% of the total area covered by the grid.

Each position of the grid was separated by 20 pixels, which correspond to 0.44 degrees of visual angle. The grid covered 600 × 600 pixels. Each item was a single-digit/character surrounded by a circle with a radius of 10 pixels, centred in a given position of the grid. Finally, several spatial distributions were generated and 100 of them were selected (some examples are presented in Fig. 1C). The final area covered by the convex hull was (50 ± 7)% of the total area covered by the grid.

Eye-tracking recordings

Participants were seated in front of a 19-inch screen (SyncMaster 997 MB, 1024 × 768 pixels resolution, 100 Hz refresh rate; Samsung, Suwon, Korea) at a viewing distance of 65 cm, subtending an angle of 29.3 degrees horizontally and 22.5 degrees vertically. A chin rest that was aligned with the centre of the screen prevented head movements. An EyeLink 1000 eye-tracker (SR Research Ltd., Ottawa, Ontario, Canada) was used to record gaze locations of both eyes at a sampling rate of 1 kHz. Nominal average accuracy is 0.5 degrees, and spatial resolution is 0.01 degrees root mean squared, as given by the manufacturer. The participant’s gaze was calibrated with a standard 9-point grid for both eyes. Built-in drift correction was performed before every block of 20 trials. Based on the results of the drift measures, the participant moved forward in the experiment or had to call the experimenter to perform a new calibration.

The best-calibrated eye was selected for each participant based on the visual exploration of every trial. All eye movements were labelled as fixations, saccades, and blinks by the eye-tracker software using the default thresholds for cognitive experiments (308/s for velocity, 8,0008/s 2 for acceleration, and 0.18 for motion³⁷).

Hand tracking recordings

Hand movements were collected with a standard mouse device. The sampling rate is up to 1000 Hz, but it is not homogeneous because the mouse position was only saved when it was moving with its corresponding timestamp. This is not a problem as we are only interested in events such as reaching to or departing from an item. Sequences of selected items was extracted from hand movement data, which had a spatial precision of a pixel (see monitor dimensions).

External validation measures

In order to perform an external validation of the cTMT measures, we administered an executive functions battery³⁸, the INECO Frontal Screening (IFS), and a canonical visual working memory test, the Change Detection Task (CDT)³⁹. The CDT was implemented online, using the jsPsych library⁴⁰ in JavaScript language, and deployed in the Cognition platform (www.cognition.run).

INECO Frontal Screening

The INECO Frontal Screening (IFS) was collected as recommended by the validation study³⁸ The IFS evaluates EFs providing high sensitivity to characterise deficits among different clinical populations^38,41,42,43. The IFS includes a Motor Programming task⁴⁴; Interference⁴⁴ and a Go/NoGo⁴⁴ tasks based on motor sequences; a Verbal Inhibitory Control task⁴⁵ in which participants have to complete the final word of a sentence, avoiding its strong constraint; a Verbal⁴⁶ and a Spatial⁴⁷ Working Memory tasks, a Backward Digit Span⁴⁶; and a measure of Abstraction Capacity by reporting proverb interpretations⁴⁶. Each task adds points that sum up to a total between 0 and 30. Using a cutoff of 25 points, sensitivity of the IFS was 96.2%, and specificity 91.5% in differentiating controls from patients, and it correlated with classical executive tests such as the time to complete TMT-B (rho = − 0.75; p < 0.001)³⁸. The IFS has good internal consistency (α = 0.80), sensibility to evaluate frontal-executive dysfunction⁴⁸, and was remarkably similar with increasing age⁴⁹. The IFS was administered through an interview with the experimenter, and overall it took approximately 10 min.

Change detection task

The change detection task is a simple assessment that can reliably estimate visual working memory capacity (VWM) in a very simple way³⁹. An array of 4 or 6 coloured squares were presented for 150 ms and after a 900 ms interval with no stimuli, only one square appeared on the screen. Participants had to respond if that square was part of the original array or not, meaning that it had the same colour as the one presented in the array in that particular position. Subjects responded using two keyboard keys with the index finger of each hand. There were consistent and inconsistent types of trials. In our online experimental design (Fig. 4D) we show 120 trials and response times (RT) and keyboard responses were measured. In order to evaluate VWM capacity, we calculate the number of items stored in working memory on a given trial type (K) (Eq. 1) for the 6 array trials (K₆), 4 array trials (K₄), and the average value between them (K_average).

$$K={N}_{set}(2\frac{correct\,trials}{all\,trials}-1)$$

(1)

where N_set corresponds to the number of squares in the presented array for a specific trial (4 or 6).

Data analysis

Performance analysis

Each trial had a time limit of 25 s for its completion. Given that most of the participants failed to reach 20 items, we decided to use a criterion of 12 correctly concatenated items, starting from the first item, to declare the trial completed and define the response time as the time needed to concatenate the first 12 items. A similar criterion was used for the percentage of completion: the percentage of trials that had been successfully completed until target 12 (PC). It is worth mentioning that increasing the number of items covered throughout the trial significantly reduced the difficulty of the task (even for the first 12 items), due to a benefit in searching the next item among fewer distractors in each step of the task. The selection of the threshold of 12 targets resulted in a good estimation of performance keeping a reasonable amount of data. In fact, the main results did not depend on the threshold (see Fig. S1 for a replication of the results with two other thresholds). Moreover, these criteria generated robust results throughout the task, given that there were no significant learning effects, as revealed by the comparison of the first and last thirds of the trials regarding the ratio (B/A) for PC and RT (Wilcoxon signed-rank test: PC Ratio: p = 0.14; z = − 1.49; RT Ratio: p = 0.12; z = 1.56).

Correct trials were those that fulfilled the completeness criterion and also presented a correctly concatenated sequence of targets. To that end, the drawn trajectory of the mouse should enter all the targets only in the correct order (e.g. 1-A-2-B, etc.). In order to define a path as correctly concatenated, we evaluated the sequence of items produced by the participant. A target was reached when a threshold of 10 pixels from the centre was reached. An additional criterion was that trajectories should not cross. In other words, the trajectory curve should not touch itself, as in the original TMT.

Statistics in eye data

In order to compare the distributions of saccades and fixations of cTMT-A and cTMT-B (see Fig. 2C–F), we filtered the raw data by keeping only the correct trials, discarding fixation durations that were over 1000 ms, and removing saccade durations that were over 100 ms. Finally, to equilibrate the samples, we subsampled by an order of magnitude. Given the large amount of data, the distributions did not change visually after filtering. We applied the Kuiper’s test⁵⁰ (twosamples library in R language⁵¹) to statistically test for differences between cTMT-A and cTMT-B.

Parsing into stages

Following Wölwer and Gaebel³², fixations were classified based on their relationship with the mouse position in three different stages. Fixations were defined as Monitoring fixations if they were located near the cursor (closer than 25 pixels) at any time during the fixation interval. Fixations were defined as Planning fixations if they were located far from the cursor (farther than 25 pxs) during all the fixation intervals^32,33,34. From the planning fixations, we also defined a separate group called Exploration fixations that correspond to the fixations occurring before the first-hand movement.

The mean number of fixations and the median fixation duration were calculated for each participant and condition. Wilcoxon signed-rank tests were used to compare between conditions cTMT-A and cTMT-B. Effect sizes for these tests were estimated as $e.s.=\frac{Z}{\sqrt{n}}$

Internal measure of working memory

A remembered target is one that was seen and not immediately selected with the mouse, i.e., other targets were seen before actually passing with the mouse on top of that target. For instance, in Fig. 4Ai, the target “2” was seen while searching for “1”, and then reached with the mouse directly without fixating on it; in Fig. 4Aii, the participants saw the target “2” again right before they selected it with their hand. Thus, in the former case of this schematic example, the participants remembered the position of the target, and in the latter, they did not. This criterion does not differentiate if there is one or several targets between the last view and the passage with the mouse. This analysis only included correct trials (as defined in section Performance analysis, see Methods).

Regarding the target remembered ratio (TR-B/TR-A) along with the task, we calculated the previously described metric in 5 blocks with 20 trials each. These were the actual blocks of the task, with a pause between them (Fig. S3). The target remembered ratio was calculated for each block separately. For this particular metric, only blocks with at least 3 correct trials for each part were included.

Internal measure of inhibitory control

Hand trajectories are directed towards the fixated targets^52,53, when they are the next in the sequence (Correct Detections). Inhibition occurs during fixations on items that do not follow in the sequence (False Detection), when the hand must keep its trajectory without orienting it towards the item (Fig. 5A). A lack of inhibition will be manifested as a persistent tendency to orient the hand trajectory towards False Detections.

In order to add all the hand trajectories projected into the direction of the fixated item, first, the hand trajectories were segmented between the onset and the offset of the fixations into the items. Second, the position of the hand at the time of the fixation onset was subtracted in both vertical and horizontal directions. Therefore, all the trajectories start at the origin (0,0). Third, they were projected into the direction of the fixated item and normalised by the distance between the initial point and the item. Thus, the fixated item was at the point (1,0). Finally, fixations to the next item in the sequence (Correct Detections) were separated from fixations to other items (False Detections) (see Fig. 5A).

The spatial distribution of the trajectories was estimated as the 2D-histogram of the trajectories (see Fig. S3). The temporal pattern was estimated as the position relative to the fixated item (and projected as described before) as a function of time.

Results

Hand movements: global performance of the cTMT

Participants completed 100 trials of the cTMT, strictly alternating between part A and part B (Fig. 1A, but note that we also replicated the relevant results in a subsample of 30 trials). A trial started when the participant pressed the mouse button, which enabled them to draw on-screen, and finished when the participant released the mouse button or after 25 s. We applied this time limit in order to run the whole experiment in approximately 40 min, avoiding fatigue (the total time for completing 30 trials is less than 12 min). As a consequence, participants did not have enough time to reach all the items in numerous cases. Thus, to characterise the general performance we used the hand movement data and measured both the time needed to concatenate 12 targets in the correct order (RT) and the percentage of trials that had been successfully completed until target number 12 (percentage of completion, PC).

Regarding validation measures using hand movements, the initial mouse button press, and the final button release, we found a significant increase in RT in part B compared to part A (Wilcoxon signed-rank test: p = 1.1*10^–9, z = −6, e.s. = 0.86; Fig. 1B). Also, the PC was lower in B compared to A (Wilcoxon signed-rank test: p = 1.1*10^–9, z = 6, e.s. = 0.86; Fig. 1C). It is important to note that these results hold even considering only the first 30 trials, including both conditions, and excluding the very first ones to discard possible errors (trial 1 from TMT-A and trial 1 from TMT-B) (see Fig. S2A,B). This is consistent with previous results for the pencil and paper TMT task^54,55.

Next, the performance based on hand movements was tested for associations with EFs. We observed a significant correlation between the Completion Ratio (PC-B/PC-A) and the total IFS score (Fig. 1E; Spearman Correlation: rho = 0.437, p = 1.7*10^–3), but not with the RT ratio (Fig. 1D; Spearman Correlation: rho = 0.006, p = 0.97). These results served as a validation of the proposed version of the TMT. It is worth mentioning that the fact that the correlation of the IFS with PC, but not with the RT was significant, might be a direct consequence of the time pressure and time limit of our design, not present in the original TMT.

One relevant aspect of the present version of the TMT is that spatial configurations are extensively explored and, crucially, they were the same for all participants, except that trial type (A or B) was assigned randomly to each spatial configuration before the experiment. As seen in Fig. 1F,G, the initial hand trajectory is similar in both parts, rich in twists in order to not overlap the trail. As the vast majority of participants could not reach the last targets, it is common to find at the end of the B trials a decrease in the density of samples (see Fig. 1G).

In summary, the novel cTMT results resemble the classic TMT, even taking only the first 30 trials. These results are exclusively related to the change of lists (only numbers or numbers and letters) as the same spatial configurations were presented in both types of trials. Finally, the performance correlated with an external screening test of EFs (IFS).

Eye movements

We observe a similar structure in both scanpaths, except that TMT-A has, qualitatively, more colour consistency along the trajectory, revealing that in TMT-A almost all trials reached the last item. Figure 2A,B illustrates the eye scanpaths of two representative trials (TMT-A and TMT-B, respectively) with identical spatial configuration. TMT-B, on the other hand, presented more variability in the number of reached items, also reflected by the larger error bars observed in the PC barplot in Fig. 1C.

Regarding fixations and saccades, TMT-A and TMT-B were indistinguishable in many measures, including saccade and fixation duration as well as saccade amplitude (Fig. 2C–E; Kuiper’s test: V = 0.01, p = 0.77; V = 0.01, p = 0.96; V = 0.02, p = 0.11 respectively). The number of fixations showed a clear difference between both conditions, with a higher number in TMT-B (Kuiper’s test: V = 0.17, p = 2.5*10^–4; median(TMT-A) = 76, median(TMT-B) = 81; Fig. 2F). An identical saccade duration, saccade amplitude, and fixation duration suggest a similar visual mechanism between parts A and B. The observed difference in the number of fixations might originate in a more complex processing of the task, more related to higher-order cognitive processes than visual mechanisms. In other words, to solve both parts of the task, subjects seem to use their visual machinery in a very similar way, except that TMT-B requires a more intensive scanning of the visual scene. To explore in more detail the possible mechanisms involved in the differential performance of A and B, we parsed the task in three phases, and analysed eye movements in each phase.

Parsing the task in three phases using hand and eye movements

The previous section showed that the difference in the time needed to complete the task in both conditions is mainly explained by the number of fixations performed during the trial, and not by fixation duration or saccade duration. In the following section, we will focus on the number of fixations in our analysis. We aimed to understand which aspects of the resolution of the task change between parts A and B, revealed by fixation type.

Previous work classified fixations during the TMT in two phases: planning and monitoring³². Here, we use a similar classification, with the addition of a new initial exploration phase. It corresponds to all fixations occurring at the start of each trial before the movement of the hand and accounts not only for the search for the first item but also for the initial exploration of the scene (Fig. 3A). The monitoring phase consisted of fixations that occurred over the cursor and were more related to the motor execution of the task, while the planning phase consisted of fixations that occurred outside the cursor and were related to more executive aspects of the task. For an illustration of the phase classification, we created a video where fixations are coloured according to the phase where they occur in real-time (See Supplemental Video).

First, we compared the number of fixations and fixation duration between TMT-A and TMT-B at each phase. We observed a higher number of exploratory (Wilcoxon signed-rank test: p = 9*10^–4, z = − 3.3, e.s. = 0.47) and planning fixations in part B (Wilcoxon signed-rank test: p = 1.9*10^–9, z = − 6, e.s. = 0.86), following the trend of the overall task. Conversely, there was a lower number of monitoring fixations in part B (Wilcoxon signed-rank test: p = 1.1*10^–4, z = 3.8, e.s. = 0.54; Fig. 3B). Regarding fixation duration, there was no significant difference between TMT-A and TMT-B in exploration (Wilcoxon signed-rank test: p = 0.98, z = − 0.01) and planning (Wilcoxon signed-rank test: p = 0.7, z = 0.38). However, in the monitoring phase, fixation duration was higher in TMT-A (Wilcoxon signed-rank test: p = 2.4*10^–6, z = 4.7, e.s. = 0.67; Fig. 3D). Figure 3B,D shows that the number of fixations was more informative than fixation duration, explaining the differences between TMT-B and TMT-A, which is consistent with the distribution of eye movements depicted in Fig. 2.

To explore the association between eye movements and performance in the three phases of the task, we calculated the ratio (B/A) of the number of fixations and their corresponding duration, and correlated them with a measure of performance (RT-B/RT-A). We found significant correlations between RT ratio and the number of fixations ratio in exploration (Spearman Correlation: rho = 0.39, p = 5.2*10^–3) and planning phases (Spearman Correlation: rho = 0.29, p = 4.3*10^–2; Fig. 3B,C), but not in monitoring (Spearman Correlation: rho = − 0.22, p = 0.13). This is again consistent with the differences in the distribution of the number of fixations (Fig. 2F). As seen in the distributions in Fig. 2E, fixation duration did not vary between conditions, so it was expected that the fixation duration ratio did not affect the RT ratio (Spearman Correlation in Exploratory: rho = − 0.05, p = 0.71; Planning: rho = − 0.02, p = 0.91; Monitoring: rho = − 0.05, p = 0.74; Fig. 3E).

Summarising, the number of fixations but not fixation or saccade duration/amplitude varied between parts A and B and provided adequate measures of task performance. Splitting the task into three phases unveiled the different aspects of the executive process (exploration, planning, execution, and monitoring). The increase in the number of fixations in B versus A, in both exploration and planning, as well as a decrease in monitoring characterised the different stages. Additionally, a small but significant increase in fixation time in part A versus B was observed only in the monitoring phase. Lastly, the increase in the number of fixations observed in B/A for the exploration and planning phases correlated positively with relative performance B/A (RT Ratio).

Visual working memory

In this section, we derived an internal measure of visual working memory using eye and hand movements in the cTMT. Then, we inspected how this internal measure of visual working memory of the targets affected performance in TMT-B with respect to TMT-A. Finally, we compared the derived measure with the individual performance in a validated visual working memory task. In order to quantify our measure, we estimated the number of Targets Remembered along with the search (TR), i.e. the number of targets that had no fixations right before the hand reached them (see methods section), including only correct trials.

On average, participants remembered more targets in TMT-A than in TMT-B (Fig. 4B; A = 4.60 ± 0.87; B = 4.38 ± 0.98; Wilcoxon signed-rank test: p = 0.028; z = 2.2, e.s. = 0.31). This result is consistent with fewer overall fixations in TMT-A given that a higher target location memory implies less target search around the scene (see Fig. 2). It is also consistent with fewer planning fixations (see Fig. 3). This suggests that participants memorised the location of more targets ahead and had to look again at the same target fewer times in order to correctly complete the trial in TMT-A.

The TR Ratio (TR in B/TR in A) correlated with the PC Ratio (Fig. 4C; Spearman Correlation: rho = 0.48, p = 5.3*10^–4), indicating that the relative improvement in remembering targets in B was associated with the overall performance of the task. Moreover, the TR ratio was tested for associations with an external WM measure, the visual working memory capacity (K_average) estimated from a Change Detection Task (Fig. 4D) (see Methods)³⁹, showing a moderate correlation (Fig. 4E; Spearman Correlation: rho = 0.43, p = 5.3*10^–3, N = 41).

In brief, we extracted a novel internal measure of visual working memory in the cTMT that correlated both with performance (PC Ratio) and a canonical external VWM measure (CDT), suggesting that it is possible to isolate individual EF components within the cTMT.

Inhibitory control

In this section, we derived, from the eye and hand movements' data, a second internal measure of executive functioning, in this case, inhibitory control. When the eyes fixate on a new item, it could be either the next item in the sequence or not, i.e. it could be a Correct or a False Detection of the target. In the latter case, the hand has to avoid following the eye and wait until the correct item is found. This behaviour is evident when aligning all the paths explored by the hand after fixating a new item (Fig. 5B,C, Fig. S3). The spatial distribution of these paths shows that, when a correct item was identified, the hand moved directly towards the target (Fig. 5B). When a false detection occurred, the hand stayed still (Fig. 5C) or moved in other directions (Fig. S3) showing an inhibition of early motor actions. In order to quantify this behaviour, we estimated the displacement in the direction of the new item, for Correct and False detections, and for TMT-A and TMT-B separately.

Consistent with the spatial distributions, the hand displacement for the Correct detections was larger than for False detections, reaching the position of the target (displacement = 1, Fig. 5D) and revealing an inhibitory motor process. Interestingly, the curves in TMT-A and TMT-B were similar, as it was also evident for the difference curves between Correct and False detections (Fig. 5E). The Area under the difference curves was significantly different from zero for both cTMT-A and cTMT-B (Fig. 5F; Signed Rank test: TMT-A: p < 10^–8, TMT-B: p < 10^–8, z = 6.1, e.s. = 0.87), but there was no significant difference between TMT-A and TMT-B (Signed Rank test, p = 0.29, z = 1.1).

The Area under the difference curves could be an interesting estimation of the inhibition, the larger the area, the larger the inhibition to avoid following the eyes after False detections. Nevertheless, the Displacement in both TMT-A and TMT-B did not yield significant correlations with the IFS (Spearman correlation, rho < 0.2, p > 0.25, N = 49) or its subset of verbal inhibitory measures (Spearman correlation, rho < 0.15, p > 0.45, N = 49).

In summary, we extracted a novel internal measure of inhibition in the cTMT that seemed to capture the dynamics of inhibitory control processes within the task, but it did not reflect the difference in performance, and it did not correlate with the external measures (IFS).

Discussion

In the present study, we aimed to design a computerised version of the TMT (cTMT) that could tackle its main limitations. In particular, we aimed to build new measures within the task that could reflect individual EF processes, based on the precise recording of hand and eye movements. Firstly, we validated the cTMT showing that the RTs and performance profiles are consistent with the classic TMT. Moreover, we observed a significant correlation between the Completion Ratio and an independent executive functions battery (IFS). Secondly, we showed that eye movements' features were very similar in TMT-A and TMT-B, and differed only in the number of fixations, implying that the visual mechanisms are similar between conditions, but they differ in higher-level processes. Thirdly, when the task was parsed into three different stages (exploratory, planning, and monitoring), we found a higher number of exploratory and planning fixations in TMT-B, and a lower number of monitoring fixations. This could be interpreted as higher planning and executive (high level) costs in B, and fewer resources devoted to lower-level processes (monitoring hand movements). Fourthly, the mean amount of targets remembered was higher in TMT-A, and the ratio of remembered targets between TMT-B and TMT-A correlates with the Completion Ratio of the whole task. These results imply a lower memory performance in part B given its higher demands, and that the individual memory skills within the task explain, at least in part, overall performance. Strikingly, the amount of remembered targets also correlated with an external measure of visual working memory capacity (K_average in Change Detection Task), which validated our measure as an individual marker of VWM. Finally, we derived a potential internal measure of inhibition that is based on the hand movements towards Correct and False eye detections of items. To our knowledge, this is the first study that uses high-resolution eye and hand movements in TMT. One important aspect of our work is that we were able to dissect the task and extract individual markers of EFs, tackling one of the main limitations of the traditional TMT, making the cTMT a promising tool for research and clinical use.

As our first hypothesis, we replicated the general results of classical TMT: the resolution of type B trials took more time, while the percentage of completion was higher for A type trials. Furthermore, we found a correlation between the Total IFS Score and the Completion Ratio (PC-B/PC-A), while not with the RT Ratio, probably due to the limiting time factor. Previous digital implementations of the TMT expanded the analysis of the classic version by extracting more features^28,29,30,31 but in this work, we also focused on extracting internal measures as markers of EFs.

There were only a few previous experiments on eye movements with the TMT task. One used a high-resolution eye tracker but did not extract any more features other than the number of fixations⁵⁶, and others tried to disentangle the task but used low-resolution eye trackers^32,33,34,57. Thus, we started inspecting eye movements recorded with a high-resolution eye tracker that enables fixation and saccade analysis. We observed that even though almost all fixation and saccade properties were very similar between both TMT parts (saccade and fixation duration, saccade amplitude), the number of fixations was statistically higher in part B. This result is consistent with the previous bibliography⁵⁶ and may be the result of increased cognitive load interfering with the participants’ search strategy. The number of fixations can assess participants’ attention by indicating how many attentional resources are utilised between stimuli⁵⁸.

From the previous work on eye movements in the TMT, a series of works^32,33,34 proposed that the task could be divided into phases, and that the total time spent in each phase changes in different patient populations. Starting from the taxonomy previously proposed³² (monitoring and planning fixations) and adding a new class called exploratory fixations, we explored separately the number of fixations and the fixation duration. Previous work was done using a low-frequency eye tracker (50 Hz), and thus their analysis was limited to total time on each phase³². When we focused on the number of fixations, we found a higher number of exploratory (the first ones, until the cursor moves) and planning (those fixations away from the cursor after the first movement) in part B. This is also consistent with Wölwer and Gaebel³², who showed that the longer planning periods in schizophrenia patients resulted from a higher number of fixations within such a planning period in both test versions. In relation to the fixations’ duration, we only saw statistical differences in the monitoring ones (those after the first movement and over the cursor). This is consistent with the fact that the monitoring phase is more related to the motor execution of the task, but the planning phase is related to more executive aspects of the task (in other words, to the specific executive component needed in TMT-B⁵⁹). We suggest there is an amount of time participants dedicate to monitor the cursor (without limitation of time practically) in part A. But in B, as it is more complex and more cognitive load is involved, subjects sacrifice this time in order to dedicate it to planning (trade-off). Saccade durations are not related to processing costs, and they have a small impact on total time as they are smaller than the fixation durations, and do not change between part A and B.

Then, we investigated the visual working memory performance based on remembered TMT items. We found a higher number of remembered targets in part A that is consistent with less planning fixations in A, since participants might use their memory of target locations, requiring less search in TMT-A. The TR Ratio correlated with the overall performance (PC Ratio) and also correlated with an external measure of visual working memory (CDT)³⁹, validating our cTMT memory measure. It is worth noting that previous work linked the TMT performance with working memory, but results depended on which tests were administered (canonical and complex tests such as the Wechsler Memory Scale and the Wisconsin Card Sorting Test, among others)¹¹. These works focused on correlating results of classical tests in a general way and, to our knowledge, no other reports have attempted to examine the relationship between internal markers of the TMT and specific VWM tests as the CDT.

Based on the cTMT we not only extracted a working memory measure but also a way to assess cognitive inhibition using only the hand and eye trajectories. As Sánchez-Cubillo and collaborators¹² remark, the role of inhibitory control (IC) in TMT is not fully elucidated. In accordance with Arbuthnott and Frank¹⁰, a relationship between TMT-B and inhibitory abilities has been supported on the basis of significant correlations between TMT and the Stroop Interference condition^63,64. However, the use of more specific measures of inhibitory abilities such as Go/No-Go tasks⁶⁵ or negative priming tasks⁶⁶ has provided contradictory evidence about the role of inhibition in TMT scores with both positive and negative results, respectively. In fact, previous work highlighted the complexity of this particular executive function, as it represents a multidimensional construct^67,68 more difficult to disentangle^68,69. In this research, we aimed to use more precise measurements of hand and eye movements within the task to build specific IC estimates. Here, our estimate utilises the Correct and False detections of the next target in order to quantify the inhibitory control of the subjects. Consistent with the spatial distributions, the hand displacement for the Correct detections was larger than for False detections, reaching the position of the target. The Area under the difference curves was significantly different from zero for both types of trial, but not between them. We suggest that the area under the difference curves could be an interesting estimation of the inhibition (i.e. the larger the area, the larger the inhibition to avoid following the eyes after False detections). This was corroborated with our data. Nevertheless, we suffered from the same deficit as previous work, failing to find significant correlations with the external measures of IC drawn from the IFS questionnaire. A possible explanation for this result may be related to the multidimensionality of the IC^67,68,69 and the type of inhibitory tests implemented in the IFS battery^38,68: a simple motor Go/NoGo task that is usually saturated in neurotypical participants and a modified Hayling test, which is a marker of verbal inhibition, while the cTMT, although involves cognitive control, it is a more visuospatial task. Future work should explore this in two possible ways. Firstly, developing other internal measures of IC that capture different subcomponents such as visuospatial inhibition⁶⁸ and, secondly, using other external IC tasks such as the Go/NoGo or the Stop-Signal task for cognitive control, or Spatial Stroop or Flanker Task for visuospatial inhibition, in order to disentangle the different aspects of IC involved in the TMT^68,69.

Moreover, these measures were evaluated only on the individual differences in EFs in a neurotypical population, which is a demanding test due to the lower intersubject variability. Nevertheless, we highlight the importance of finding correlations between our global performance measures and an independent EFs questionnaire, and our WM measure, and an external measure of VWM capacity. These results encourage further research to expand the sample to other populations such as Alzheimer’s Disease or Fronto-Temporal Dementias where the paper-and-pen TMT has already proven to be very useful, and also previous work found effects analysing the task segmentation^32,33,34. The length of the task could be an impediment to evaluate those clinical populations, but in this work, we showed that even in this neurotypical population the effects are significant using only the first trials.

It is worth mentioning that our computerised version of the TMT not only allowed us to record hand and eye movements precisely, but also to overcome some of the limitations of the paper-and-pencil version. For instance, our version balances the spatial configurations for type A and B trials, as the spatial configuration of the targets in the classic version are not the same, implying that part of the results observed might be explained by the particular configurations of TMT-A and TMT-B^18,19. In fact, Gaudino and colleagues showed that using only numbers, significant time differences arose between the spatial configurations of parts A and B¹⁹. So, controlling the spatial configuration allowed us to reduce the sources of variability in the time between both parts. Moreover, it let us make a more accurate conclusion about the task switching, linking these differences in performance with the change of lists: from only numbers to letters and numbers. Additionally, our experimental design had a higher number of trials than previous works^{5,29,30,32,33,34,70}. But, it is worth noting that there were no significant learning effects, as revealed by the comparison of the first and last thirds of the trials regarding the ratio (B/A) for PC and RT (Wilcoxon signed-rank test: PC Ratio: p = 0.14; z = − 1.49; RT Ratio: p = 0.12; z = 1.56).

Conclusions

In recent years computational psychiatry and digital neuropsychology^71,72 have gained traction based on the use of computational approaches to model neuroscience and behaviour variables of interest, and machine learning approaches to predict brain pathologies and syndromes from behavioural measures. One of the limiting factors for using this last type of method to obtain new insights and develop new tools, is the lack of precise enough measures for executive functions and the extension and diversity of actual protocols. In this way, our contribution could help generate new precise features of different EFs based on a single complex task. And, in the future, this task could be even replaced by natural behaviour.

The measures presented here will also allow us to understand the internal dynamics and interplay of EFs during the resolution of a complex task. To summarise, in the present work, we validated the overall performance of the computerised version of the task with external measures and explored the involvement of eye movements in the different phases of the task resolution in both trial types. Moreover, the cTMT surpasses many of the gaps of the standard TMT: (1) it provides multiple fine-grained subscores of the underlying EFs, which are critical for analysing more specific deficits in different pathologies: (2) this version provides multiple behavioural measures that allow a more robust characterization of the participant’s performance and brings multiple features for machine learning multimodal and multi-feature analysis; (3) it provides greater control of spatial configuration bias and more robust results (less variable) by controlling the potential bias of one single configuration. Thus, we propose that the cTMT could become a powerful tool for an improvement in the accuracy of diagnoses of a wide variety of pathologies where the EFs are affected, such as Alzheimer’s Disease or Fronto-Temporal Dementias.

Data availability

The analysis code and the data used in the present study will be available upon publication.

References

Lange, R. T., Iverson, G. L., Zakrzewski, M. J., Ethel-King, P. E. & Franzen, M. D. Interpreting the Trail Making Test following traumatic brain injury: Comparison of traditional time scores and derived indices. J. Clin. Exp. Neuropsychol. 27, 897–906. https://doi.org/10.1080/1380339049091290 (2005).
Article PubMed Google Scholar
Bowie, C. R. & Harvey, P. D. Administration and interpretation of the Trail Making Test. Nat. Protoc. 1, 2277–2281. https://doi.org/10.1038/nprot.2006.390 (2006).
Article CAS PubMed Google Scholar
Rabin, L. A., Burton, L. A. & Barr, W. B. Utilization rates of ecologically oriented instruments among clinical neuropsychologists. Clin. Neuropsychol. 21, 727–743. https://doi.org/10.1080/13854040600888776 (2007).
Article PubMed Google Scholar
Reitan, R. M. Validity of the Trail Making Test as an indicator of organic brain damage. Percept. Mot. Skills https://doi.org/10.2466/pms.1958.8.3.271 (1958).
Article Google Scholar
Salthouse, T. A. What cognitive abilities are involved in trail-making performance?. Intelligence 39, 222–232. https://doi.org/10.1016/j.intell.2011.03.001 (2011).
Article PubMed Central PubMed Google Scholar
Soukup, V. M., Ingram, F., Grady, J. J. & Schiess, M. C. Trail Making Test: issues in normative data selection. Appl. Neuropsychol. 5, 65–73. https://doi.org/10.1207/s15324826an0502_2 (1998).
Article CAS PubMed Google Scholar
Ashendorf, L. et al. Trail Making Test errors in normal aging, mild cognitive impairment, and dementia. Arch. Clin. Neuropsychol. 23, 129–137. https://doi.org/10.1016/j.acn.2007.11.005 (2008).
Article PubMed Google Scholar
Periáñez, J. A. et al. Trail Making Test in traumatic brain injury, schizophrenia, and normal ageing: Sample comparisons and normative data. Arch. Clin. Neuropsychol. 22, 433–447. https://doi.org/10.1016/j.acn.2007.01.022 (2007).
Article PubMed Google Scholar
Giovagnoli, A. R. et al. Trail making test: Normative values from 287 normal adult controls. Ital. J. Neurol. Sci. 17, 305–309. https://doi.org/10.1007/BF01997792 (1996).
Article CAS PubMed Google Scholar
Arbuthnott, K. & Frank, J. Trail Making Test, Part B as a measure of executive control: Validation using a set-switching paradigm. J. Clin. Exp. Neuropsychol. 22, 518–528. https://doi.org/10.1076/1380-3395(200008)22:4;1-0;FT518 (2000).
Article CAS PubMed Google Scholar
Kortte, K. B., Horner, M. D. & Windham, W. K. The Trail Making Test, Part B: Cognitive flexibility or ability to maintain set?. Appl. Neuropsychol. 9, 106–109. https://doi.org/10.1207/S15324826AN0902_5 (2002).
Article PubMed Google Scholar
Sánchez-Cubillo, I. et al. Construct validity of the Trail Making Test: Role of task-switching, working memory, inhibition/interference control, and visuomotor abilities. J. Int. Neuropsychol. Soc. 15, 438–450. https://doi.org/10.1017/S1355617709090626 (2009).
Article PubMed Google Scholar
Kim, H. J., Baek, M. J. & Kim, S. Alternative type of the trail making test in nonnative english-speakers: The Trail Making Test-Black & White. PLoS ONE 9, e89078. https://doi.org/10.1371/journal.pone.0089078 (2014).
Article ADS CAS PubMed Central PubMed Google Scholar
Lee, T. M., Cheung, C. C., Chan, J. K. & Chan, C. C. Trail making across languages. J. Clin. Exp. Neuropsychol. 22, 772–778. https://doi.org/10.1076/jcen.22.6.772.954 (2000).
Article CAS PubMed Google Scholar
Maj, M. et al. Evaluation of two new neuropsychological tests designed to minimize cultural bias in the assessment of HIV-1 seropositive persons: A WHO study. Arch. Clin. Neuropsychol. 8, 123–135 (1993).
Article CAS PubMed Google Scholar
Miyake, A., Emerson, M. J. & Friedman, N. P. Assessment of executive functions in clinical settings: Problems and recommendations. Semin. Speech Lang. 21, 169–183. https://doi.org/10.1055/s-2000-7563 (2000).
Article CAS PubMed Google Scholar
Jurado, M. B. & Rosselli, M. The elusive nature of executive functions: A review of our current understanding. Neuropsychol. Rev. 17, 213–233. https://doi.org/10.1007/s11065-007-9040-z (2007).
Article PubMed Google Scholar
Fossum, B., Holmberg, H. & Reinvang, I. Spatial and symbolic factors in performance on the Trail Making Test. Neuropsychology https://doi.org/10.1037/0894-4105.6.1.71 (1992).
Article Google Scholar
Gaudino, E. A., Geisler, M. W. & Squires, N. K. Construct validity in the trail making test: What makes part B harder?. J. Clin. Exp. Neuropsychol. 17, 529–535. https://doi.org/10.1080/01688639508405143 (1995).
Article CAS PubMed Google Scholar
Chan, E. et al. Limitations of the trail making test part-B in assessing frontal executive dysfunction. J. Int. Neuropsychol. Soc. JINS 21, 169–174. https://doi.org/10.1017/S135561771500003X (2015).
Article PubMed Google Scholar
Theeuwes, J., Belopolsky, A. & Olivers, C. N. L. Interactions between working memory, attention and eye movements. Acta Psychol. (Amst.) 132, 106–114. https://doi.org/10.1016/j.actpsy.2009.01.005 (2009).
Article Google Scholar
Woodman, G. F. & Luck, S. J. Do the contents of visual working memory automatically influence attentional selection during visual search?. J. Exp. Psychol. Hum. Percept. Perform. 33, 363–377. https://doi.org/10.1037/0096-1523.33.2.363 (2007).
Article PubMed Central PubMed Google Scholar
Moher, J. & Egeth, H. E. The ignoring paradox: Cueing distractor features leads first to selection, then to inhibition of to-be-ignored items. Atten. Percept. Psychophys. 74, 1590–1605. https://doi.org/10.3758/s13414-012-0358-0 (2012).
Article PubMed Google Scholar
Huang, C., Vilotijević, A., Theeuwes, J. & Donk, M. Proactive distractor suppression elicited by statistical regularities in visual search. Psychon. Bull. Rev. 28, 918–927. https://doi.org/10.3758/s13423-021-01891-3 (2021).
Article PubMed Central PubMed Google Scholar
Shalom, D. E. & Sigman, M. Freedom and rules in human sequential performance: A refractory period in eye-hand coordination. J. Vis. 13, 4–4. https://doi.org/10.1167/13.3.4 (2013).
Article PubMed Google Scholar
Wu, S.-C. & Remington, R. W. Coordination of component mental operations in a multiple-response task. In Proc. 2004 Symp. Eye Track. Res. Appl. 63–70 (Association for Computing Machinery, 2004). https://doi.org/10.1145/968363.968380.
Eckstein, M. K., Guerra-Carrillo, B., Miller Singley, A. T. & Bunge, S. A. Beyond eye gaze: What else can eyetracking reveal about cognition and cognitive development?. Dev. Cogn. Neurosci. 25, 69–91. https://doi.org/10.1016/j.dcn.2016.11.001 (2017).
Article PubMed Google Scholar
Salthouse, T. A. & Fristoe, N. M. Process analysis of adult age effects on a computer-administered Trail Making Test. Neuropsychology 9, 518–528. https://doi.org/10.1037/0894-4105.9.4.518 (1995).
Article Google Scholar
Woods, D. L., Wyma, J. M., Herron, T. J. & Yund, E. W. The effects of aging, malingering, and traumatic brain injury on computerized trail-making test performance. PLoS ONE 10, e0124345. https://doi.org/10.1371/journal.pone.0124345 (2015).
Article CAS PubMed Central PubMed Google Scholar
Fellows, R. P., Dahmen, J., Cook, D. & Schmitter-Edgecombe, M. Multicomponent analysis of a digital Trail Making Test. Clin. Neuropsychol. 31, 154–167. https://doi.org/10.1080/13854046.2016.1238510 (2017).
Article PubMed Google Scholar
Dahmen, J., Cook, D., Fellows, R. & Schmitter-Edgecombe, M. An analysis of a digital variant of the trail making test using machine learning techniques. Technol. Health Care 25, 251–264. https://doi.org/10.3233/THC-161274 (2017).
Article PubMed Central PubMed Google Scholar
Wölwer, W. & Gaebel, W. Impaired Trail-Making Test-B performance in patients with acute schizophrenia is related to inefficient sequencing of planning and acting. J. Psychiatr. Res. 36, 407–416. https://doi.org/10.1016/s0022-3956(02)00050-x (2002).
Article PubMed Google Scholar
Wölwer, W., Falkai, P., Streit, M. & Gaebel, W. Trait characteristic of impaired visuomotor integration during trail-making test B performance in schizophrenia. Neuropsychobiology 48, 59–67. https://doi.org/10.1159/000072878 (2003).
Article PubMed Google Scholar
Wölwer, W., Stroth, S., Brinkmeyer, J. & Gaebel, W. Electrophysiological correlates of planning and monitoring in first episode schizophrenia. Psychiatry Res. Neuroimaging 203, 83–88. https://doi.org/10.1016/j.pscychresns.2011.11.009 (2012).
Article Google Scholar
Kohl, M. MKpower: Power Analysis and Sample Size Calculation (2020).
Brainard, D. H. The psychophysics toolbox. Spat. Vis. 10, 433–436. https://doi.org/10.1163/156856897X00357 (1997).
Article CAS PubMed Google Scholar
Cornelissen, F. W., Peters, E. M. & Palmer, J. The Eyelink Toolbox: Eye tracking with MATLAB and the psychophysics toolbox. Behav. Res. Methods Instrum. Comput. 34, 613–617. https://doi.org/10.3758/BF03195489 (2002).
Article PubMed Google Scholar
Torralva, T., Roca, M., Gleichgerrcht, E., López, P. & Manes, F. INECO Frontal Screening (IFS): A brief, sensitive, and specific tool to assess executive functions in dementia—corrected version. J. Int. Neuropsychol. Soc. 15, 777–786. https://doi.org/10.1017/S1355617709990415 (2009).
Article PubMed Google Scholar
Luck, S. J. & Vogel, E. K. Visual working memory capacity: From psychophysics and neurobiology to individual differences. Trends Cogn. Sci. 17, 391–400. https://doi.org/10.1016/j.tics.2013.06.006 (2013).
Article PubMed Central PubMed Google Scholar
de Leeuw, J. R. jsPsych: A JavaScript library for creating behavioral experiments in a Web browser. Behav. Res. Methods 47, 1–12. https://doi.org/10.3758/s13428-014-0458-y (2015).
Article ADS PubMed Google Scholar
Bahia, V. S. et al. The accuracy of INECO frontal screening in the diagnosis of executive dysfunction in frontotemporal dementia and Alzheimer disease. Alzheimer Dis. Assoc. Disord. 32, 314–319. https://doi.org/10.1097/WAD.0000000000000255 (2018).
Article PubMed Google Scholar
Broche-Pérez, Y. et al. Clinical utility of the INECO Frontal Screening for detecting Mild Cognitive Impairment in Parkinson’s disease. Dement. Amp Neuropsychol. 13, 394–402. https://doi.org/10.1590/1980-57642018dn13-040005 (2019).
Article Google Scholar
Ihnen, J. et al. Chilean version of the INECO Frontal Screening (IFS-Ch): Psychometric properties and diagnostic accuracy. Dement. Amp Neuropsychol. 7, 40–47. https://doi.org/10.1590/S1980-57642013DN70100007 (2013).
Article Google Scholar
Dubois, B., Slachevsky, A., Litvan, I. & Pillon, B. The FAB: A frontal assessment battery at bedside. Neurology 55, 1621–1626. https://doi.org/10.1212/WNL.55.11.1621 (2000).
Article CAS PubMed Google Scholar
Shallice, T. The relationship between prospective and retrospective. In Cognitive Models of Memory (ed. Conway, M. A.) (MIT Press, 1997).
Google Scholar
Hodges, J. R. Cognitive Assessment for Clinicians (Oxford University Press, 2017).
Book Google Scholar
Wechsler, D. WMS-R: Wechsler Memory Scale-Revised : Manual (Psychological Corporation, 1987).
Google Scholar
Moreira, H. S. et al. Distinguishing mild cognitive impairment from healthy aging and Alzheimer’s Disease: The contribution of the INECO Frontal Screening (IFS). PLoS ONE 14, e0221873. https://doi.org/10.1371/journal.pone.0221873 (2019).
Article CAS PubMed Central PubMed Google Scholar
Sanjurjo, N. S. et al. The IFS (INECO Frontal Screening) and level of education: Normative data. Appl. Neuropsychol. Adult 26, 331–339. https://doi.org/10.1080/23279095.2018.1427096 (2019).
Article Google Scholar
Kuiper, N. H. Tests concerning random points on a circle. Nederl Akad Wetensch Proc. Ser. A 63, 38–47 (1960).
Article Google Scholar
Dowd, C. A New ECDF Two-Sample Test Statistic. ArXiv200701360 Stat (2020).
Bulloch, M. C., Prime, S. L. & Marotta, J. J. Anticipatory gaze strategies when grasping moving objects. Exp. Brain Res. 233, 3413–3423. https://doi.org/10.1007/s00221-015-4413-7 (2015).
Article PubMed Google Scholar
Fisk, J. D. & Goodale, M. A. The organization of eye and limb movements during unrestricted reaching to targets in contralateral and ipsilateral visual space. Exp. Brain Res. 60, 159–178. https://doi.org/10.1007/BF00237028 (1985).
Article CAS PubMed Google Scholar
Margulis, L. E., Louhau, M. R. S. & Ferreres, A. R. Baremo del trail making test para capital federal y Gran Buenos Aires. Rev. Argent. Cienc. Comport. 10, 54–63. https://doi.org/10.32348/1852.4206.v10.n3.19741 (2018).
Article Google Scholar
Tombaugh, T. N. Trail Making Test A and B: Normative data stratified by age and education. Arch. Clin. Neuropsychol. 19, 203–214. https://doi.org/10.1016/S0887-6177(03)00039-8 (2004).
Article PubMed Google Scholar
Hicks, S. L. et al. An eye-tracking version of the trail-making test. PLoS ONE 8, e84061. https://doi.org/10.1371/journal.pone.0084061 (2013).
Article ADS CAS PubMed Central PubMed Google Scholar
Jyotsna, C., Amudha, J., Rao, R. & Nayar, R. Intelligent gaze tracking approach for trail making test. J. Intell. Fuzzy Syst. 38, 6299–6310. https://doi.org/10.3233/JIFS-179711 (2020).
Article Google Scholar
Hyönä, J. The use of eye movements in the study of multimedia learning. Learn. Instr. 20, 172–176. https://doi.org/10.1016/j.learninstruc.2009.02.013 (2010).
Article Google Scholar
Gouveia, P. A. R., Brucki, S. M. D., Malheiros, S. M. F. & Bueno, O. F. A. Disorders in planning and strategy application in frontal lobe lesion patients. Brain Cogn. 63, 240–246. https://doi.org/10.1016/j.bandc.2006.09.001 (2007).
Article PubMed Google Scholar
Mahurin, R. K. et al. Trail making test errors and executive function in schizophrenia and depression. Clin. Neuropsychol. 20, 271–288. https://doi.org/10.1080/13854040590947498 (2006).
Article PubMed Google Scholar
Crowe, S. F. The differential contribution of mental tracking, cognitive flexibility, visual search, and motor speed to performance on parts A and B of the trail making test. J. Clin. Psychol. 54, 585–591. https://doi.org/10.1002/(SICI)1097-4679(199808)54:5%3c585::AID-JCLP4%3e3.0.CO;2-K (1998).
Article CAS PubMed Google Scholar
Larrabee, G. J. & Curtiss, G. Construct validity of various verbal and visual memory tests. J. Clin. Exp. Neuropsychol. 17, 536–547. https://doi.org/10.1080/01688639508405144 (1995).
Article CAS PubMed Google Scholar
Chaytor, N., Schmitter-Edgecombe, M. & Burr, R. Improving the ecological validity of executive functioning assessment. Arch. Clin. Neuropsychol. 21, 217–227. https://doi.org/10.1016/j.acn.2005.12.002 (2006).
Article PubMed Google Scholar
Spikman, J. M., Kiers, H. A., Deelman, B. G. & van Zomeren, A. H. Construct validity of concepts of attention in healthy controls and patients with CHI. Brain Cogn. 47, 446–460. https://doi.org/10.1006/brcg.2001.1320 (2001).
Article CAS PubMed Google Scholar
Langenecker, S. A., Zubieta, J.-K., Young, E. A., Akil, H. & Nielson, K. A. A task to manipulate attentional load, set-shifting, and inhibitory control: Convergent validity and test-retest reliability of the Parametric Go/No-Go Test. J. Clin. Exp. Neuropsychol. 29, 842–853. https://doi.org/10.1080/13803390601147611 (2007).
Article PubMed Google Scholar
Miner, T. & Ferraro, F. R. The role of speed of processing, inhibitory mechanisms, and presentation order in trail-making test performance. Brain Cogn. 38, 246–253. https://doi.org/10.1006/brcg.1998.1034 (1998).
Article CAS PubMed Google Scholar
Sperl, L., Ambrus, G. G., Kaufmann, J. M., Schweinberger, S. R. & Cañal-Bruland, R. Electrophysiological correlates underlying interference control in motor tasks. Biol. Psychol. 163, 108138. https://doi.org/10.1016/j.biopsycho.2021.108138 (2021).
Article CAS PubMed Google Scholar
Diamond, A. Executive functions. Annu. Rev. Psychol. 64, 135–168. https://doi.org/10.1146/annurev-psych-113011-143750 (2013).
Article PubMed Google Scholar
Meyer, H. C. & Bucci, D. J. Neural and behavioral mechanisms of proactive and reactive inhibition. Learn. Mem. 23, 504–514. https://doi.org/10.1101/lm.040501.115 (2016).
Article PubMed Central PubMed Google Scholar
Poreh, A., Miller, A., Dines, P. & Levin, J. Decomposition of the trail making test: Reliability and validity of a computer assisted method for data collection. Arch. Assess. Psychol. 2, 57–72 (2012).
Google Scholar
Germine, L., Reinecke, K. & Chaytor, N. S. Digital neuropsychology: Challenges and opportunities at the intersection of science and software. Clin. Neuropsychol. 33, 271–286. https://doi.org/10.1080/13854046.2018.1535662 (2019).
Article PubMed Google Scholar
Montague, P. R., Dolan, R. J., Friston, K. J. & Dayan, P. Computational psychiatry. Trends Cogn. Sci. 16, 72–80. https://doi.org/10.1016/j.tics.2011.11.018 (2012).
Article PubMed Google Scholar

Download references

Acknowledgements

We thank Sol Fitipaldi and Lucas Sedeño for training IL on the administration and interpretation of the INECO Frontal Screening to the participants. The authors thank Valeria Mussel for her help proofreading the manuscript. The authors were supported by the Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET) and the Universidad de Buenos Aires (UBA). The research was supported by the Agencia Nacional de Promoción Científica y Tecnológica (PICT 2018-2699) and the CONICET (PIP 11220150100787CO). AI is supported by grants from CONICET; ANID/FONDECYT Regular (1170010); FONCYT-PICT 2017-1820; ANID/FONDAP/15150012; Takeda CW2680521; Sistema General de Regalías (BPIN2018000100059), Universidad del Valle (CI 5316); Alzheimer’s Association GBHI ALZ UK-20-639295; and the MULTI-PARTNER CONSORTIUM TO EXPAND DEMENTIA RESEARCH IN LATIN AMERICA [ReDLat, supported by National Institutes of Health, National Institutes of Aging (R01 AG057234), Alzheimer’s Association (SG-20-725707), Rainwater Charitable foundation—Tau Consortium, and Global Brain Health Institute)]. The contents of this publication are solely the responsibility of the authors and do not represent the official views of these Institutions.

Author information

These authors contributed equally: Ignacio Linari and Gustavo E. Juantorena.
These authors jointly supervised this work: Agustin Petroni and Juan E. Kamienkowski.

Authors and Affiliations

Laboratorio de Inteligencia Artificial Aplicada, Instituto de Ciencias de la Computación, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires - CONICET, Buenos Aires, Argentina
Ignacio Linari, Gustavo E. Juantorena, Agustín Petroni & Juan E. Kamienkowski
Cognitive Neuroscience Center (CNC), Universidad de San Andrés, and National Scientific and Technical Research Council (CONICET), Buenos Aires, Argentina
Agustín Ibáñez
Global Brain Health Institute (GBHI), University of California San Francisco (UCSF), San Francisco, USA
Agustín Ibáñez
Trinity College Dublin (TCD), Dublin, Ireland
Agustín Ibáñez
Latin American Brain Health Institute (BrainLat), Universidad Adolfo Ibáñez, Santiago, Chile
Agustín Ibáñez
University of Gothenburg, Gothenburg, Sweden
Agustín Petroni
Maestría de Explotación de Datos y Descubrimiento del Conocimiento, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
Juan E. Kamienkowski
Departamento de Computación, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Pabellón 1, Ciudad Universitaria, (1428) Ciudad Autónoma de Buenos Aires, Buenos Aires, Argentina
Juan E. Kamienkowski

Authors

Ignacio Linari
View author publications
You can also search for this author in PubMed Google Scholar
Gustavo E. Juantorena
View author publications
You can also search for this author in PubMed Google Scholar
Agustín Ibáñez
View author publications
You can also search for this author in PubMed Google Scholar
Agustín Petroni
View author publications
You can also search for this author in PubMed Google Scholar
Juan E. Kamienkowski
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.P. and J.K. designed the study; I.L. and G.J. collected the data; I.L., G.J., A.P., and J.K. analysed the data; I.L., G.J., A.I., A.P., and J.K. wrote the manuscript.

Corresponding author

Correspondence to Juan E. Kamienkowski.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Linari, I., Juantorena, G.E., Ibáñez, A. et al. Unveiling Trail Making Test: visual and manual trajectories indexing multiple executive processes. Sci Rep 12, 14265 (2022). https://doi.org/10.1038/s41598-022-16431-9

Download citation

Received: 21 February 2022
Accepted: 11 July 2022
Published: 22 August 2022
DOI: https://doi.org/10.1038/s41598-022-16431-9

This article is cited by

Diagnostic role of serum brain-derived neurotrophic factor in HCV cirrhotic patients with minimal hepatic encephalopathy with and without schistosomiasis
- Essam S. Bedewy
- Abeer Elhadidi
- Amany N. Abbasy
Egyptian Liver Journal (2024)
Heart failure decouples the precuneus in interaction with social cognition and executive functions
- Matthias L. Schroeter
- Jannis Godulla
- Karsten Mueller
Scientific Reports (2023)
Studying cognitive-motor interactions using a tablet-based application of the Color Trails Test
- Noa Ben Yair
- Meytal Wilf
- Meir Plotnik
Experimental Brain Research (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.