US20260165633A1
SYSTEM AND METHOD FOR TRAINING A SUBJECT TO SELF-REGULATE NEURAL VARIABILITY
Publication
Application
Classifications
IPC Classifications
CPC Classifications
Applicants
CARNEGIE MELLON UNIVERSITY
Inventors
RYAN WILLIAMSON, AKASH UMAKANTHA, CHRIS KI, MATTHEW SMITH, BYRON YU
Abstract
Disclosed herein is a system and method for training a subject to self-regulate neural variability. The system and method implements a prefrontal cortex brain-computer interface (BCI) and a method that allows subjects to use neurofeedback to produce a desired neural activity by regulation of their arousal levels to stabilize their neural activity across a timescale of seconds or minutes. The system comprises a prefrontal cortex brain-computer interface (BCI) to train subjects to use neurofeedback to produce desired neural activity. Subjects used the disclosed system and method, which includes BCI feedback, to self-regulate their arousal levels and successfully stabilize their neural activity.
Figures
Description
RELATED APPLICATIONS
[0001]This application is a national filing under 35 U.S.C. § 371 claiming the benefit and priority to International Patent Application No. PCT/US24/10801, filed Jan. 9, 2024 entitled “SYSTEM AND METHOD FOR TRAINING A SUBJECT TO SELF-REGULATE NEURAL VARIABILITY”, which claims the benefit of U.S. Provisional Patent Application No. 63/439,314 filed Jan. 17, 2023, the contents of which are incorporated herein in their entireties.
GOVERNMENT INTEREST
[0002]This invention was made with the support of the United States government under contracts 1734916 and 1954107 awarded by the National Science Foundation (NSF). The U.S. government has certain rights in the invention.
BACKGROUND OF THE INVENTION
[0003]All activities are associated with a neural pattern of activity. For example, consider shooting free throws on a basketball court. Even if the shooter tries the same shot each time, their internal states, such as arousal and motivation, may change from shot to shot. Such changes in cognitive states over time may manifest as neural variability, in which the brain produces variable activity when presented with the same task conditions multiple times. These internal state fluctuations are captured in neural population activity and indirectly shape performance. For instance, slow fluctuations in the prefrontal cortex areas corresponded to an arousal-related signal that reflected the impulsivity of the subject during a change detection task.
[0004]Deficits in regulating neural variability have been linked to neuropsychiatric disorders. Understanding the degree to which one could control neural variability over time has implications for enhancing and restoring the brain's cognitive capabilities. Therefore, it would be desirable to be able to train a subject to regulate this neural variability.
SUMMARY OF THE INVENTION
[0005]When presented with identical task conditions across trials, the brain produces variable patterns of neural activity, wherein the variability stems from many sources, one of which is fluctuations in internal states (e.g., arousal). Reducing neural variability has implications for improving the brain's cognitive abilities.
[0006]Disclosed herein is a system and method implementing a prefrontal cortex brain-computer interface (BCI) that allows subjects to use neurofeedback to produce a desired neural activity by regulation of their arousal levels to stabilize their neural activity across a timescale of seconds or minutes. The system comprises a prefrontal cortex brain-computer interface (BCI) that trains subjects to use neurofeedback to produce desired neural activity. Subjects used the disclosed system and method, which includes BCI feedback, to self-regulate their arousal levels and successfully stabilize their neural activity.
[0007]Through moment-to-moment neurofeedback, the BCI framework offers subjects precise temporal control of their neural activity. The disclosed brain-computer interface (BCI) task utilizes a metric, termed the neural distance, to characterize how far the current neural population activity is from a baseline state. Using the disclosed system and method, subjects can be trained to reduce their neural distance. Subjects reduce their neural distances by relying on accurate moment-to-moment neurofeedback to self-regulate an internal state, possibly linked to arousal.
BRIEF DESCRIPTION OF THE DRAWINGS
[0008]
[0009]
[0010]
[0011]
[0012]
[0013]
[0014]
[0015]
[0016]
[0017]
[0018]
[0019]
[0020]
[0021]
[0022]
[0023]
[0024]
[0025]
[0026]
DETAILED DESCRIPTION
[0027]Disclosed herein is a system and method for training a subject to moderate their neural responses to stimuli to minimize neural variability over time. The system and method were implemented using rhesus macaque monkeys as subjects but as would be realized, the system and method can be used with any subject, including humans.
[0028]In one embodiment, two adult male rhesus macaque monkeys were surgically implanted with titanium headposts, which were fixed onto the skull of each subject to limit head movement and facilitate eye movement tracking and recording of neural activity. Subjects were then implanted with an electrode array. In one embodiment, the electrode array is a 96-electrode “Utah” array manufactured by Blackrock Microsystems of Salt Lake City, UT, although other electrode arrays could be used. The electrode array was placed in the prefrontal cortex anterior to the arcuate sulcus and dorsal to the medial sulcus in both hemispheres. Other positionings of the array are contemplated to be within the scope of the invention.
[0029]The subjects were head-fixed and positioned 36 cm from a 21-in cathode ray tube monitor with 1024×768 pixel resolution and a refresh rate of 100 Hz. Task feedback was displayed using custom software that utilized psychophysics toolbox extensions. In one embodiment, the custom software may be based on in Matlab from MathWorks of Natick, MA. Eye position was monitored using monocular infrared tracking at a 1000 Hz sample rate. In one embodiment, eye tracking as performed using EyeLink 1000 from RS Research of Mississauga, Ontario, Canada).
[0030]Neural activity was recorded from an implanted microelectrode array in area 8ar of the dorsolateral prefrontal cortex (dLPFC). Signals from the implanted electrodes were band-pass filtered (0.3-7500 Hz) and then digitized at 30,000 Hz before being stored offline for analysis. For each electrode, spiking waveforms were defined as a 52-sample (1.73 ms) window of the filtered voltage signal triggered by the signal crossing a predefined threshold. The threshold was defined as a multiple of the root-mean-square voltage of a brief epoch of the raw signal on each electrode collected at the beginning of the session. All behavioral and neural data was recorded for further offline processing and analysis. In one embodiment, the recording was performed using a Grapevine recording system from Ripple of Salt Lake City, UT.
[0031]To separate waveforms likely to be caused by neural spiking from waveforms caused by other electrical artifacts, a neural network classifier trained to classify spike waveforms as “neural” or “not neural” was used. The classifier was trained using array recordings from multiple subjects in which the waveforms had been hand sorted. In one embodiment, the classifier successfully removes a high percentage of putative noise waveforms in held-out test waveform datasets. Classification required very little computation time, allowing for the classification of hundreds of waveforms in a few milliseconds. This algorithm was applied to the prefrontal cortex recordings online to ensure that activity going into the BCI was of neural origin.
[0032]Because neurons in the prefrontal cortex encode direction-specific sensory representations of visual stimuli, in one embodiment, visual feedback was provided to the subject. In one embodiment, to prevent the BCI from being spatially biased, the visual feedback during trials with neurofeedback comprised the display of an annulus wherein the diameter of the annulus could be controlled by the neural activity of the subject. To represent how far a subject's current neural activity was from a target neural state (
[0033]
[0034]In one embodiment the target state is the reduction of the diameter of the annulus toward the central dot. As would be realized by one of skill in the art, the stimulus is not limited a visual stimulus, nor to the specific visual stimulus described herein, but may be any sensory stimulus, for example, other visual stimuli, audible stimuli, haptic stimuli, etc.
[0035]In the exemplary embodiment described herein, the visual feedback was constrained to lie outside the receptive fields of the recorded neurons by having the annulus size range from 10 pixels to 80 pixels. In all experimental sessions, the subjects performed a calibration task and a brain-computer interface (BCI) task.
[0036]The calibration task comprised exposing the subjects to a set of 60 calibration trials. In a calibration trial, a circle was displayed at the center of the screen against an isoluminant background. A trial was initiated when the subject fixated on the blue circle, at which point an annulus appeared on the screen centered on the fixation circle. The annulus remained fixed in size until 400 ms post-fixation, after which the annulus started shrinking. The diameter of the annulus during these trials was limited to lie between 10 and 80 pixels. Over the next 3.4 s, the subject simply had to maintain fixation while the radius of the annulus shrank at a fixed speed. During this entire time, the annulus remained centered on the fixation dot. The subject was rewarded if it successfully maintained fixation throughout the entire 3.4 s period during which the annulus shrank.
[0037]The calibration task is depicted in
[0038]The calibration task 1800 is depicted in flowchart form
[0039]Taking the recorded prefrontal cortex (PFC) neural activity from the calibration trials, a light sorting was first performed using a neural network sorter. The spike counts were binned into non-overlapping 50 ms bins beginning 400 ms after fixation to the end of the 3.5 s period in which the annulus moved on its own. The binned spike counts were then concatenated across all 60 calibration trials and factor analysis (FA) was applied to the aggregate spike count matrix to identify a low-dimensional BCI subspace that best described shared variance at the population level. Factor analysis is defined by the following model:
- [0040]x∈
p is a vector of spike counts across the p simultaneously-recorded neurons;
- [0041]d∈
p is a vector of mean spike counts;
- [0042]L∈
p×m is the loading matrix relating m latent variables to the neural activity; and
- [0043]Ψ∈
p×p is a diagonal matrix of independent variances for each neuron.
- [0040]x∈
[0044]For the BCI task, the number of latent variables was always set to either 4 or 5 depending on the subject (Subject S: 5, Subject P: 4). The model parameters μ, L and Ψ were estimated using the expectation-maximization (EM) algorithm.
[0047]For a bin at time step t (other than the 1st), the smoothed value wi,t was a weighted average of all observations from calibration trial i up to time step t with the weights exponentially decaying over time. α thus determines the weight of the current latent zi,t for the weighted average wi,t (i.e., a low α corresponds to heavy smoothing/reliance on past neural activity). In all experiments, α=0.1535 was used.
[0048]The exponential smoother was reset at the beginning of each calibration trial such that the smoothed latent for the first bin of each trial was simply its posterior mean. The target baseline state μ is defined as the mean of the smoothed FA projections:
- [0049]n is the number of calibration trials performed; and
- [0050]T=68 (matches the 3.4 s period in which the annulus shrunk on its own) refers to the number of 50 ms time bins in each calibration trial.
[0051]In all sessions, μ, was very close to the origin. This is due to a property of the FA model in which the mean of the projections for the training data (all bins in the calibration trials) is the zero vector:
[0052]This slight deviation of μ from 0 stems from the light smoothing that was applied to the projections before computing the mean.
[0053]The neural distance for bin t of calibration trial i, yi,t, which is the squared Euclidean distance of a smoothed FA latent at time bin t from the target state, was then computed for all time bins in the calibration trials:
[0054]The neural distance is thus a measure of how far the current smoothed neural state, wi,t, is from the mean smoothed activity μ observed at the beginning of a session. To determine the distance reward threshold 308 that would achieve a reward in a BCI trial, all distances were aggregated, and percentiles computed in 0.1 percentile increments. Percentile values were then swept to determine the percentile of the distance that would lead the subject to get 50% of the calibration trials being correct if treated as BCI trials. This distance percentile was then set as the reward threshold for the rest of the session. The value of 50% was selected such that the subject would not find it too difficult to perform the BCI task.
[0055]To provide further intuition into what neural distance measures, neural distance is decomposed into a sum involving a weighted average of the deviation of each neuron's current firing rate from its mean firing rate during the calibration trials. Within a single trial, this weighted average term approximated the neural distance very well over time. In a crude sense, one can view reducing the neural distance as trying to get each recorded neuron's firing rate as close as possible to the same neuron's mean firing rate observed at the start of an experiment.
[0056]After the calibration task, the subject performed a BCI task. In the BCI task, the subject was presented with alternating BCI trials and sham blocks of trials 316. BCI blocks mostly contained BCI trials and had rare but randomly interleaved sham trials (Subject P: 90 BCI, 10 interleaved sham trials; Subject S: 80 BCI, 20 interleaved sham trials). Sham blocks always contained 20 sham trials, as shown in
[0057]Like calibration trials, BCI and sham trials both began with a 400 ms fixation period during which the annulus remained still. Then, the feedback period followed, and based on the trial type, different feedback was shown. In BCI trials, the subject was given visual neurofeedback such that the annulus size correctly reflected how far the subject's current neural activity was from the target state. During these trials, the subject's objective was to have its neural activity be within the reward threshold (this corresponds to keeping the annulus diameter below 30 pixels) for at least 400 ms. If the subject achieved an annulus size below the reward threshold for 400 ms (i.e., 8 consecutive bins), the trial would end with the subject receiving a liquid reward. If the subject never reached the success condition during the 3.4 s feedback period, the trial ended with the subject not receiving a reward.
[0058]The BCI task is shown graphically in
[0059]The BCI task is depicted in the flowchart form in
[0060]In sham trials, the feedback was inconsistent with the subject's current neural state. Unlike BCI trials, in sham trials, the feedback period lasted the full 3.4 s and as long as subjects maintained fixation throughout that time, they were rewarded. Only completed BCI and sham blocks were kept for analysis.
[0061]In a BCI trial, the size of the annulus at time bin t, at, was directly determined by the neural activity observed during the previous bin t−1. Similar to how the neural distance was found for the BCI calibration phase, the spikes from the previous 50 ms were sorted using the neural network sorter, the resulting spike count vector was projected into the calibration-defined factor space and the neural distance was computed in accordance with Eq. (6). The neural distance was mapped to a percentile value using the percentiles defined during the calibration task. For all BCI trials in a session, the μ defined during the session's calibration trials was used to compute the neural distance.
[0062]This percentile value was then mapped to an annulus size using a predefined affine transformation.
[0063]Although the annulus only changed sizes during the feedback period, smoothed neural distances were computed and tracked for bins during the fixation period (first 8 50 ms bins of a trial). This “freeze” period provided enough time for the trial's exponential smoother to stabilize. Thus, when the neurofeedback period started at the 9th bin of a trial, the smoothed distance calculated for bin 8, w8, was used to determine the annulus that was displayed. The annulus was updated every 50 ms to reflect the annulus size computed from the previous 50 ms.
[0064]The description that follows applies to both interleaved and block sham trials. After the initial 400 ms fixation period, subjects viewed replay feedback from a test session's BCI trial that lasted the full 3.4 s of the feedback period. As long as the subjects maintained fixation during the replay feedback, they were given a liquid reward. To ensure the subject did not break the association between shrinking the annulus and receiving a reward, correct BCI trials were specifically selected in which the subject met the success criteria right at the end of the 3.4 s feedback period. At the beginning of each day, a set of 5 BCI trials that met this requirement from a test session were determined. Then, during a sham trial, the visual feedback associated with a BCI trial from this set was randomly selected and shown. The feedback during the sham trials was unaffected by the current state of the subject's neural activity.
[0065]To compare a session's BCI and sham trials on BCI task performance, recorded neural activity during sham trials was passed through that session's BCI decoder in an offline setting. For each sham trial (interleaved and block sham), the recorded neural activity was binned into 50 ms time bins, the spike counts were projected into FA space in accordance with Eq. (2), exponential smoothing was applied in accordance with Eq. (3), and the neural distances were computed in accordance with Eq. (6). If, at any point during the feedback period (8th to 64th time bin in a trial), the success conditions were met (
[0066]As shown in
[0067]All neural analyses discussed for sham (interleaved and Block) and BCI trials are based on the kept bins of neural activity after the recorded neural activity has been passed through our decoder offline. For BCI trials, there is no difference between the remaining neural activity and recorded neural activity as during the experiment, BCI trials ended earlier if the success condition was met. To determine whether a sham trial was correct or not and what bins to consider for offline analysis, neural activity recorded during the sham trials was passed through the BCI decoder offline to match what occurred during the experiment for BCI trials.
[0068]During the experiments, smoothing was used in the neural distance computations to provide informative visual feedback and accommodate the online task demands of holding one's neural activity near the target for an extended period. However, for all the neural distances calculated from, temporal smoothing was excluded from the neural distance computations to focus on how the subject's neural activity changed on a moment-by-moment basis.
[0069]Specifically, the following equation was used to determine the neural distance at time bin t for a trial k:
[0070]The target state is defined to be the mean neural activity observed during the calibration task. In this setting where smoothing is not included to find the neural distances, we thus want our target state to be the mean of the calibration trials' unsmoothed FA projections, which is determined by Eq. (5).
[0071]Because BCI blocks had more trials than sham blocks, every BCI block was truncated such that they were similar in clock time duration to their subsequent sham block (See
[0072]When matching BCI blocks, the BCI blocks were end-aligned to their paired sham blocks and the last set of trials were selected such that the clock time duration of this set of trials matched the duration of the corresponding sham block. For instance, if a sham block lasted ˜90 s, the last n trials of paired BCI block were selected such that the n trials' total time duration was roughly 90 s. Trials were selected from the end of the BCI block as they were the closest in time to the sham block trials that they would be compared to. When fitting the linear regression models, the y-intercept was included in the model such that the model was described by the following relationship:
- [0073]γ, β0, β1, x∈
.
- [0073]γ, β0, β1, x∈
[0074]y represents the bin-averaged distance for a trial that occurred at clock time x. The β1,s found after fitting the model to each block are computed. Then, the block-averaged β1's computed of the two conditions are compared for all sessions.
[0075]
[0076]Using the disclosed system and method, two male adult rhesus macaques were trained to control the size of an annulus on a screen by modulating the activity of ˜30 neural units in the PFC. Every 50 ms, the annulus size was updated to reflect how far the subject's current neural activity was from a baseline state observed at the start of the experiment. Subjects can successfully use the disclosed system and method to suppress fluctuations in their neural activity over time.
[0077]At the beginning of each experimental session, subjects performed a set of calibration trials in which they fixated a central dot while an annulus gradually shrunk towards the center. The neural activity recorded during these trials was used to define a BCI mapping that related neural activity to visual feedback for the subsequent BCI task. To do this, a factor analysis (FA) model (302 in
[0078]After completing the calibration task 310, the subject performed the BCI task 320, which contained alternating blocks of BCI and sham trials. In BCI trials, at time bin t, the computed neural distance 312 directly determined the shown annulus size in the following time bin. Therefore, the annulus size provided truthful moment-to-moment feedback on the subject's current neural state. At time bin t, if the subject's neural distance was small, the annulus would also be small in the following bin. Furthermore, in BCI trials, subjects were able to use neurofeedback to stabilize their neural activity over time. To incentivize them to do so, if the subject's neural activity was within the reward threshold 308 for 400 ms or 8 consecutive time bins, the trial ended and a liquid reward was given. Subjects had a 3.4 s neurofeedback period 402 to achieve this success criterion.
[0079]Sham trials were included as a control condition to determine the effects of genuine and accurate neurofeedback. For all sessions, there were two types of sham control trials: 1) Block Sham trials, which were organized in a homogeneous sham block, and 2) Interleaved Sham trials, which were randomly distributed within a BCI block, as shown in
[0080]To assess the efficacy of neurofeedback from the BCI, the observed neural activity (which determines the task performance given the BCI mapping) was compared using accurate neurofeedback and sham feedback. Because block sham trials presented consecutive trials that always lasted the full 3.4 s of the feedback period, this set of trials was designated as a control group to determine the level at which subjects could suppress neural variability when not encouraged to control their neural activity. To evaluate how sham trials would have fared in our BCI task, the recorded neural activity during the sham trials of a session was played through its BCI decoder. For every sham trial, the trial was labelled as correct or incorrect by checking if there was any time bin at which the neural activity reached the BCI task success criterion. If the sham trial was deemed correct, only the bins from the start to when the success criterion was met for the following were considered for neural analyses, as shown in
[0081]In a session, it was observed that subjects were better able to stabilize their neural activity in BCI blocks than they could in sham blocks (see line 606 in
[0082]Subjects used moment-to-moment visual feedback from the annulus to reduce their neural distances. As shown in
[0083]
[0084]Although BCI and block sham trials indeed differed in the validity of the visual feedback, they also differed in the context in which they appeared. During sham blocks, subjects encountered a set of consecutive trials that always lasted the entire length of the feedback period. The results could potentially be attributable not to an improvement in neural control with the BCI but rather to a gradual shift in PFC activity away from the calibration period. Such a shift could happen because of an increasing realization, and perhaps frustration, that the sham block consisted of numerous consecutive trials in which no control strategy could be successful. Thus, comparing the BCI trials with the block sham trials cannot alone demonstrate that the real-time visual neurofeedback provided by the annulus was successfully used by the subjects. To isolate the effects of presenting accurate moment-to-moment visual neurofeedback, BCI trials were compared with interleaved sham trials that were, unbeknownst to the subjects, randomly distributed in BCI blocks (See
[0085]Subjects self-regulated an arousal-related internal state to help achieve BCI control. As noted earlier, internal states can fluctuate on both fast and slow timescales, which affects how neural population activity evolves. One such example is seen in the PFC, where neural fluctuations covaried with a slowly changing internal state linked to arousal. Under the BCI task paradigm, subjects must subdue these internal changes to help stabilize their neural activity.
[0086]
[0087]
[0088]The relationship between the recorded PFC activity and changes in pupil size, a well-studied physiological measure of many different internal states, such as arousal were explored. Prior work has shown how the activity patterns of the locus coeruleus 1402 in
[0089]It is possible, however, that this positive correlation observed in BCI trials is induced primarily by the size-varying annulus' luminance changes, which can directly affect pupil sizes via the pupillary light reflex. A small annulus would concentrate a large amount of luminance at the fovea, causing the contraction of the pupil. To test whether this hypothesis, in which luminance solely drove pupil size changes, could explain our results, the correlations of the shown and predicted annulus sizes were compared with pupil size changes in sham trials (
[0090]As would be realized by those of skill in the art, specific exemplary embodiments disclosed herein, including specific equipment used, specific algorithms and specific visual stimuli are provided as exemplary embodiments and the invention is not meant to be limited thereby. Modifications and variations of the examples used herein are intended to be within the scope of the invention, which is given by the following claims:
Claims
1. A method comprising:
exposing a subject to a stimulus;
altering the stimulus toward a target state; and
recording neural activity of the subject as the stimulus achieves the target state to establish a baseline neural activity.
2. The method of
monitoring the neural activity of the subject as the subject is exposed to the stimulus during a time series of steps; and
for each time step, altering the stimulus to be closer to or farther away from the target state, based on a comparison of the neural activity observed in an immediately preceding time step to the baseline neural activity;
wherein the stimulus is altered to be closer to the target state if the neural activity observed in an immediately preceding time step moves closer to the baseline neural activity; and
wherein the stimulus is altered to be farther from the target state if the neural activity observed in an immediately preceding time step moves away from the baseline neural activity.
3. The method of
rewarding the subject when the observed neural activity is within a target threshold of the baseline for a predetermined number of time steps.
4. The method of
5. The method of
6. The method of
7. The method of
8. The method of
9. The method of
10. The method of
holding the stimulus in a fixed state for a predetermined period of time prior to initiation of the time series of steps during which the observed neural activity controls the state of the stimulus.
11. A system comprising:
an electrode array implanted in the brain of a subject;
a neural network sorter trained to classify waveforms of neural activity collected by the microelectrode array as being caused by neural spiking or as being noise;
a processor; and
software that, when executed by the processor, performs the steps of:
exposing the subject to a stimulus;
monitoring neural activity of the subject as the subject is exposed to the stimulus during a time series of steps;
for each time step, calculating a neural distance metric for the neural activity observed in an immediately preceding time step;
altering the stimulus in the current time step based on the calculated neural distance.
12. The system of
13. The system of
altering the stimulus to be closer to a target state if the neural distance is smaller than the neural distance in the immediately preceding time step; and
altering the stimulus to be farther from the target state if the neural distance larger than the neural distance in the immediately preceding time step.
14. The system of
determining that the neural distance is within a within a target threshold of the baseline for a predetermined number of time steps; and
rewarding the subject.
15. The system of
holding the stimulus in a fixed state for a predetermined period of time prior to initiation of the time series of steps.
16. The system of
exposing a subject to a stimulus;
altering the stimulus; and
recording neural activity of the subject as the stimulus is altered to establish a baseline neural activity.
17. The system of
receiving signals from the microelectrode array;
performing bandpass filtering of the signals; and
digitizing the filtered signals.
18. The system of
19. The system of
20. The system of
a display;
wherein the visual stimulus comprises altering a diameter of an annulus centered on a target dot, the target and the annulus being displayed on the display.