Michael Demidenko

Programs of Research

I have been fortunate to work on a range of exciting and impactful programs of research over the years. Below is an in-depth overview of the research programs I have led (or contributed to) since 2017, along with the topic areas, statistical analyses, and software used.

For information that precedes 2017, checkout my employment & volunteer history on LinkedIn. For an abbreviated version, see my Resume/CV page.

Current program of Research

My current program of research leverages advanced quantitative methods and data science techniques to address critical measurement issues in brain-behavior research during adolescence. This work was part of a three-year F32 grant funded by the National Institute on Drug Abuse and most recently as part of my Staff Research Scientist role, where I apply sophisticated statistical modeling, predictive modeling and computational approaches to answer key measurement questions about validity and reliability in task-based fMRI. Using advanced analytic techniques in R and Python, I work with massive neuroimaging datasets (e.g., 400,000+ files/40+ TB of data) requiring high-performance computing and distributed storage across Stanford's Sherlock, University of Minnesota's MSI and AWS cloud infrastructure.

Specific details of the grant can be found here and specific projects are described below.

Project 1: Advanced Psychometric Modeling in Neuroimaging

This project applies cutting-edge psychometric and statistical modeling techniques to neuroimaging data. Building on prior work examining how group activity and brain-behavior relationships fluctuate within samples, I have expanded this research using sophisticated statistical frameworks to evaluate measurement invariance across multiple datasets. Measurement invariance ensures that statistical measures perform consistently across samples, enabling valid between-group comparisons (critical to experimentation/hypothesis testing work using biological data).

I employed a comprehensive suite of advanced statistical data-reduction, pattern recognition and modeling building approaches, including Confirmatory Factor Analysis (CFA), Exploratory Structural Equation Modeling (ESEM), Exploratory Factor Analysis (EFA) and Local Structural Equation Modeling (LSEM) to rigorously evaluate the psychometric properties of fMRI tasks. The project integrates three large adolescent samples with nearly identical task designs, requiring sophisticated data harmonization and multi-sample modeling approaches.

This work follows a registered report methodology to ensure reproducible science. The Stage 1 registered report received in-principle acceptance at Developmental Cognitive Neuroscience. All statistical analyses and computational workflows are openly available on GitHub, demonstrating transparent and reproducible quantitative research practices.

Available in Developmental Cognitive Neuroscience

Project 2: Statistical Reliability Framework for Neuroimaging

This project centers on developing and implementing statistical methods for assessing reliability in high-dimensional neuroimaging data in an open-source python package. I developed PyReliMRI, a comprehensive Python package that implements multiple reliability metrics specifically designed for neuroimaging researchers working with complex, high-dimensional brain data.

The package incorporates statistical approaches including intraclass correlation coefficients, simlarity indices, test-retest reliability and novel metrics adapted for neuroimaging data structures. I have applied these tools in systematic evaluations of how different analytical choices impact individual and group-level fMRI estimates, requiring extensive computational modeling and statistical simulation work.

The methodological framework has been rigorously peer-reviewed through a Stage 1 registered report accepted at Peer Community in Registered Reports and the Stage 2 has been published in the flagship journal, Imaging Neuroscience. The package documentation and statistical implementation details are available at ReadTheDocs, showcasing best practices in scientific software development and statistical methodology.

In-kind Project 3: Big Data Processing for the Largest US Neuroimaging Dataset

This project involves developing scalable data science pipelines for the ABCD Study®, which represents the largest neuroimaging dataset collected in the United States. The ABCD study is a massive consortium across 21 sites collecting multimodal neuroimaging data including task-based fMRI, resting-state fMRI, structural and diffusion acquisitions from thousands of participants.

Working with the ABCD-BIDS Community Collection (ABCC), I lead the development of semi-automated, high-throughput preprocessing pipelines using MRIQC and fMRIPrep frameworks. This involves sophisticated workflow management, distributed computing strategies and quality control algorithms to process terabytes of neuroimaging data efficiently. My role ensures that preprocessed derivatives are released to the NIH data archive, enabling the broader scientific community to access analysis-ready data without requiring massive computational resources.

I have also developed advanced data visualization and statistical reporting tools for ABCD behavioral data. These tools leverage modern data science approaches to create interactive dashboards that can aggregate and visualize 10,000+ datapoints efficiently. Using reactive programming frameworks and statistical graphics libraries, these reports enable researchers to make data-driven inclusion/exclusion decisions and perform comprehensive quality assessment across this massive dataset.

Project 4: Large-Scale Statistical Model Optimization

This project represents a comprehensive statistical modeling study where I systematically evaluated 360 different statistical models to optimize decision-making for precision in fMRI analyses. This work required extensive computational resources and advanced statistical programming to assess how various modeling choices impact measurement precision and reliability across different analytical frameworks.

The study employed sophisticated simulation approaches, cross-validation techniques, and meta-analytical methods to provide evidence-based recommendations for optimal statistical approaches in neuroimaging. This quantitative framework directly informs best practices for statistical analysis in functional neuroimaging, with significant implications for reproducibility and statistical power.

The findings demonstrate how different statistical modeling decisions systematically impact fMRI measurement precision, providing the empirical foundation for the reliability metrics implemented in PyReliMRI. This work exemplifies the application of rigorous quantitative methods to improve neuroimaging methodology.

Publication: Available at Imaging Neuroscience

Project 5: Automated Statistical Analysis Pipeline for Open Science

This project involved developing openneuro_glmfitlins, an automated statistical analysis tool for neuroimaging data from OpenNeuro datasets. The tool implements sophisticated General Linear Model (GLM) frameworks with automated model specification, statistical inference, and results reporting.

The pipeline incorporates advanced statistical methods including multi-layed univariate analyses and dynamic statistical reporting. Using modern software engineering practices, the tool provides standardized, reproducible statistical analyses while handling the computational complexity of large-scale neuroimaging datasets.

This work addresses critical needs in computational neuroscience by democratizing access to rigorous statistical analysis methods. The semi-automated pipeline ensures that statistical analyses follow best practices for reproducibility and statistical rigor, while enabling researchers to focus on scientific interpretation rather than technical implementation.

Repository: GitHub - openneuro_glmfitlins

Project 6: High-Performance Computing for Developmental Neuroimaging

Extending my expertise in large-scale data processing, this project focuses on developing computational pipelines for the Human Connectome Project (HCP) youth dataset. This work requires sophisticated data science approaches to handle the unique computational challenges of high-resolution, multi-modal developmental neuroimaging data.

The project involves implementing advanced preprocessing algorithms, developing quality control metrics specific to imaging data and creating scalable computational workflows that can efficiently process massive datasets. The pipeline incorporates pre-preprocessing and post-processing of seven task and resting state time series data.

This work demonstrates advanced skills in high-performance computing, distributed processing and statistical algorithm application. The preprocessing framework ensures that researchers can access analysis-ready developmental neuroimaging data while maintaining the highest standards of data quality and computational efficiency.

Repository: GitHub - hcpya_preprocess

Technical Skills Highlighted Across Projects:

Statistical Methods: Structural Equation Modeling, Factor Analysis, Mixed-Effects Models, Bayesian Statistics, Predictive Modeling, Meta-Analysis, Simulation Studies, Regression, Dimensionality Reduction, Pattern Recognition.

Programming Languages: Advanced R, Python, Shell Scripting, High-Performance Computing

Data Science Tools: Pandas, NumPy, SciPy, Scikit-learn, Matplotlib, Plotly, Jupyter, Git/GitHub

Big Data Technologies: Distributed Computing, Cloud Computing (AWS), Containerization, Workflow Management

Software Development: Package Development, Documentation, Testing, Version Control, Reproducible Research

Measurement & Reproducibility

An evolving interest of mine involves issues of measurement and the reproducibility of model estimates and interpretations. With respect to measurement, a big proportion of what researchers do with data comes down to measurement. The consistency (i.e., reliability) and accuracy (i.e., validity) of measurement in research, like other things, is extremely important. For example, if you have a thermometer that is telling you it's 110F outside, when it's actually 77F, how useful is that thermometer? Sure, if the thermometer is consistently off by 33F, you can adjust the values and still use the thermometer (I suppose this would be quite useful before thermometers were easily accessible, so you'd consider this new measure as a blessing). What if you took five measurements, and you got 110F, 101F, 94F, 115F and 77F. What use is this thermometer and how can you even fix it after the fact? You usually cant -- so you might as well throw away the thermometer. But if you were wrong all along and over looked an important measurement issue that was contributing to this variability in temperatures? Say the different readings came when the thermometer was in direct sun, in a hot car, under a blanket or in a regulated temperature space? You may inadvertently throw away a tool that is useful, simply because you didn't follow the instructions on how to use the tool.

The same problem can arise in psychological, behavioral and/or neurobiological research. Whether you're conducting hypothesis driven work or data-driven prediction models, the dataset of numbers (continuous, ordinal and/or nominal) that are used as inputs into the statistical models will definitely impact the outputs and your takeaways. So if a measure is inconsistent and/or not accurate, you fall victim to the saying, "garbage in, garbage out"... Sadly, more often than not, commonly used measures do not come with very detailed instructions and so it is on the onerous of the research to ensure the interpretation is valid and reliability to the extent that seems appropriate (i.e., reliability and validity may be EXTREMELY important before brain surgery compared to the prediction of who will score high or low on a math test).

Reproducibility is also key to outputs and takeaways for any analysis. If one researcher runs an analysis on a data set and produces some results/conclusions, but another research cannot reproduce the results using the same data and methods. Is this still useful? How do you reconcile these differences? Which result is correct -- the one that supports that narrative?

Similarities and Differences in Alternative Definitions of Brain Activity

Measurement issues are especially important in studying neural activity. In some ways, because soooo much goes into getting the data of brain activity, they may be more critical than self-report and behavioral data. The number of steps removed that the final neuroimaging signal is from what is initially collected is several fold greater than self-report.

In the section on adolescent risk taking, I described how we found no evidence for a key element of a modern theoretical framework between different risk taking groups. In this 2020 study (Demidenko et al., 2020), we made some decisions in testing the hypotheses that some may not agree with or would do entirely different. Which is a cool thing about science -- we can ask things in different ways and hopefully converge on the same results. This convergence of results is a question that I was interested in for second study in my dissertation, because perhaps my choices were... dare I say it... wrong.

The question: How do different definitions (i.e., operationalizations) of the same neural measure relate in their a) neural activity and b) converge in the conclusion about association with some behavioral measure? In this publication (Demidenko et al., 2021), in the data-driven study we evaluated how different definitions used to arrive at the neural activity in the brain impacted the magnitude and direction of brain-behavior associations. In a way, in this study we found converging evidence for what we reported in the trait and state study (Demidenko et al., 2019). Specifically, while there may be verbal agreements and definitions about what set of task contrasts (what is often used in fMRI research to get at specific mental process in a task) may be related, when they are empirically evaluated the brain-brain associations (i.e., covariation of mean signal intensity) are not always consistent. This nuance differences is pretty important, because researchers often use different contrasts in their studies for different reasons (some well justified). So identifying when findings do and do not converge can increase exponentially as the number of parameters that differ between studies increases. Moreover, in the case of this study (Demidenko et al., 2021), the subtle differences in defining a contrast to get at neural activity in the brain may provide a wide range of associations with self-reported behavior that were really difficult to interpret. Posing concerns about the measures being using in research programs using neural activity data and self-report/behavioral data.

Reproducing Results and Impacts of Defining Measures

As mentioned earlier, having a science that is reproducible and/or replicable is important. Furthermore, if there are different ways to get at a numerical representation of a variable (i.e., operationalizations), it is important to have a reasonable explanation when/why the data may not converge.

In neurodevelopment cognitive neuroscience, one important topic is focused on the effects of the family environment on the development brain. The brain is malleable and there are several sensitive periods. Stressful environments, such as harsh parenting, disadvantaged neighborhoods and high crime neighborhoods may alter developmental trajectories and influence future behaviors and health-related outcomes. Large consortium studies have been used more and more to ask these specific questions. One study in 2019 published some findings pubertal developmental explained a significant amount of the associations between the family environment and brain development. Because these data are accessible with specific authorizations, a group of researchers and I wanted to test the reproducibility and extension of these the findings in the 2019 publication.

The question: In the open dataset, can we replicate the results of the initial study and is there converging evidence the alternative definitions of the key self-report variables 'Family Environment' and 'Pubertal Development. In this publication (Demidenko et al, 2022), we found that we could replicate direction (i.e., positive/negative association in initial study and replication study) of nearly all of the reported effects (90%) and the majority of the non-significant/significant (i.e., p > .05 or p < .05) categorization (60%). With respect to the alternative variable in the family variable, we found quite a bit of variable is the key findings, in that some alternative definitions would impact the conclusions. Furthermore, in the context of the definition of pubertal development (i.e., self-reported parent or self-reported child), we found that there was nearly no similarity in the interpretation when using the parent versus child reported pubertal development. This study demonstrate that effects are replicable in large samples, however, the conclusions may differ depending on what is being interpreted as meaningful, the p-value or the magnitude/direction of the associations? Importantly, we demonstrated that when a study can define a specific variable, such as the family environment, using large set of variables from the dataset this can change the conclusions depending on how/what variable used. In the context of the family environment and brain development, this posits point of discussion as we have shown in prior work that we were not able to confirm a hypothesis in a registered report (Demidenko et al., 2021).

Data Analysis & Coding Software

Over the course of the last five years, I have been able to learn and apply different statistical models and use multiple statistical/coding programs

Data Management & Statistical Analyses

Across several projects during my graduate and postdoctoral training, I have been able to apply several statistical analyses.

Demidenko et al. (2019): I handled and prepared the data, ran/reported descriptive statistics (means, standard deviations, counts, min/max), moment product correlations, hierarchical multiple regression (on continuous outcome) and ordinal regression (i.e., ordinal outcome)

Demidenko et al. (2020): I handled and prepared the self-report and neuroimaging data. I ran/reported descriptives statistics for key demographic, self-report and behavioral variables. For the neuroimaging data, I preprocessing and modeled the timeseries data, and ran the group-level, whole brain non-parametric analyses (5000 permutations). In addition to the whole brain analyses, I performed region on interest analyses using multiple regression. I created all visualizations for key models.

Demidenko et al. (2021): I handled and prepared the self-report and neuroimaging data. I ran and reported the descriptive statistics, the whole brain activation GLM contrast analyses, the region of interest moment product correlation matrix. In addition, I extracted the timeseries and with code from colleagues to extract and plot timeseries for select regions. I created the majority of the visualizations for key models.

Demidenko et al. (2021): I handled and prepared the self-report and neuroimaging data. I ran and reported the descriptive statistics and moment product correlations. I created the majority of the visualizations.

Demidenko et al. (2022): I handled and prepared the self-report and neuroimaging data. I ran and reported the descriptive statistics, moment product correlation of key variables and the mediation analyses using structural education modeling. I wrote the code to run the multiverse analyses and tailored the output structure to be compatible with the specr package. I created all of the visualizations.

Demidenko et al. (2022): I handled and prepared the self-report and neuroimaging data. I extracted the timeseries data and ran Group Iterative Multiple Model Estimation (GIMME). I extracted key a priori parameters and used these in the brain-behavior models using logistic regression and multiple regression. I created all of the visualizations.

Beltz et al. (2021), Beltz et al. (2022) & Constante et al. (2022): I handled and prepared the neuromaging data. I extracted the timeseries data and provided it to the lead author for subsequent analyses. I created some visualizations.

Demidenko et al. (2023): I handled and prepared the self-report data. I ran descriptive statistics, moment product correlations and multilevel models. This project includes multiple collaborators across multiple institutes.

Demidenko et al. (2022, Stage 1 Registered Report [received in-prinicple acceptance]): Wrote and piloted simulated models in R and pilots fMRI analyses in Python. Packaged and shared associated code via github.

Demidenko et al. (2023, Stage 1 Registered Report [under review]): Wrote and piloted simulated models in R and pilots fMRI analyses in Python. Led the curation of a python-based library calculating reliability estimates on 3D neuroimaging data. Prepared github and readthedos documentation. Packaged and shared associated code via github.

Coding & Statistical Software

For data analyses, I have worked with R, Python, JASP and MPlus statistical software. As mentioned in my current research programs, I have written a Python library to estimate different types of reliabilities on neuroimaging data.

For neuroimaging analyses, I have worked with Linux, FSL, Nilearn, Python and R.

Open-Science

The bulk of science is funded by taxpayer's money through the National Institutes of Health ($45 billion 2023 budget) or the National Science Foundation ($10.5 billion 2023 budget). This means, the research, products and findings are taxpayer owned (I'm willing to be debated on this...2663 Mission St in SF at 11:37pm. Be there!). In my opinion, this means taxpayer funded research should abide by open science practices. This includes but is not limited to: Sharing Code, Making Data Public, Making Publications Open Access. In addition to making science open, science should also not be overly skewed by a negative incentive structure, such as rewarding only shiny findings/stories.

Registered Reports

When it is possible, it is beneficial for research studies to be submitted as registered reports. In simple terms, for a registered report the researcher describes the justification (introduction), the population and analyses (methods) before the data is accessed/acquired. This servers multiple purposes, two that I highlight here. First, it encourages the researcher to follow the scientific method in hypothesis-driven (but is not limited to this, as it's flexible for exploratory work, too) research by justifying the work and the methods before any research begins. Before this is finalized, reviewers at journals can review and provide feedback on the work before the research is performed. Hence, before the analyses are performed the author and reviewers are already in agreement, so there are no post-hoc what-ifs, redoing analyses or having an unpublishable study before it even begins (given the incentive structure in research, lots of studies go unpublished...). Second, the research reports in the publication all results not just the flashy significant ones. In this case, researchers get the benefit of publications (which helps with applying for academic jobs) without the pressure of having to p-hack (unfortunate reality of the business) to get significant finds and biasing the published literature by discarding non-significant findings.

When it has been possible, I have tried to incorporate registered reports into my research. Specifically, when I am planning to do a study on secondary data that I have not accessed, or I am beginning grant I have not started, registered reports are a great avenue when there are clear plans and hypotheses. Unfortunately, given that I already had access to the data and/or had not yet known about registered reports, I was unable to apply this in my dissertation work. However, I have used registered reports with secondary open-data. We have asked key neurodevelopment question about environmental factors, brain functioning and internalizing symptoms in the large ABCD study (Demidenko et al., 2021). In addition, I have contributed to another registered report that has received stage 1 approval (Ip et al., 2022) and have received stage 1 approval for a project that address one of the research aims proposed in my NIDA F32 grant proposal (Demidenko et al., 2023).

Pre-Registrations

Sometimes a research program cannot use a registered report. Either the data is already accessed or it has been published by a team member. In this case, an alternative method can be used, Preregistration. While preregistrations significantly differ from registered reports, preregistrations allow researchers to document their analyses and provide a time-stamp of the document before the work begins. This permits the researcher to be explicit in what they plan to do and how they plan to do it for that specific research study. Then, when the work is completed and submitted for publication, the researcher can be explicit how/when they deviated from this time-stamped plan. The major differences with preregistrations and registered reports is that a preregistration requires more self-monitoring and so it can be manipulated (e.g., one can preregister after they had run analyses) a lot easier than registered reports.

There have been several scenarios where I had either work with the research data or had access to the research data which prevented me from using registered reports. In these scenarios, I tried to use preregistrations. In my first preregistration (Demidenko et al., 2021), I was performing analyses on fMRI data that I already worked with and had preprocessed (sequence of data cleaning steps for MRI/fMRI). For this project, the co-authors and I met on multiple occasions, outlined the goals and analytic plans. I wrote this plan out in a template and them submitted it on the OSF paltform. After this first preregistration and having worked with registered reports, I learned about the benefits of both techniques. In my subsequent preregistration (Demidenko et al., 2022), I attempted to use the registered report approach in the preregistration framework. Since we couldn't submit the work as a registered report, after we received interested from an editor for our project at a journal we outlined our introduction and methods and drafted the analytic code. Once we agreed on these materials, we submitted the preregistration on OSF platform and what would be the stage 1 draft/code on github. Unlike a traditional registered report, we could not have the stage 1 reviewed by reviewers at a journal, but we could abide by similar sequence of steps internal. Furthermore, it increased our precision in 1) the justification and 2) the methods/analyses.

Adolescent Risk taking/Decision Making

One of my initial academic interests was risk taking (specifically, substance use related-behaviors) during adolescence. Adolescence make-up over 1 billion of the worlds population and this age range (10-25 [definition of adolescence has varied over the 20th and 21st century, see Swayer et al 2018]) is marked by distinct increases in deaths from homicide, suicide and unintentional injuries. One category that contributes to these mortality rates is substance use. Identifying ways to reduce mortality rates as a result of substance use's role in unintentional injuries would benefit society greatly.

Adolescent Trait/State Measures in Context of Substance Use

Modern theoretical framework hypothesize that there are distinct traits and states in adolescents that give rise to risk taking behaviors, such as substance use. These traits/states include being more sensitive to rewards (i.e., positive experiences) and being less likely to self-regulate (i.e., being more impetuous/impulsive). In studies, researchers often have adolescents self-report about the trait(s) (using 1-5 scales) and/or perform well designed experimental tasks that evoke different state(s) (via computerized experimental tasks). The numerical values extracted from the self-report scales and/or experimental tasks are sometimes believed to belong to related processes, such as being more/less sensitive to positive experiences. However, for the interpretations to converge across research teams (in magnitude and/or direction) with a specific theoretical framework, a key assumption needs to be tested. That is, verbally defined trait and state measures may be argued to be related but they should also be empirically related. Otherwise, interpreting findings can be quite challenging. Moreover, if the hypothesis is that they then both [more or less equally] associate with substance use behaviors, this should be apparent in the data.

To answer the question: Are state and trait measures related (i.e., convergent/discriminant validity) and are they similarly related in direction/magnitude with substance use behaviors (i.e., predictive validity)?, I used a sample of 2000+ adolescents to address this in my Masters Thesis. In this publication (Demidenko et al., 2019), the empirical study demonstrated that [in our sample] there was inadequate empirical evidence to suggest that trait and state measures were representing the same psychological process (also referred to as as 'construct'). Furthermore, the trait (self-report) and state (computerized tasks) did not associate with substance use at a similar magnitude. Consistently, self-report measures were related to substance use at greater magnitude than the derived parameters (i.e., numerical representations of a process) from the computerized tasks. Around the time of this publication, others demonstrated a similar problem. In a 2023 APA handbook chapter (Keating et al., 2023), we discussed some of these issues further as they relate to cognitive development during adolescence.

Differences in Neural Activity Between High and Avg/Low Adolescent Risk Takers.

Similar to the above problem, another element of modern theoretical frameworks of risk taking in adolescents is focused on differences in neural activity to rewarding/positive experiences. Specifically, it is hypothesized that brain regions that are sensitive to rewards may, in part, contribute to the influx of risk taking observed during adolescence. This is often investigated across distinct development stages, such as adults, adolescents and children (a definition that has changed a TON over the years!). In these studies, researchers compare these distinct developmental groups (or stages) to see how their brain activity differs in response to specific types of stimuli and behaviors in the MRI scanner. Together with other information, this type of evidence is often used to make conclusions about what does/doesn't increase risk taking during adolescence. While informative, this type of evidence lacks specificity. For example, if the assertion is that neural activity is the different between those who do and don't engage in substance use, why not limit the scope to adolescents that do and do not engage in these behaviors and see how their brain activity differs?

To answer the question: How does neural activity in the brain differ between high and avg/low risk taking adolescents?, I used 108 adolescent fMRI scans as part of my dissertation. In this publication (Demidenko et al., 2020), the empirical study demonstrated that there were not significant differences between high and avg/low risk taking adolescents (17-21) in key brain regions when engaged in a monetary reward computerized task during fMRI. In fact, when extracting the average neural activity from specific brain regions believed to be important to a popular neurodevelopmental framework, we were unable to relate the neural activity to risk taking using a single or multi-wave definition of risk taking (i.e., substance use). This highlight the lack of specificity of the theoretical framework in differentiating risk taking profiles during adolescence. More importantly, it puts into question some of the hyperbolic statements that are made about teenagers such has been shown on Vox's 'The Teenage Brain' on Netflix.

Page updated

Google Sites

Report abuse