Schizophrenia prediction: use of Neuroimages and Artificial Intelligence Models

By Hugo Galván, Agustina Boveda, Pablo Koss, & Sebastian Galván
Published on June 9, 2023 | Last updated on April 23, 2024

"Schizophrenia (SZ) involves significant alterations in perception, thoughts, mood, and behavior. This project aims to develop an AI model using machine learning for complementary SZ diagnosis, utilizing prefrontal cortex connectomics and tractography techniques. It focuses on creating scripts for data separation, comparing classification models, and analyzing the connectome of healthy individuals and those with SZ. Early detection and accurate diagnosis through machine learning will enable targeted interventions, improving outcomes for individuals with SZ."

Schizophrenia prediction: use of Neuroimages and Artificial Intelligence Models

Hugo Galván, Agustina Boveda, Pablo Koss,Sebastian Galván

Tractografia

Summary

Schizophrenia (SZ) involves significant alterations in perception, thoughts, mood, and behavior. This project aims to develop an AI model using machine learning for complementary SZ diagnosis, utilizing prefrontal cortex connectomics and tractography techniques. It focuses on creating scripts for data separation, comparing classification models, and analyzing the connectome of healthy individuals and those with SZ. Early detection and accurate diagnosis through machine learning will enable targeted interventions, improving outcomes for individuals with SZ.

Project definition

Background

Schizophrenia theories: Theory of Disconnection Syndrome

The theory of disconnection syndrome is a theory that explains the symptoms of schizophrenia as the result of disruptions in the normal integration of emotion, perception, and thought.

Previous works and literature: Use of only one method of analyses, Require additional validation afterwards

MRI: Now it’s only use for differential diagnosis and not for prediction

Tools

The “Schizophrenia” project will rely on the following technologies:

Processing of Tractography data is done with DSI Studio
Processing of fMRI data is done with SPM
Machine learning anlayses rely on scikit-learn.
The main summary and results of the project are stored in a Jupyter Notebook
Updating and version control relies on github, through commits and pull requests.

Data

The preprocessed data is publicly available on UCLA Consortium for Neuropsychiatric Phenomics LA5c Study. The data is composed of 2 groups of subjects: 1) 58 patients with schizophrenia and 2) 138 Healthy controls. The data is composed of 2 modalities: 1) fMRI and 2) Tractography. The data is already preprocessed and ready to be used.

Deliverables

Note that the deliverables changed somewhat over the course of the project, and are more extensive than the original conception.

The Github repository that contains all of the relevant code for the project, clearly formatted and commented.

Data, generated from the UCLA Consortium for Neuropsychiatric Phenomics LA5c Study dataset as described above.
Code for the analysis “pipeline” using Python
Jupyter notebook for interactive data visualization
This README.md file describing the project
Examples of visualizations created using the pipeline and notebooks
The slides used for the final Brainhack School presentation

Results

Progress overview

The project was divided into 3 main parts: 1) Data preparation, 2) Machine learning, and 3) Data visualization. The first part was done using DSI Studio and SPM. The second part was done using scikit-learn. The third part was done using Jupyter Notebook.

Tools I learned during this project

DSI Studio for tractography data processing and visualization
SPM for fMRI data processing and visualization
scikit-learn for machine learning analyses
Jupyter Notebook for data visualization and presentation

Results

The results of the project are summarized in the [Github repository](

Deliverable 1: report template

The report template is available in the Github repository- Metrics

Deliverable 2: project gallery

The project gallery is available in the Github repository- Pipeline

Conclusion and acknowledgement

Conclusion

We have managed, among many tasks, to link the worlds of clinical medicine, laboratory neurosciences and supervised learning, looking for the models achieved to support the clinic in a format a) screening, or with a tolerable bias to false positives that, together with other studies, advance in depth for a final diagnosis b) as a tool for minimum tolerance to biases, and pathology is taken as the last study and finalizes the clinical diagnosis process.
Under these criteria, we are putting together modeling strategies, on operational hypotheses that are on the project’s work table and beyond metrics that may be low, highlighting the variances of the important characteristics in the laboratory. We begin to understand the noise between variances that prevent us from visualizing and obtaining objective metrics. Also where they come from and we are putting together another model that verifies these facts and penalizes when detected, which will help highlight the true attributes that are hidden.
We are understand that they are a tool to support the clinic in the diagnosis, and that they are a tool to advance in the understanding of the disease and its treatment.

Acknowledgements

Thanks to all of the instructors, teaching assistants, and participants of the Brainhack School 2023! Thanks to the organizers, HUMAI, Hub Buenos Aires, and the Brainhack School 2023 sponsors for making this possible. Thanks to the UCLA Consortium for Neuropsychiatric Phenomics LA5c Study for making the data publicly available.

Schizophrenia prediction: use of Neuroimages and Artificial Intelligence Models

Schizophrenia prediction: use of Neuroimages and Artificial Intelligence Models

Summary

Project definition

Background

Tools

Data

Deliverables

Results

Progress overview

Tools I learned during this project

Results

Deliverable 1: report template

Deliverable 2: project gallery

Conclusion and acknowledgement

Conclusion

Acknowledgements

See also these similar projects

Introduction to data visualization in Python

Twin normative modelling project

Does rs-fMRI preprocessing matter for prediction performance in machine learning?